Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Senior Data Engineer @ Leewayhertz

Home > Engineering

 Senior Data Engineer

Job Description

Job Description


We are seeking a highly skilled Senior Data Engineer with deep expertise in AWS data services, data wrangling using Python & PySpark, and a solid understanding of data governance, lineage, and quality frameworks. The ideal candidate will have a proven track record of delivering end-to-end data pipelines for logistics, supply chain, enterprise finance, or B2B analytics use cases.

Role & responsibilities

  • Design, build, and optimize ETL pipelines using AWS Glue 3.0+ and PySpark.
  • Implement scalable and secure data lakes using Amazon S3, following bronze/silver/gold zoning.
  • Write performant SQL using AWS Athena (Presto) with CTEs, window functions, and aggregations.
  • Take full ownership from ingestion transformation validation metadata documentation dashboard-ready output.
  • Build pipelines that are not just performant, but audit-ready and metadata-rich from the first version.
  • Integrate classification tags and ownership metadata into all columns using AWS Glue Catalog tagging conventions.
  • Ensure no pipeline moves to QA or BI team without validation logs and field-level metadata completed.
  • Develop job orchestration workflows using AWS Step Functions integrated with EventBridge or CloudWatch.
  • Manage schemas and metadata using AWS Glue Data Catalog.
  • Take full ownership from ingestion transformation validation metadata documentation dashboard-ready output.
  • Ensure no pipeline moves to QA or BI team without validation logs and field-level metadata completed.
  • Enforce data quality using Great Expectations, with checks for null %, ranges, and referential rules.
  • Ensure data lineage with OpenMetadata or Amundsen and add metadata classifications (e.g., PII, KPIs).
  • Collaborate with data scientists on ML pipelines, handling JSON/Parquet I/O and feature engineering.
  • Must understand how to prepare flattened, filterable datasets for BI tools like Sigma, Power BI, or Tableau.
  • Interpret business metrics such as forecasted revenue, margin trends, occupancy/utilization, and volatility.
  • Work with consultants, QA, and business teams to finalize KPIs and logic.
  • Build pipelines that are not just performant, but audit-ready and metadata-rich from the first version.
  • Integrate classification tags and ownership metadata into all columns using AWS Glue Catalog tagging conventions.

Preferred candidate profile

  • Strong hands-on experience with AWS: Glue, S3, Athena, Step Functions, EventBridge, CloudWatch, Glue Data Catalog.
  • Programming skills in Python 3.x, PySpark, and SQL (Athena/Presto).
  • Proficient with Pandas and NumPy for data wrangling, feature extraction, and time series slicing.
  • Strong command over data governance tools like Great Expectations, OpenMetadata / Amundsen.
  • Familiarity with tagging sensitive metadata (PII, KPIs, model inputs).
  • Capable of creating audit logs for QA and rejected data.
  • Experience in feature engineering rolling averages, deltas, and time-window tagging.

BI-readiness with Sigma, with exposure to Power BI / Tableau (nice to have).

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Production, Manufacturing & Engineering
Role Category: Engineering
Role: Engineering - Other
Employement Type: Full time

Contact Details:

Company: Leewayhertz
Location(s): Noida, Gurugram

+ View Contactajax loader


Keyskills:   AWS Data Ingestion Pyspark Data Quality Data Lineage Metadata Management Redshift Aws Metadata Great Expectation Aws Glue Data Governance Python

 Fraud Alert to job seekers!

₹ 15-30 Lacs P.A

Similar positions

Primary Design Engineer - Substation

  • Hitachi Energy
  • 6 - 11 years
  • Chennai
  • 23 hours ago
₹ Not Disclosed

Piping Design Engineer

  • Quest Global
  • 3 - 6 years
  • Bengaluru
  • 12 hours ago
₹ -8 Lacs P.A.

Integrity Engineer - Corrosion

  • Quest Global
  • 5 - 10 years
  • Bengaluru
  • 14 hours ago
₹ Not Disclosed

Integrity Engineer - Pipeline

  • Quest Global
  • 5 - 10 years
  • Bengaluru
  • 21 hours ago
₹ 50,000 P.A.

Leewayhertz

LeewayHertz Technologies Private Limited