Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Data Engineer @ 7dxperts

Home > Data Science & Machine Learning

 Data Engineer

Job Description

Role & responsibilities

  • 3+ years of experience in Spark, Databricks, Hadoop, Data and ML Engineering.
  • 3+ Years on experience in designing architectures using AWS cloud services & Databricks.
  • Architecture, design and build Big Data Platform (Data Lake / Data Warehouse / Lake house) using Databricks services and integrating with wider AWS cloud services.
  • Knowledge & experience in infrastructure as code and CI/CD pipeline to build and deploy data platform tech stack and solution.
  • Hands-on spark experience in supporting and developing Data Engineering (ETL/ELT) and Machine learning (ML) solutions using Python, Spark, Scala or R languages.
  • Distributed system fundamentals and optimising Spark distributed computing.
  • Experience in setting up batch and streams data pipeline using Databricks DLT, jobs and streams.
  • Understand the concepts and principles of data modelling, Database, tables and can produce, maintain, and update relevant data models across multiple subject areas.
  • Design, build and test medium to complex or large-scale data pipelines (ETL/ELT) based on feeds from multiple systems using a range of different storage technologies and/or access methods, implement data quality validation and to create repeatable and reusable pipelines
  • Experience in designing metadata repositories, understanding range of metadata tools and technologies to implement metadata repositories and working with metadata.
  • Understand the concepts of build automation, implementing automation pipelines to build, test and deploy changes to higher environments.
  • Define and execute test cases, scripts and understand the role of testing and how it works.

Preferred candidate profile

  • Big Data technologies Databricks, Spark, Hadoop, EMR or Hortonworks.
  • Solid hands-on experience in programming languages Python, Spark, SQL, Spark SQL, Spark Streaming, Hive and Presto
  • Experience in different Databricks components and API like notebooks, jobs, DLT, interactive and jobs cluster, SQL warehouse, policies, secrets, dbfs, Hive Metastore, Glue Metastore, Unity Catalog and ML Flow.
  • Knowledge and experience in AWS Lambda, VPC, S3, EC2, API Gateway, IAM users, roles & policies, Cognito, Application Load Balancer, Glue, Redshift, Spectrum, Athena and Kinesis.
  • Experience in using source control tools like git, bit bucket or AWS code commit and automation tools like Jenkins, AWS Code build and Code deploy.
  • Hands-on experience in terraform and Databricks API to automate infrastructure stack.
  • Experience in implementing CI/CD pipeline and ML Ops pipeline using Git, Git actions or Jenkins.
  • Experience in delivering project artifacts like design documents, test cases, traceability matrix and low-level design documents.
  • Build references architectures, how-tos, and demo applications for customers.
  • Ready to complete certifications

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: 7dxperts
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   Data Engineering Data Bricks Python ML ML Engineering Pyspark MLops Ci Cd Pipeline GIT Machine Learning SQL

 Fraud Alert to job seekers!

₹ 15-20 Lacs P.A

Similar positions

Staff, Data Scientist

  • Walmart
  • 5 - 10 years
  • Bengaluru
  • 3 days ago
₹ Not Disclosed

Data Engineer

  • Tek Ninjas
  • 8 - 13 years
  • Pune
  • 3 days ago
₹ Not Disclosed

Machine Learning Engineer - Python/Tensorflow

  • Vayuz Technologies
  • 4 - 5 years
  • Agra
  • 3 days ago
₹ Not Disclosed

Machine Learning Engineer - Python/Tensorflow

  • Vayuz Technologies
  • 4 - 5 years
  • Mumbai
  • 3 days ago
₹ Not Disclosed

7dxperts

7Dxperts specializes in utilizing all types of dimensions of data to tackle challenging questions in the field of Data/Spatial/ML. Our customers can be engaged for insight projects to prove immediate value, build data-driven solutions to target specific problems, or build capability, operation &...