Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Data Engineer @ InfoVision Inc

Home > Software Development

 Data Engineer

Job Description

We are seeking a skilled Data Engineer with extensive experience in the Cloudera Data Platform (CDP) to join our dynamic team. The ideal candidate will have over four years of experience in designing, developing, and managing data pipelines, and will be proficient in big data technologies. This role requires a deep understanding of data engineering best practices and a passion for optimizing data flow and collection across a diverse range of sources.

Required Skills and Qualifications:

  • Experience: 4+ years of experience in data engineering, with a strong focus on big data technologies.
  • Cloudera Expertise: Proficient in Cloudera Data Platform (CDP) and its ecosystem, including Hadoop, Spark, HDFS, Hive, Impala, and other relevant tools.
  • Programming Languages: Strong programming skills in Python, Scala, or Java.
  • ETL Tools: Experience with ETL tools and processes.
  • Data Warehousing: Knowledge of data warehousing concepts and experience with data modeling.
  • SQL: Advanced SQL skills for querying and manipulating large datasets.
  • Linux/Unix: Proficiency in Linux/Unix shell scripting.
  • Version Control: Familiarity with version control systems like Git.
  • Problem-Solving: Strong analytical and problem-solving skills.
  • Communication: Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders.

 

Preferred Qualifications:

  • Cloud Experience: Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Data Streaming: Experience with real-time data streaming technologies like Kafka.
  • DevOps: Familiarity with DevOps practices and tools such as Docker, Kubernetes, and CI/CD pipelines.

Education:

  • Bachelors degree in computer science, Information Technology, or a related field.

Main Skill:

Hadoop, Spark,Hive,Impala,Scala,Python,Java,Linux


Roles and Responsibilities
  • Develop and maintain scalable data pipelines using Cloudera Data Platform (CDP) components.
  • Design and implement ETL processes to extract, transform, and load data from various data sources into the data lake or data warehouse.
  • Optimize and troubleshoot data workflows for performance and efficiency.
  • Manage and administer Hadoop clusters within the Cloudera environment.
  • Monitor and ensure the health and performance of the Cloudera platform.
  • Implement data security best practices, including encryption, data masking, and user access control.
  • Work closely with data scientists, analysts, and other stakeholders to understand data requirements and provide the necessary support.
  • Collaborate with cross-functional teams to design and deploy big data solutions that meet business needs.
  • Participate in code reviews, provide feedback, and contribute to team knowledge sharing.
  • Create and maintain comprehensive documentation of data engineering processes, data architecture, and system configurations.
  • Provide support for production data pipelines, including troubleshooting and resolving issues as they arise.
  • Train and mentor junior data engineers, fostering a culture of continuous learning and improvement.
  • Stay up to date with the latest industry trends and technologies related to data engineering and big data.
  • Propose and implement improvements to existing data pipelines and architectures.
  • Explore and integrate new tools and technologies to enhance the capabilities of the data engineering team.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: InfoVision Inc
Location(s): Hyderabad

+ View Contactajax loader


Keyskills:   hive cloudera continuous integration bigdata frameworks data analytical scala big data technologies ci/cd data pipeline sql java unix shell scripting spark linux design hadoop big data programming communication skills cd python development impala data engineering hdfs aws

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Engineering Manager

  • Cognizant
  • 11 - 13 years
  • Hyderabad
  • 2 days ago
₹ Not Disclosed

Web Solutions Engineer

  • Google
  • 4 - 9 years
  • Hyderabad
  • 7 days ago
₹ Not Disclosed

Software Engineering Manager, Core Dev Rust

  • Google
  • 8 - 13 years
  • Bengaluru
  • 7 days ago
₹ Not Disclosed

Software Engineer, Silicon Software Platform

  • Google
  • 5 - 10 years
  • Bengaluru
  • 7 days ago
₹ Not Disclosed

InfoVision Inc

Info vision Software Solutions (India) Pvt. Ltd