Any Project specific Prerequisite skills Spark Delta Lake
Detailed JD Understanding of Spark core concepts like RDDs DataFrames DataSets SparkSQL and Spark StreamingExperience with Spark optimization techniquesDeep knowledge of Delta Lake features like time travel schema evolution data partitioningAbility to design and implement data pipelines using Spark and Delta Lake as the data storage layerProficiency in Python Scala Java for Spark development and integrate with ETL processKnowledge of data ingestion techniques from various sources flat files CSV API databaseUnderstanding of data quality best practices and data validation techniquesOther SkillsUnderstanding of data warehouse concepts data modelling techniquesExpertise in Git for code managementFamiliarity with CICD pipelines and containerization technologiesNice to have experience using data integration tools like DataStage Prophecy InformaticaAb Initio
Employement Category:
Employement Type: Full timeIndustry: IT Services & ConsultingRole Category: Application Programming / MaintenanceFunctional Area: Not SpecifiedRole/Responsibilies: Spark delta lake