Keyskills: sql spark python relational databases aws cloudera hive scala pyspark data warehousing emr azure data factory plsql aws cloud hadoop etl big data data lake snowflake talend microsoft azure warehouse data engineering aws glue data bricks sqoop