As a Data Governance Architect, your work is a combination of hands-on contribution, customer engagement, and technical team management. Overall, you ll design, architect, deploy, and maintain big data-based data governance solutions.
More specifically, this will involve:
Technical management across the full life cycle of big data-based data governance projects
from requirement gathering and analysis to platform selection, design of the architecture, and
deployment.
Scaling the solution in a cloud-based infrastructure.
Collaborating with business consultants, data scientists, engineers, and developers to develop data solutions.
Exploring new technologies for creative business problem-solving
Leading and mentoring a team of data governance engineers
What do we expect
10+ years of technical experience with 5+ years in the Hadoop ecosystem and 3+ years in Data
Governance Solutions
Hands-on experience with Data Governance Solutions with a good understanding of the below
Data Catalog
Business Glossary
Business metadata, technical metadata, operational Metadata
Data Quality
Data Profiling
Data Lineage
Hands-on experience with the following technologies:
Hadoop ecosystem - HDFS, Hive, Sqoop, Kafka, ELK Stack, etc
Spark, Scala, Python, and core/advanced Java
Relevant AWS/GCP components required to build big data solutions
Good to know: Databricks, Snowflake
Familiarity working with:
Designing/building large cloud-computing infrastructure solutions (in AWS/GCP)
Data lake design and implementation
Full life cycle of a Hadoop solution
Distributed computing and parallel processing environments
HDFS administration, configuration management, monitoring, debugging, and performance tuning