About the Role
We are looking for a highly skilled Site Reliability Engineer (SRE) to lead the implementation and management of our observability stack across Azure-hosted infrastructure and .NET Core applications. This role will focus on configuring and managing Open Telemetry, Prometheus, Loki, and Tempo, along with setting up robust alerting systems across all services including Azure infrastructure and MSSQL databases.
You will work closely with developers, DevOps, and infrastructure teams to ensure the performance, reliability, and visibility of our .NET Core applications and cloud services.
Key Responsibilities Observability Platform Implementation:
Required Skills and Experience
Preferred Qualifications
Keyskills: Azure Terraform Tempo AZ-400 Loki Prometheus .Net OpenTelemetry
keka Technologies Pvt Ltd About Us: Keka has grown super-fast to become the leading HR Tech product, thanks to our people and customers. We are here to transform businesses in India by empowering HR and employees with right tools, so they can focus on doing their best. We are unstoppable and ar...