Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Site Reliability Engineer IV @ Avalara Technologies

Home > Devops

 Site Reliability Engineer IV

Job Description

What You'll Do

As a member of our Reliability Engineering Product SRE team, you will build production applications with excellent MVRs and SMMs, ensuring customer satisfaction through your expertise in SRE domain skills. We are looking for someone who is passionate about automation, efficiency, and. You will use a bundled tech stack to provide deep visibility into customer, product, and infrastructure interactions. You will have for SLOs, SLIs, Service level agreements, and the golden metrics that move reliability. You will programmatically approach MVRs using coding and scripting languages, while also applying AI/ML-driven insights where applicable.


What Your Responsibilities Will Be
  • Build products with MVRs and reliability standards, ensuring system resilience and scalability.
  • Operate observability tools across multiple cloud providers, incorporating AI-powered anomaly detection to enhance monitoring.
  • Assist development teams in defining SLO/SLI dashboards and alerts, optimizing alerting signals with ML-based noise reduction techniques.
  • Use Go, Python, or Terraform to automate operational tasks and build self-healing mechanisms.
  • Administer Grafana, Prometheus, Loki, and other observability tools, integrating predictive analytics where beneficial.
  • Troubleshoot and support production environments, using AI-assisted diagnostics where applicable for faster cause identification.
  • Automate incident response workflows, applying AIOps to reduce manual toil and improve MTTR.

What You'll Need to be Successful
  • 5 years' experience in a SaaS environment.
  • Bachelor's degree or equivalent experience.
  • Participate in an on-call rotation.
  • Experience with networking (OSI model, TCP/IP, DNS), in cloud environments.
  • Experience with Linux administration, security hardening, and performance tuning.
  • Experience with troubleshooting distributed software failures.
  • Experience with observability principles, including log analysis, tracing, and metrics correlation.
  • Background in infrastructure as code (Terraform, Pulumi) and container orchestration (Kubernetes, ECS, Nomad).
  • Interest in AI-powered automation, including AIOps tools, ML-based alert tuning, and predictive maintenance.
  • Experience with Observability tools like Prometheus,grafana or OpenTelemetry with ML-based anomaly detection.
  • Excellent technical writing skills for documenting architectures, processes, and automation workflows.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Avalara Technologies
Location(s): Pune

+ View Contactajax loader


Keyskills:   Site Reliability Engineering Nomad ECS SaaS DNS TCP/IP AIOps Kubernetes Linux administration

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Cloud Platform Engineer

  • Accenture
  • 7 - 12 years
  • Hyderabad
  • 3 days ago
₹ Not Disclosed

Azure Devops Engineer

  • Vlink
  • 3 - 8 years
  • Noida, Gurugram
  • 3 days ago
₹ 20-25 Lacs P.A.

Cloud Engineer

  • Cradlepoint
  • 4 - 8 years
  • Noida, Gurugram
  • 3 days ago
₹ Not Disclosed

Senior Cloud DevOps Engineer

  • NICE
  • 4 - 7 years
  • Pune
  • 3 days ago
₹ Not Disclosed

Avalara Technologies

If youre thinking scale, think bigger and dont stop there. At Walmart Global Tech India, we dont just innovate, we enable transformations across stores and different channels for the Walmart experience. \\r\\n \\r\\nA regular day at Walmart Global Tech India means using technology to deliver leadin...