Senior Site Reliability Engineer @ Aviato Consulting

Home > Devops

Senior Site Reliability Engineer

Aviato Consulting
7 - 12 years
Bengaluru
2 days ago
Email to a friend
Report this job

Job Description

Interested candidates can directly apply through our careers page:

https://careers.aviato.consulting/jobs/cm5xeghu601jtqmznz5zmlr38

Aviato Consulting is seeking an experienced Senior Site Reliability Engineer to join our growing team. This isn't just another SRE role; it's an opportunity to own critical infrastructure, drive technical strategy, and shape the reliability culture for major Australian and EU clients, all within a supportive, G-inspired environment built on transparency and collaboration.

What's In It For You?

Learn from the Best: Report directly to and receive mentorship from our Head of SRE, an experienced ex-Google SRE Manager. Gain invaluable insights into scaling, reliability, and leadership honed at one of the world's tech giants.
High-Impact Projects: Take ownership of complex GCP environments for diverse, significant clients across Australia and the EU. Your work directly influences the stability and performance of critical systems.
Drive Innovation, Not Just Tickets: We empower our Senior SREs to think strategically. You'll architect solutions, implement cutting-edge practices (SLOs, error budgets, advanced automation), and proactively improve systems, not just react to issues.
A Culture That Works: Founded by ex-Googlers, we foster a transparent, collaborative, and low-bureaucracy environment where doing the right thing matters. We value SRE principles and give you the autonomy to implement them effectively.
Cutting-Edge Tech: Deepen your expertise with GCP, Kubernetes, Terraform, modern observability tooling (Grafana, Dynatrace, Sentry), and sophisticated CI/CD pipelines.

What You'll Do (Your Impact):

Own & Architect Reliability: Design, implement, and manage highly available, scalable, and resilient architectures on Google Cloud Platform (GCP) for key customer environments.
Lead GCP Expertise: Serve as a subject matter expert for GCP within the team and potentially wider organisation, driving best practices for security, cost optimization, and performance.
Master Kubernetes at Scale: Architect, deploy, secure, and manage production-grade Kubernetes clusters (GKE preferred), ensuring optimal performance and reliability for critical applications (including API platforms like Apigee, though prior Apigee experience isn't mandatory).
Drive Automation & IaC: Lead the design and implementation of robust automation strategies using Terraform, Ansible, and scripting (Python, Go, Bash) for provisioning, configuration management, and CI/CD pipelines (Jenkins, GitHub Actions, etc.).
Elevate Observability: Architect and refine comprehensive monitoring, logging, and alerting strategies using tools like Grafana, Dynatrace, and Sentry to ensure proactive issue detection and rapid response.
Lead Incident Response & Prevention: Spearhead incident management efforts, conduct blameless post-mortems, and drive the implementation of preventative measures to continuously improve system resilience.
Champion SRE Principles: Actively promote and embed SRE best practices (SLOs, SLIs, error budgets) within delivery teams and operational processes.
Mentor & Collaborate: Share your expertise, mentor junior team members (potentially), and collaborate effectively across teams to foster a strong reliability culture.

What You'll Bring (Your Expertise):

Proven SRE Experience: 5+ years of hands-on experience in a Site Reliability Engineering, DevOps, or Cloud Engineering role, with a significant focus on production systems.
Deep GCP Knowledge: Demonstrable, in-depth expertise in designing, deploying, and managing services within Google Cloud Platform (Compute Engine, GKE, Networking, IAM, Cloud SQL/Spanner, Pub/Sub, Monitoring/Logging etc.). GCP certifications are a plus.
Strong Kubernetes Skills: Proven experience managing Kubernetes clusters in production environments (GKE highly desirable). Understanding of networking, security, and operational best practices within Kubernetes.
Infrastructure as Code Mastery: Significant experience using Terraform in complex environments. Proficiency with configuration management tools (Ansible, Puppet, Chef) is beneficial.
Automation & Scripting Prowess: Strong proficiency in scripting languages like Python or Go, with experience in automating operational tasks and building tooling.
Observability Expertise: Experience implementing and leveraging monitoring, logging, and tracing tools (e.g., Prometheus, Grafana, ELK Stack, Dynatrace, Datadog, Sentry).
Problem-Solving Acumen: Strong analytical and troubleshooting skills, with experience leading incident response for critical systems.
Collaboration & Communication: Excellent communication skills and a collaborative mindset, with the ability to explain complex technical concepts clearly. Experience mentoring others is advantageous.
(Desirable): Experience with API Management platforms (Apigee, Kong, etc.), advanced networking concepts, or security hardening in cloud environments.

Technologies We Use (You'll Master):

Cloud: Google Cloud Platform (GCP)
Containerisation & Orchestration: Kubernetes (GKE), Docker
Infrastructure & Automation: Terraform, Ansible
Monitoring & Observability: Grafana, Dynatrace, Sentry, Google Cloud Operations Suite
CI/CD: Jenkins, GitHub Actions, Bamboo (or similar)
Scripting: Python, Go, Bash
Collaboration: JIRA, Confluence, Slack

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Aviato Consulting
Location(s): Bengaluru

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: GCP Site Reliability Engineering Devops Google Cloud Platforms Kubernetes

Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

Devops Engineer

GSR Business Services

3 - 7 years

Bengaluru

2 days ago

₹ 5-10 Lacs P.A.

Devops Engineer

Tech Mahindra

4 - 6 years

Bengaluru

2 days ago

₹ -15 Lacs P.A.

Devops Engineer

Engineering Industries

3 - 5 years

Delhi, NCR

1 day ago

₹ Not Disclosed

Kubernetes Platform Engineer

Forbes Global

4 - 6 years

Mumbai

2 days ago

₹ Not Disclosed

Aviato Consulting

Aviato Consulting is a company that helps businesses transform through technology, primarily using Google Cloud. We specialize in app development, cloud foundations, data & analytics, AI & machine learning, application modernization, and security reviews. Founded by ex-Googlers, we aim to ...

Senior Site Reliability Engineer @ Aviato Consulting

Home > Devops