Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Staff Site Reliability Engineer - Cloud Solutions Team @ Surveymonkey

Home > Devops

 Staff Site Reliability Engineer - Cloud Solutions Team

Job Description

What were looking for

As a member of the infrastructure team at Survey Monkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers.

What youll be working on
  • Architect, build, and operate AWS environments at scale with well-established industry best practices.
  • Automating infrastructure provisioning, DevOps, and/or continuous integration/delivery.
  • Provide Technical Leadership & Mentorship
  • Mentor and guide senior engineers to build technical expertise and drive a culture of excellence in software development.
    • Foster collaboration within the engineering team, ensuring the adoption of best practices in coding, testing, and deployment.
    • Review code and provide constructive feedback to ensure code quality and adherence to architectural principles.
  • Collaboration & Cross-Functional Leadership
    • Collaborate with cross-functional teams (Product, Security, and other Engineering teams) to drive the roadmap and ensure alignment with business objectives.
    • Provide technical leadership in meetings and discussions, influencing key decisions on architecture, design, and implementation.
  • Innovation & Continuous Improvement
    • Propose, evaluate, and integrate new tools and technologies to improve the performance, security, and scalability of the cloud platform.
    • Drive initiatives for optimizing cloud resource usage and reducing operational costs without compromising performance.
    • Write libraries and APIs that provide a simple, unified interface to other developers when they use our monitoring, logging, and event-processing systems.
  • Participate in on-call rotation.
  • Support and partner with other teams on improving our observability systems to monitor site stability and performance Wed love to hear from people with:
    • 12+ years of relevant professional experience with cloud platforms such as AWS, Heroku.
    • Extensive experience leading design sessions and evolving well-architected environments in AWS at scale.
    • Extensive experience with Terraform, Docker, Kubernetes, scripting (Bash/Python/Yaml), and helm.
    • Experience with Splunk, OpenTelemetry, CloudWatch, or tools like New Relic, Datadog, or Grafana/Prometheus, ELK (Elasticsearch/Logstash/Kibana).
    • Experience with metrics and logging libraries and aggregators, data analysis and visualization tools Specifically Splunk and Otel.
    • Experience instrumenting PHP, Python, Java and Node.js applications to send metrics, traces, and logs to third-party Observability tooling.
    • Experience with GitOps and tools like ArgoCD/fluxcd.
    • Interest in Instrumentation and Optimization of Kubernetes Clusters.
    • Ability to listen and partner to understand requirements, troubleshoot problems, or promote the adoption of platforms.
    • Experience with GitHub/GitHub Actions/Jenkins/Gitlab in either a software engineering or DevOps environment.
    • Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Elasticsearch, Memcached, Redis, Kafka and Debezium.
    • Preferably experience with secrets management, for example Hashicorp Vault.
    • Preferably experience in an agile environment and JIRA.

SurveyMonkey believes in-person collaboration is valuable for building relationships, fostering community, and enhancing our speed and execution in problem-solving and decision-making. As such, this opportunity is hybrid and requires you to work from the SurveyMonkey office in Bengaluru 3 days per week.
#LI - Hybrid

Job Classification

Industry: Software Product
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Surveymonkey
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   kubernetes memcached helm redis docker scripting elastic search logstash java postgresql devops design heroku jenkins prometheus kibana mongodb amazon cloudwatch yaml python github software development elk new relic cloud platforms node.js grafana kafka terraform gitlab bash aws

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Devops Engineer

  • GSR Business Services
  • 3 - 7 years
  • Bengaluru
  • 1 day ago
₹ 5-10 Lacs P.A.

Devops Engineer

  • Tech Mahindra
  • 4 - 6 years
  • Bengaluru
  • 1 day ago
₹ -15 Lacs P.A.

Devops Engineer

  • Engineering Industries
  • 3 - 5 years
  • Delhi, NCR
  • 8 hours ago
₹ Not Disclosed

Kubernetes Platform Engineer

  • Forbes Global
  • 4 - 6 years
  • Mumbai
  • 19 hours ago
₹ Not Disclosed

Surveymonkey

Company DetailsSurveyMonkey