Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Senior Software Engineer - Infrastructure @ Nvidia

Home > Devops

 Senior Software Engineer - Infrastructure

Job Description

NVIDIA is searching for a highly motivated software engineer for the NVIDIA NetQ team that is building a next gen Network management and Telemetry system in cloud using modern design principles at internet scale. NVIDIA NetQ is a highly scalable, modern network operations toolset that provides visibility, troubleshooting, and validation of your Cumulus fabrics in real time. NetQ utilizes telemetry and delivers actionable insights about the health of your data center network, integrating the fabric into your DevOps ecosystem.

What youll be doing:

  • Building and maintaining infrastructure components like NoSQL DB (Cassandra, Mongo), TSDB, Kafka etc

  • Maintain CI/CD pipelines to automate the build, test, and deployment process and build improvements on the bottlenecks. Managing tools and enabling automations for redundant manual workflows via Jenkins, Ansible, Terraforms etc

  • Enable performing scans and handling of security CVEs for infrastructure components

  • Enable triage and handling of production issues to improve system reliability and servicing for customers

What we need to see:

  • 5+ years of experience in complex microservices based architectures and Bachelors degree.

  • Highly skilled in Kubernetes and Docker/containerd.

  • Experienced with modern deployment architecture for non-disruptive cloud operations including blue green and canary rollouts.

  • Automation expert with hands on skills in frameworks like Ansible & Terraform.

  • Strong knowledge of NoSQL DB (preferably Cassandra), Kafka/Kafka Streams and Nginx.

  • Expert in AWS, Azure or GCP.

  • Having good programming background in languages like Scala or Python.

  • Knows best practices and discipline of managing a highly available and secure production infrastructure.

Ways to stand out from the crowd:

  • Experience with APM tools like Dynatrace, Datadog, AppDynamics, New Relic, etc.

  • Skills in Linux/Unix Administration.

  • Experience with Prometheus/Grafana.

  • Implemented highly scalable log aggregation systems in past using ELK stack or similar.

  • Implemented robust metrics collection and alerting infrastructure.

NVIDIA is widely considered to be one of the technology world s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If youre creative, passionate and self-motivated, we want to hear from you! NVIDIA is leading the way in ground-breaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services.

Job Classification

Industry: Electronic Components / Semiconductors
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Nvidia
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   Automation NoSQL Linux nginx cassandra Network operations Artificial Intelligence SCALA Troubleshooting Python

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Software Engineer, Site Reliability Engineering

  • Google
  • 2 - 7 years
  • Bengaluru
  • 7 days ago
₹ Not Disclosed

Technical Solutions Engineer, Infrastructure, Serverless

  • Google
  • 2 - 7 years
  • Pune
  • 7 days ago
₹ Not Disclosed

Azure Cloud Devops Engineer

  • eSolutionsFirst
  • 10 - 18 years
  • Hyderabad
  • 23 hours ago
₹ 15-30 Lacs P.A.

Devops Engineer

  • RWS Group
  • 4 - 5 years
  • Bengaluru
  • 2 days ago
₹ 18-22.5 Lacs P.A.

Nvidia

Nvidia Corporation