NVIDIA

Senior Staff Site Reliability Engineer

NVIDIA

Santa Clara, CA, USA
Full-TimeDepends on ExperienceSenior LevelMasters
Job Description

At NVIDIA, we're seeking a highly skilled and experienced Senior Staff Site Reliability Engineer to join our dynamic team. As a leader in the AI and technology industry, we are constantly pushing boundaries and innovating to create cutting-edge products. In this role, you will play a critical role in ensuring the reliability and scalability of our systems, impacting millions of users worldwide. We are looking for someone with a strong technical background, exceptional problem-solving skills, and a passion for driving continuous improvement. If you thrive in a fast-paced, collaborative environment and are ready to take on new challenges, we want to hear from you!

  1. Design and implement highly available and fault-tolerant systems to support millions of users worldwide.
  2. Monitor and maintain the health and performance of our systems, proactively identifying and resolving any issues.
  3. Lead incident response and resolution, including root cause analysis and post-incident reviews.
  4. Collaborate with cross-functional teams to continuously improve system reliability, scalability, and performance.
  5. Develop and maintain automation tools for deployment, monitoring, and maintenance of systems.
  6. Stay updated with the latest technologies and industry best practices to drive innovation and improve efficiency.
  7. Provide technical guidance and mentorship to junior team members.
  8. Participate in on-call rotations and provide 24/7 support for critical systems.
  9. Ensure compliance with security standards and policies.
  10. Create and maintain documentation of systems, processes, and procedures.
  11. Identify opportunities for process improvement and implement changes to increase efficiency and reduce downtime.
  12. Communicate effectively with team members, stakeholders, and management to provide regular updates on system performance and reliability.
Where is this job?
This job is located at Santa Clara, CA, USA
Job Qualifications
  • Extensive Experience With Infrastructure And Software Engineering: A Senior Staff Site Reliability Engineer At Nvidia Should Have A Strong Background In Both Infrastructure And Software Engineering, With A Deep Understanding Of How These Two Areas Intersect And Impact Each Other.

  • Proficiency In Multiple Programming Languages: The Ideal Candidate Should Be Proficient In Multiple Programming Languages, Such As Python, Java, And Shell Scripting, To Build And Maintain Reliable Systems And Automation Tools.

  • Expertise In Cloud Computing: As Nvidia's Products Are Primarily Cloud-Based, The Senior Staff Site Reliability Engineer Should Have A Deep Understanding Of Cloud Computing Platforms, Such As Aws, Azure, And Gcp, And Be Able To Design And Optimize Infrastructure For These Environments.

  • Strong Troubleshooting And Problem-Solving Skills: A Senior Staff Site Reliability Engineer Should Be Able To Quickly Identify And Troubleshoot Complex Issues In A Production Environment, Using Various Tools And Techniques To Resolve Them Effectively.

  • Leadership And Project Management Experience: In Addition To Technical Skills, A Senior Staff Site Reliability Engineer Should Have Experience Leading And Managing Projects, As Well As Mentoring And Training Junior Team Members. They Should Also Have Excellent Communication Skills To Collaborate With Cross-Functional Teams And Stakeholders.

Required Skills
  • Security

  • Virtualization

  • Networking

  • Scripting

  • Automation

  • Cloud Computing

  • Disaster recovery

  • Performance tuning

  • Containerization

  • Linux/UNIX

  • Configuration management

  • Monitoring

Soft Skills
  • Communication

  • Conflict Resolution

  • Emotional Intelligence

  • Leadership

  • Problem Solving

  • Time management

  • creativity

  • Attention to detail

  • Teamwork

  • Adaptability

Compensation

According to JobzMall, the average salary range for a Senior Staff Site Reliability Engineer in Santa Clara, CA, USA is between $170,000 and $200,000 per year. This range can vary based on factors such as experience, education, and specific job duties. Additionally, location and company size may also impact salary range.

Additional Information
NVIDIA is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based upon race, religion, color, national origin, sex, sexual orientation, gender identity, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
Required LanguagesEnglish
Job PostedMarch 25th, 2024
Apply BeforeMay 22nd, 2025
This job posting is from a verified source. 
Reposted

Apply with Video Cover Letter Add a warm greeting to your application and stand out!

About NVIDIA

NVIDIA Corp. designs and manufactures computer graphics processors, chipsets, and related multimedia software. The company operates through two segments: Graphics Processing Unit and Tegra Processor. The Graphics Processing Unit segment includes sales of the company's GeForce discrete and chipset products that supports desktop and notebook PCs plus license fees from Intel and sales of memory products. The Tegra Processors segment provides processors that deliver superior visual and multimedia experience on tablets, smart phones and gaming devices while consuming minimal power.

Frequently asked questions

Get interviewed today!

JobzMall is the world‘ s largest video talent marketplace.It‘s ultrafast, fun, and human.

Get Started