NVIDIA

Senior Site Reliability Engineer - Cloud

NVIDIA

Santa Clara, CA, USA
Full-TimeDepends on ExperienceSenior LevelMasters
Job Description

Are you ready to take your career to the next level and join a dynamic team at one of the world's leading technology companies? NVIDIA is seeking a Senior Site Reliability Engineer with a strong background in cloud technologies to help us maintain and improve the reliability of our cutting-edge products and services. As a key member of our team, you will have the opportunity to work with the latest cloud technologies and collaborate with a talented group of engineers to ensure the continuous delivery of our world-class products. If you have a passion for solving complex problems, a deep understanding of cloud infrastructure, and a desire to work in a fast-paced and innovative environment, then we want to hear from you!

  1. Collaborate with a team of engineers to ensure the continuous delivery and reliability of NVIDIA's cutting-edge products and services.
  2. Utilize strong knowledge of cloud technologies to maintain and improve the reliability of our products and services.
  3. Identify and troubleshoot complex problems in a timely and efficient manner.
  4. Develop and implement automated solutions for monitoring, testing, and deploying applications in the cloud.
  5. Work closely with cross-functional teams to identify and address any potential issues with cloud infrastructure.
  6. Continuously evaluate and improve our cloud infrastructure and processes to optimize performance and scalability.
  7. Provide technical guidance and mentorship to junior team members.
  8. Stay up-to-date with the latest cloud technologies and trends, and make recommendations for incorporating them into our systems.
  9. Ensure compliance with company security and privacy policies and procedures.
  10. Collaborate with other teams, such as development and operations, to coordinate efforts and improve overall system performance.
  11. Participate in on-call rotations and respond to system alerts and incidents.
  12. Communicate effectively with team members and stakeholders about project statuses, issues, and improvements.
  13. Analyze system logs and metrics to identify potential areas for improvement.
  14. Develop and maintain documentation for processes, procedures, and system configurations.
  15. Proactively identify potential risks and implement measures to prevent system downtime.
Where is this job?
This job is located at Santa Clara, CA, USA
Job Qualifications
  • Extensive Experience In Cloud Computing Technologies: A Senior Site Reliability Engineer - Cloud At Nvidia Should Possess In-Depth Knowledge And Hands-On Experience With Various Cloud Computing Platforms Such As Aws, Azure, And Google Cloud. They Should Be Able To Design, Deploy, And Manage Highly Available And Scalable Cloud-Based Systems.

  • Strong Understanding Of Devops Principles: The Role Of A Senior Site Reliability Engineer - Cloud Involves Collaboration With Software Development Teams To Ensure Seamless Integration And Deployment Of Applications. Therefore, The Ideal Candidate Should Have A Strong Understanding Of Devops Principles And Practices, Including Continuous Integration And Delivery, Infrastructure As Code, And Automated Testing.

  • Expertise In Infrastructure Automation Tools: As Nvidia's Cloud Infrastructure Continues To Grow, Automation Becomes Crucial For Managing And Scaling Systems Efficiently. A Senior Site Reliability Engineer - Cloud Should Have A Deep Understanding Of Infrastructure Automation Tools Such As Terraform, Puppet, Chef, And Ansible, And Be Able To Leverage Them To Streamline And Automate Processes.

  • Proven Track Record In Troubleshooting And Problem-Solving: Site Reliability Engineering Involves Identifying And Resolving Complex Issues In A Timely Manner To Ensure High Availability And Reliability Of Systems. A Successful Senior Site Reliability Engineer - Cloud Should Have A Solid Track Record Of Troubleshooting And Problem-Solving, Using A Structured Approach To Identify Root Causes And Implement Effective Solutions.

  • Excellent Communication And Teamwork Skills: As A Senior Member Of The Team, A Senior Site Reliability Engineer - Cloud Should Be Able To Communicate Effectively With Cross-Functional Teams, Including Software Engineers, Devops Engineers, And Project Managers. They Should Also Possess Strong Teamwork Skills, As They Will Be Working Closely With Different Teams To Deliver Quality Solutions On Time.

Required Skills
  • Security

  • Troubleshooting

  • DevOps

  • Scripting

  • Automation

  • Cloud Computing

  • Kubernetes

  • Disaster recovery

  • Performance tuning

  • Containerization

  • Monitoring

  • Infrastructure management

Soft Skills
  • Communication

  • Conflict Resolution

  • Emotional Intelligence

  • Leadership

  • Time management

  • creativity

  • flexibility

  • Teamwork

  • Adaptability

  • Problem-Solving

Compensation

According to JobzMall, the average salary range for a Senior Site Reliability Engineer - Cloud in Santa Clara, CA, USA is $150,000 - $185,000 per year. This may vary depending on the specific company, experience level, and other factors.

Additional Information
NVIDIA is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based upon race, religion, color, national origin, sex, sexual orientation, gender identity, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
Required LanguagesEnglish
Job PostedJuly 15th, 2024
Apply BeforeJune 21st, 2025
This job posting is from a verified source. 
Reposted

Apply with Video Cover Letter Add a warm greeting to your application and stand out!

About NVIDIA

NVIDIA Corp. designs and manufactures computer graphics processors, chipsets, and related multimedia software. The company operates through two segments: Graphics Processing Unit and Tegra Processor. The Graphics Processing Unit segment includes sales of the company's GeForce discrete and chipset products that supports desktop and notebook PCs plus license fees from Intel and sales of memory products. The Tegra Processors segment provides processors that deliver superior visual and multimedia experience on tablets, smart phones and gaming devices while consuming minimal power.

Frequently asked questions

Get interviewed today!

JobzMall is the world‘ s largest video talent marketplace.It‘s ultrafast, fun, and human.

Get Started