
Senior Site Reliability Engineer
Are you a highly skilled and experienced engineer looking for a new and exciting challenge? Are you passionate about ensuring the reliability and stability of complex systems? Look no further, because EPAM Systems is seeking a Senior Site Reliability Engineer to join our dynamic team!As a Senior Site Reliability Engineer, you will play a crucial role in maintaining and improving the availability, performance, and scalability of our systems. Your expertise in troubleshooting and optimizing infrastructure and applications will be instrumental in driving continuous improvement and ensuring a seamless user experience for our clients.We are looking for a dedicated and driven individual who possesses a deep understanding of engineering principles and a strong background in system administration and automation. If you have a proven track record of successfully managing and enhancing large-scale systems and thrive in a collaborative and fast-paced environment, we want to hear from you! Join us at EPAM Systems and take your career to new heights.
- Monitor and maintain the availability, performance, and scalability of complex systems.
- Troubleshoot and resolve any issues related to infrastructure and applications.
- Optimize systems for maximum efficiency and reliability.
- Work closely with cross-functional teams to identify and implement improvements to systems.
- Develop and implement automation processes to streamline system maintenance.
- Stay up-to-date with industry trends and best practices in system administration and automation.
- Collaborate with developers to ensure the seamless integration of new features and updates.
- Proactively identify and mitigate potential risks to system stability and performance.
- Mentor and train junior team members in system administration and automation.
- Participate in on-call rotation to provide 24/7 support for critical systems.
- Communicate effectively with stakeholders and provide regular updates on system status and improvements.
- Develop and maintain documentation for system configurations and processes.
- Adhere to security protocols and ensure the protection of sensitive data.
- Continuously evaluate and implement new tools and technologies to enhance system performance.
- Work collaboratively with other departments to align system goals with overall business objectives.
Extensive Experience In Designing And Implementing Highly Available And Scalable Systems.
Proficiency In Automation And Infrastructure As Code Tools Such As Ansible, Terraform, And Puppet.
In-Depth Knowledge Of Containerization Technologies Such As Docker And Kubernetes.
Strong Troubleshooting Skills And Experience With Monitoring And Alerting Tools.
Excellent Communication And Collaboration Skills, With The Ability To Work Effectively With Cross-Functional Teams.
DevOps
Scripting
Automation
Cloud Infrastructure
Disaster recovery
Performance tuning
Network troubleshooting
Configuration management
Incident response
Problem-Solving
ROOT
Monitoring And Alerting
Communication
Conflict Resolution
Emotional Intelligence
Leadership
Time management
creativity
Critical thinking
Teamwork
Adaptability
Problem-Solving
According to JobzMall, the average salary range for a Senior Site Reliability Engineer in Orlando, FL, USA is between $107,000 and $156,000 per year. This may vary based on factors such as experience, skills, and the specific company or industry a person is working in. Additionally, bonuses, benefits, and other forms of compensation may also impact the overall salary range.
Apply with Video Cover Letter Add a warm greeting to your application and stand out!
EPAM Systems, Inc. is a US company that specializes in product development, digital platform engineering, and digital and product design agency.

Get interviewed today!
JobzMall is the world‘ s largest video talent marketplace.It‘s ultrafast, fun, and human.
Get Started
