
Lead Site Reliability Engineer
Welcome to Ultimate Kronos Group, a leading provider of cloud-based human capital management solutions. We are looking for a highly skilled and motivated Lead Site Reliability Engineer to join our fast-paced and dynamic team. As the Lead Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our products and services. We are seeking a candidate who is passionate about automation, has a strong technical background, and thrives in a collaborative environment. If you are a self-starter with exceptional problem-solving skills and a drive for continuous improvement, we want to hear from you.
- Develop and implement strategies to improve the reliability, scalability, and performance of our products and services.
- Lead and mentor a team of Site Reliability Engineers to ensure the smooth functioning of our systems.
- Collaborate with cross-functional teams to identify and troubleshoot issues and implement effective solutions.
- Design and implement automated processes to streamline operations and improve efficiency.
- Monitor system performance and proactively identify potential issues before they impact customers.
- Continuously evaluate and improve our infrastructure and processes to ensure maximum uptime and optimal performance.
- Stay updated with industry trends and best practices in site reliability engineering to drive innovation and improvement.
- Communicate effectively with stakeholders and provide regular updates on system performance and improvements.
- Create and maintain documentation for system architecture, processes, and procedures.
- Identify and address security vulnerabilities and ensure compliance with relevant regulations.
Extensive Experience In Managing And Scaling Large-Scale, Highly Available Production Systems.
Strong Understanding Of System Design And Architecture Principles, With A Focus On Reliability And Fault Tolerance.
Proficiency In At Least One Programming Language, Preferably Python Or Java, And Experience With Automation And Configuration Management Tools Such As Ansible, Puppet, Or Chef.
Proven Track Record Of Leading And Mentoring A Team Of Engineers, With Excellent Communication And Collaboration Skills.
In-Depth Knowledge Of Cloud Computing Platforms Such As Aws, Azure, Or Google Cloud, And Experience With Containerization Technologies Such As Docker And Kubernetes.
Network Security
DevOps
Database
Automation
Disaster recovery
Performance tuning
Agile methodologies
Configuration management
Cloud architecture
Incident response
Infrastructure management
Monitoring And Alerting
Communication
Conflict Resolution
Emotional Intelligence
Leadership
Time management
creativity
Attention to detail
Teamwork
Adaptability
Problem-Solving
According to JobzMall, the average salary range for a Lead Site Reliability Engineer in Lowell, MA, USA is $130,000 to $150,000 per year. This can vary depending on factors such as experience, education, and the specific company or industry the engineer is working in. Additionally, bonuses and benefits may also be included in the overall compensation package.
Apply with Video Cover Letter Add a warm greeting to your application and stand out!
Ultimate provides HCM solutions designed to improve the employee experience by putting people first. HR, payroll, talent, time and scheduling, engagement surveys, HR service delivery, and more.

Get interviewed today!
JobzMall is the world‘ s largest video talent marketplace.It‘s ultrafast, fun, and human.
Get Started