Ultimate Kronos Group

Lead Site Reliability Engineer

Ultimate Kronos Group

Lowell, MA, USA
Full-TimeDepends on ExperienceSenior LevelMasters
Job Description

Are you an experienced and ambitious Site Reliability Engineer looking for a leadership role? Do you thrive in a fast-paced and dynamic environment? Ultimate Kronos Group is seeking a highly skilled and motivated individual to join our team as a Lead Site Reliability Engineer.As the Lead Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems. You will have the opportunity to lead a team of talented engineers and drive the development and implementation of best practices and processes.We are looking for a candidate who is passionate about technology and has a strong background in site reliability engineering. The ideal candidate will have excellent communication skills, a collaborative mindset, and a proven track record of managing complex projects.Join us and be a part of a growing and innovative organization that values creativity, collaboration, and continuous improvement. If you have a passion for problem-solving and a drive for excellence, we want to hear from you!

  1. Lead a team of engineers in the development and implementation of best practices and processes for ensuring the reliability, availability, and performance of our systems.
  2. Develop and maintain a deep understanding of our systems and infrastructure to identify and resolve potential issues before they impact our customers.
  3. Collaborate with cross-functional teams to design, build, and maintain highly available and scalable systems.
  4. Monitor system performance and proactively identify areas for improvement, implementing solutions to optimize system performance.
  5. Develop and maintain documentation and runbooks for troubleshooting and resolving system issues.
  6. Drive continuous improvement by identifying and implementing new tools, techniques, and processes to enhance system reliability and performance.
  7. Manage complex projects and ensure timely delivery of solutions.
  8. Mentor and provide guidance to team members, fostering a culture of continuous learning and growth.
  9. Collaborate with other teams to troubleshoot and resolve customer issues related to system performance.
  10. Communicate effectively with stakeholders, providing regular updates on system health and performance.
  11. Stay up-to-date with industry trends and advancements in site reliability engineering and incorporate them into our systems and processes.
  12. Ensure compliance with security standards and protocols to protect our systems and data.
  13. Collaborate with vendors and third-party providers to ensure the reliability and performance of our systems.
  14. Drive incident response and participate in on-call rotations to ensure 24/7 support for our systems.
  15. Foster a positive and collaborative team environment, promoting a culture of accountability, ownership, and continuous improvement.
Where is this job?
This job is located at Lowell, MA, USA
Job Qualifications
  • Extensive Experience In Devops And Site Reliability Engineering, With At Least 5 Years In A Lead Or Senior Role.

  • In-Depth Knowledge Of Cloud Computing Platforms Such As Aws, Azure, Or Google Cloud, And Experience With Infrastructure Automation Tools Like Terraform Or Ansible.

  • Strong Hands-On Experience With Containerization And Orchestration Technologies Like Docker And Kubernetes.

  • Proven Track Record Of Managing And Maintaining Highly Available, Scalable, And Secure Production Systems.

  • Excellent Communication And Leadership Skills, With The Ability To Mentor And Guide Junior Team Members And Collaborate Effectively With Cross-Functional Teams.

Required Skills
  • Security

  • Troubleshooting

  • DevOps

  • Agile Methodology

  • Scripting

  • Automation

  • Cloud Computing

  • Disaster recovery

  • Performance tuning

  • Load Balancing

  • Monitoring

  • Systems Architecture

Soft Skills
  • Communication

  • Conflict Resolution

  • Emotional Intelligence

  • Leadership

  • Time management

  • creativity

  • Teamwork

  • Adaptability

  • Problem-Solving

  • Decision-making

Compensation

According to JobzMall, the average salary range for a Lead Site Reliability Engineer in Lowell, MA, USA is $138,000 - $162,000 per year. This may vary depending on factors such as experience, skills, and the specific company.

Additional Information
Ultimate Kronos Group is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based upon race, religion, color, national origin, sex, sexual orientation, gender identity, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
Required LanguagesEnglish
Job PostedApril 1st, 2025
Apply BeforeJune 21st, 2025
This job posting is from a verified source. 

Apply with Video Cover Letter Add a warm greeting to your application and stand out!

About Ultimate Kronos Group

Ultimate provides HCM solutions designed to improve the employee experience by putting people first. HR, payroll, talent, time and scheduling, engagement surveys, HR service delivery, and more.

Frequently asked questions

Get interviewed today!

JobzMall is the world‘ s largest video talent marketplace.It‘s ultrafast, fun, and human.

Get Started