Are you a highly motivated, tech-savvy individual with a passion for problem-solving and optimizing digital systems? Do you thrive in fast-paced environments and enjoy collaborating with cross-functional teams? If so, Honeywell has an exciting opportunity for you as a Site Reliability Engineer. As a key member of our team, you will be responsible for ensuring the reliability, scalability, and performance of our digital platforms, driving continuous improvement and innovation. With your strong technical skills and strong attention to detail, you will play a crucial role in delivering exceptional experiences for our customers. Join us in our mission to make the world a smarter, safer, and more sustainable place.
- Ensure the reliability, scalability, and performance of Honeywell's digital platforms.
- Collaborate with cross-functional teams to identify and troubleshoot issues, and implement solutions in a timely manner.
- Continuously monitor the health and performance of the digital systems, and proactively identify and address potential issues.
- Optimize and automate processes to improve efficiency and reduce downtime.
- Develop and maintain documentation for system configurations, processes, and procedures.
- Stay up-to-date with industry trends and best practices in site reliability engineering.
- Participate in on-call rotations to address critical incidents and ensure 24/7 availability of the digital systems.
- Work closely with development teams to implement new features and enhancements in a reliable and scalable manner.
- Conduct root cause analysis for incidents and provide recommendations for long-term solutions.
- Collaborate with security teams to ensure the security and compliance of the digital systems.
- Implement and maintain monitoring, alerting, and logging systems to proactively identify and resolve issues.
- Drive continuous improvement and innovation by identifying areas for optimization and implementing solutions.
- Contribute to the development and implementation of disaster recovery plans.
- Communicate effectively with stakeholders at all levels to provide updates on system performance and incident resolution.
- Act as a technical mentor and provide guidance to junior team members.
Strong Understanding Of Cloud Computing Platforms Such As Aws, Azure, Or Google Cloud Platform.
Proficiency In Scripting Languages Such As Python Or Bash For Automation And Monitoring Tasks.
Experience With Configuration Management Tools Such As Ansible, Puppet, Or Chef.
Knowledge Of Containerization And Orchestration Technologies Like Docker And Kubernetes.
Familiarity With Monitoring And Logging Tools Like Prometheus, Elk Stack, Or Splunk For Troubleshooting And Performance Optimization.
Troubleshooting
DevOps
Scripting
Incident Management
Automation
Cloud Computing
Performance optimization
Disaster recovery
Infrastructure management
System Administration
Network Monitoring
Security Hard
Communication
Conflict Resolution
Emotional Intelligence
Leadership
Time management
creativity
Attention to detail
Teamwork
Adaptability
Problem-Solving
According to JobzMall, the average salary range for a Site Reliability Engineer in Atlanta, GA, USA is between $115,000 and $140,000 per year. This range can vary depending on factors such as the specific company, years of experience, and additional skills and certifications. Some companies may offer higher salaries, particularly for more experienced and highly skilled engineers. Additionally, bonuses and other forms of compensation may also be included in a site reliability engineer's total salary package.
Apply with Video Cover Letter Add a warm greeting to your application and stand out!
Honeywell International Inc. is an American multinational conglomerate company that makes a variety of commercial and consumer products, engineering services and aerospace systems for a wide variety of customers, from private consumers to major corporations and governments.

Get interviewed today!
JobzMall is the world‘ s largest video talent marketplace.It‘s ultrafast, fun, and human.
Get Started