
Site Reliability Engineer, Edge Services
Are you a highly skilled and motivated individual with a passion for ensuring the availability and performance of large-scale systems? Do you thrive in a fast-paced and dynamic environment, constantly seeking new challenges and opportunities to innovate? If so, ByteDance is looking for a Site Reliability Engineer for our Edge Services team to help us build and maintain a highly available and scalable infrastructure for our global user base. As a key member of our team, you will play a critical role in optimizing the reliability, performance, and efficiency of our services, ensuring an exceptional user experience for millions of users around the world. Join us and be a part of our mission to inspire creativity and bring joy to our users through our innovative technology.
- Develop and maintain a highly available and scalable infrastructure for large-scale systems.
- Monitor and troubleshoot system performance and availability issues to ensure a seamless user experience.
- Collaborate with cross-functional teams to design and implement solutions for improving system reliability and performance.
- Automate deployment processes and develop tools for efficient system management.
- Conduct regular system audits and implement best practices for system security and data protection.
- Continuously evaluate and improve system processes to optimize efficiency and cost-effectiveness.
- Stay up-to-date with industry trends and developments in site reliability engineering to propose and implement innovative solutions.
- Participate in on-call rotation to provide 24/7 support for critical system issues.
- Identify and mitigate potential risks to system stability and performance.
- Document system configurations, processes, and procedures for knowledge sharing and training purposes.
- Collaborate with other teams to ensure seamless integration of new services and features into the existing infrastructure.
- Mentor and guide junior team members to develop their skills and knowledge in site reliability engineering.
- Communicate effectively with team members and stakeholders to provide updates on system status and any potential issues.
- Adhere to company policies and procedures, as well as industry standards and regulations.
- Contribute to a positive and collaborative work environment, promoting teamwork and a culture of innovation.
In-Depth Knowledge Of Cloud Computing Technologies, Such As Aws, Azure, And Google Cloud Platform, And Experience In Managing And Optimizing Large-Scale Distributed Systems.
Strong Proficiency In Scripting And Programming Languages, Including Python, Java, Or Go, And Experience With Automation And Configuration Management Tools Like Terraform, Puppet, Or Chef.
Extensive Experience In Troubleshooting And Resolving Complex System Issues, Including Network, Server, And Application Performance, And A Deep Understanding Of Monitoring And Logging Tools Like Prometheus, Elk, Or Datadog.
Proven Track Record Of Working In A Fast-Paced, High-Availability Environment, With A Focus On Reliability And Scalability, And Experience In Designing And Implementing Disaster Recovery And Business Continuity Plans.
Excellent Communication And Collaboration Skills, With The Ability To Work Closely With Cross-Functional Teams, Including Developers, Infrastructure Engineers, And Product Managers, To Identify And Address Technical Challenges And Drive Continuous Improvement.
Security
Networking
Troubleshooting
DevOps
Scripting
Automation
Cloud Computing
Linux Administration
Load Balancing
Monitoring
Scalability
Cdn Management
Communication
Conflict Resolution
Customer Service
Emotional Intelligence
Leadership
Time management
creativity
Teamwork
Adaptability
Problem-Solving
According to JobzMall, the average salary range for a Site Reliability Engineer, Edge Services in Boston, MA, USA is between $100,000 and $160,000 per year. This range can vary depending on factors such as the company, experience level, and specific job responsibilities.
Apply with Video Cover Letter Add a warm greeting to your application and stand out!
ByteDance is a technology company operating a range of content platforms that inform, educate, entertain and inspire people across languages, cultures, and geographies. Dedicated to building global platforms of creation and interaction, ByteDance now has a portfolio of applications available in over 150 markets and 75 languages. For example, TikTok, Helo, Vigo Video, Douyin, and Huoshan.

Get interviewed today!
JobzMall is the world‘ s largest video talent marketplace.It‘s ultrafast, fun, and human.
Get Started
