This job might no longer be available.
Senior Site Reliability Engineer - Reliability Response
1 year ago
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
As a Site Reliability Engineer, you'll help us scale our global infrastructure at a time of incredible growth for our business. At Roblox, you'll have boundless opportunities to shape the future and demonstrate your experience delivering thoughtful solutions in front of a global audience. If you know what it takes to build large-scale infrastructure that can sustain millions of concurrent players year-round, you'll fit right into our experienced and ever-expanding engineering team.
You Will:
- Be an automation expert.
- Automate reporting, and establish regular site reliability reports.
- Establish automation to track the completeness of the postmortem action items
- Automate standard operating procedures to reduce manual toil
- Automate alerting and work with telemetry to build monitoring dashboards
- Monitor critical applications, identify bottlenecks, and improve systems reliability.
- Partner with embedded SREs, Engineering Efficiency, Networking and Security teams
- Contribute as a troubleshooter during site-impacting events
- Investigate the root cause following incident resolution
- Influence, and keep the bar high for managing production changes.
- Serve in the Incident Manager On-Call rotation
- Mentor junior team members
You Are:
- Experienced: you have a BS degree (or equivalent professional experience) in Computer Science or related engineering with at least 4+ years of experience with at least 2+ years in SRE/DevOps
- A systems engineer: you have experience building and deploying distributed services and are excited to work with them globally.
- A programmer: you have experience with any programming languages: Python, Ruby, JavaScript, Go, Perl, or others.
- Experienced troubleshooter: you ask the right questions to solve issues and effectively navigate ambiguity. You can drive the incident resolution and guide people during site outages.
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.
Annual Salary Range
$218,540 — $283,780 USD
You’ll Love:
- Industry-leading compensation package
- Excellent medical, dental, and vision coverage
- A rewarding 401k program
- Flexible vacation policy
- Roflex - Flexible and supportive work policy
- Roblox Admin badge for your avatar
- At Roblox HQ:
- Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
- Onsite fitness center and fitness program credit
- Annual CalTrain Go Pass
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.
Create Your Profile — Game companies can contact you with their relevant job openings.