This job might no longer be available.
Lead/Senior Site Reliability Engineer - Backend
2 years ago
We are looking for Senior Site Reliability Engineers who can help build, monitor, and run a live platform that will revolutionize the way we create, play, and share gameplay experiences.
Responsibilities
- Monitor and operate Core platform services
- Troubleshoot operational problems as they arise, test fixes, and perform follow-ups to ensure issues have been correctly resolved
- Part of an on-call rotation to assist finding a resolution during incidents
- Apply your systems knowledge to triage problems and tune resource usage
- Participate in code reviews for projects written by your team
Requirements
- 5+ years of experience monitoring and operating a live environment
- Hands-on experience with container technologies such as Docker and kubernetes
- Experience with managing Linux VMs
- Experience with cloud services and architecture (Azure, GCP, AWS)
- Ability to write and maintain tools written in an language like PowerShell
Pluses
- Experience with several of the following tools: Splunk, fluentd/fluent-bit, Jenkins, Microsoft Orleans, Datadog, nginx, redis
- Familiarity with several different database technologies
- Experience deploying and managing Kubernetes clusters with tools like Helm and Terraform
- Experience with C# or similar programming language
- Experience leading investigations and resolving live environment outages
- Interest and/or experience with blockchain technology
Create Your Profile — Game companies can contact you with their relevant job openings.