This job might no longer be available.

Site Reliability Engineer

Gearbox Software
Frisco Texas
3 years ago
Apply

Site Reliability Engineer (Mid-level)


To further drive our vision of premier stability and rapid feature delivery, we are looking for a mid-level Site Reliability Engineer to join our team. As an SRE, you will be responsible for assisting in the design and implementation of flexible cloud architectures with an automation-first emphasis. You will be challenged along the way to adopt the shared mentality that observability is everything, and push for that philosophy to be realized throughout the platform. As an SRE you should be comfortable integrating multiple technologies together to form a single, coherent view of platform health. You should have an firm understanding of cloud and microservice security. When challenged with designing and implementing a new feature in the infrastructure, you are confident in both, ready to defend them in a room with other technical minds. You also recognize that the best designs come from collaboration, not dictation, and are willing to bring implementations to the table with an expectation that there will likely be collaborative changes to your initial work.

This position will require you to carry a company paid mobile device and participate in 24/7 on-call rotations alongside your engineering colleagues.

Responsibilities:

  • Collaborate with our growing team of DevOps/SRE engineers, helping establishing best practices in observability, reliability, and security
  • Design and implement software solutions to improve the reliability, observability, and security of our platform services
  • Work alongside senior engineers to organize technical roadmaps into achievable work
  • Assist in observability integration of services throughout the stack
  • Write tooling that aids developers in build management and rapid deployment
  • Mentoring junior engineers as needed
  • Participate in after hours on-call support rotations

Requirements:

  • Minimum of 3 years extensive hands-on experience in a wide variety of AWS technologies; multi-cloud experience is preferred
  • Minimum of 3 years experience with containers and infrastructure as code, preferably Docker and Terraform.
  • Minimum of 3 years experience in disciplined software engineering with a focus on development and implementation of highly-scalable/available applications
  • Proven experience in Docker and AWS security best practices
  • Deep understanding of observability stack management (monitoring, alerting, structured logging, APM, etc.)
  • Extensive professional experience in one or more of the following languages: Go, Python, Ruby
  • Experience in developing and supporting production systems built on cloud services, using high-availability best practices
  • Hands-on experience developing and maintaining CI/CD pipelines, preferably in git/GitLab
  • Understanding of RESTful and Websocket based APIs
  • Bachelor's degree in computer science, related field, or equivalent training and professional experience
  • Excellent teamwork skills, flexibility, and ability to handle multiple tasks
  • Comfortable communicator, able to clearly detail designs and implementations on an individual level and in large group settings

Bonus Points for:

  • Familiarity with a variety of monitoring stacks (Influxdb, Grafana, Solarwinds, Netflow, syslog, Wireshark)
  • Familiarity with HashiCorp Consul
  • Familiarity with Datadog
  • Familiarity with Perforce
  • Familiarity with Atlassian products (OpsGenie, Bamboo, JIRA, Confluence)
  • Experience working with developers in an agile environment
  • Experience in the games industry, preferably launching multiple online-enabled AAAs
  • Knowledge about Gearbox-owned IPs
Create Your Profile — Game companies can contact you with their relevant job openings.
Apply

Jobs at Gearbox Software

Engineering jobs