Skip to main content

Site Reliability Engineer, Service Tech

Craft:

Job Description:

At Blizzard Entertainment, our Site Reliability Engineers (SREs) use systems expertise combined with software engineering patterns to help define, create, and support the architecture, build systems, orchestration, and operations of services across the business. The role is comprised of talented engineers that are focused on evangelizing reliability- as- a- feature through monitoring, service- level objectives, automation, everything- as- code, and testing.

Blizzard's games and platforms reach a global audience of passionate gamers. The scale is massive and the challenges are very real, but wise application of technology is the answer to keep it all running reliably with minimal oversight. Our Site Reliability Engineers are at the heart of this work, working directly with the engineering teams from idea to launch to deliver the most epic (and reliable!) experiences... ever.

As an SRE in Service Tech, you'll...

  • Work with a handful of teams who own services used by our game teams to minimize abuse in our games, as well as others used by our global customer service and localization teams
  • Help your teams improve the monitoring and alerting around their infrastructure and services
  • Work with other SRE's to provision and operate kubernetes clusters for your teams in our cloud regions
  • Work with your teams to deploy modern, resilient services on top of those clusters.
  • Improve build/deploy pipelines to make deployments faster and safer
  • Use chaos engineering tools to inject failures into our systems and make sure they degrade gracefully

As an SRE at Blizzard, you may find yourself...

  • Being part of an on- call rotation to assist finding a resolution during incidents
  • Hosting blameless postmortems to share learnings, discover gaps, embrace transparency, and improve reliability across our services
  • Building positive and collaborative relationships across the company
  • Employing your systems knowledge to triage problems and tune resource usage
  • Championing automation to reduce toil and increase development velocity
  • Helping define and instrument Service- Level Objectives to ensure epic player experiences
  • Leveraging Configuration Management to build and maintain consistency across services
  • Building Terraform configs to manage infrastructure in public and private clouds
  • Supporting and improving build pipelines with Jenkins, Argo, and/or Spinnaker
  • Adopting Containers and Kubernetes for new and existing services
  • Applying everything- as- code methodologies across configuration, infrastructure, orchestration, and elsewhere

You may succeed in this role if you...

  • Love to solve novel and exciting problems
  • Dislike solving the same problems over- and- over- so you automate or eliminate them
  • Are inspired to make everyone's job easier by improving workflows
  • Are comfortable digging through metrics, logs, and whatever else is available to triage and fix an incident at any time
  • Strive to be better, smarter, and faster tomorrow than you are today
  • Enjoy trying new technologies to improve what we're doing today
  • Naturally spread the philosophies and practices of the DevOps mindset to others
  • Like to collaborate with others to solve problems, share knowledge, and provide feedback
  • Can self- assess the needs of a system or team, and make a case to prioritize that work
  • Relish working with software, network, cloud, and systems engineers to solve problems across all tiers of the stack
  • Help your peers succeed as much as you can

Types of projects you may work on...

  • Managing services and infrastructure supporting Blizzard's real- time business analytics systems
  • Defining the future of running services with Kubernetes
  • Design and facilitate service migrations to modern patterns with the least amount of customer impact
  • Supporting our massive global data platforms across multiple clouds
  • Performing and improving service migrations from one cloud/data center to another 
  • Integrating monitoring and logging with systems to improve observability and enable Service- Level Objectives
  • Designing and executing stress tests to validate scale expectations vs reality

Areas of Expertise for an SRE at Blizzard

SREs at Blizzard are expected to become experts in the technologies used by the teams they are working with. Below is a non- exhaustive list of technologies SREs may be exposed to:

  • Service- Level Objectives (SLI, SLO, SLA, Error Budget, Burn Rate)
  • Distributed Systems (architectures, micro- services, high- availability, elections)
  • Real- time analytics (Data streaming, ETL jobs, Airflow)
  • Configuration Management (Puppet, Hiera, Terraform, Terragrunt, Ansible)
  • Container Computing (Docker, Kubernetes, Service Mesh)
  • Cloud Services and Architecture (AWS, GCP, OpenStack)
  • Distributed Message Bus (RabbitMQ, Kafka)
  • Proxies and Load Balancing (Nginx, HAProxy, Envoy)
  • Monitoring (Prometheus, Kibana, Grafana, Elasticsearch, Datadog, New Relic)
  • Logging (Splunk, SysLog, ELK Stack, Linux Journal, grep)
  • Source Control (GitHub Enterprise, Perforce)
  • CI/CD (Jenkins, ArgoCD, Spinnaker)
  • Linux (bash, debugging, tuning)
  • Networking (triaging, packet loss, routing)
  • Programming (Java, Scala, Python and Go)

Expectations of a SRE at Blizzard:

  • Familiar knowledge of all areas of expertise, general knowledge of 5 areas of expertise, deep knowledge of 2 areas of expertise
  • General knowledge of all areas of their partner team's systems
  • Capable of sharing ideas and technology to their peers in a clear and effective way
  • Builds strong relationships with their immediate team and peers
  • Considers others’ interests as well as their own
  • Creates new technical documentation on their own
  • Demonstrates deep understanding of the services they support and their goals
  • Expands knowledge on SRE best practices and anti- patterns

About King

King is a leading interactive entertainment company for the mobile world, with people all around the world playing one or more of our games, including franchises such as Candy Crush, Farm Heroes, Pet Rescue and Bubble Witch.
We have 249 million monthly active users as of fourth quarter 2019 across web, social and mobile platforms. King was acquired by Activision Blizzard Inc. (Nasdaq ATVI) in February 2016 and operates as an independent unit of the
company.

A Great Saga Needs All Sorts of Heroes

Making games is fun. Especially when you do it with people who share the same idea of what makes a good workplace great. We design games for everyone, no matter where they are or who they are, and we employ all sorts of people from all kinds of backgrounds to bring them to life. Truth is, we simply cannot expect diversity in our players and originality in our games without first nurturing it in our people. A great saga needs all sorts of heroes.

Making the World Playful

Making the World Playful is what inspires us to create new experiences and raise the bar. It’s what makes King a place where we can all dream bigger, continue to add innovation to our games, broadening the portfolio and exploring new territories in mid- core and casual. We take the art and science of gaming to the next level through our curiosity for the unexplored, passion for games, respect for each other and love for our players – and we’re not afraid to have fun along the way. In fact, together with our parent company Activision Blizzard and experts around the world, we believe having fun is good for you. There has never been a better time to join us. We're dreaming bigger and see a world of possibilities ahead. If you share our passion, our values, and our hunger to shape the future, join us in Making the World Playful!

Applications needs to be in English.
Discover King at careers.king.com