Senior Site Reliability Engineer

Tencent

United States Remote

Full time

Engineering

Mar 6

Responsibilities:

We are seeking a Sr. Site Reliability Engineer with extensive cloud and on prem SRE design and implementation experience.

This senior role will closely work with our internal IT and cloud providers to design the best global SRE architecture and solution in the cloud. This role will also support the studio’s legacy infrastructure and its evolution to the cloud. Our customers include internal or acquired gaming studios, innovative offices/workplaces, various business groups and external customers. The work scope will include understanding the internal customers’ business requirements, collecting the technical requirements, developing reference architecture and prototypes based on leading industry best practice, leading implementation, and deployment for global locations, as well as issue troubleshooting when necessary.

For this SRE job, you will:

  • Design, implement and support operational and reliability aspects of large-scale Cloud-enabled studio with focus on performance at scale, real time monitoring, logging, and alerting
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response and blameless postmortems. Fixing Support Escalation Issues
  • Documenting “Tribal” Knowledge
  • Be part of an on-call rotation to support production systems (in the future)

Based in Palo Alto, CA, this person will work closely with the global IT team, HQ teams. 

Requirements:

  • 7+ years of experience with Infrastructure automation, distributed systems design, experience with design, develop tools for running large scale private or public cloud system in Production
  • In depth knowledge on various cloud technologies. One or two public cloud professional SA certificates
  • Expertise in configuration management with a framework such as Ansible, Terraform, Helm
  • Proficiency with programming languages like Python, Golang, and shell scripting to automate tasks
  • Passion for infrastructure and monitoring as code
  • Can effectively collect, synthesize customer needs and challenges, design/lead the establishment of global IT foundation for all our game studios.
  • Bachelor’s degree (or higher), Computer Science, Mathematics, or related science or engineering major
  • Bilingual preferred (English, Chinese)

The base pay range for this position in California is $114,000 to $228,800 per year.

Actual pay is based on market location and may vary depending on job-related knowledge, skills, and experience. A sign on payment, relocation package, and restricted stock units may be provided as part of the compensation package, as well as other medical, financial, and/or other benefits, dependent on the specific position offered

Apply for this position Back to job

You must be logged in to to apply to this job.

Apply

Your application has been successfully submitted.

Please fix the errors below and resubmit.

Something went wrong. Please try again later or contact us.

Personal Information

Profile

View resume

Details

{{notification.msg}}