Help us maintain the quality of our job listings. If you find any issues with this job post, please let us know.
Select the reason you're reporting this job:
Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A team where creativity thrives, new perspectives are invited, and ideas matter. Regardless of your role, team, or location, this is a place where everyone makes play happen. Join us.
***Leveling Up!***
We're leveling up our recruiting system to bring you an even better candidate experience! Job listings and applications will be temporarily unavailable while we make this exciting change. Our new career portal will go live on November 12 at 1 a.m. PST. and offer an easier application process, job alerts, and recommendations.
We exist to inspire the world to play, and we’re looking for the right people to make that happen. As we bring new forms of entertainment to people worldwide through our games, experiences, and new play methods, we need innovative, collaborative, diverse, and adaptable people. We are committed to fostering a diverse and inclusive workplace, and we believe that your unique perspective and experiences will play a crucial role in improving Electronic Arts.
As a Site Reliability Engineer II at EA, you'll have the unique opportunity to work on the backend infrastructure, including cloud web services, that power the creation of exciting new features for our current and upcoming mobile titles. Your work will directly contribute to our mission of inspiring the world to play, by ensuring our games are always available and enjoyable for our players. Our studio is dedicated to building games for Imaginative Creators, allowing players to express their unique personalities through gaming experiences.
As a Site Reliability Engineer II, you'll be a key player in our team, reporting to the Director of DevOps and SRE. Your role will involve close collaboration with the game team’s backend engineers, from prototyping to live operations. You'll work with AWS ECS, EKS, Kubernetes, Helm, and Terraform to configure and deploy services to an ECS and EKS cluster running on the AWS Cloud Platform.
Responsibilities
Develop responsive, resilient, massively scalable, and globally available web services that support millions of players.
Design and implement robust systems that can handle high traffic and ensure a seamless gaming experience for our users.
Creatively blend security best practices and original techniques to secure user data and prevent cheating.
Apply and improve service deployment and troubleshooting strategies that maximize uptime.
Drive the design and implementation of infrastructure configuration and deployment strategies.
Demonstrate excellent problem-solving skills under iteratively changing requirements.
Work with the central tech and game teams to author and review the cloud infrastructure configuration to support game features.
Setup live monitoring and alerting
Support a great player experience by participating in live service support, incident troubleshooting, and resolution, possibly during non-core hours.
Qualifications
Please note that you do not need to satisfy all requirements to be considered. We encourage you to apply if you can meet most of the requirements and are comfortable having an open conversation about your qualifications.
You are a quick learner, a self-motivated person, detail-oriented, and a team player.
You have a Bachelor's degree in Computer Science, Computer Engineering, or a related field.
You have 4+ years of job experience in a hands-on coding, DevOps, or infrastructure configuration role.
You have experience with the following Cloud platforms: Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure.
You have shipped and supported scalable web services hosted in the cloud using AWS ECS, EKS, Kubernetes, and containerization.
Fundamental relational database and NoSQL database systems management experiences (MySQL, Postgres, Redis, and Couchbase)
You have experience with Grafana, NewRelic, and Data Dog observability platforms.
You have owned large systems and features from design to deployment, including live service support.
You develop infrastructure for products that release new updates with zero downtime.
You have experience with load testing, troubleshooting, and optimizing the performance of web services.
You have experience with CI/CD pipeline technology like Gitlab CI, Jenkins, and Airflow.
You have knowledge of CDN like Akamai, AWS CloudFront
You have experience with source code control systems like SVN, GIT
Strong Linux hands-on experience.
Strong knowledge of Bash/Shell script and Python.
Willing to work with remote teams at different time-zone or off regular work hours
Respond to work needs for emergencies during non-work hours, including weekends and holidays. Typically respond to Emergencies and Maintenance malfunctions.
Bonus
You have experience with the following: JIRA, Confluence, IaC with Terraform or Pulumi, mobile game server applications, Java, PHP, Javascript, Typescript, NodeJS, distributed streaming technologies (e.g. Kafka)
Site Reliability Engineer
Qualifications
Please note that you do not need to satisfy all requirements to be considered. We encourage you to apply if you can meet most of the requirements and are comfortable having an open conversation about your qualifications.
You are a quick learner, a self-motivated person, detail-oriented, and a team player.
You have a Bachelor's degree in Computer Science, Computer Engineering, or a related field.
You have 4+ years of job experience in a hands-on coding, DevOps, or infrastructure configuration role.
You have experience with the following Cloud platforms: Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure.
You have shipped and supported scalable web services hosted in the cloud using AWS ECS, EKS, Kubernetes, and containerization.
Fundamental relational database and NoSQL database systems management experiences (MySQL, Postgres, Redis, and Couchbase)
You have experience with Grafana, NewRelic, and Data Dog observability platforms.
You have owned large systems and features from design to deployment, including live service support.
You develop infrastructure for products that release new updates with zero downtime.
You have experience with load testing, troubleshooting, and optimizing the performance of web services.
You have experience with CI/CD pipeline technology like Gitlab CI, Jenkins, and Airflow.
You have knowledge of CDN like Akamai, AWS CloudFront
You have experience with source code control systems like SVN, GIT
Strong Linux hands-on experience.
Strong knowledge of Bash/Shell script and Python.
Willing to work with remote teams at different time-zone or off regular work hours
Respond to work needs for emergencies during non-work hours, including weekends and holidays. Typically respond to Emergencies and Maintenance malfunctions.
Bonus
You have experience with the following: JIRA, Confluence, IaC with Terraform or Pulumi, mobile game server applications, Java, PHP, Javascript, Typescript, NodeJS, distributed streaming technologies (e.g. Kafka)
Required profile
Experience
Level of experience:Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.