Match score not available

Senior Site Reliability Engineer-AWS

Remote: 
Full Remote
Experience: 
Senior (5-10 years)
Work from: 

Tech Mahindra logo
Tech Mahindra XLarge http://www.techmahindra.com
10001 Employees
See all jobs

Job description

JOB DESCRIPTION

Somos a Tech Mahindra, empresa do Grupo Mahindra, uma multinacional Indiana e está presente no Brasil e em +90 países. Somos mais de 120.000 profissionais que nos ajudam a conectar experiências.

A Tech Mahindra representa o mundo conectado, oferecendo experiências de tecnologia da informação inovadoras e centradas no cliente, permitindo que empresas, colaboradores e a sociedade cresçam.

Nós realmente acreditamos que a tecnologia torna isso possível, mas são as pessoas que fazem isso acontecer. Diversidade Cultural, de Gênero e de Habilidades se alinham nos nossos pilares do Rise e nos permite "Diversidade de Pensamentos", que capacita nossos stakeholders a crescer.


AWS -Site Reliabilty Engineer (Senior)


Roles and Responsibilities

As a Senior Site Reliability Engineer, you'll be responsible for building and supporting Cloud infrastructure automation solutions that support OFSE Digital Cloud strategy. You will also be developing improving, deploying, and support Cloud services.

As a Senior Site Reliability Engineer, you will be responsible for:

 

  • Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native technologies.
  • Following security guidelines to develop secure and compliant Cloud services by working with Risk and Security teams.
  • Monitoring configuration management, platform layout, and hosting infrastructure. 
  • Automating deployment of applications and infrastructure 
  • Be able to work independently and in a team, environment managing a range of customers and technical situations. 
  • Providing technical application support for enterprise-level systems  
  • Running our infrastructure with Chef, Ansible, Terraform, GitHub CI/CD, and Kubernetes 
  • Participating in Capacity planning, system performance monitoring, resource utilization trending and incident and change management.  
  • Co-ordinating with Cloud infrastructure partners for Server, Network, Database, service-related incidents, and projects 
  • Deploying application upgrades/patches in production and test environments  
  • Troubleshooting application alerts, Azure and AWS Policy from monitoring tools and code inspection and performing RCAs  
  • Writing tutorials, how-to videos, and other technical articles for the customer community and knowledgebase articles and keep them up to date  
  • Working on critical, complex customer problems that may span multiple services   
  • Participating in 24x7 on-call rotation and working with global teams   
  • Collaborating with cross functional stakeholders   
  • Providing mentorship and guidance to team members 


 

To be successful in this role you will:

 

Skills and Experience .

  • Have bachelor's degree in computer science or “STEM” Majors (Science, Technology, Engineering and Math) 
  • Have 7+ years of experience in application support in cloud preferably AWS. Have prior experience in setting up, running and configuring AWS applications.
  • Have experience in infrastructure optimization in AWS
  • Be an expert in performance monitoring and capacity management of enterprise systems using various tools.
  • Have 2+ years of Hands-on experience with Public Cloud-based applications, technologies and tools, deployment, monitoring, and operations, such as Docker, Kubernetes, Prometheus, Grafana, Kibana, etc. 
  • Have experience in RDBMS and NoSQL database technologies 
  • Have experience in Change management and Incident management process .

English Communication - Advanced



RESPONSIBILITIES AND ASSIGNMENTS

AWS -Site Reliabilty Engineer (Senior)


Roles and Responsibilities

As a Senior Site Reliability Engineer, you'll be responsible for building and supporting Cloud infrastructure automation solutions that support OFSE Digital Cloud strategy. You will also be developing improving, deploying, and support Cloud services.

As a Senior Site Reliability Engineer, you will be responsible for:

 

  • Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native technologies.
  • Following security guidelines to develop secure and compliant Cloud services by working with Risk and Security teams.
  • Monitoring configuration management, platform layout, and hosting infrastructure. 
  • Automating deployment of applications and infrastructure 
  • Be able to work independently and in a team, environment managing a range of customers and technical situations. 
  • Providing technical application support for enterprise-level systems  
  • Running our infrastructure with Chef, Ansible, Terraform, GitHub CI/CD, and Kubernetes 
  • Participating in Capacity planning, system performance monitoring, resource utilization trending and incident and change management.  
  • Co-ordinating with Cloud infrastructure partners for Server, Network, Database, service-related incidents, and projects 
  • Deploying application upgrades/patches in production and test environments  
  • Troubleshooting application alerts, Azure and AWS Policy from monitoring tools and code inspection and performing RCAs  
  • Writing tutorials, how-to videos, and other technical articles for the customer community and knowledgebase articles and keep them up to date  
  • Working on critical, complex customer problems that may span multiple services   
  • Participating in 24x7 on-call rotation and working with global teams   
  • Collaborating with cross functional stakeholders   
  • Providing mentorship and guidance to team members 


 

To be successful in this role you will:

 

Skills and Experience .

  • Have bachelor's degree in computer science or “STEM” Majors (Science, Technology, Engineering and Math) 
  • Have 7+ years of experience in application support in cloud preferably AWS. Have prior experience in setting up, running and configuring AWS applications.
  • Have experience in infrastructure optimization in AWS
  • Be an expert in performance monitoring and capacity management of enterprise systems using various tools.
  • Have 2+ years of Hands-on experience with Public Cloud-based applications, technologies and tools, deployment, monitoring, and operations, such as Docker, Kubernetes, Prometheus, Grafana, Kibana, etc. 
  • Have experience in RDBMS and NoSQL database technologies 
  • Have experience in Change management and Incident management process .

English Communication - Advanced



Quem Somos ?

Somos parte do Grupo da Mahindra, empresa no valor de 21 bilhões de dólares, que emprega mais de 240.000 pessoas em mais de 100 países. O Grupo opera nas principais indústrias que impulsionam o crescimento econômico mundial, desfrutando de uma posição de liderança em tratores, veículos utilitários, after-market, tecnologia da informação e resortes de férias.

Nossas plataformas de inovação e recursos reutilizáveis conectam-se através de uma série de tecnologias para entregar um valor tangível para os nossos clientes. 

A Tech Mahindra representa o mundo conectado, oferecendo serviços e soluções de tecnologia da informação, inovadoras e personalizadas de acordo com a necessidade de cada cliente, permitindo que empresas, parceiros e a sociedade Rise™, trabalhem juntos.


Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
Portuguese
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs