Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor’s degree in Computer Science or related field, 5+ years experience with AWS or cloud platforms, AWS certification such as Solutions Architect preferred, Knowledge of Gitlab and Jenkins is a plus, Proven experience managing Kubernetes clusters.

Key responsabilities:

  • Architect and manage AWS cloud environments for resilience and cost-efficiency
  • Lead design, deployment, and optimization of Kubernetes clusters using AWS EKS
  • Collaborate to enhance CI/CD pipelines and containerization processes
  • Implement monitoring systems to ensure performance and reliability
  • Manage incidents and optimize cloud infrastructure based on forecasts
Healthmap Solutions logo
Healthmap Solutions SME https://www.healthmapsolutions.com/
201 - 500 Employees
See more Healthmap Solutions offers

Job description

Description
Position at Healthmap Solutions

Company Background

Healthmap Solutions is the future of specialty health management that focuses on progressive diseases, with a particular expertise in kidney health populations. Healthmap Solutions uses clinical big data resources and high-powered analytics to power complex specialty health management programs. Healthmap Solutions is a diverse, growing company committed to our clients and our employees. We are champions for better health, for those who need us most.

Position Summary:
The Site Reliability Engineer (SRE), will be a key player in building and scaling our cloud infrastructure.
 
Responsibilities: 
  • Architect and manage Amazon Web Services (AWS) cloud environments, including EC2, VPC, S3, and other key resources, ensuring resilience, scalability, and cost-efficiency
  • Lead the design, deployment, and optimization of Kubernetes clusters using AWS EKS, leveraging container orchestration to support the scalability of our applications
  • Collaborate closely with our software engineers to streamline and enhance our CI/CD pipelines, infrastructure as code (IaC) practices, and containerization processes
  • Implement and maintain monitoring and alerting systems (Datadog or similar) to ensure performance, reliability, and early detection of potential issues
  • Manage and oversee high-impact incidents, swiftly troubleshooting and collaborating with cross-functional teams to restore services and ensure operational continuity
  • Strategically plan capacity requirements by analyzing, forecasting, and optimizing cloud infrastructure for future growth while maintaining cost-effectiveness
  • Develop and maintain automation tools that minimize manual tasks and elevate operational efficiency
  • Ensure our cloud infrastructure adheres to best practices in security and compliance, safeguarding our platform and services
  • Perform other duties as assigned
Requirements:
  • Bachelor’s degree in Computer Science or a related field, or equivalent practical experience
  • 5+ years of experience with AWS or other cloud platforms and cloud security
  • AWS certification such as Solutions Architect with in-depth knowledge of AWS services like EC2, VPC, Lambda, RDS is a plus
  • Knowledge of Gitlab and Jenkins is a plus
  • Proven experience managing Kubernetes clusters in production, especially with AWS EKS
Skills:
  • Excellent communication skills
  • Strong analytical and problem-solving skills with a drive for continuous improvement
  • Able to work with cutting-edge technologies with desire for continuous learning
  • Strong monitoring capabilities to drive growth and performance 
  • Contribute to a supportive, cross-functional work environment
Travel:
 
Limited Travel, Scheduled per needs of the business


#LI-REMOTE

Americans with Disability Specifications

The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

As an Equal Opportunity Employer, we will not discriminate against any job candidate or employee due to age, race, religion, ethnicity, national origin, gender, gender identity/expression, sexual orientation, disability, familial status, veteran status, marital status, parental status, or pregnancy. In our innovative and inclusive workplace, we prohibit discrimination and harassment of any kind.


Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Motivational Skills
  • Verbal Communication Skills
  • Collaboration
  • Analytical Skills

Site Reliability Engineer Related jobs