Match score not available

Sr Reliability Engineer

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor's degree in Computer Science or equivalent, 5+ years of experience as a Site Reliability Engineer or similar role, Experience with cloud platforms (AWS, GCP, Azure), Proficient in scripting languages (Python, Bash, Go), Strong problem-solving and analytical skills.

Key responsabilities:

  • Build production applications ensuring high reliability metrics
  • Operate observability tools across cloud providers
  • Implement automated solutions for system reliability and performance
  • Develop monitoring systems to identify issues proactively
  • Collaborate with teams to define SLO/SLI dashboards
Avalara logo
Avalara
1001 - 5000 Employees
See more Avalara offers

Job description

O que você fará

As a member of our Reliability Engineering Product SRE team, you will be responsible for building production applications with the highest level of MVRs and SMMs and ensuring customer satisfaction through your expertise in SRE domain skills. We are seeking an individual who is interested in automation and efficiency. You will be using a bundled tech stack to show how the customer, product, and infrastructure are interacting or behaving. You will have a keen eye for customer satisfaction based on numbers (SLO, SLI, SLA) and will be expected to know the golden metrics that drive it. You will programmatically approach MVRs using coding languages and know scripting languages.

  • Build products with MVRs and reliability standards.
  • Set up and operate observability tools across multiple cloud providers.
  • Create reusable observability components to assist with onboarding to observability tools.
  • Assist development teams in defining SLO/SLI dashboards and alerts.
  • Using Go, Python, or Terraform to automate away things.
  • Managing/Administering observability tools like Grafana, Prometheus, and Loki across multiple cloud providers.
  • Onboarding feature development teams to platforms.
  • Troubleshoot and support the production environments.
  • Design, build, and implement automated solutions to ensure the reliability, scalability, and performance of Avalara's critical systems.
  • Develop and maintain monitoring and alerting systems to proactively identify and troubleshoot potential issues.
  • Participate in incident response and root cause analysis to resolve production incidents efficiently.
  • Collaborate with development teams throughout the software development lifecycle to integrate SRE best practices.
  • Continuously improve the efficiency and effectiveness of SRE processes and tools.
  • Stay up-to-date with the latest SRE trends and technologies.
  • Participate in on-call rotations to provide 24/7 support for production systems.


O que precisa possuir para ser bem sucedido

  • Bachelor's degree in Computer Science or equivalent;
  • 5+ years of experience as a Site Reliability Engineer or a related role.
  • Proven experience designing, building, and implementing highly reliable and scalable distributed systems.
  • Deep understanding of cloud platforms (AWS, GCP, Azure) and infrastructure as code (Terraform, Ansible, Pulumi).
  • Proficiency in scripting languages (Python, Bash, Go) for automation and tool development.
  • Strong problem-solving and analytical skills with a data-driven approach.
  • Excellent communication and collaboration skills to work effectively with cross-functional teams.
  • Passion for automation and continuous improvement.
  • Ability to participate in an on-call rotation.
  • Networking: A good understanding of the OSI model, TCP/IP, and DNS; particularly as it relates to cloud environments.
  • Linux Fundamentals: Solid experience with the administration, security hardening, and performance tuning of one or more distributions of Linux.
  • Observability: Experience with developing service level indicators and objectives, instrumenting software, and building alerts.
  • Containers: A solid understanding of the underpinnings of container technology such as groups and namespaces.
  • Container Orchestration Systems: Experience with the operations, administration, and development of orchestration systems such as Kubernetes, ECS, Mesos, and Nomad.
  • Technical Writing: Most of the services we develop are greenfield, and you will need to build documentation and diagrams for other engineering teams.
  • Customer Satisfaction: Keen eye for customer satisfaction (our customers are other engineering teams and Avalara customers).
  • Passion for Learning: Interest in the broader technology space with a constant desire to expand your understanding.
  • Adaptability: Experience working on a variety of projects. In short, we want people with T-shaped skills.
  • Tools & Technologies we are looking at as part of the skillset: Terraform, Grafana, Prometheus, Loki, Alert manager, Pushgateway, Prometheus exporters & client libraries, PromQL, LogQL, Fluentd, Fluent-bit, Sumologic, Splunk, Tempo, Jaeger, OpenTelemetry, Cortex, etc
  • Other Common Tools & Technologies expected: AWS, GCP, Oracle Cloud, Terraform, GitLab, Artifactory, Atlassian suite, GIT, Kubernetes, Go, C#, Python, Bash, Powershell, Docker, Windows, Linux, etc


Sobre a equipe

You will join a highly skilled and passionate team of SREs dedicated to building and maintaining the infrastructure that powers Avalara's global tax compliance platform. We work in a collaborative and fast-paced environment where innovation and ownership are encouraged. We are committed to continuous learning and professional development, and we offer opportunities to grow your skills and make a significant impact on the company's success.

Como cuidaremos de você

Total Rewards

In addition to a great compensation package, paid time off, and paid parental leave, many Avalara employees are eligible for bonuses.

Health & Wellness

Benefits vary by location but generally include private medical, life, and disability insurance.

Inclusive culture and diversity

Avalara strongly supports diversity, equity, and inclusion, and is committed to integrating them into our business practices and our organizational culture. We also have a total of 8 employee-run resource groups, each with senior leadership and exec sponsorship.

Flexible hybrid working

We support hybrid work and flexible schedules for our employees.

Learn more about our benefits by region here: https://careers.avalara.com/

Sobre a Avalara

We’re Avalara. We’re defining the relationship between tax and tech.

We’ve already built an industry-leading cloud compliance platform, processing nearly 40 billion customer API calls and over 5 million tax returns a year.

Last year, we became a billion-dollar business, and our tribe expanded by a cool thousand people - there’s nearly 5,000 of us now. Our growth is real, and we’re not slowing down - not until we’ve achieved our mission - to be part of every transaction in the world.

We’re bright, innovative and disruptive, like the orange we love to wear. It captures our quirky spirit and optimistic mindset. It shows off the culture we’ve designed, that empowers our people to win. Ownership and achievement go hand in hand here. We instill passion in our people through the trust we place in them.

We’ve been different from day one. Join us, and your career will be too.

EEO Statement

We’re an Equal Opportunity Employer. Supporting diversity and inclusion is a cornerstone of our company — we don’t want people to fit into our culture, but to enrich it. All qualified candidates will receive consideration for employment without regard to race, color, creed, religion, age, gender, national orientation, disability, sexual orientation, US Veteran status, or any other factor protected by law. If you require any reasonable adjustments during the recruitment process, please let us know.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Problem Solving
  • Willingness To Learn
  • Verbal Communication Skills
  • Analytical Skills
  • Adaptability
  • Organizational Skills

Site Reliability Engineer Related jobs