Match score not available

Site Reliability Engineer (SRE)

extra holidays
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Familiarity with SRE best practices and observability, Experience with containerization, Kubernetes, and Infrastructure as Code, Strong background with Google Cloud Platform and distributed systems, Passionate about blockchains and decentralized technology, Excellent communication skills and collaborative attitude.

Key responsabilities:

  • Operate and scale backend infrastructure for the platform
  • Improve observability, monitoring, and alerting systems
  • Develop internal tools, participate in on-call rotation
  • Document processes and establish CI/CD pipelines
  • Ensure reliability, scalability, and maintainability of systems

Job description

About Us

Do you dream of a future where finance is open, accessible, and user-driven? Are you passionate about Decentralized Exchanges (DEXs) and the potential of DeFi to revolutionize financial markets? If so, then we want to hear from you!

Osmosis is the leading interchain DEX built on the Cosmos ecosystem. We're on a mission to build the future of DeFi, and we're searching for a talented and visionary Site Reliability Engineer (SRE) to join our growing team.

About the Role:

We are looking for a Site Reliability Engineer with a passion for blockchain technology to be responsible for operating and scaling the infrastructure and services that power our platform. You will work closely with the chain development team to ensure that our systems are reliable, scalable, and maintainable.

What you could work on:

  • Operating and optimizing backend infrastructure and services, including osmosis nodes, testnets and data services
  • Improve the current observability, monitoring, and alerting systems to provide better insights into system behavior and performance
  • Sharing observability best practises across the organization
  • Developing internal tools that integrate with existing infrastructure (e.g., controllers, node health checks, custom CLIs)
  • Participate actively in the on-call rotation schedule, contributing to the rapid identification and resolution of production issues through effective debugging and troubleshooting
  • Document processes, procedures, post-incident reports, and best practices for running services in production, ensuring consistency and quality across the team
  • Establish and maintaining robust CI/CD pipelines to automate the deployment process, facilitating faster and more reliable releases of new features and updates

You may be a fit to this role if you:

  • Familiarity with SRE best practices and passion for observability
  • Strong experience with containerization and orchestration technologies (Docker, Kubernetes)
  • Have experience in running and operating production workloads
  • Strong background with Infrastructure as a Code (preferably Terraform)
  • Experience with Google Cloud Platform
  • Experience with Cloudflare and Cloudflare Workers
  • Have a strong understanding of distributed systems and how they can be operated at scale
  • Are passionate about blockchains and decentralized technology
  • Have great communication skills and the ability to collaborate with others
  • Have a demonstrated ability to take ownership

Experience that will set you apart:

  • Previous experience working on high-scale or highly critical systems
  • Previous experience working on POS blockchain projects in production
  • Experience operating Cosmos nodes and relayers
  • Contributions to open-source projects
  • Worked in remote and globally distributed teams

What We Offer

  • The opportunity to be part of building the future of DeFi
  • Flexible work schedule
  • Travel stipend for in-person conferences and meetups
  • Flexible PTO
  • Competitive compensation packages
  • Work at the forefront of the blockchain sector and within the Cosmos ecosystem

Legal Stuff

Osmosis provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Verbal Communication Skills

Site Reliability Engineer (SRE) Related jobs