Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Salary: 
161 - 161K yearly
Work from: 
New York (USA), United States

Offer summary

Qualifications:

Experience with Terraform, Kubernetes, GCP, Strong understanding of SRE principles.

Key responsabilities:

  • Maintaining engineering systems
  • Enhancing service resilience and scalability
  • Optimizing monitoring tools for teams
  • Developing disaster recovery program
  • Expanding infrastructure globally
float
 logo
float Computer Software / SaaS Startup https://www.float.com/
51 - 200 Employees
See more float offers

Job description

Who We Are

Float is the world’s leading software for teams to plan their time. Launched in 2012, we’ve grown every year since, and remain proudly independent, self-funded and profitable. As a certified B Corporation, we’re committed to making a positive contribution to our team, customers, the environment, and the remote community. We’re a team of 50 working 100% remotely who believe in living our Best Work Life. You’ll. partner with team members globally, including Australia, Mexico, Italy, Nigeria, Canada, and the USA. Hear what our team has to say by browsing our blog, or reading our Glassdoor reviews. Check out what our customers think of Float from our G2 reviews.

We’re on a scale up journey, and we’re seeking people who thrive in this stage, given the autonomy, and the opportunity, to do the best work of their career.

Why We’re Hiring For This Role

The role of Site Reliability Engineers at Float is to increase the autonomy of the product and engineering teams by growing their capabilities to focus on solving problems. SRE makes sure our engineers get scalable infrastructure to build software on top of, making sure pipelines from idea to customer run smoothly and are easily built upon, and we also deal with broad areas of security around our network and defining internal security policy and practices.

Our goals for the Engineering team are to increase the pace with which they deliver improvements for our customers, provide an increasingly sophisticated and reliable service from our teams, and mitigate external threats as we grow.

You will help us tackle those problems by increasing reliability of our services to support larger clients joining Float, and increasing the robust security systems we’ve implemented to continue protecting our growing customer base.

Chris Nash, our Team Lead (SRE & QA), explains the important role you will play within our SRE team. Watch this video.

You’ll be working asynchronously with a bright, dedicated team from across the globe, with a strong focus on taking complex problems and creating solutions that feel simple and intuitive for our customers.

What You’ll Be Responsible For

Early on, you’ll jump right into:

  • Continuing to support the regular maintenance of all the engineering systems supporting Float’s customers
  • Identifying areas requiring support to scale
  • Identifying areas for improving service resilience, ultimately delivering the ability to be resilient within the product and engineering teams themselves
  • Optimizing our monitoring and observability stack, building on the knowledge to create a standard set of tools and configurations for the product and engineering teams
  • Understanding Float’s SLOs in context, and building out SLO patterns and procedures for product and engineering teams

Once you are settled, we expect that you will jump into the following projects:

  • Building a repeatable and trustworthy disaster recovery program using chaos engineering techniques
  • Migrating all of our deployment configurations to a global single source of truth
  • Expanding Float’s infrastructure across multiple regions to create a global network

What You’ll Need To Be Successful

We want you to love your work and believe that these skills will allow you to succeed in the role.

Applying these skills requires:

  • An excellent understanding of how SRE operates as an enabling team
  • A very good understanding of Service Level Objectives
  • Working experience with Terraform, Bash, and a go-to language which ideally would be one of PHP, NodeJS, Python
  • Experience with Kubernetes and GCP would be highly valued

As a fully remote team, we’re looking for someone comfortable with asynchronous communication as the default, which means you have previous remote experience and are comfortable using tools like Slack, Loom, and Linear to communicate as needed. Don’t worry—you will have significant deep work time since we have very few meetings.

Why Join Us

Pay for this role is US $161,475 (Level 3). Here’s a blog post with more information on how we determine our salaries.

We’re a global async remote company with a diverse team of people from all over the world who share a common belief in living our best work life. We believe deeply in the idea of transparency and share our Float Handbook publicly so potential new team members can see first hand our perks & benefits as well as our ways of working. If you feel like you can thrive at Float to do your best work, we would love to hear from you.

Hiring Process For This Role

You’ll find a lot of useful information about our interview process and what it’s like to join our global team on the Float careers page. The hiring process for this role looks like this:

  • Initial First Meet (20 min): You'll meet with Julia Fulton, Talent Manager, to discuss your interest in the role and review your questions about working at Float.
  • Take-Home Assignment: Candidates that move forward will be invited to complete a technical assignment for the engineering team to review. This is a 4-hour assignment. Candidates will receive high-level feedback from the hiring team and those that move forward will proceed to the technical interview stage to discuss results further in more detail.
  • Manager Interview (45 min): You’ll meet with Chris Nash (Team Lead, SRE & QA) and Colin Ross (Director of Engineering) to discuss more about your technical experience. This will be a great opportunity for you to ask any questions and talk about goals for the role.
  • Co-Worker Interview (30 min): You’ll meet with Roberto Menezes (Senior Staff Engineer) and Bogdan Frunza (SRE), to dive deeper into your skills and experience.
  • Founder Interview (30 min): You’ll meet with Glenn, Float’s CEO, to get to know you and see if you have potential to be a great addition to the team.

Note: Industry research shows that women and those in traditionally underrepresented groups generally don’t apply to jobs unless they check all the boxes for the role. If you feel strongly that you have what it takes for this role but don’t check 100% of the boxes—that’s okay—we encourage you to apply anyway and highlight what you can bring to the table.

Required profile

Experience

Industry :
Computer Software / SaaS
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Motivational Skills
  • Strong Work Ethic

Site Reliability Engineer (SRE) Related jobs