Match score not available

Senior Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

5-10 years experience in site reliability engineering, Strong understanding of SRE principles like SLOs and SLIs, Hands-on experience with Azure services, Proficiency with CI/CD tools like TeamCity, Excellent problem-solving and interpersonal skills.

Key responsabilities:

  • Manage SLIs and SLOs for platform reliability
  • Troubleshoot operational issues affecting uptime
  • Modernize and optimize infrastructure with CI/CD
  • Implement enhanced monitoring and observability systems
  • Lead incident reviews and process improvements
EventsAir logo
EventsAir https://www.eventsair.com/
51 - 200 Employees
See more EventsAir offers

Job description

A bit About us 

EventsAir is a leading provider of award-winning event management software to organizers of in-person, hybrid and virtual events. Our software is used extensively across the APAC region and growing rapidly around the globe with an expanding presence in North America and Europe. 

 

As a technology leader with a long 30-year heritage in the event-tech industry, our platform and services are trusted by more than 1100 professional conference organizers, associations, corporates, government and universities. Capable of supporting small corporate events to large multi-faceted conference and exhibitions, the EventsAir platform has been used for some of the world's most demanding events including the Rugby World Cup, London Olympics and Birmingham Commonwealth Games. 

 

Having secured major investment from private equity firm, The Riverside Company, EventsAir is continuing to expand and accelerate its global presence in the event-tech marketplace, and we invite you to be part of this extraordinary journey.  


Why Join Us 

We push the limits of what is possible every day. We are a down to earth bunch. We have fun and are super dedicated. What motivates us to get out of bed every morning? The challenge to develop and create the world's best events management platform. 

 

Join our team of dedicated professionals, and together, let's redefine the possibilities of event management, and leave a dent in the event tech industry. 


The Senior SRE Opportunity

This Site Reliability Engineering role is an exciting opportunity to apply your Software Engineering mindset and skills in conjunction with DevOps, SRE knowledge. You will manage operations and infrastructure problems, such as managing and optimizing various Azure resources, make quality of life improvements to the developer experience.


Help ensure that the EventsAir platform scales efficiently and remains reliable as well as troubleshoot and fix any issues that may arise. You’ll also work closely with other teams to improve software development practices and software release and delivery. 


This includes efforts to enhance our infrastructure, from integrating new Azure services to optimizing deployment workflows and implementing CI/CD best practices. Your expertise will play a key role in maintaining and improving our systems. 

 

We are open to all talent across Australia for this role and are open to remote working options, as some of our teams are not based in Brisbane. 


What will you be doing in this role?

Reliability Engineering 

Manage SLIs and SLOs to drive platform reliability. 

Identify, mitigate, and manage risks to service stability. 

Troubleshoot and resolve operational issues affecting infrastructure and service uptime.


Operational Excellence 

Scale the Events Air platform to meet increasing demand. 

Modernize and optimize infrastructure, applying CI/CD methodologies. 

Lead incident reviews, root cause analysis, and drive process improvements.


Monitoring & Logging 

Implement and enhance monitoring, logging, and observability across services. 

Leverage Azure Monitor, tracing, and dashboards to provide real-time insight into system health. 

Develop and maintain observability systems for better tracking of performance metrics. 


What will make you a great fit for this role?

  • You have a deep understanding of software development and are proficient with dotnet. 
  • You value clear documentation, consistently writing things down to ensure shared understanding and long-term maintainability. 
  • You're adept at making informed decisions around core principles like dependency, extensibility, and compatibility, and may even have specialized expertise in a particular area of software engineering. 
  • You enjoy automating builds, tests, deployments, infrastructure, or operational tasks. 
  • You embrace a "you build it, you run it" culture, taking pride in the quality and reliability of your work. 
  • You are self-motivated, consistently delivering high-quality work on time, while knowing when to seek help or take on new challenges. 
  • You enjoy collaborating with others, pushing one another to find the best solutions through a balance of passion, pragmatism, and empathy. 
  • You are results-driven, focusing on delivering iterative value to customers and adjusting course based on clear, transparent business insights, while encouraging others to do the same.


What do you need for our role?

  • 5-10 years of experience in site reliability engineering and DevOps. 
  • Strong understanding of SRE principles, including SLOs, SLIs, and error budgeting.  
  • Hands-on experience with Azure services and a strong understanding of cloud architecture.  
  • Extensive experience with monitoring, logging, and alerting tools.  
  • Proficiency with CI/CD tools like TeamCity and a deep understanding of deployment automation.  
  • Excellent problem-solving skills, with a proactive approach to identifying and mitigating issues.  
  • Excellent communication and interpersonal skills, with the ability to work effectively across teams. 


What will you benefit from if you become an Airstar?

  • Fantastic flexibility. We offer hybrid working to all employees regardless of role or level! Need to do the school drop off or pick- no problem, want to attend that school dance show where your kid’s starring center stage- no worries.
  • We don’t like burn out. We offer 3 extra days of annual leave to encourage downtime over the Christmas period and help bridge the gap between the public holidays; to spend time with family and friends.
  • We love socials! Socials at EventsAir are the bomb! We host quarterly catered socials for all to enjoy on our covered, outdoor back deck, or vouchers for those remote working.
  • Be a champion. We love to champion and promote causes close to our heart. Think: International Women’s Day, R U OKAY? And many more awesome causes.
  • Grow through mobility! Apply for side steps, up steps and secondments through different teams when role needs arise internally!
  • Be Healthy! Grab an apple from the kitchen, take a walk around the block or grab a stand-up desk for the day, even get the flu shot on the house if you want!
  • Learn and develop. Use your personal $1000 L&D budget towards a course, a conference or something to upskill yourself in your current role.
  • Work from almost anywhere. Want a few extra days in that bustling city where you’re on vacation? Or want to go head overseas and be around your family for longer? Use the Work from almost anywhere policy to work from anywhere for 2 weeks a year!
  • Enjoy parenting! Use the extra maternity and paternity leave we offer on top of your statutory allowances to spend important time with your new family. Equally, if something unfortunate occurs use our miscarriage & still birth leave policy to assist in the transitional period you are both going through.
  • Struggling? Reach out to our Employee Assistance Program offered to all employees and lock in a counselling session with an expert for solid advice.
  • Recognize and be rewarded. Take part in our recognition program where you can earn points and vouchers for valuable work across your role and the business.


Don’t just be a star, be an Airstar! Apply now through the link!

 

*Recruiters need not approach us- we got this! *

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Success Driven
  • Verbal Communication Skills
  • Social Skills
  • Collaboration
  • Problem Solving

Site Reliability Engineer Related jobs