Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Salary: 
4 - 50K yearly
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

8+ years of experience in SRE., Expert in Kubernetes and cloud platforms., Proficient in Infrastructure as Code tools., Certifications such as CKA or RHCE..

Key responsabilities:

  • Architect, deploy, and manage Kubernetes clusters.
  • Drive performance improvements for database systems.
  • Develop and maintain Infrastructure as Code using Terraform.
  • Implement monitoring and alerting systems.
Bfore.ai logo
Bfore.ai Startup https://bfore.ai
51 - 200 Employees
See more Bfore.ai offers

Job description

BforeAI is an innovative and rapidly expanding scale-up dedicated to deterring cybercrime through cutting-edge predictive and pre-emptive technologies. We harness the power of prescriptive AI to revolutionize the way we tackle cyber threats, particularly in the realm of brand protection.

Named by Gartner in 26 reports over the last 2 years, BforeAI is the industry’s fastest, most accurate solution for automated protection against online fraud.

We are like weather forecasts for cyber threats. Join us in the fight for a safer cyberspace!

🚀 Why it’s great to work here

We are a location independent company – no physical office required – and we operate as a fully distributed team. We deeply believe in the value of diversity and inclusivity within our workplace, understanding that these principles lead to a happier team and ultimately a superior product. We offer an intellectually stimulating company environment and you’ll be working with a bright, dedicated team from across the globe.

If you possess a high level of autonomy and self-organization, and feel you can thrive at BforeAI, we’d love to hear from you!

✨ What’s Cool About This Job

As an SRE at BforeAI, you will be a critical part of our technology team, responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. Your expertise in Kubernetes, Networking, Security, Cloud environments and database optimization will be essential for maintaining our high-traffic, data-intensive systems.

Please note, this job can be anywhere in EMEA - we have to select a country for job boards.

📣 What you’ll be doing

  • Architect, deploy, and manage Kubernetes clusters, ensuring high availability, scalability, and reliability to meet organizational demands.
  • Drive performance improvements for database systems through advanced query optimization, indexing strategies, and efficient caching mechanisms.
  • Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, Ansible, or equivalent technologies to enable consistent, automated, and scalable deployments.
  • Implement and manage robust monitoring and alerting systems to proactively maintain system health and ensure optimal performance.
  • Enforce cloud environment best practices for security, access control, and compliance with regulatory standards.
  • Establish, maintain and be responsible for our Incident management procedures.
  • Partner with engineering teams to support their infrastructure needs, ensuring alignment with SRE practices and system requirements.
  • Make sure our infrastructure and products are resilient and recoverable by establishing and maintaining resiliency and recovery best practices and procedures.
  • Establish and maintain SRE best practices and remove any blocker to enable the reliability of the system.
  • Create and maintain detailed documentation for configurations, processes, and procedures to ensure transparency and knowledge sharing across teams.

💥 You’ll be a great fit if

  • You have 8+ years of experience in SRE, system administration, or similar roles.
  • You are an expert in Kubernetes, including hands-on experience in cluster setup, management, and maintenance with certifications such as Certified Kubernetes Administrator (CKA) and/or certified Kubernetes Security Specialist (CKSS).
  • You are proficient in database performance optimization and administration such as PostgreSQL, MySQL, or similar.
  • You have experience with Infrastructure as Code (IaC) tools such as Terraform (with certification like HashiCorp Terraform Certification), Ansible, or similar.
  • You have experience with monitoring and logging tools such as Splunk, Prometheus, Grafana, Datadog, ELK, Logstash, Fluentd, etc.).
  • You have experience with Incident response tools such as PagerDuty, OpsGenie, etc.
  • You have experience with cloud platforms, such as AWS, Azure, or GCP, ideally supported by an architect-level certification from at least one provider.
  • You have experience in secrets management tools such as Hashicorp Vault, CyberArk Conjur, AWS Secrets manager, etc.
  • You have strong problem-solving and troubleshooting skills.
  • You are a strong communicator with the ability to collaborate across multi-disciplinary global teams.
  • You have RHCSA (Red Hat Certified System Administrator) and/or RHCE (Red Hat Certified Engineer) certification.

Don't meet every single requirement? Don't count yourself out just yet. Studies show some individuals are less likely to apply to jobs unless they meet every qualification. At BforeAI, we're dedicated to building a diverse workplace based on merit, work ethics, and character, and we believe everyone deserves a fair shot at success!

If you're excited about this role but your past experience doesn't align perfectly with every qualification, we hope you’ll still consider applying!

We use an Employee of Record service to facilitate seamless global hiring processes and offer benefits tailored to the country where you will be working! For countries not supported by our EOR partner, talk to us about being a contractor. In all cases, you will need to be authorized to work in the country you’re based in.

  • All salary bands listed are Cost to Company (CTC). Final gross salary offers will be localized based on the candidate's country of residence, accounting for tax structures, and statutory benefits.

💡 Want to know more about BforeAI?

  • About BforeAI
  • BforeAI on YouTube
  • Our Values & Benefits at BforeAI
  • BforeAI Gartner® Recognition in Emerging Tech Reports
  • BforeAI Named in Four 2023 Gartner® Reports
  • Our CPO Luciano Allegro highlights the importance of a proactive cybersecurity approach in a recent TechNewsWorld article
  • Our CEO Luigi Lenguito on Pre-crime Threat Intelligence
  • BforeAI has been named a 2024 Cool Vendor by Gartner for Artificial Intelligence in Banking and Investment Services

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs