Basic Function
The Cloud Operations Engineer plays a key role in supporting Lumin Digital’s Operations Center by managing incident response, enhancing proactive monitoring, and driving process automation. This role focuses on improving Incident Command practices, ensuring high service availability, and reducing on-call toil through automation. The ideal candidate excels at collaboration, communication, and cross-functional teamwork while maintaining a strong commitment to continuous improvement.
Essential Functions and Responsibilities:
Perform operational tasks to ensure the reliability and availability of Lumin Digital’s cloud services.
Triage and resolve incidents by gathering logs, identifying root causes, and implementing solutions.
Develop and maintain process automation to reduce manual tasks and increase efficiency.
Implement and manage proactive monitoring of both critical and non-critical services.
Collaborate with cross-functional teams to improve Incident Command practices and workflows.
Identify opportunities to enhance operational processes and drive improvements.
Perform other duties as assigned.
Position Specifications
Education:
Bachelor’s degree or higher in a relevant field or equivalent experience required.
Experience:
3–5 years of experience in cloud operations, site reliability engineering (SRE), DevOps, or related technical roles.
Proven track record of managing and resolving production incidents in a high-availability environment.
Hands-on experience automating operational processes and workflows using scripting languages or automation tools (e.g., Python, Terraform, Ansible).
Experience building, configuring, and maintaining monitoring and alerting systems for cloud infrastructure and applications.
Experience working in cloud-native environments, particularly AWS (preferred) or other major cloud providers (Azure, GCP).
familiarity with CI/CD pipelines, infrastructure as code (IaC), and version control systems (e.g., Git).
Experience collaborating with cross-functional engineering teams to drive incident postmortems and continuous improvement efforts.
Knowledge, Skills, & Abilities:
Strong problem-solving skills with a detail-oriented approach.
Exceptional written and verbal communication skills.
Proven ability to collaborate across teams to achieve shared goals.
Demonstrated cultural alignment with Lumin Digital’s values, including humility, ownership, and integrity.
Experience with monitoring platforms such as CloudWatch, Splunk, Grafana, or Azure Monitor.
Familiarity with automation and orchestration tools.
Ability to work effectively in a cloud environment, with AWS experience preferred.
Proficiency with tools like Atlassian or similar platforms.
Hands-on experience with AWS or other cloud providers.
Experience with process automation tools and techniques.
Travel:
Minimal, generally 12 days or less per year, ~2X team get togethers a year
LIFE AT LUMIN DIGITAL
Lumin Digital is a trailblazer in digital banking solutions, driven by a unique approach to technology, service, and people. We empower credit unions and banks by creating cutting-edge digital experiences that continuously serve, engage, and grow their membership base. Lumin is 100% cloud-native, purpose-built to unlock the full advantages of the cloud for financial institutions and their users.
At Lumin, we thrive on curiosity and innovation. Our culture fosters trust - in our expertise and decisions, respect - for diverse perspectives and talents, and boldness - in pursuing innovative paths. These values guide us, shaping a workplace where collaboration thrives, ideas flourish, and new possibilities are discovered. Focused on continuous improvement and innovation, we encourage our team to explore, experiment, and put new ideas into action, challenging the usual way of doing things.
All qualified applicants, including those with arrest or conviction records, will be considered for employment. Any conditional offer will include a notice regarding the review of the candidate’s criminal history as part of the hiring process.