Senior Data Engineer

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

7-10+ years of experience in data engineering or a closely related role, Strong proficiency in Python, particularly for data engineering applications, Deep experience with relational and distributed databases, and strong SQL skills, Hands-on experience with data pipelines for both batch and streaming workloads..

Key responsibilities:

  • Build robust, scalable pipelines to ingest structured and unstructured data
  • Design and maintain data lakes, warehouses, and data marts for analytics and ML
  • Collaborate with cross-functional teams to deliver reliable data products
  • Define and enforce data quality standards and best practices for documentation.

Thalamus logo
Thalamus http://www.thalamusgme.com
11 - 50 Employees
See all jobs

Job description

About Thalamus

Thalamus is the market leader in graduate medical education recruitment technology, empowering over 8,000 residency and fellowship programs at 800+ health systems and all new physicians throughout the US annually. As we expand beyond GME into broader physician recruitment, our unique dataset—spanning the full profiles of residency/fellowship applicants and programs—positions us to revolutionize hiring in healthcare through AI/ML and data-driven insights. This furthers our mission to ensure the right doctor ends up at the right hospital to treat the right patients.

 

About the Role

We're looking for a Senior Data Engineer to help us expand and modernize our data architecture. This role offers a unique opportunity to shape new data storage, pipelining, and analytics solutions from the ground up alongside a collaborative, mission-driven team. You’ll work across engineering, product, design, and CX to make data more accessible, actionable, and impactful.

You should possess high attention to detail, have excellent communication skills, resourceful, customer focused, team oriented, and have an ability to work independently under time constraints to meet deadlines. You will be comfortable thinking big and diving deep. A proven track record in taking on end-to-end ownership and successfully delivering results in a fast-paced, dynamic business environment is strongly preferred. Above all you should be passionate about working with large data sets and love turning messy data into trusted, high-quality assets to answer business questions and drive change.

What you will achieve

  • Build robust, scalable pipelines to ingest structured and unstructured data from internal and third-party sources

  • Design and maintain data lakes, warehouses, and data marts to support analytics and ML use cases

  • Build and maintain APIs and data services to make curated datasets easily accessible across the organization

  • Collaborate with product managers, application engineers, data scientists, and customer experience teams to deliver reliable, usable data products

  • Define and enforce data quality standards, testing strategies, and validation frameworks

  • Work with the Infrastructure team to optimize data infrastructure for cost, security, and performance

  • Own documentation and development best practices that support collaboration and long-term maintainability

  • Help define Thalamus's evolving data engineering model and drive continuous improvements to our stack

You should have ...

  • 7-10+ years of experience in data engineering or a closely related role

  • Strong proficiency in Python, with a focus on data engineering applications

  • Deep experience with relational and distributed databases, with strong SQL skills for data modeling, transformation, and analysis across platforms like Postgres, Snowflake, or Databricks

  • Hands-on experience building and managing data pipelines for both batch and streaming workloads, working with structured and unstructured data

  • Familiarity with diverse data formats and ingestion methods, including JSON, XML, YAML, log files, and APIs

  • Experience developing data services and APIs using Python web frameworks such as Flask, Django, or FastAPI

  • Proven ability to design and deploy data infrastructure on cloud platforms such as Azure or AWS, including storage, compute, and orchestration tools

Bonus

  • Experience supporting ML pipelines and integrating with data science workflows

  • Familiarity with dbt or similar modern data transformation tools

  • Prior experience in healthcare or regulated data environments

The salary range for this position is $180,000 - $220,000 and a grant of stock options. Final compensation will be determined based on experience, skills, and geographic location.  

 
Our Commitment ...

Thalamus is a mission-driven organization centered on the belief that our company should model what we want of the US healthcare system, that the diversity of providers aligns with patient populations. We believe this is best achieved by building a team with a diversity of backgrounds, cultures, and experiences, including “distance traveled.” Thalamus is an equal opportunity employer. We do not discriminate based upon race, religious creed, color, national origin, ancestry, physical or mental disability, medical condition, genetic information, marital status (including registered domestic partnership status), sex and gender (including pregnancy, childbirth, lactation, and related medical conditions), gender identity and gender expression (including transgender individuals who are transitioning, have transitioned, or are perceived to be transitioning to the gender with which they identify), age, sexual orientation, Civil Air Patrol status, military and veteran status, and any other consideration protected by federal, state, or local law. We encourage those who really want to make an impact and who exemplify our core values to apply for our open positions.

Actual base salary offered will be determined by: experience, skills, and work location. This range is for base salary, our total compensation includes equity and benefits. We welcome you to apply even if your expectations are outside our listed range.  

Thalamus is committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures and throughout employment. If you need assistance or any accommodation, please let us know.  

Thalamus does not accept unsolicited resumes from recruiters or employment agencies without a fully executed recruitment agreement in place. In the absence of such agreement, Thalamus reserves the right to pursue and hire any candidates without an obligation to pay fees. Agencies are requested not to contact Thalamus hiring managers or employees regarding recruiting services.  

 

*This position is based in the United States, and you must be legally authorized to work in the United States.

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Detail Oriented
  • Communication
  • Problem Solving

Data Engineer Related jobs