Thalamus is the market leader in graduate medical education recruitment technology, empowering over 8,000 residency and fellowship programs at 800+ health systems and all new physicians throughout the US annually. As we expand beyond GME into broader physician recruitment, our unique dataset—spanning the full profiles of residency/fellowship applicants and programs—positions us to revolutionize hiring in healthcare through AI/ML and data-driven insights. This furthers our mission to ensure the right doctor ends up at the right hospital to treat the right patients.
We're looking for a Senior Data Engineer to help us expand and modernize our data architecture. This role offers a unique opportunity to shape new data storage, pipelining, and analytics solutions from the ground up alongside a collaborative, mission-driven team. You’ll work across engineering, product, design, and CX to make data more accessible, actionable, and impactful.
You should possess high attention to detail, have excellent communication skills, resourceful, customer focused, team oriented, and have an ability to work independently under time constraints to meet deadlines. You will be comfortable thinking big and diving deep. A proven track record in taking on end-to-end ownership and successfully delivering results in a fast-paced, dynamic business environment is strongly preferred. Above all you should be passionate about working with large data sets and love turning messy data into trusted, high-quality assets to answer business questions and drive change.
What you will achieve
Build robust, scalable pipelines to ingest structured and unstructured data from internal and third-party sources
Design and maintain data lakes, warehouses, and data marts to support analytics and ML use cases
Build and maintain APIs and data services to make curated datasets easily accessible across the organization
Collaborate with product managers, application engineers, data scientists, and customer experience teams to deliver reliable, usable data products
Define and enforce data quality standards, testing strategies, and validation frameworks
Work with the Infrastructure team to optimize data infrastructure for cost, security, and performance
Own documentation and development best practices that support collaboration and long-term maintainability
Help define Thalamus's evolving data engineering model and drive continuous improvements to our stack
You should have ...
7-10+ years of experience in data engineering or a closely related role
Strong proficiency in Python, with a focus on data engineering applications
Deep experience with relational and distributed databases, with strong SQL skills for data modeling, transformation, and analysis across platforms like Postgres, Snowflake, or Databricks
Hands-on experience building and managing data pipelines for both batch and streaming workloads, working with structured and unstructured data
Familiarity with diverse data formats and ingestion methods, including JSON, XML, YAML, log files, and APIs
Experience developing data services and APIs using Python web frameworks such as Flask, Django, or FastAPI
Proven ability to design and deploy data infrastructure on cloud platforms such as Azure or AWS, including storage, compute, and orchestration tools
Bonus
Experience supporting ML pipelines and integrating with data science workflows
Familiarity with dbt or similar modern data transformation tools
Prior experience in healthcare or regulated data environments
The salary range for this position is $180,000 - $220,000 and a grant of stock options. Final compensation will be determined based on experience, skills, and geographic location.
Thalamus is a mission-driven organization centered on the belief that our company should model what we want of the US healthcare system, that the diversity of providers aligns with patient populations. We believe this is best achieved by building a team with a diversity of backgrounds, cultures, and experiences, including “distance traveled.” Thalamus is an equal opportunity employer. We do not discriminate based upon race, religious creed, color, national origin, ancestry, physical or mental disability, medical condition, genetic information, marital status (including registered domestic partnership status), sex and gender (including pregnancy, childbirth, lactation, and related medical conditions), gender identity and gender expression (including transgender individuals who are transitioning, have transitioned, or are perceived to be transitioning to the gender with which they identify), age, sexual orientation, Civil Air Patrol status, military and veteran status, and any other consideration protected by federal, state, or local law. We encourage those who really want to make an impact and who exemplify our core values to apply for our open positions.
Actual base salary offered will be determined by: experience, skills, and work location. This range is for base salary, our total compensation includes equity and benefits. We welcome you to apply even if your expectations are outside our listed range.
Thalamus is committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures and throughout employment. If you need assistance or any accommodation, please let us know.
Thalamus does not accept unsolicited resumes from recruiters or employment agencies without a fully executed recruitment agreement in place. In the absence of such agreement, Thalamus reserves the right to pursue and hire any candidates without an obligation to pay fees. Agencies are requested not to contact Thalamus hiring managers or employees regarding recruiting services.
*This position is based in the United States, and you must be legally authorized to work in the United States.
Nagarro
Dabrein Solutions
SynergisticIT
CapTech Consulting
Laba Group