Associate Data Engineer
Cohort AI Inc. • Naperville
Posted: April 1, 2026
Job Description
About the Role
We’re looking for an Associate Data Engineer to join our team and help make data reliable, usable, and impactful.
In this role, you’ll work closely with different teams to bring in new datasets, clean and transform them, and make sure they’re ready for analysis. A big part of your work will involve building and maintaining data pipelines, mapping data into standard formats, and ensuring everything runs smoothly behind the scenes.
This is a great opportunity if you’re early in your data engineering career and enjoy working hands-on with data, solving real-world problems, and learning how large-scale data systems operate—especially in the healthcare space.
Responsibilities:
Build and maintain data pipelines to ingest, transform, and organize data from multiple sources
Work with clinical, claims, or similar structured datasets and map them into standardized data models
Run data quality checks to ensure accuracy and consistency
Use tools like Databricks, BigQuery, Redshift, dbt, and command-line utilities
Collaborate with cross-functional teams including data, product, and business stakeholders
Help maintain ongoing data refresh processes and troubleshoot pipeline issues when needed
Needed:
Bachelor’s degree in Computer Science or a related field (or equivalent hands-on experience)
Strong working knowledge of SQL
Familiarity with Python, PySpark, or SparkSQL
Experience with modern data platforms like Databricks, Snowflake, or BigQuery
Comfort working in a remote, collaborative environment
Based in the United States (preferred time zones: Central, Mountain, or Pacific)
Additional Content
About the Role
We’re looking for an Associate Data Engineer to join our team and help make data reliable, usable, and impactful.
In this role, you’ll work closely with different teams to bring in new datasets, clean and transform them, and make sure they’re ready for analysis. A big part of your work will involve building and maintaining data pipelines, mapping data into standard formats, and ensuring everything runs smoothly behind the scenes.
This is a great opportunity if you’re early in your data engineering career and enjoy working hands-on with data, solving real-world problems, and learning how large-scale data systems operate—especially in the healthcare space.
Responsibilities:
Build and maintain data pipelines to ingest, transform, and organize data from multiple sources
Work with clinical, claims, or similar structured datasets and map them into standardized data models
Run data quality checks to ensure accuracy and consistency
Use tools like Databricks, BigQuery, Redshift, dbt, and command-line utilities
Collaborate with cross-functional teams including data, product, and business stakeholders
Help maintain ongoing data refresh processes and troubleshoot pipeline issues when needed
Needed:
Bachelor’s degree in Computer Science or a related field (or equivalent hands-on experience)
Strong working knowledge of SQL
Familiarity with Python, PySpark, or SparkSQL
Experience with modern data platforms like Databricks, Snowflake, or BigQuery
Comfort working in a remote, collaborative environment
Based in the United States (preferred time zones: Central, Mountain, or Pacific)