Oversee the design and architecture of the Heyday data warehouse, providing a reliable data presentation used by the data analysis team. This includes designing and implementing transformations, creating the analytics data model and administering the data warehouse environment.
ETL Infrastructure Management: Manage data integration using Fivetran, oversee permissions and runtime settings, set up and maintain data ingestion pipelines, and ensure secure data storage solutions in AWS.
ETL Job Design and Optimization: Develop and optimize ETL jobs to improve data handling efficiency, monitor performance, and adjust processes to support the scalability of healthcare data operations.
Data-lake Administration and Optimization: Oversee performance in Snowflake environments, focusing on query optimization, indexing, and tuning to enhance data access and processing.
Data Modeling and Regulatory Compliance: Update and maintain data models for healthcare encounters and claims, ensure compliance with regulatory standards, and execute data migrations and schema updates using DBT.
Collaboration with Data Analysts: Work closely with data analysts to support workflows for business-critical analysis and reporting, enhancing data accessibility and utility across the organization.
Collaboration with the rest of the engineering team – providing data modeling and performance best practices and acting as a go-to person for data questions.
Stakeholder Collaboration: Partner with stakeholders across various departments to gather insights and translate business needs into data-driven solutions that support operational and strategic goals.
Support for Machine Learning Workflows: Provide data infrastructure and processing support for machine learning projects, ensuring data quality and availability for advanced analytics.
Requirements & Skills:
Bachelor’s degree in Computer Science, Data Science, Engineering, or a related field.
Minimum 3 years of experience as a Data Engineer with expertise in Snowflake, DBT, Looker, and Fivetran.
Proficiency in SQL and experience with large-scale database management.
Demonstrable experience in cloud-based ETL processes, particularly in AWS.
Strong understanding of healthcare data sets, and some experience modeling around healthcare claims and clinical data (EHRs) to support healthcare utilization and economics analyses. Familiarity with healthcare data compliance and HIPAA regulations.
Strong collaborative skills and ability to communicate effectively with technical and non-technical stakeholders.