Evaluate the current on-premises (Teradata et al) infrastructure and applications to determine what should be migrated to the cloud.
With analytic leadership, develop a migration strategy and roadmap that outlines the timeline, budget, and key milestones, ensuring high-value, high-usage assets take priority.
Ensure that security measures and compliance requirements are met during migration and deployment.
Plan and execute the migration of data from on-premises systems to the cloud.
Consider data transfer methods, such as bulk data transfer, online data transfer, or using data migration services provided by the cloud provider.
Validate data integrity and consistency post-migration.
Architect and design cloud-based data warehouse solutions that cater to the specific data requirements of healthcare entities, such as electronic health records (EHRs), medical imaging data, patient demographics, clinical workflows, external quality-related data, HRIS, ERP, etc.
Define healthcare-specific data models, schemas, and data flow processes that ensure accurate and efficient data integration.
Devise optimal strategies for what data assets are included in a data lake/lakehouse/warehouse.
Build technology prototypes as part of the design process as well as occasional coding of production deliverables.
Develop, standardize, and oversee ETL/ELT pipelines that extract diverse data sources, transform data into usable formats, and load it into the cloud data warehouse.
Implement data validation and quality checks to maintain the integrity of critical patient information.
Ensure compliance with healthcare regulations such as HIPAA, GDPR, and other data privacy standards when architecting data solutions.
Stay up to date with cloud technologies relevant to healthcare data management, leveraging platforms like AWS, Azure, Google Cloud, Snowflake, Databricks, etc.
Develop architectures that enable end-to-end management of data lineage, data quality, and data stewardship across federated teams.
Facilitate robust data governance practices specific to healthcare data, ensuring data quality, privacy, and regulatory compliance.
Collaborate with cybersecurity experts and data governance groups to establish stringent security measures for protecting patient health information (PHI) and sensitive data.
Design the cloud data warehouse to scale seamlessly as healthcare data volumes increase, accommodating the growing demands of medical records and diagnostics.
Leverage cloud platform features to enable optimal data retrieval.
Implement cloud monitoring and management tools to track the performance, availability, and cost of cloud resources.
Optimize cloud resource utilization to maintain cost-effectiveness while adhering to healthcare data storage and performance requirements.
Monitor resource consumption and recommend resource adjustments based on usage patterns.
Collaborate closely with healthcare data analysts, clinicians, and administrators to understand analytical requirements and design data structures that support evidence-based decision-making and clinical insights.
Enable the creation of analytics dashboards, predictive modeling, and population health management analytic assets.
Work in tandem with healthcare IT teams, data engineers, Operations, Finance, medical professionals, and administrators to align cloud data warehouse solutions with the Cleveland Clinic’s mission and goals.
Provide expert guidance and leadership to other Data Architects across the organization, with a focus on standardizing and improving the discipline. Mentor and create architectural bench strength within the Data and Analytics organization.
Other duties as assigned.
Requirements & Skills:
Bachelor’s degree in Health Informatics, Computer Science, MIS, or a related field. Master’s degree preferred.
10+ experience as a data architect or similar role, with a specific focus on cloud data warehousing within the healthcare sector.
Expert knowledge of healthcare provider data models, Epic preferred.
Previous data cloud/warehouse/architecture deployment for a large enterprise/organization is a must.
Expertise in cloud (AWS, Google Cloud, Snowflake, Azure Data Stack, etc) with knowledge of their healthcare-specific services and compliance measures.
In-depth understanding of healthcare data regulations, including HIPAA, GDPR, and other relevant standards.
Hands-on proficiency in SQL, Python, Java, and/or Scala
Familiarity with healthcare data standards (e.g., HL7, FHIR).
Experience with Data Streaming technologies like Kafka is preferred.
Experience with Machine learning frameworks is preferred.
Strong analytical, problem-solving, and communication skills.
Ability to collaborate effectively with multidisciplinary teams, including medical professionals.
Relevant cloud certifications and healthcare informatics certifications are highly advantageous.
Ability to perform work in a stationary position for extended periods.
Ability to travel throughout the hospital system.
Ability to operate a computer and other office equipment.
Ability to communicate and exchange accurate information.