A typical day may involve collaborating with partners, you will design data models, acquisition processes, and applications to address needs. With experience in large-scale data processing systems (batch and streaming), you will lead business growth and enhance product experiences. And will collaborate with Technology Teams, Global Analytical Teams, and Data Scientists across programs.
You’ll take ownership of problems from end to end: extracting/cleaning data and understanding generating systems. Improving the quality of data by adding sources, coding rules, and producing metrics is crucial as requirements evolve. Agility and smart risk-taking are important qualities in this industry, where digital innovation meets partner/customer needs over time.
Requirements & Skills:
BS in Computer Science or equivalent experience with 5+ years as a Data Engineer or similar role
Programming skills in Python & Java (good to have)
Design data models for storage and retrieval to meet product and requirements
Build scalable data pipelines using Spark, Airflow, AWS data services (Redshift, Athena, EMR), and Apache projects (Spark, Flink, Hive, and Kafka)
Familiar with modern software development practices (Agile, TDD, CICD) applied to data engineering
Enhance data quality through internal tools/frameworks detecting DQ issues. Working knowledge of relational databases and SQL query authoring