Data Ingestion and Processing: Design and implement robust data pipelines to collect, clean, transform, and store large volumes of data from diverse sources.
Data Preparation: Preprocess and transform data into formats suitable for machine learning algorithms.
Feature Engineering: Develop and select relevant features that improve the performance of machine learning models.
Data support for Models: Collaborate with Data Scientists and ML Engineering Professionals, and other cross-functional teams to integrate AI solutions into systems and processes. This includes optimizing AI solutions to handle large volumes of data efficiently.
Scalability: Design and implement scalable data architectures to support the growing demands of AI applications.
Performance Optimization: Continuously optimize data processing and model inference for efficiency and speed.
Requirements & Skills:
Strong Programming Skills: Proficiency in Python, Java, or Scala.
Data Engineering Tools: Experience with Big Data (BigQuery, Hive), Experience in processing unstructured and structured data, Understanding of Call center data, media data processing, Clickstream, and real-time data processing. Experience with ETL tools like Databricks or equivalent.
Cloud Platforms: Expertise with GCP cloud environment
Machine Learning Frameworks: Familiarity with TensorFlow, PyTorch, Vertex AI, or other ML frameworks.
Data Modeling: Understanding of data modeling techniques and database design.
Data Pipelines: Experience with data pipeline orchestration tools like Apache Airflow.
Problem Solving: Ability to analyze and solve complex data engineering challenges.
Bachelor’s or master’s degree in computer science, Data Science, or a related field.
Experience with machine learning and AI applications.
5+ years in Technology and Data engineering or a related role
3+ years of experience in the cloud.
Experience with MLOps (Machine Learning Operations) practices.
Knowledge of containerization technologies like Docker and Kubernetes.
Familiarity with data visualization tools.
Strong communication and collaboration skills.
Experience working with the Google Cloud Platform.