CI/CD Pipeline Automation: Establish and maintain CI/CD pipelines using Jenkins or GitHub Actions to automate software build, test, and deployment processes. Ensure efficient and reliable deployment to streamline platform integration for customers.
Design, Deploy, and Manage Infrastructure: Architect, deploy, and manage infrastructure for H2O.ai Cloud across major cloud platforms (AWS, Azure, GCP) and on-premises Kubernetes (K8s) clusters. Set up and configure servers, networks, and storage to support the deployment process and ensure platform scalability.
Monitoring and Performance Optimization: Set up and configure monitoring systems (e.g., Datadog) to track system performance and optimize resource utilization. Troubleshoot issues by analyzing logs and proactively ensuring platform stability.
Disaster Recovery & Security: Implement security best practices to protect customer data and ensure compliance with industry regulations (e.g., SOC2, HIPAA). Define and implement disaster recovery strategies for critical customer environments.
Requirements & Skills:
Experience: Minimum 5+ years of relevant experience, with at least the last 3+ years focused on working with Jenkins for CI/CD pipeline development.
CI/CD Tools: Hands-on experience with Jenkins and/or GitHub Actions or related CICD tools.
Coding Skills: Strong expertise in Groovy, Python, and/or shell scripting. Additional knowledge of Go is a plus.
Kubernetes Expertise: Proven experience deploying applications on Kubernetes and a strong understanding of container orchestration.
Helm Charts: Ability to understand and write Helm Charts for Kubernetes deployments.
Cloud Platforms: Proficiency in AWS, Azure, and GCP, as well as on-premises infrastructure.
Tools and Automation: Familiarity with the tools used/planned for automating H2O deployments and integrations with databases, platforms such as – hive, impala, Cloudera, YAML configuration files for automation workflows and deployment orchestration
Linux Expertise: Familiarity with Ubuntu and CentOS/RHEL environments.
Monitoring and Performance: Experience with tools like Datadog for monitoring and troubleshooting.
Customer-Facing Skills: Strong communication and problem-solving skills to address customer queries and foster adoption of H2O.ai solutions.
Virtualization: Familiarity with technologies like VMware and Vagrant.