Infrastructure Management: Design, build, and manage highly scalable and reliable systems. Handle complex infrastructure of physical and virtual clusters. Manage data generation instances using tools and automation.
Tool & Framework Development: Develop and enhance tools for monitoring, alerting, and telemetry at scale. Create and maintain self-service frameworks to improve overall productivity and quality
Continuous Integration and Deployment: Advocate for the integration of automated tests into CI/CD pipelines, ensuring continuous and seamless testing processes.
Coding and Technical Guidance: Write and maintain high-quality code for Infra as a code and infrastructure monitoring, explore and build the possibilities of AI/ML. Provide technical guidance to junior engineers, and ensure scalable automation solutions.
Requirements & Skills:
Bachelor’s or Masters degree or equivalent in Computer Science or related field.
3-6 years of relevant work experience
Proficiency in programming with Python and/or other object-oriented programming languages.
Experience with Google/AWS/Azure cloud platform or other public cloud technologies
Experience with tools development for test and development teams.
Exposure to stress and scale testing is highly desirable.
Familiarity with continuous integration tools such as Jenkins, configuration management tools like Ansible, and log management tools like ELK.
Knowledge of storage and data protection domains is a plus.
Strong analytical and problem-solving abilities.
Excellent communication and collaboration skills.
Ability to mentor junior engineers and lead automation efforts within a team.