Leading a talented engineering team in PySpark development and promoting the adoption of Apache Spark and the Databricks Data Intelligence Platform among Python users
Overseeing sustained recruitment of top-tier talent, fostering a well-organized and synergistic team structure, and collaborating effectively with internal and external stakeholders
Implementing robust processes to efficiently execute product vision, strategy, and roadmap in alignment with organizational goals and priorities
Driving the integration of Generative AI into Apache Spark to expand user base and improve user experience.
Lead product development for one of the fastest growing libraries in the open source Apache Spark project, as well as the Databricks Data Intelligence Platform
Make company-wide impact by driving Python adoption across the Databricks product portfolio
Develop and deepen understanding and expertise in PySpark and PyData ecosystem, a well-adopted yet still hyper-growing product
Define, shape, and drive the future of Apache Spark and Databricks Data Intelligence Platform for Python users, aided by the power of Generative AI
Grow a world-class team of software engineers working on our compute fabric; increase headcount by 5+ engineers in the next 18 months, with continued growth beyond that according to product objectives. Hire top-notch staff-level talent
Ensure consistent delivery against milestones and strong alignment with the field working “two-in-a-box” with product leadership
Evolve organizational structure to align with long-term initiatives, build strong “5 ingredients” teams with good comms architecture
Manage technical debt, including long-term technical architecture decisions, and balance product roadmap
Requirements & Skills:
5+ years experience working in a related system, including ecosystem, Apache Spark, and internal database
Practical experience applying LLM/generative AI models
A passion for database systems, storage systems, distributed systems, language design, or performance optimization
Can ensure the team builds high-quality and reliable infrastructure services. Experience being responsible for testing, quality, and SLAs of a product. Previous experience building and leading teams in a complex technical domain, such as on distributed data systems or database internals
Ability to attract, hire, and coach engineers who meet the Databricks hiring standards. Can uplevel existing teams via hiring top-notch senior talent, growing leaders, and helping struggling members. Can gain the trust of the team and guide their careers. Experience managing distributed teams is preferred
Comfort working cross-functionality with product management and directly with customers; ability to deeply understand product and customer personas