Design and implement drivers, and application software that drive next-generation CPU and GPU
Develop, augment, and maintain software libraries for computing on GPU
Optimize code for performance, efficiency, and low latency
Innovate new ideas in conjunction with networking engineers to optimize overall data center efficiency
Hardware Integration:
Board brings up and validation all low-level interactions with hardware
Drivers to monitor the health of the board
Troubleshoot and resolve hardware-software interaction issues
Security:
Design and implement features for verification of trusted software like secure boot
Stay up-to-date on security best practices and vulnerability remediation
Collaboration:
Work effectively in a team environment, collaborating with engineers and peer functional leads from different disciplines to innovate solutions, triage issues, and speed execution
Work with product line management, customers, and sales teams to understand requirements
Leadership:
Mentor and coach team members on the technical skills and approaches to solve problems.
Review and maintain the quality of code going into production.
Requirements & Skills:
Strong programming skills: Proficiency in C, C++, and other low-level programming languages.
Operating system knowledge: Experience with Linux driver kernel development is a must. Candidate should have strong experience in Linux kernel memory management, interrupts, IOMMU
Hardware experience: Familiarity with the main elements of CPU, DPU, memory, NICs, and board monitoring elements is a must
Debugging and testing skills: Ability to identify and resolve software and hardware issues.
Problem-solving skills: Strong analytical and problem-solving abilities
Experience with data center deployments: Prior experience in data center architectures, developing and maintaining software for deployments is a plus
Experience with GPU programming: Prior experience with GPU libraries like CUDA and ROCm is a huge plus