AI GPU Linux Kernel Compute - Senior Staff Engineer (Data Center - CPU - IOMMU - C/C++)
CONFIDENTIAL
Join a publicly traded computer hardware infrastructure company with over $5 Billion in sales, whose stock price has surged over 300% in the last year, spearheading the AI Data Center infrastructure revolution.
We are looking for experienced, hands-on software engineers to play a crucial role in designing and developing software for cutting-edge high-density Data Center Compute Racks. Your expertise will be essential in the development of AI GPU, CPU, and Linux driver Kernel software. Exciting opportunities are available at various senior levels including Senior Engineer, Staff Engineer, Senior Staff Engineer, and Principal Engineer.
Key Responsibilities:
• Design and implement drivers and application software for next-generation CPU and GPU.
• Develop and enhance software libraries for GPU computing.
• Optimize code for performance, efficiency, and low latency.
• Collaborate with networking engineers to innovate solutions enhancing data center efficiency.
• Architect data center management solutions, integrating multiple lower-level drivers, operating systems, and libraries.
Hardware Integration:
• Perform board bring-up and validate all low-level hardware interactions.
• Create drivers to monitor board health.
• Troubleshoot and resolve hardware-software interaction issues.
Requirements:
• Must be a full-time employee working remotely within the USA. Enjoy paid vacations, holidays, a comprehensive benefits program, and annual stock grants.
• 10+ years of embedded software development experience with at least 3+ years in high-density Data Center COMPUTE racks, including AI GPU, CPU, and Linux driver Kernel development.
• Proven experience in GPU programming and familiarity with libraries like CUDA and ROCm.
• Strong programming skills in Embedded C/C++.
• Extensive experience in Linux driver kernel development, focusing on memory management, interrupts, and IOMMU.
• Familiarity with Data Center Deployments and software development for data center architectures.
• Solid understanding of key components such as CPU, DPU, memory, NICs, and Board Monitoring.
• Excellent debugging and testing skills, with a knack for identifying and resolving hardware and software issues.
• Exceptional English communication skills with the ability to collaborate with global teams. Demonstrated ability to communicate requirements and visions through written specifications and verbal presentations in technical forums.
This position is focused on software DEVELOPMENT.
• NO Managers; this is a HANDS-ON individual contributor, coding engineer role.
• NO Testing-only focused engineers.