
NVIDIA is the world leader in accelerated computing, developing breakthroughs that tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society. Come join the team and help build the next era of computing!
We're seeking an outstanding Senior HTOL Reliability Engineer to join our Santa Clara lab. This role requires deep device-circuitry knowledge and hands-on hardware development. You will build next-generation HTOL boards and run HTOL processes on advanced ovens. This ensures world-class reliability of the silicon powering the AI era.
What you'll be doing:
Implement and optimize HTOL test programs aligned with JEDEC standards.
Operate and maintain HTOL ovens, ensuring efficient test conditions and high data accuracy.
Build and debug burn-in boards, resolving signal-integrity issues and optimizing thermal performance.
Apply sophisticated thermal management techniques to deliver detailed temperature control and mitigate thermal stress in HTOL environments.
Work alongside lab technicians, build engineers, and reliability engineers to solve technical challenges and continuously improve test processes.
Contribute to multi-functional teams to debug and resolve hardware and software product issues.
Maintain and improve our reliability database, finding opportunities for improvement.
Collaborate with vendors to develop and implement improvements to burn-in boards, HTOL systems, and thermal interface materials.
What we need to see:
Master's or Bachelor's degree in Electrical Engineering or a related field (or equivalent experience).
5+ years of experience in HTOL test system operation and data analysis for semiconductor devices.
Proven expertise in HTOL stress testing, JEDEC standards, and environmental stress tests including Temperature Cycling (TC), Reflow, Thermal Shock, and HAST.
Hands-on experience with MCC HTOL chamber operation, repairs, and preventative maintenance.
Proficiency with oscilloscopes, current probes, and other test equipment for data acquisition and analysis.
Skill in vector debugging, test-script development/modification, and data-analysis tools. ATE experience is a plus.
Programming experience with Python or MATLAB for data analysis and automation.
Excellent communication, teamwork, and problem-solving skills, with strong attention to detail.
Ways to stand out from the crowd:
Experience with dual-die or multi-die configurations and the associated thermal challenges.
Background crafting burn-in boards for high-power GPU or SoC devices.
Familiarity with reliability analytics platforms (JMP) and statistical lifetime modeling (e.g., Weibull, Arrhenius).
Track record driving vendor qualification and component selection for reliability test hardware.
Exposure to AI/ML-based approaches for reliability data analysis or predictive failure modeling.
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, with a genuine passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 116,000 USD - 184,000 USD.
You will also be eligible for equity and benefits
Applications for this job will be accepted at least until June 14, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.