Job Description

Are you passionate about programming languages, compiler technology, and GPU performance? Do you want to help shape the future of high-performance kernel development for AI? We are looking for outstanding engineers to buildCUTLASSDSL, a Python-native language for GPU kernel development, along with the MLIR dialects and lowering passes behind it. In this role, you willalsohelp accelerate kernel compilation while delivering performance comparable to CUTLASS C++, enabling efficient hardware-software co-design for NVIDIA's next generation of AI platforms.

Whatyou'llbe doing:

Design, develop, andoptimizeCUTLASSDSL, a Python-native language for high-performance GPU kernel development
Build and advance the MLIR dialects, lowering passes, and code generation flows that power theCUTLASSDSL stack
Drive innovations that improve kernel compilation speed whilemaintainingperformance on par with CUTLASS C++
Collaborate closely with architecture, research, software product teams, and the open-source community to bringcutting-edgeoptimizations into real products

What we need to see:

MS, PhD, or equivalent experience in Computer Science, Software Engineering, or a related field
2+ years ofrelevant work experience
Excellent programming skills in Python and strongproficiencyin C++
Hands-on experience with DSLs, compilers, or code generation systems
Strong command of the MLIR/LLVM stack, including IR design and pass optimization
Strong communicationskills and the ability to thrive in a highly collaborative environment

Ways to stand out from the crowd:

Deep understanding of the CUDA GPU programming model, GPU microarchitecture, and performance analysis and optimization techniques
Familiarity with key high-performance computing abstractions such as Layout, Tile, MMA, and TMA in theCuTeecosystem

About NVIDIA

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Industry

Hardware & Semiconductors

Company Size

10,000+ employees

Headquarters

Santa Clara, CA

Year Founded

1993

Website

nvidia.com

Social Media

Deep Learning Performance Architect, CUTLASS DSL

Job Description

About NVIDIA