
We are looking for a Senior Software Engineer to help build and optimize large-scale, high-performance GenAI infrastructure and inference systems on Kubernetes.
As AI workloads increasingly move toward Kubernetes-native infrastructure, we are building systems that support distributed inference, performance optimization, reliability, observability, and production-grade deployment at scale.
This role is ideal for an engineer who can reason deeply about systems, performance, tradeoffs, and reliability, and who is comfortable owning difficult technical decisions end-to-end.
You will work across inference serving, distributed systems, optimization, and Kubernetes-native AI infrastructure.
What You’ll Do
Required Qualifications
Nice to Have

NeuReality is a venture-backed deep tech AI startup transforming AI inferencing for data centers globally.
Our mission: to make AI accessible and ubiquitous. We break down the cost and complexity barriers that currently prevent over 60% of businesses and governments from enterprise adoption, making AI more profitable, sustainable, and simpler to deploy.
We champion purpose-built inference architecture. Our NR1 AI Inference Solutions, featuring our revolutionary NR1 Chip, integrate seamlessly with any GPU, AI Accelerator, or AI Model to unlock peak performance. As the world's first AI-CPU designed for ultimate cost and energy efficiency, the NR1 Chip redefines AI price/performance delivering 6.5 more AI m/tokens per dollar and power envelope than legacy x86 CPUs.
From innovative NR Software to generative and agentic AI-ready NR1 Inference Appliances, NeuReality delivers breakthrough AI capabilities that are immediately accessible and economically viable for every business and government.