SambaNova

Runtime Engineer

SambaNova  •  Palo Alto, CA (Onsite)  •  13 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
60
AI Success™

Job Description

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.

SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.

The Opportunity

The Runtime team at Sambanova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine learning applications in the enterprise & commercial landscape.

Runtime is responsible for the lowest levels of the SambaNova stack, efficiently interacting with the hardware to provide the best application performance and maximize hardware utilization. We handle all aspects of software infrastructure to enable higher level applications, including:

  • High performance user libraries
  • Operating System interface/integration
  • Data model manipulation for scaling
  • Networking/communication intra and inter node
  • Orchestration of partitioned workloads
  • Error monitoring and tools for system management and observability

We build a high performance, distributed and scalable software execution environment for SambaNova DataScale & Cloud platforms to support data-flow applications such as ML training and inference and HPC applications.

We are searching for a software engineer who will work on all parts of the runtime stacks, supporting AI, ML, and scientific applications in high-performance distributed systems. You will participate in building, testing and deploying next-generation high-performance compute systems for AI applications at scale. We expect the candidate to have a strong background in programming, building and testing software in distributed systems, performance tuning of large scale systems, and good teamwork and planning skills.

Role Responsibilities

  • Work on design and implementation of new and enhanced features of the runtime stack to support high performance and scalable ML inference and training applications
  • System software (drivers and kernel) support for the next generation silicon.
  • Design user-space libraries for high performance and high utilization of HW resources.
  • User-facing tools (analysis, job and HW management, profiling, debugging, etc) for Datascale systems.
  • Collaborate with other teams including Hardware, ML Application, Compiler, DevOps.

Basic Qualifications

  • Bachelor’s in Computer Science, Computer Engineering, or equivalent and with 3-5 years of industry experience
  • Proficiency in C/C++ and Python
  • Experience with user space libraries, operating systems, and kernel drivers
  • Experience working with highly concurrent and distributed systems, with a focus on performance and scalability

Preferred Qualifications

  • Experience with different types of fabrics, such as PCIe, Infiniband, and RoCE
  • Experience with fast networking stacks, such as RDMA
  • Good communication skills and enthusiasm to help colleagues

Submission Guidelines
Please note that in order to be considered an applicant for any position at SambaNova Systems, you must submit an application form for each position for which you believe you are qualified.

EEO Policy
SambaNova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws.

Benefits Summary for US-Based, Full-Time Employment Positions
SambaNova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.

SambaNova

About SambaNova

Welcome to SambaNova: Revolutionizing AI Capacity

At SambaNova, we're empowering developers, enterprises, governments, and data centers to unlock their full AI potential. Our full-stack infrastructure, from chips to models, enables lightning-fast performance, low power consumption, and high-efficiency computing.

Our Mission

To give every developer, enterprise, government and data center absolute sovereignty over their own data, models and AI infrastructure – to future-proof the AI workloads that will power and scale tomorrow.

Our Technology

We give our customers the optionality to experience SambaNova through the cloud or on-premise.

Samba Cloud delivers the fastest inferences on the largest open source models like Llama 4 and DeepSeek. Developers can get started building in minutes with our OpenAI compatible APIs. All customers start on the developer tier and when they need more capacity can scale into our enterprise tier.

SambaStack is our on-premise offering which includes the system, the platform, and foundation models. These components combine into a powerful technology stack that delivers unparalleled performance, ease of use, accuracy, data privacy, and the ability to power every use case across the world's largest organizations.

SambaManaged is a modular and ready-to-deploy AI cloud designed to deliver unmatched efficiency for data centers and cloud service providers. This solution allows organizations to quickly deploy advanced AI inference services—without the need for costly infrastructure upgrades or specialized expertise—in as little as 90 days.

At the heart of SambaNova innovation is the Reconfigurable Dataflow Unit (RDU). Purpose built for AI workloads, the RDU takes advantage of a dataflow architecture and a three-tiered memory design. The three tiers of memory enable the platform to run hundreds of models on a single node and to switch between them in microseconds. In 2023, SambaNova released its 4th generation RDU chip, the SN40L.

Industry
Hardware & Semiconductors
Company Size
201-500 employees
Headquarters
Palo Alto, CA
Year Founded
2017
Social Media