The Next Chapter

Senior Performance & Infrastructure Engineer - HPC

The Next Chapter  •  Amsterdam, NL (Remote)  •  18 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

The organization

Our client operates one of the largest GPU infrastructures in the world — 100,000+ GPUs. Their infrastructure doubles in size every year. We’re looking for engineers who love getting deep into Linux systems, pushing hardware and software to their limits, and making the world’s fastest AI and HPC workloads run even faster

The role

You’ll join a small, senior team that works between the hardware and Linux OS layers, solving performance problems that affect tens of thousands of GPUs. This is hands-on, high-impact engineering where microsecond gains matter and every optimization is felt at global scale.

What you’ll do
  • Trace, profile, tune and optimize Linux kernel & subsystems (CPU scheduling, memory management, networking stack) for GPU clusters and InfiniBand fabrics

  • Troubleshoot and resolve complex performance bottlenecks

  • Integrate and validate new GPU hardware & infra (KVM/QEMU, PCIe devices, Kubernetes)

  • Improve monitoring, alerting, and automation for large-scale, distributed systems

  • Occasionally assist customers in optimizing workloads

Your profile

Key requirements (non-negotiable):

  • Solid Linux internals knowledge, with kernel tracing, profiling and tuning experience (eg. perf, ftrace, eBPF, sysctl, kgdb etc.)

  • Excellent programming skills, C or C++ system-level code, with a good grasp of data structures & algorithms

  • Experience in performance optimization (eg. high-load/high-throughput, low-latency, low-jitter, memory bypasses, zero-copy, lock-free, synchronization across large-scale clusters etc.)

  • Scripting or development skills in Go, Python, or similar

Nice-to-haves (not key):

  • Large-scale clusters (GPU or CPU)

  • Virtualization stacks (KVM/QEMU), Slurm, Kubernetes

  • Deep learning frameworks (eg. PyTortch, Tensorflow...)

  • GPU-specific stack (eg. CUDA, NCCL....)

This is for you if you

Love solving deep technical challenges, care about performance downto the microsecond, and want to work on infrastructure that pushes the limits of what’s possible.

What's offered
  • Salary: up to 160k + 25% bonus.

  • Flexible working arrangements.

  • A dynamic and collaborative work environment that values initiative and innovation.

  • Location: Amsterdam or full-remote from anywhere within the EU/EER

The Next Chapter

About The Next Chapter

Dutch-based recruitment agency with a focus on IT, (High) Tech, Science & start/scale-ups. We specialize in connecting with English language technical talent on all levels and a Bsc/Msc/PhD background.

We offer flexibility in pricing and services, tailored to your specific needs: contingency based ("No Hire, No Pay") or for a small retainer and lowered successfee. Another option is our RaaS concept (Recruiter as a Service), whereas we will be your dedicated in-house recruiter. Sourcing, job marketing, selection and process management with optimal candidate experience in mind. Please don't hesitate to reach out to us for more details.

IT & technology recruitment voor (Engelstalige) professionals op HBO/WO niveau, zowel nationaal als internationaal. We bieden onze services als W&S ("No hire, No pay") maar bieden meerdere service- en pricing modellen, afgestemd op je specifieke behoeften. Een alternatief is Recruiter as a Service, waarbij we als volwaardig corporate recruiter rechtstreeks werven. Searchen, sourcen, procesmanagement en alles met een optimale "candidate experience" voor ogen. Neem contact met ons op voor meer informatie.

On our website you will find current job openings as well as useful information, for example about work permit / visa rules for The Netherlands.

Industry
HR & Recruiting
Company Size
1-10 employees
Headquarters
Den Bosch, NL
Year Founded
2021
Social Media