AI Success™

Job Description

We’re looking for an AI Research Engineer who blends frontier research curiosity with engineering discipline. You’ll join the Jan and Tokamak model team, training a state-of-the-art model that can carry out various jobs (including long running deep research tasks) for millions of users.

This role is ideal for someone who thrives in high-performance environments, understands the nuances of training LLMs/VLAs/etc, and is obsessed with fast experimentation and applied usability.

You are a great fit if you are

Intellectually Curious
Productively Combative

Responsibilities

Obsess over model usability for users
Conduct training runs and AI experiments

Scrutinize results and iterate
Continuously productionize checkpoints with the product engineering team

What you'll do

Iterate upon latest RL techniques, e.g. GRPO, DPO, RePO, etc.
Build and maintain modular, scalable training codebases
Develop and maintain efficient data pipelines (synthetic and real)
Ensure training jobs can scale across multiple GPUs and nodes (e.g., FSDP, DDP, NCCL)
Maintain long-term code health: write clean, testable and reproducible code
Contribute to upstream open source dependencies
(Optionally) Publish papers and present findings

Requirements

Deep expertise in Python and training codebases, e.g. PyTorch, or equivalent
Proven experience training deep learning & reinforcement learning models in real-world settings
Experience working with large datasets, complex pipelines
Understanding of training dynamics: what goes wrong, and how to fix it
Familiarity with job launchers, logging tools (e.g., Weights & Biases, TensorBoard), and checkpointing systems
A mindset of engineering rigor applied to research: readable code, thoughtful design, and reproducibility

Bonus Points

Experience with TorchScript, ONNX, or custom inference runtimes
Open Source contributions to PyTorch or ML tooling
Experience working on transformer models, diffusion models, VLMs, or large-scale vision/NLP tasks
Familiarity with batch schedulers (SLURM), cluster environments, and GPU resource management
Ability to collaborate closely with systems engineers or MLOps teams to ensure smooth integration

About Jan

Jan is an Personal AI Assistant that helps you get things done. It features a powerful local mode that keeps your data fully private.

Try Jan: https://jan.ai

Jan models: https://huggingface.co/janhq
Why: https://www.jan.ai/handbook/why/open-superintelligence

About Menlo Research

Menlo Research is an open AI & Robots lab.

We build the brains for robots. It’s time to tell robots what to do!

Industry

IT & Software

Company Size

51-200 employees

Headquarters

Singapore, SG

Year Founded

2023

Website

menlo.ai

Social Media