Ellison Institute of Technology Oxford

HPC Engineer - Generative Biology Institute

Ellison Institute of Technology Oxford  •  Oxford, GB (Hybrid)  •  1 day ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

At the Ellison Institute of Technology (EIT), we’re on a mission to translate scientific discovery into real world impact. We bring together visionary scientists, technologists, engineers, researchers, educators and innovators to tackle humanity’s greatest challenges in four transformative areas:

  • Health, Medical Science & Generative Biology
  • Food Security & Sustainable Agriculture
  • Climate Change & Managing CO₂
  • Artificial Intelligence & Robotics

This is ambitious work - work that demands curiosity, courage, and a relentless drive to make a difference. At EIT, you’ll join a community built on excellence, innovation, tenacity, trust, and collaboration, where bold ideas become real-world breakthroughs. Together, we push boundaries, embrace complexity, and create solutions to scale ideas from lab to society. Explore more at www.eit.org

Welcome to the Generative Biology Institute:

Led by Founding Director Jason Chin, the Generative Biology Institute (GBI) at the Ellison Institute of Technology is tackling the key challenges in making biology engineerable, and thereby unlocking the unrivalled power of biology for the benefit of humanity.

The vision of the GBI is to lay the foundations for engineering biology, and unlock its potential for good. To achieve this, we must overcome two key challenges. First, we need the ability to write in the natural language of biology, enabling the rapid and scalable synthesis of entire genomes with precision. Second, we must understand what to write - determining which DNA sequences will generate biological systems that perform the desired functions. Addressing these challenges will allow us to harness the full power of biology to create transformative solutions across health, agriculture, clean energy and more.

GBI will have sustained and substantial funding to support the unique scale and ambition of its ground-breaking vision for engineering biology. GBI researchers will also be supported by cutting-edge technology hubs including mass spectrometry, flow cytometry, sequencing, automation, imaging, and bioprocessing. GBI will also have access to substantial compute resources that can be leveraged to further accelerate progress, including scientific compute, bioinformatics, and machine learning. The environment at GBI will allow researchers to undertake ambitious, long-term, collaborative research, and we will actively support the translation of research to commercial applications, where appropriate.

The Generative Biology Institute commenced operations in 2025, occupying newly renovated bespoke space in the Oxford Science Park. The team will later move to a purpose-made facility in the Oxford Science Park, currently under construction. Once complete, this state-of-the-art facility will include more than 40,000 m² of research laboratory and office space. It will house over 30 groups and up to 600 employees at scale, focused on solving the two critical challenges in making biology engineerable and applying the solutions to addressing the global challenges encapsulated in EIT’s Humane Endeavors.

Your Role:

Working as part of a new Scientific Computing team within GBI, the HPC Engineer will help operate, improve, and scale the data and computing platform that will enable cutting-edge research in engineering biology. This is a broad, hands-on role at the interface of Linux systems, high-performance computing, cloud infrastructure, Kubernetes, Slurm, storage, monitoring, and researcher support. They will help turn emerging researcher needs and operational lessons into robust platform improvements, reusable tooling, and clear runbooks.

This role is particularly suited to someone who enjoys practical systems work, learning new technologies, and collaborating closely with scientists and engineers. We do not expect candidates to have deep experience in every technology listed in this description Instead, we are looking for a strong, scientifically minded systems engineer: someone who can troubleshoot complex environments, communicate clearly with multidisciplinary teams, learn unfamiliar tools quickly, and help build reliable, scalable services that advance GBI’s scientific mission.

Key Responsibilities

  • Operate, maintain, and improve GBI’s hybrid HPC platform, including Linux-based compute environments, Slurm/Slinky workloads, Kubernetes/OKE services, Open OnDemand, GPU and CPU partitions, and shared storage.
  • Help provision, configure, scale, and validate compute, storage, networking, and platform services using infrastructure as code, configuration management, and automation tools such as Terraform, Helm and Ansible.
  • Monitor platform health, capacity, job scheduling, GPU utilisation, storage behaviour, and network performance; investigate issues using tools such as Prometheus and Grafana.
  • Support researchers in using our Scientific Computing Platform, including triaging user issues and translating common pain points into platform improvements.
  • Build and maintain reproducible runtime environments, container images, and workflow-supporting services for scientific computing workloads, including bioinformatics, AI/ML, data processing, and simulation workflows.
  • Contribute to safe rollout and maintenance processes for Slurm images, worker node pools, scheduler configuration, container runtime changes, security updates, and monitoring improvements.
  • Create and maintain clear technical documentation, runbooks, validation checks, and issue/PR notes so the platform can be operated consistently and improved safely by the wider team.

Requirements

Essential Knowledge, Skills and Experience:

  • Bachelor’s or Master’s degree in Computer Science, Computational Biology, Engineering, Physics, Mathematics, or a related discipline, or equivalent practical experience.
  • Hands-on experience supporting or administering Linux-based systems in an HPC, cloud, research, academic, or production environment.
  • Working knowledge of HPC or batch-computing concepts, including schedulers, resource requests, queues/partitions, shared filesystems, and multi-user compute environments; Slurm experience is preferred.
  • Ability to troubleshoot issues across systems, networking, storage, identity, containers, schedulers, and user workloads, and to follow problems through to a reliable operational fix.
  • Experience with scripting, automation, and version-controlled operational changes using tools such as Git, CI/CD, Terraform, Ansible, Helm, or similar.
  • Ability to work closely with multidisciplinary research teams, understand scientific computing needs, and deliver practical services that advance scientific goals.
  • Strong communication and documentation skills, with the ability to explain technical concepts clearly to scientists, engineers, and non-specialist audiences.
  • A proactive, learning-oriented approach suited to a new team building and improving a platform while also operating it day to day.

Desirable Knowledge, Skills and Experience:

  • Experience operating Slurm clusters, Slinky/slurm-operator, Open OnDemand, JupyterLab services, or other researcher-facing HPC portals and access patterns.
  • Experience with Kubernetes or managed Kubernetes platforms such as OCI OKE, EKS, GKE, or AKS, including Helm, Argo CD, operators, services, storage classes, and workload troubleshooting.
  • Experience with cloud infrastructure, particularly OCI, and with infrastructure as code and remote execution models such as Terraform Cloud.
  • Experience with shared and high-performance storage such as Lustre, BeeGFS, GPFS, NFS, OCI File Storage, object storage, or data movement workflows for large scientific datasets.
  • Experience supporting GPU-accelerated workloads, NVIDIA tooling, CUDA-aware environments, DCGM metrics, GPU health monitoring, and/or AI/ML and bioinformatics workloads on shared compute platforms.
  • Experience with containerised HPC and scientific workflow tooling, such as Apptainer/Singularity, Docker/Podman, Pyxis/Enroot, Nextflow, Snakemake, CWL, or WDL.
  • Experience building monitoring and operational dashboards using Prometheus, Grafana, exporter metrics, alerting rules, or capacity and reliability reporting.
  • Familiarity with identity, access, and security controls in Linux or research environments, such as OIDC, Okta ASA/PAM, least-privilege access, and security patching.
  • Experience working in a scientific, academic, life-science, or research computing environment where requirements evolve through close collaboration with researchers.

Benefits

Our Benefits:

  • Salary: Competitive + travel allowance + bonus
  • Enhanced holiday pay
  • Pension
  • Life Assurance
  • Income Protection
  • Private Medical Insurance
  • Hospital Cash Plan
  • Therapy Services
  • Perk Box
  • Electric Car Scheme

Working Together – What It Involves:

  • You must have the right to work permanently in the UK with a willingness to travel as necessary. In certain cases, we can consider sponsorship, and this will be assessed on a case-by-case basis.
  • You will live in, or within easy commuting distance of, Oxford (or be willing to relocate).
  • Hybrid working
Ellison Institute of Technology Oxford

About Ellison Institute of Technology Oxford

Led by a world-class faculty of scientists, technologists, policy makers, economists and entrepreneurs, the Ellison Institute of Technology aims to deliver scaled solutions to humanity's important problems.

By breaking down the silos between science, research, industry and policy, we are reimagining the path from scientific discovery to applied solutions for the benefit of society.

Set for completion in 2027, the EIT Campus in Littlemore will include more than 300,000 sq ft of research laboratories, educational and gathering spaces.

Fuelled by growing ambition and the strength of Oxford’s science ecosystem, EIT is now expanding its footprint to a 2 million sq ft Campus across the western part of The Oxford Science Park.

Designed by Foster + Partners led by Lord Norman Foster, this will become a transformative workplace for up to 7,000 people, with autonomous laboratories, purpose-built laboratories including a plant sciences building and dynamic spaces to spark interdisciplinary collaboration.

Industry
Biotech & Life Sciences
Company Size
201-500 employees
Headquarters
Oxford, GB
Year Founded
2023
Website
eit.org
Social Media