The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data.
You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high-quality, high-volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement
Requirements
Benefits
Why Join Us?
Joining C the Signs is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.
Benefits:

C the Signs is an AI-powered clinical platform that enables the earliest and most accurate detection of cancer across 100+ cancer types.
Founded by NHS doctors Dr Bea Bakshi and Dr Miles Payling, our mission is to make early detection a standard for all, not a privilege for some.
Built within the NHS and integrated with primary care systems, C the Signs combines real-world patient data and advanced AI to predict cancer risk and tumour origin within seconds - empowering clinicians to act earlier and save more lives.
- Detects a patient with cancer every 22 minutes
- Used across 1,500+ GP practices
- Trusted by 10,000+ healthcare professionals
- Proven 99% sensitivity and 99% negative predictive value
- Reducing emergency presentations by over 50%
We exist because time matters. When cancer is found early, there is time - time to choose, time to treat, time to live.