ProCogia

LLM Research Intern

ProCogia  •  Vancouver, CA (Onsite)  •  4 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

About ProCogia:

We help businesses transform data into real growth!
Our clients operate in high-stakes, highly regulated industries (such as telecom, financial services, life sciences, and more), where precision, compliance, and measurable outcomes are non-negotiable. We partner with them by embedding expert data science, engineering, and AI talent directly into projects that matter.
We’re a diverse, close-knit team with a shared goal: delivering top-class, end-to-end data solutions. We don’t just analyse data, we push the boundaries of what’s possible, helping clients unlock new value and insights.
When you join ProCogia, you’ll find a supportive, growth-driven environment where your ideas are welcomed, and your development is prioritized. We offer competitive salaries, generous benefits and perks for personal and professional development.
If you’re ready to unleash your potential and work at the cutting edge of data consulting, we’d love to meet you!

The core of our culture is maintaining a high level of cultural equality throughout the company. Our diversity and differences allow us to create innovative and effective data solutions for our clients.

Our Core Values: Trust, Growth, Innovation, Excellence, and Ownership

Responsibilities

  • Assess client-specific data assets and determine the appropriate adaptation strategy — continued pretraining, supervised fine-tuning, or a combination — based on the domain, data volume, and use case requirements
  • Curate, clean, structure, and prepare domain-specific datasets from raw client data for use in model training pipelines
  • Fine-tune large language models in the 70B–100B+ parameter range using techniques such as LoRA, QLoRA, and multi-adapter patterns
  • Perform continued pretraining on open-weight models (Qwen, Llama, and related ecosystems) to embed domain knowledge directly into model weights
  • Manage distributed training workflows across multi-node GPU clusters
  • Design and execute evaluation frameworks to validate domain adaptation quality, factual grounding, and model behavior
  • Support RAG system development where applicable, including vector database integration, chunking strategies, and reranking pipelines
  • Contribute to inference optimization and deployment pipeline integration

Required Qualifications

  • Currently enrolled in or recently completed a Bachelor's, Master's, or PhD program in Computer Science, Machine Learning, or a related field
  • Demonstrated hands-on experience fine-tuning large language models, supported by concrete project work, research, or open-source contributions
  • Experience with frontier-scale models (100B+ parameters) or distributed training across multi-node GPU clusters
  • Familiarity with parameter-efficient fine-tuning methods (LoRA, QLoRA) and open-weight model architectures
  • Experience with data curation and preparation workflows for LLM training, including cleaning, formatting, deduplication, and quality filtering
  • Proficiency in Python-based ML frameworks such as PyTorch, HuggingFace Transformers, DeepSpeed, or FSDP
  • Understanding of training compute, memory constraints, and inference trade-offs at scale

Nice to Have

  • Familiarity with RAG architectures or production inference serving frameworks (vLLM, TGI, TensorRT-LLM)
  • Experience in low-resource or multilingual NLP settings
  • Relevant publications, open-source contributions, or documented projects involving LLM training

ProCogia is proud to be an equal-opportunity employer. We are committed to creating a diverse and inclusive workspace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

ProCogia

About ProCogia

ProCogia is a leading data and AI consultancy, specializing in building infrastructure and delivering tailor-made solutions to empower businesses worldwide. With a passionate team of experts and a technology-agnostic approach, we partner with clients across industries to unlock the game-changing potential of data.

Headquartered in Vancouver, BC, with offices in Seattle, New York, Boston, Toronto, Calgary, India, and Ireland, we serve clients in Telecom, Pharma, Biotechnology, Retail, Technology, and more—driving growth, efficiency, and sustainability.

Purpose:

At ProCogia, we measure success by the value we bring to people’s lives. By being the employer of choice for data and AI experts, empowering decision-makers with impactful insights, and delivering long-term stakeholder value, we aim to positively contribute to businesses and society.

Vision:

• Achieve recognition for customer satisfaction and employee well-being.

• Establish ProCogia as a respected global brand.

• Deliver cutting-edge data and AI lifecycle solutions that enable business success.

• Leverage a diverse global workforce to create impactful, sustainable solutions.

Mission:

1. Set new standards in data, analytics, and AI through our expertise.

2. Build a better tomorrow by empowering smarter decisions with data-driven insights.

3. Nurture the next generation of data and AI experts through education and development programs.

Values:

• Trust: Built on transparency, integrity, and reliability.

• Growth: Committed to continuous learning and adapting to the evolving landscape.

• Innovation: Always challenging the status quo with creative, repeatable solutions.

• Excellence: Surpassing expectations in every partnership.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Vancouver, CA
Year Founded
2013
Social Media