Apex Group Ltd

AI Infrastructure and Data Sr. Associate

Apex Group Ltd  •  Pune, IN (Hybrid)  •  6 hours ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

The Apex Group was established in Bermuda in 2003 and is now one of the world’s largest fund administration and middle office solutions providers.

Our business is unique in its ability to reach globally, service locally and provide cross-jurisdictional services. With our clients at the heart of everything we do, our hard-working team has successfully delivered on an unprecedented growth and transformation journey, and we are now represented by over circa 13,000 employees across 112 offices worldwide.Your career with us should reflect your energy and passion.

That’s why, at Apex Group, we will do more than simply ‘empower’ you. We will work to supercharge your unique skills and experience.

Take the lead and we’ll give you the support you need to be at the top of your game. And we offer you the freedom to be a positive disrupter and turn big ideas into bold, industry-changing realities.

For our business, for clients, and for you

The AI Infrastructure and Data Senior Associate plays a key role in designing and managing AI infrastructure systems at CAIDO Group. This role involves hands-on work with GPU clusters, distributed training frameworks, and data pipelines to ensure scalable and efficient AI operations. The Senior Associate will mentor junior team members while driving infrastructure excellence.

KEY RESPONSIBILITIES

• Design and implement infrastructure frameworks for AI model training and deployment

• Manage GPU cluster operations including monitoring, scaling, and optimization

• Configure and maintain Kubernetes-based orchestration platforms for ML workloads

• Implement and tune distributed training frameworks (PyTorch, TensorFlow)

• Build and maintain high-performance data pipelines for AI applications

• Automate infrastructure provisioning using Terraform, Ansible, or Pulumi

• Monitor cloud costs and optimize resource utilization (FinOps)

• Troubleshoot complex infrastructure issues and implement solutions

• Collaborate with ML engineers to diagnose training bottlenecks and performance issues

• Mentor Associate-level team members and contribute to team knowledge sharing

• Document infrastructure architectures and operational procedures

REQUIRED QUALIFICATIONS

• Bachelor's degree in Computer Science, Engineering, or related field

• 3-5 years of experience in DevOps, cloud infrastructure, or SRE roles

• Strong proficiency in Linux systems administration

• Experience with container orchestration (Kubernetes, Docker)

• Proficiency in Python and Infrastructure as Code tools

• Hands-on experience with cloud platforms (AWS, GCP, or Azure)

• Understanding of machine learning workflows and requirements

• Experience with monitoring and observability tools

• Strong problem-solving and analytical skills

• Excellent communication and collaboration abilities

PREFERRED QUALIFICATIONS

• Experience managing GPU clusters or HPC environments

• Knowledge of distributed training frameworks (PyTorch DDP, DeepSpeed)

• Familiarity with parallel file systems (Lustre, GPFS)

• Certified Kubernetes Administrator (CKA) or equivalent

• Cloud architecture certifications (AWS, GCP, Azure)

• Experience with MLOps practices and tools

WORK DETAILS

• Employment Type: Full-time, Permanent

• Location: Pune, India

• Department: CAIDO Group - Innovation and AI

• Reports To: AI Infrastructure and Data Manager

• Work Type: Office-based with potential hybrid options

Disclaimer Unsolicited CVs sent to Apex (Talent Acquisition Team or Hiring Managers) by recruitment agencies will not be accepted for this position. Apex operates a direct sourcing model and where agency assistance is required, the Talent Acquisition team will engage directly with our exclusive recruitment partners.

Apex Group Ltd

About Apex Group Ltd

We are a single-source financial solutions provider dedicated to driving positive change while supporting the growth and ambitions of asset managers, allocators, financial institutions, and family offices around the world.

Established in Bermuda in 2003, we have continually disrupted the industry through our investment in innovation and talent. Today, we set the pace in fund and asset servicing and stand out for our unique single-source solution and unified cross asset-class platform which supports the entire value chain, harnesses leading innovative technology, and benefits from cross-jurisdictional expertise delivered by a long-standing management team and over 13,000 highly integrated professionals.

As a pioneering data and fintech-enabled company, we are a disruptor driving digital tools into fund and asset servicing. However, our vision to drive positive change extends beyond the industry. The Apex Foundation, a not-for-profit entity, is our passionate commitment to empower sustainable change.

Industry
Finance & Insurance
Company Size
5,001-10,000 employees
Headquarters
Hamilton, BM
Year Founded
Unknown
Social Media