Job Description
While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.
If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!
About Quantiphi:
Quantiphi is an award-winning, AI-First global digital engineering company that helps the world’s leading Fortune 1000 organizations transform bold ideas into measurable business impact. We go beyond building innovative AI technologies—we solve the problems that matter most to our clients.
Since our founding in 2013, Quantiphi has built a proven track record of turning complex challenges into meaningful outcomes across industries.
Headquartered in Boston, with more than 4,000 professionals worldwide, we partner with global enterprises to deliver large-scale digital, cloud, and AI-driven transformation. #SolvingWhatMatters.
We are an Elite and Premier partner to Google Cloud, AWS, NVIDIA, Snowflake, and other leading technology platforms, and our work has been recognized across the industry, including:
- 21 Google Cloud Partner of the Year awards in the past 10 years
- 3 AWS AI/ML Partner of the Year awards
- 3 NVIDIA Partner of the Year awards
- 3 Snowflake Partner of the Year awards
- Rated Leaders by Gartner, Forrester, IDC, ISG, Everest Group and other leading analyst firms
Quantiphi delivers First-in-class AI solutions across Life Sciences, Healthcare, Banking, Financial Services, CPG, Manufacturing, Energy, High-Tech, Telecommunications, etc., powered by cutting-edge Generative AI and Agentic AI accelerators.
We are also proud to be certified as a Great Place to Work—reflecting our commitment to our people and our culture.
For more details, visit: Websiteor LinkedIn Page
Role: Generative AI Architect
Experience Level:8+ years
Employment type: Full Time
Location: Remote (USA)
What you will do:
- We are looking for a Generative AI Architect / Lead to design and deliver enterprise-grade GenAI solutions using AWS Bedrock and Agentcore. This role focuses on building scalable applications leveraging large language models (LLMs), retrieval-augmented generation (RAG), and agentic AI workflows.
- The ideal candidate will be a hands-on architect who can define solution architecture, guide teams, and actively contribute to development while ensuring performance, scalability, and cost efficiency.
Key Responsibilities:
- Design and implement GenAI solutions using AWS Bedrock and Agentcore.
- Define architecture for LLM-based applications, including RAG pipelines and agentic workflows.
- Develop and orchestrate agentic AI workflows, enabling multi-step reasoning, tool usage, and task automation.
- Build and manage RAG pipelines, including embeddings, retrieval mechanisms, and vector databases.
- Integrate LLM capabilities into enterprise applications via APIs and backend services.
- Design and optimize prompt engineering strategies for accuracy, relevance, and performance.
- Work with structured and unstructured data sources to enable knowledge-driven AI applications.
- Ensure model evaluation, monitoring, and optimization for latency, cost, and response quality.
- Collaborate with application, data, and platform teams for end-to-end solution delivery.
- Define best practices for security, governance, and responsible AI usage.
- Troubleshoot and resolve issues in production GenAI systems.
- Provide technical leadership and mentor team members while remaining hands-on.
Basic Qualifications (BQ):
- 8+ years of relevant hands-on technical experience implementing, and developing cloud solutions on AWS.
- Hands-on experience on AWS services. Proven experience using AWS Sagemaker and Bedrock leveraging different types of data sources, Training jobs, real-time and batch applications.
- Design and implement agentic AI architectures using frameworks such as LangChain, Strand Agents etc., enabling autonomous task planning, decision-making, and multi-step reasoning.
- Hands-on experience with Amazon AgentCore for building, deploying, and scaling production-grade agentic AI applications, including agent memory management, tool registry, and observability.
- Architect and deploy scalable AI solutions on AWS, leveraging services like Lambda, Bedrock, Step Functions, S3, API Gateway, and SageMaker.
- Proficiency in working with LLM APIs (e.g., Claude, Nova, and other third-party LLM providers), including API integration,and multi-model orchestration strategies.
- Hands-on experience fine-tuning or optimizing large language models (LLM).
- Familiarity with LLM tool use, prompt templating and context management.
- Strong expertise in Vector Databases, including indexing strategies, embedding generation, similarity search, and integration with RAG architectures.
- Model Evaluation & Optimization: Evaluate LLM's zero-shot and few-shot capabilities, fine-tuning hyperparameters, ensuring task generalization, and exploring model interpretability for robust web app integration.
- Develop and maintain Model Context Protocol (MCP) implementations to manage state, context windows, memory, and prompt orchestration across distributed agent systems.
- Experience with at least one of the workflow orchestration tools, Airflow, StepFunctions, SageMaker Pipelines, Kubeflow etc.
- Experience implementing secure, scalable APIs and integrating with 3rd-party data sources and tools.
- Ability to collaborate with cross-functional teams such as Developers, QA, Project Managers, and other stakeholders to understand their requirements and implement solutions.
- Should have experience with Deep Learning Concepts - Transformers, BERT, Attention models, tokenization, embeddings.
Other Qualifications (OQ):
- Experience with software development, exposure to frontend backend frameworks and communication protocols
- Experience working on Infrastructure as Code (IaC) and CI/CD pipelines
- Experience with NLP concepts: syntactic/semantic analysis, NER etc.
What is in it for you:
- Join one of the world’s fastest-growing AI-first digital engineering companies and make a real impact at scale.
- Lead and collaborate with a high-energy team of talented, driven individuals solving complex, meaningful challenges.
- Work with Fortune 500 companies and disruptive innovators in a research-driven environment with 60+ patents.
- Stay ahead of the curve by gaining hands-on experience with cutting-edge AI, ML, data, and cloud technologies while continuously upskilling.
If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!