Cloudera

AI Solutions Architect

Cloudera  •  Washington, DC (Remote)  •  7 hours ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Business Area:

Professional Services

Seniority Level:

Mid-Senior level

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

As an AI Solutions Engineer within Cloudera’s Public Sector Consulting team, you will be the technical architect and execution lead for agencies moving from "data chaos" to "agentic autonomy." You will work directly with government organizations to design, build, and deploy mission-critical AI applications on the Cloudera Data Platform (CDP).

This is not a "theoretical" role. You will be on the front lines of Phase 2 and Phase 3 adoption journeys—helping customers clean legacy data silos, select the right model architectures, and industrialize MLOps pipelines in highly secure, often air-gapped or hybrid-cloud environments.

As the AI Solutions Engineer you will:

1. AI Model Strategy, Selection and Implementation

  • Evaluate and select optimal model architectures (LLMs, SLMs, or traditional ML) based on mission requirements, considering tradeoffs between accuracy, latency, and cost.

  • Guide customers on "Build vs. Buy vs. Fine-tune" decisions, prioritizing open-source models (Llama, Mistral, Falcon) that can run securely within a sovereign data perimeter.

  • Experience building Agentic Workflows (AI agents that can execute API calls and multi-step tasks).

2. End-to-End Data Engineering

  • Design and implement robust data pipelines within CDP to transform "messy" legacy data into AI-ready formats.

  • Develop and optimize Vector Databases and Retrieval-Augmented Generation (RAG) architectures to ground AI responses in verified agency facts.

  • Build Data pipelines with Spark, Nifi, Kafka or other ETL tools.

3. Optimization & Performance Tuning

  • Optimize model inference for production environments using quantization, pruning, and hardware acceleration (NVIDIA GPU orchestration).

  • Implement LLMOps to monitor model performance, detect hallucination rates, and manage model versioning and drift.

4. Public Sector Advisory & Governance

  • Collaborate with the customer’s AI Center of Excellence (CoE) to establish automated guardrails for ethics, bias mitigation, and FedRAMP/IL5 compliance.

  • Translate complex technical AI concepts into mission-value briefings for GS-level stakeholders and agency leadership.

We’re excited about you if you have: (Minimum Qualifications):

  • Experience: 5+ years in Data Engineering, Machine Learning, or Software Engineering, with at least 2 years focused on Generative AI or Deep Learning.

  • Technical Stack: Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face).

    • Hands-on experience with Cloudera (CDP), Spark, or similar big data ecosystems.

    • Proficiency in orchestration tools like LangChain, LlamaIndex, or Haystack.

    • Experience developing visual data representations and dashboards (Django, React, or Angular)

    • Experience using a compiled programming language, preferably one that runs on the JVM (Java, Scala, etc)

  • Data Expertise: Proven ability to build ETL/ELT pipelines and work with both SQL and NoSQL/Vector databases (e.g., Pinecone, Milvus, or PGVector).

  • Public Sector Knowledge: Understanding of government security frameworks (NIST AI RMF, FedRAMP, SRGs, STIGs).

  • Active Top Secret Security Clearance

You may also have: (Preferred Qualifications)

  • Experience fine-tuning of foundational models using techniques such as PEFT (Parameter-Efficient Fine-Tuning) and LoRA to adapt AI to domain-specific government nomenclature.

  • Experience training of specialized models on proprietary datasets while ensuring strict adherence to data privacy and sensitivity labels.

  • Experience installing and operating Cloudera Data Platform

  • Experience installing and operating Kubernetes

  • Experience in Air-Gapped deployments and managing AI workloads in disconnected environments.

  • Advanced degree (MS or PhD) in Computer Science, Data Science, or a related field.

  • Active Counterintelligence (CI) or Full Scope (FS) Poly is highly preferred.

This role is not eligible for immigration sponsorship.

What you can expect from us:

  • Generous PTO Policy

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy

  • Mental & Physical Wellness programs

  • Phone and Internet Reimbursement program

  • Access to Continued Career Development

  • Comprehensive Benefits and Competitive Packages

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-MH2

#LI-Remote

Cloudera

About Cloudera

Cloudera is the only data and AI platform company that brings AI to data anywhere: in clouds, data centers, and at the edge. Cloudera delivers 100% of data in all forms–whether it is in Cloudera or anywhere in the entire data estate. The world’s largest organizations rely on Cloudera to fuel insights that boost bottom lines, safeguard against threats, and save lives. Learn more at Cloudera.com.

---------------------------------------------------------------------------------

Recruitment Fraud Alert

It has come to our attention that job seekers have been contacted about fake job opportunities with Cloudera from individuals fraudulently posing as Cloudera employees. These recruiting fraud schemes often include requests for personal information and payments.

Be aware that Cloudera will never request a payment as part of its recruitment process. Additionally, Cloudera will never make a job offer without conducting an interview process. Any information submitted to Cloudera in relation to a job application should only be through our official career portal (https://www.cloudera.com/careers.html). Email communications from Cloudera will come from an email address ending in @cloudera.com.

If you are the target of a recruiting scam, consider filing a report with law enforcement authorities. Cloudera is not responsible for fraudulent job offers and/or any claims, damages, expenses, or other inconvenience connected to recruiting scams.

Industry
IT & Software
Company Size
1,001-5,000 employees
Headquarters
Santa Clara, California
Year Founded
Unknown
Social Media