Prediktive

Cloud Data Engineer

Prediktive  •  United States (Onsite)  •  3 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

We are looking for a Cloud Data Engineer based in Latin America to work on a long-term project for one of our clients, a Data Analytics and Business Intelligence services company based in Los Angeles.

The person in this role will be part of the new Product Engineering team tasked with designing and building the next generation of Agentic AI-powered products. This person will not just move data but will collaborate with the team to design the data store for our autonomous agents. The Engineer will be responsible for building multi-modal data pipelines that process structured and unstructured data into varied data stores, including vector and graph databases.

Responsibilities

  • Design and build multi-modal data ingestion pipelines to collect and structure data from relational databases and unstructured sources such as PDFs, Word documents, and PNG/SVG diagrams.
  • Develop and optimize the knowledge layer, managing embeddings in vector databases (e.g., Pinecone, ChromaDB, Vertex AI Search) and graph databases (e.g., Neo4j) to enable multi-step agent reasoning across distributed data sources.
  • Integrate autonomous coding agents into daily development workflows to plan, generate, and refactor data infrastructure and microservices.
  • Collaborate with AI Engineers to design and maintain evaluation pipelines and gold datasets used for automated verification, retrieval quality measurement, and confidence scoring of AI outputs.
  • Develop and maintain scalable data services and RESTful APIs using Python (FastAPI/Django) to expose structured and validated data.
  • Deploy, monitor, and optimize data workloads on GCP using services such as Vertex AI and BigQuery, ensuring system reliability, security, and cost efficiency.

Requirements

  • Advanced Level of English.
  • 5+ years of Data Engineering experience, with a proven track record of building production-grade AI data systems.
  • 4+ years of experience working with Python and strong experience working with frameworks and libraries such as FastAPI, Pydantic, SQLAlchemy, and SQL for building scalable data services and microservices.
  • Hands-on experience working with vector databases such as Pinecone or ChromaDB, and building RAG pipelines and GraphRAG patterns.
  • Strong experience working with data orchestration and processing tools such as Apache Airflow, Cloud Composer, or Prefect, as well as BigQuery and DataProc.
  • Hands-on experience working with cloud platforms such as GCP or AWS, deploying and managing scalable data pipelines and data workloads.
  • Demonstrated experience using AI-assisted development tools such as GitHub Copilot, Cursor, Gemini Code Assist, Claude Code, Codex, or similar autonomous coding agents to accelerate software development workflows.

Bonus Points

  • Bachelor’s Degree in Computer Science, Systems Engineering or related fields

What we offer

  • Long term positions.
  • Compensation in USD.
  • Paid time off.
  • Cool clients and products.
  • Work with great engineers.

4tech

Prediktive

About Prediktive

Prediktive is the premier Technology Business Partner powering the growth of tech-enabled companies.

It specializes on the execution of software product development and business strategic programs.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Silicon Valley, CA
Year Founded
2017
Social Media