Google

Software Engineer III, Multimodal Agentic AI, XR

Google  •  $147k - $211k/yr  •  San Jose, CA (Onsite)  •  5 hours ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in Python or C++.
  • 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
  • Experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).

Preferred qualifications:

  • Master's degree or PhD in Computer Science, or a related technical field.
  • 2 years of experience with data structures and algorithms.
  • Experience conducting applied research to enable new functionality and improve the quality and efficiency of large language and multimodal models.
  • Knowledge of machine learning and statistics.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Our team is at the forefront of building the next generation of conversational AI. We're developing agentic AI solutions for smart glasses, utilizing Gemini Live and Astra to create a unique and trusted multimodal experience. This technology delivers instant, natural conversational intelligence directly to the user's eye, allowing them to navigate their world more immersively than ever.

In this role, you will design multimodal agentic solutions focused on goal-oriented reasoning tasks. You will enhance and develop new multimodal tools and extensions. You will define and execute the strategy for data, evaluation, and post-tuning of the Gemini model to enhance its impact for smart glasses use cases.

For decades, the computing revolution has reshaped our world driven by
breakthroughs in compute, connectivity, mobile, and now, AI. Google's XR team is at the forefront of the next major leap – the convergence of AI and XR. This is more than just new devices – it's about reimagining how we interact with the world around us. We're building a future where
lightweight XR devices like smart glasses and headsets pair with helpful AI to augment human intelligence, offering personalized, conversational, and contextually aware experiences.Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $147000 - $211000 (USD) + 15% bonus target + bonus + equity + benefits

Learn more about benefits at Google

Responsibilities

  • Design, develop, and deploy scalable and agentic AI solutions for high-value, real-world multimodal conversational AI use cases on smart glasses.
  • Gain an understanding of the Gemini Live and Astra tech stack and infrastructure. Optimize agent architecture/orchestration to ensure efficient deployment and operation at scale, with a focus on inference cost optimization.
  • Take ownership of AI quality for production systems. This includes defining technical metrics, implementing evaluation frameworks, analyzing loss patterns, and driving improvements through data collection and smart data generation and model enhancements.
  • Implement, optimize, and advance AI techniques, with a focus on multimodal conversational quality, multimodal tool use, and multimodal goal-oriented reasoning.
Google

About Google

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Check out our career opportunities at goo.gle/3DLEokh

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
Mountain View, CA
Year Founded
Unknown
Social Media