
In this role, you will be pivotal in architecting and optimizing the serving stack for models like Gemini in an on-prem cloud environment, addressing exciting challenges to improve speed and efficiency. This is a unique opportunity to go deep, leading system-level design and performance profiling, ensuring Google's LLMs run faster and more cost-effectively than ever before.
Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
Individual pay is determined by factors including job-related skills, experience, and relevant education or training.
A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.
Check out our career opportunities at goo.gle/3DLEokh