Google

Staff Software Engineer, GKE AI

Google  •  Warsaw, PL (Onsite)  •  5 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
65
AI Success™

Job Description

Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience with one or more general purpose programming such as: Java, C/C++, Python, Objective C, JavaScript, or Go.
  • Experience in one or more of the following: test automation, refactoring code, test-driven development, build infrastructure, optimizing software, debugging, building tools and testing frameworks.
  • Experience with coding in data structures, algorithms and software design.
  • Experience in performance debugging of single-node systems.

Preferred qualifications:

  • Ability to manage issues and evolving changes in the areas of software design, integration, and infrastructure.

About the job

Google Cloud's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google Cloud's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. You will anticipate our customer needs and be empowered to act like an owner, take action and innovate. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

The GKE AI Scalability team is dedicated to engineering Google Kubernetes Engine (GKE) to handle the most extreme AI/ML workloads. We architect solutions for "Mega Clusters", pushing Kubernetes performance and scale to support our largest customers and their needs, up to several million accelerators. Our work directly enables AI research and deployment on Google Cloud.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Design, develop, and operate software and systems to enhance Google Kubernetes Engine's (GKE) scalability for massive AI/ML workloads.
  • Diagnose and resolve performance bottlenecks across the Kubernetes stack at scale.
  • Collaborate with teams across Google Cloud to deliver highly reliable and performant large-scale cluster solutions.
  • Contribute to the full software development lifecycle, from ideation to production support.
  • Participate in on-call rotations to ensure the stability of large-scale GKE clusters.
Google

About Google

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Check out our career opportunities at goo.gle/3DLEokh

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
Mountain View, CA
Year Founded
Unknown
Social Media