
M365 Copilot inference is a high-impact engineering team advancing applied AI and large-scale machine learning across Microsoft. We design and operate the platform powering Microsoft 365 Copilot experiences.
Our team is operating at massive GPU (Graphics Processing Unit) scale across multiple regions and SKUs in global datacenters. We build the core LLM (Large Language Model) API (Application Programming Interface) , routing, capacity, and control plane services that turn that fleet into Copilot experiences.
We are hiring a Principal Group Software Engineering Manager to own GPU fleet health, capacity intake and planning, and automated model deployment for Copilot. This is one of the most strategic leadership roles in Copilot: every feature, experiment, and model launch flows through the systems this leader owns. You will lead existing teams, grow the org, and build the control plane that turns capacity management from a manual, ticket-driven process into an automated, self-driven platform.
You will own end‑to‑end GPU fleet health and capacity platform, establishing a single source of truth with strong observability across hardware, hosts, and workloads to drive utilization and reliability. Design and scale capacity intake, planning, and deployment reducing models time‑to‑production and meeting SLAs (service level agreement) for priority workloads through automation and data‑driven operations.
Build a unified control plane that connects intake, planning, deployment, and fleet operations, enabling global optimization across cost, latency, compliance, and flexible model scaling (0→1 platform ownership).
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
Qualifications
Required Qualifications:
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
Software Engineering M6 - The typical base pay range for this role across the U.S. is USD $165,600 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 - $331,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Every company has a mission. What's ours? To empower every person and every organization to achieve more. We believe technology can and should be a force for good and that meaningful innovation contributes to a brighter world in the future and today. Our culture doesn’t just encourage curiosity; it embraces it. Each day we make progress together by showing up as our authentic selves. We show up with a learn-it-all mentality. We show up cheering on others, knowing their success doesn't diminish our own. We show up every day open to learning our own biases, changing our behavior, and inviting in differences. Because impact matters.
Microsoft operates in 190 countries and is made up of approximately 228,000 passionate employees worldwide.