DigitalOcean

Senior Director, Data Center Facilities

DigitalOcean  •  $234k - $292k/yr  •  Seattle, WA (Hybrid)  •  2 hours ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you’ll find your place here. We value winning together—while learning, having fun, and making a profound difference for the dreamers and builders in the world.

The Senior Director of Data Center On-Site Operations is a senior operational leader responsible for driving consistent, scalable, and high-performing execution across DigitalOcean’s global data center footprint. This role leads the on-site operations strategy for existing regional data centers and supports rapid expansion into new locations by building the operating model, leadership structure, processes, standards, and performance systems required to scale effectively.

This leader is accountable for improving operational excellence across data center sites, including execution quality, service reliability, staffing readiness, vendor performance, incident response, maintenance discipline, change execution, and customer-impact prevention. The role is focused on strengthening day-to-day operational performance while creating repeatable processes that allow the organization to grow quickly across both existing and new markets.

As DigitalOcean continues to invest heavily in physical infrastructure and AI/GPU capacity, this leader will play a critical role in preparing on-site operations for higher-density environments, liquid-cooled infrastructure, accelerated deployments, and increasingly demanding customer SLAs. The Senior Director will ensure that on-site teams can support rapid growth while maintaining safety, reliability, consistency, and accountability.

This executive serves as a key escalation leader for site-level operational issues, vendor performance concerns, and execution risks. The role requires strong cross-functional partnership with Infrastructure, Engineering, Design, Construction, Capacity Planning, Procurement, Networking, Product, and executive leadership to ensure data center operations are aligned with business growth, customer commitments, and long-term scalability.

Key Strategic Priorities

  • Operational Excellence and Continuous Improvement: Build and mature a disciplined operating model across all data center locations, with clear standards, repeatable processes, strong metrics, and continuous improvement mechanisms.
  • Scalable Site Operations Model: Develop the staffing models, leadership structures, training programs, procedures, and governance required to support rapid growth in both the number of locations and the complexity of operations.
  • Rapid Data Center Expansion Readiness: Ensure new sites can be operationalized quickly and consistently through standardized launch playbooks, readiness checklists, vendor coordination, staffing plans, and operational acceptance criteria.
  • Performance, Reliability, and SLA Improvement: Improve site-level performance against availability, service delivery, maintenance, change management, incident response, inventory accuracy, deployment execution, and customer-impact prevention.
  • AI/GPU Operational Enablement: Prepare on-site operations to support high-density GPU environments, liquid cooling, complex cabling, accelerated hardware deployments, higher-touch customer requirements, and increased operational risk.

Essential Duties and Responsibilities

Operational Leadership

  • Provides senior leadership for data center on-site operations across DigitalOcean’s existing and expanding global footprint.
  • Leads the development and execution of a scalable operating model for data center sites, including standards for staffing, shift coverage, escalation, maintenance, change management, incident response, vendor oversight, inventory control, deployment support, and customer-impact prevention.
  • Drives operational excellence across all data center locations by identifying gaps, standardizing best practices, improving execution discipline, and ensuring consistent adoption of policies, procedures, and operating standards.
  • Leads site operations teams responsible for 24x7 data center support, ensuring availability, reliability, safety, quality, and service delivery meet or exceed business and customer expectations.
  • Improves on-site execution performance by strengthening accountability, defining clear ownership, and implementing measurable performance standards across locations.

Scalable Growth and Process Development

  • Build repeatable operational processes that allow DigitalOcean to scale from a regional data center footprint to a larger, more complex, and more globally consistent operating environment.
  • Develops standardized site launch playbooks, operational readiness checklists, escalation models, staffing templates, maintenance routines, training plans, and vendor management practices for new and existing sites.
  • Partners with cross-functional teams to ensure that site operations are prepared for data center expansions, new technology deployments, new customer requirements, higher-density racks, GPU infrastructure, and liquid cooling environments.
  • Identifies operational constraints that limit growth and develops practical solutions to improve throughput, reduce execution friction, and accelerate site readiness.
  • Creates a process-development roadmap that supports organizational growth, improves consistency, and reduces dependency on individual tribal knowledge.

Performance Management and Metrics

  • Defines and manages key operational KPIs, SLAs, and performance indicators across the data center operations organization.
  • Uses data to identify trends, performance gaps, recurring issues, staffing constraints, vendor deficiencies, and opportunities for automation or process improvement.
  • Develops operational dashboards and review mechanisms for availability, incident response, maintenance completion, change success, inventory accuracy, deployment throughput, backlog management, staffing readiness, vendor performance, and customer-impact events.
  • Leads operational reviews with senior leadership, providing clear visibility into site health, execution risks, improvement plans, and progress against strategic goals.
  • Benchmarks operational performance against industry standards, internal targets, and business requirements.

Site Reliability, Availability, and Risk Management

  • Has responsibility for meeting or exceeding established service levels for data center operations.
  • Oversees site-level execution supporting critical physical infrastructure, including power, cooling, cabling, space, racks, network infrastructure, server deployment, hardware maintenance, and customer environment support.
  • Ensures that work performed in data center environments is completed safely, correctly, and without impact to internal or external customers.
  • Drives stronger change management discipline for work performed in live production environments, including pre-work planning, risk assessment, approvals, execution quality, rollback readiness, and post-change validation.
  • Evaluates and mitigates operational risks across new and existing sites, including staffing gaps, vendor performance issues, maintenance deficiencies, capacity constraints, process weaknesses, and readiness gaps for new deployments.
  • Improves incident response, root cause analysis, corrective action tracking, and recurrence prevention across data center locations.

Data Center Expansion and Readiness

  • Supports data center expansion efforts by ensuring on-site operational requirements are identified early and integrated into planning, design, construction, commissioning, and turnover processes.
  • Partners with Design, Construction, Capacity Planning, Engineering, Procurement, Networking, and Product teams to translate business demand into operational requirements for space, power, cooling, racks, cabling, staffing, maintenance, security, logistics, and site support.
  • Develops operational acceptance criteria for new sites, new data halls, new pods, and major infrastructure deployments.
  • Ensures new sites are handed over with complete documentation, diagrams, procedures, training, spares, escalation paths, vendor contacts, and support models.
  • Drives readiness for accelerated GPU deployments, high-density rack environments, liquid cooling, and other infrastructure programs tied to business growth.

Vendor and Partner Management

  • Leads site-level vendor management to ensure service providers, contractors, colocation partners, and maintenance vendors meet operational expectations, contractual commitments, SLAs, and safety requirements.
  • Serves as a senior escalation point for vendor performance issues, site delivery concerns, maintenance execution problems, and operational deficiencies.
  • Develops consistent vendor governance practices, including performance reviews, issue tracking, corrective action plans, escalation paths, and service-quality metrics.
  • Partners with Procurement, Legal, and Finance to identify opportunities to streamline vendor services, optimize costs, and improve accountability.
  • Builds strong working relationships with vendor stakeholders while maintaining clear expectations for execution, quality, safety, and delivery.

Team Leadership and Organizational Development

  • Leads, coaches, and develops data center operations managers, site leaders, and technical operations teams.
  • Builds a leadership bench capable of supporting rapid site growth, increased operational complexity, and higher customer expectations.
  • Creates training, mentoring, and skill-development programs that improve technical capability, execution discipline, safety awareness, and leadership readiness.
  • Develops staffing models that support service-level requirements, site complexity, customer commitments, GPU concentration, and operational risk.
  • Fosters a culture of ownership, accountability, collaboration, continuous improvement, safety, customer focus, and operational discipline.
  • Removes organizational barriers that slow execution, reduce accountability, or prevent strong cross-functional collaboration.

Governance, Documentation, and Standards

  • Defines and maintains policies, procedures, standards, and documentation related to data center operations, including performance, capacity, availability, continuity, security, safety, maintenance, and change management.
  • Ensures documentation and diagrams accurately capture critical site information, including physical layout, rack elevations, power paths, cooling configurations, cabling standards, escalation paths, maintenance routines, and operational dependencies.
  • Maintains operational governance routines to ensure standards are being followed across sites and that deviations are identified, reviewed, and corrected.
  • Supports budgeting, forecasting, planning, deployment coordination, incident management, problem management, change management, and operational reporting.
  • Creates and delivers executive-level presentations on site operations performance, operational risks, improvement programs, expansion readiness, and organizational needs.
  • Performs other duties as assigned.

Knowledge, Skills, and Abilities

  • Strong experience leading data center operations in a large enterprise, cloud, colocation, or high-growth infrastructure environment.
  • Demonstrated ability to improve operational performance across multiple data center locations.
  • Experience building scalable operational processes, playbooks, staffing models, governance routines, and performance systems.
  • Strong understanding of data center on-site operations, including power, cooling, cabling, hardware support, maintenance, vendor management, change management, incident response, logistics, and customer-impact prevention.
  • Experience operating against SLAs and improving performance in environments where availability, reliability, and execution quality are mission critical.
  • Ability to lead in a highly complex, fast-moving, matrixed organization with multiple stakeholders and competing priorities.
  • Strong understanding of data center growth challenges, including new site launches, capacity expansion, staffing readiness, vendor coordination, process standardization, and operational handoff.
  • Experience supporting high-density compute environments, GPU infrastructure, liquid cooling, large-scale server deployments, and complex cabling environments is strongly preferred.
  • Strong knowledge of data center critical infrastructure, including UPS systems, generators, transfer switches, power distribution, DC power systems, HVAC, cooling systems, liquid cooling, racks, structured cabling, and physical security.
  • Ability to translate business, server, storage, networking, and customer requirements into practical data center operational needs.
  • Strong vendor management and negotiation skills, with the ability to hold partners accountable while maintaining effective business relationships.
  • Excellent communication skills, including the ability to present complex operational issues clearly to executive, technical, and non-technical audiences.
  • Proven ability to develop credibility, influence without authority, and drive change across diverse teams.
  • Strong decision-making, problem-solving, prioritization, and risk-management capabilities.
  • Ability to develop and communicate relevant department metrics, SLA performance reports, operating reviews, and executive-level summaries.
  • Demonstrated ability to develop teams, coach leaders, mentor technical staff, and build organizational capability.
  • Familiarity with industry standards, certifications, and operational best practices for data center environments.
  • Working knowledge of IT infrastructure, including servers, storage, networking, operating systems, virtualization, and cloud infrastructure.
  • Practical knowledge of electronics, electrical systems, energy management, IP networking, Ethernet, Linux, Windows, virtualized environments, and SAN storage.
  • Strong preference for experience with modern infrastructure trends, including AI workloads, GPU clusters, software-defined networking, automation, and high-density data center designs.

Essential Skills, Knowledge, and Experience

  • Bachelor’s degree in Engineering, Business Administration, Computer Science, Management Information Systems, or a related field; equivalent work experience may be considered.
  • Minimum of 8 years of experience in data center operations, infrastructure operations, colocation operations, cloud infrastructure, or a related technical operations environment.
  • Minimum of 5 to 7 years of senior leadership experience managing operational, technical, or site-support teams in a large enterprise, cloud, or colocation environment.
  • Demonstrated experience managing operations across multiple data center locations or large-scale mission-critical facilities.
  • Proven experience developing and implementing operational improvement roadmaps.
  • Proven experience improving process maturity, operational consistency, team performance, vendor execution, and SLA performance.
  • Experience building scalable processes to support rapid organizational growth, new site launches, and increased operational complexity.
  • Experience managing vendor relationships, service providers, contractors, and third-party data center partners.
  • Experience with budgeting, forecasting, operational planning, performance reporting, and executive communications.
  • Experience with GPU, AI infrastructure, high-density compute, liquid cooling, or large-scale hardware deployment environments is strongly preferred.

Success Measures

The Senior Director of Data Center On-Site Operations will be measured by the ability to:

  • Improve site-level operational performance, reliability, and execution quality.
  • Create scalable processes that support rapid growth across new and existing data center locations.
  • Reduce operational variability between sites through standardization, documentation, governance, and leadership accountability.
  • Improve SLA performance, incident response, change success, maintenance discipline, and customer-impact prevention.
  • Build a stronger leadership bench and staffing model for continued growth.
  • Improve vendor performance and site execution accountability.
  • Accelerate operational readiness for new data centers, new data halls, GPU deployments, liquid cooling, and higher-density infrastructure.
  • Establish a culture of operational excellence, continuous improvement, ownership, and scalable execution.

Compensation Range:

  • $233,600 - $292,000

*This is a hybrid role

JR: 2026-7835

#LI-Hybrid

Why You’ll Like Working for DigitalOcean

  • We innovate with purpose. You’ll be a part of a cutting-edge technology company with an upward trajectory, who are proud to simplify cloud and AI so builders can spend more time creating software that changes the world. As a member of the team, you will be a Shark who thinks big, bold, and scrappy, like an owner with a bias for action and a powerful sense of responsibility for customers, products, employees, and decisions.
  • We prioritize career development. At DO, you’ll do the best work of your career. You will work with some of the smartest and most interesting people in the industry. We are a high-performance organization that will always challenge you to think big. Our organizational development team will provide you with resources to ensure you keep growing. We provide employees with reimbursement for relevant conferences, training, and education. All employees have access to LinkedIn Learning's 10,000+ courses to support their continued growth and development.
  • We care about your well-being. Regardless of your location, we will provide you with a competitive array of benefits to support you from our Employee Assistance Program to Local Employee Meetups to flexible time off policy, to name a few. While the philosophy around our benefits is the same worldwide, specific benefits may vary based on local regulations and preferences.
  • We reward our employees. The salary range for this position is based on market data, relevant years of experience, and skills. You may qualify for a bonus in addition to base salary; bonus amounts are determined based on company and individual performance. We also provide equity compensation to eligible employees, including equity grants upon hire and the option to participate in our Employee Stock Purchase Program.
  • DigitalOcean is an equal-opportunity employer. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Application Limit: You may apply to a maximum of 3 positions within any 180-day period. This policy promotes better role-candidate matching and encourages thoughtful applications where your qualifications align most strongly.

DigitalOcean

About DigitalOcean

DigitalOcean simplifies cloud computing so businesses can spend more time creating software that changes the world. With its mission-critical infrastructure and fully managed offerings, DigitalOcean helps developers at startups and growing digital businesses rapidly build, deploy and scale, whether creating a digital presence or building digital products. DigitalOcean combines the power of simplicity, security, community and customer support so customers can spend less time managing their infrastructure and more time building innovative applications that drive business growth.

Industry
IT & Software
Company Size
1,001-5,000 employees
Headquarters
Broomfield, Colorado
Year Founded
2012
Social Media