Talent.com
No longer accepting applications
Director, Cloud Site Operations

Director, Cloud Site Operations

CrusoeSan Francisco, California, United States
23 hours ago
Job type
  • Full-time
Job description

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role :

Crusoe is building a clean cloud for AI and high-performance computing. As we expand our global footprint of GPU-optimized data centers, we are seeking a Director of Cloud Site Operations to lead day-to-day operations across our domestic and international sites.

This leader will ensure that Crusoe’s cloud platform—powered by 100% clean energy—operates at the highest standards of availability, efficiency, and sustainability. The Director will oversee distributed teams, enforce operational discipline, and drive innovation in how we run and scale next-generation AI data centers.

What You’ll Be Working On :

Operational Leadership

Lead 24 / 7 operations across Crusoe’s global fleet of GPU-focused cloud data centers.

Ensure world-class uptime, performance, and resiliency while maintaining sustainability goals.

Standardize operational playbooks and enforce best practices for safety, security, and compliance.

Drive continuous improvement in efficiency (MTTR, PUE, MW utilization, NRC / OpEx per MW).

Site Management & Readiness

Manage hardware uptime and operational readiness for large-scale GPU clusters (H200, B200, GB200, MI300X, MI355X, GB300, etc.).

Ensure observability into performance and readiness across diverse geographies (U.S., Europe, Asia).

Team Leadership & Development

Lead and develop a distributed global team of site operations managers, engineers, and technicians.

Build a safety-first culture focused on reliability, execution, and accountability.

Implement scalable staffing and shift models to support rapid growth and international operations.

Vendor & Partner Management

Manage strategic relationships with colocation partners, OEMs, and service providers.

Ensure SLAs are exceeded while balancing cost, quality, and sustainability.

Partner closely with engineering, capacity planning, and product teams to align operational readiness with business growth.

Risk, Compliance & Security

Ensure global adherence to compliance frameworks (ISO, SOC, Uptime Institute, ASHRAE, etc.).

Oversee physical and operational security, incident response, and root cause analysis.

Maintain operational excellence in high-density, liquid-cooled GPU environments.

Executive Reporting & Strategy

Provide leadership updates on global site performance, capacity growth, and incident management.

Contribute to long-term site strategy, expansion roadmaps, and scaling models to support 300k+ GPU growth.

Serve as a thought leader for sustainable AI infrastructure, ensuring Crusoe remains at the forefront of clean compute.

What You’ll Bring to the Team :

10+ years of experience in data center or cloud infrastructure operations, including 5+ years in senior leadership.

Proven success managing global, multi-site operations for cloud or hyperscale environments.

Deep knowledge of critical power and cooling systems, including liquid cooling for high-density GPU clusters.

Experience building and scaling global teams in high-growth, mission-critical environments.

Strong executive communication and cross-functional leadership skills.

Willingness to travel internationally (25–40%).

Preferred Experience :

Background in cloud service providers, hyperscalers, or large-scale colocation environments.

Experience with GPU / AI workloads and HPC-optimized facilities.

Familiarity with clean energy integration (geothermal, hydro, solar + storage) in data center operations.

Expertise in incident management, root cause analysis, and building resilient systems at scale.

Benefits :

Industry competitive pay

Restricted Stock Units in a fast growing, well-funded technology company

Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

Employer contributions to HSA accounts

Paid Parental Leave

Paid life insurance, short-term and long-term disability

Teladoc

401(k) with a 100% match up to 4% of salary

Generous paid time off and holiday schedule

Cell phone reimbursement

Tuition reimbursement

Subscription to the Calm app

MetLife Legal

Company paid commuter benefit; $300 per month

Compensation Range

Compensation will be paid in the range of $206,000 – $258,000. Restricted Stock Units are included in all offers. Compensation is determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex / gender, sexual preference / orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

#J-18808-Ljbffr

Create a job alert for this search

Site Director • San Francisco, California, United States

Related jobs
  • Promoted
Senior Director, Site Operations, NA

Senior Director, Site Operations, NA

Vantage Data CentersSanta Clara, CA, United States
Full-time
Vantage Data Centers powers, cools, protects and connects the technology of the world's well-known hyperscalers, cloud providers and large enterprises. Developing and operating across North America,...Show moreLast updated: 21 days ago
  • Promoted
Director, Cloud Engineering

Director, Cloud Engineering

ExelixisAlameda, CA, United States
Full-time
The Director, IT Product Management - Cloud Platform, will lead a portfolio of cloud products critical to Exelixis's success and ambition to launch innovative medicines for patients.Operating withi...Show moreLast updated: 20 days ago
  • Promoted
Director- DevOps

Director- DevOps

FortinetSunnyvale, CA, United States
Full-time
Fortinet is looking for an enthusiastic and talented Devops- Director to join our cloud computing DevOps team to work with software developers and other operational specialists to support our Forti...Show moreLast updated: 27 days ago
  • Promoted
Director, Data Center CE Operations

Director, Data Center CE Operations

FluidStackSan Francisco, CA, United States
Full-time
Director, Data Center CE Operations at Fluidstack.Fluidstack is the AI Cloud Platform.We build GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolsi...Show moreLast updated: 1 day ago
  • Promoted
Director of Revenue Operations

Director of Revenue Operations

DuploCloudSan Jose, CA, United States
Permanent
DuploCloud is a software platform that allows engineering teams to achieve their infrastructure automation, security and compliance goals by offering DevOps-as-a-Service. We strive to make DevOps an...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Engineer Cloud Operations

Sr. Engineer Cloud Operations

MAXIMUSSan Francisco, CA, United States
Full-time
Cloud Engineer is a hands-on position that requires the ability to plan, design, and implement technical cloud solutions. You will help combine software and systems to develop creative engineering s...Show moreLast updated: 30+ days ago
  • Promoted
Director Engineering, Cloud Control Plane

Director Engineering, Cloud Control Plane

F5 Networks, Inc.San Jose, CA, United States
Full-time
Director Engineering, Cloud Control Plane page is loaded## Director Engineering, Cloud Control Planeremote type : Hybridlocations : Seattle : San Josetime type : Full timeposted on : Posted To...Show moreLast updated: 30+ days ago
  • Promoted
Cloud Operations Manager - EC2, VPC, IAM, S3, RDS, EKS

Cloud Operations Manager - EC2, VPC, IAM, S3, RDS, EKS

Talent Search PROSan Francisco, CA, United States
Full-time
LOCAL CANDIDATES ONLY No Relocation Candidates • •.Cloud Operations Manage and optimize AWS infrastructure (EC2, VPC, IAM, S3, RDS, EKS) for performance, availability, and cost efficiency.Automati...Show moreLast updated: 1 day ago
  • Promoted
Cloud Operations Engineer

Cloud Operations Engineer

MongoDBSan Francisco, CA, United States
Full-time
Building on the rapid success and adoption of MongoDB, we are delivering applications and services that make it much easier to manage and scale database deployments. These next-generation systems ar...Show moreLast updated: 1 day ago
  • Promoted
Director, Cloud Operations

Director, Cloud Operations

Cornerstone ResearchSan Francisco, CA, United States
Full-time
If you are a seasoned cloud technology leader looking for an opportunity to showcase your strategic design, implementation and management of cloud infrastructures, then we would like to meet you!.T...Show moreLast updated: 30+ days ago
  • Promoted
Engineering Manager, Cloud Storage

Engineering Manager, Cloud Storage

Crusoe Energy Systems LLCSan Francisco, CA, United States
Full-time
Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, spe...Show moreLast updated: 30+ days ago
  • Promoted
Director, Site Reliability Engineering - Infrastructure Platform

Director, Site Reliability Engineering - Infrastructure Platform

Okta for DevelopersSan Francisco, CA, United States
Permanent
Director, Site Reliability Engineering - Infrastructure Platform.Join as the Director of Infrastructure Platform and Shared Services at Okta for Developers. Oversee multiple teams focused on Edge ne...Show moreLast updated: 3 days ago
  • Promoted
Director, AI Data Center Operations

Director, AI Data Center Operations

NVIDIASanta Clara, CA, United States
Full-time
NVIDIA DGX Cloud is an AI supercomputing service that provides enterprises with instant access to NVIDIA's high-performance AI infrastructure and software, including dedicated DGX AI supercomputing...Show moreLast updated: 1 day ago
  • Promoted
Principal Cloud Site Reliability Engineer, Actimize

Principal Cloud Site Reliability Engineer, Actimize

NICESanta Clara, CA, United States
Full-time
At NiCE, we don't limit our challenges.We set the highest standards and execute beyond them.And if you're like us, we can offer you the ultimate career opportunity that will light a fire within you...Show moreLast updated: 3 days ago
  • Promoted
  • New!
Sr. Director, Business Development Cloud Memory

Sr. Director, Business Development Cloud Memory

Micron TechnologySan Jose, CA, US
Full-time
Our vision is to transform how the world uses information to enrich life for all.Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of...Show moreLast updated: 2 hours ago
  • Promoted
Director, Solution Management (27489)

Director, Solution Management (27489)

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 12 days ago
  • Promoted
Director, Cloud Cost and Capacity Programs

Director, Cloud Cost and Capacity Programs

Databricks Inc.San Francisco, CA, United States
Full-time
Director, Cloud Cost and Capacity Programs.At Databricks, our mission is to help data teams solve the world’s toughest problems — from climate change to healthcare to cybersecurity — with the power...Show moreLast updated: 12 days ago
  • Promoted
Cloud Operations Manager - EC2, VPC, IAM, S3, RDS, EKS

Cloud Operations Manager - EC2, VPC, IAM, S3, RDS, EKS

ResiliencySan Francisco, CA, United States
Full-time
Cloud Operations Manager - EC2, VPC, IAM, S3, RDS, EKS.Cloud Operations Manager and optimize AWS infrastructure (EC2, VPC, IAM, S3, RDS, EKS) for performance, availability, and cost efficiency.Auto...Show moreLast updated: 1 day ago