Nebius Expands in US With First GPU Cluster in Kansas City, Offices in San Francisco, Dallas and New York

In This Article:

  • AI infrastructure provider’s first GPU cluster in the US will service customer workloads from early 2025

  • GPU cluster in Kansas City has potential capacity to house up to about 35 thousand GPUs after expansion

  • Nebius’s new customer hubs in San Francisco and Dallas have been operational since September; third office in New York coming by year-end

AMSTERDAM, November 19, 2024--(BUSINESS WIRE)--Nebius Group N.V. ("Nebius Group", "Nebius" or the "Company"; NASDAQ:NBIS), a leading AI infrastructure company, today announced the launch of its first GPU cluster in the United States with a deployment in Kansas City, MO, bringing its AI-native cloud closer to American customers.

Scheduled to go live in Q1 2025, the Kansas City cluster will house thousands of state-of-the-art NVIDIA GPUs, primarily H200 Tensor Core GPUs in the initial phase, with the energy-efficient NVIDIA Blackwell platform expected to arrive in 2025. The colocation can be expanded from an initial 5 MW up to 40 MW, or about 35 thousand GPUs, at full potential capacity.

Nebius is actively ramping up its presence in the US as part of its strategy to become a leading provider of AI infrastructure to AI builders globally, and is in advanced discussions for a second, larger-scale GPU cluster in the US, also slated to come online in 2025. The Company has also opened two new customer-facing hubs in San Francisco and Dallas, with a third office set to open in New York later this year.

Arkady Volozh, founder and CEO of Nebius, said:

"Our first GPU cluster in the US and new offices represent a pivotal step in our expansion in the US market. Serving American customers from American facilities means lower latency and maximizes the advantages of our AI-native cloud. We will be building out more GPU clusters across the US to meet exploding demand for high-quality AI infrastructure from US AI developers and enterprises."

Built on top of the latest NVIDIA GPUs with a fleet of H100s already installed and H200s coming onstream this month, Nebius’s full-stack AI infrastructure is being purpose-built to meet the demands of the global AI industry and leans on deep technical expertise across hardware and software, cloud engineering and machine learning ("ML").

Publicly announced in October, the AI-native Nebius cloud is designed to manage the full ML lifecycle – from data processing and training through to fine-tuning and inference – all in one place. The recently launched Nebius AI Studio inference service expands the Company’s offering to app builders, with access to a range of state-of-the-art open-source models in a flexible, user-friendly environment at among the lowest price-per-token on the market.