Tenstorrent Galaxy™ Blackhole

Run Anything, Fast—Video, Language, Code

Tenstorrent Galaxy runs any workload, both training and inference. It leads the industry in video generation, decode, and prefill benchmarks, and TT-Forge™, Tenstorrent's open-source MLIR-based compiler, supports more models than any competitor. Video, language, code — pick your model, run it fast.

Tenstorrent Galaxy

Buy Tenstorrent Galaxy Today

Tenstorrent Galaxy Blackhole
Tenstorrent Galaxy Blackhole

Run anything with our scalable, ultra-dense AI server.

Contact Sales
Starting at $110,000
Tenstorrent Galaxy Wormhole
Tenstorrent Galaxy Wormhole

Tenstorrent Galaxy server with our previous generation chip, Wormhole. Still scalable, ultra-dense, performant.

Contact Sales
Starting at $70,000
More Models, Deploy Fast
More Models, Deploy Fast
More than 10,000 HuggingFace models are tested on Tenstorrent Galaxy Blackhole with 75% coverage of the top 100,000 HuggingFace models and growing every day across LLMs, Image Gen, Speech, Vision, Embeddings, Encoders and more.
Simple Scale
Simple Scale
The underlying Tensix Neo™ architecture is designed to scale from one chip to thousands under one programming model. It’s the same mesh of cores communicating the same way, connected by Ethernet. Scale to fit your needs, big or small.
Yours, 
End to End
Yours, 
End to End
Our full software stack, compiler to kernel, is open source. Compile a model and run it out of the box, or tune the kernels directly. No black boxes at any layer. TT-Forge, our MLIR based compiler, can port models from PyTorch, Jax, and ONNX.

Tenstorrent Galaxy Blackhole Specs

Accelerator Compute, Memory, and Connectivity
Accelerators
32× Blackhole® ASICs 
Performance
23 PFLOPS Block FP8
Accelerator SRAM
6.2 GB @ 2.9 PB/s 
Accelerator DRAM
1 TB GDDR6 @ 16 TB/s
Accelerator Fabric
10× 400 GbE links per ASIC for 32 TB/s
Cluster Scale-out
Up to 56× 800 GbE QSFP‑DD ports for 11.2 TB/s
Host Compute, Memory, and Connectivity
Host CPU
1× AMD EPYC 9004 (Zen 4), up to 32 cores, ≤280 W TDP
Host Memory
Up to 576 GB (6× 96 GB) DDR5-4800 ECC RDIMM (6 slots, 0 free)
Networking
1× OCP NIC 3.0 PCIe Gen5 x16 SFF (2× 200 GbE default configuration)
Management Network
1× Dedicated RJ45 1 GbE with baseboard management controller (BMC)
Storage OS
2× 960 GB M.2 2280 PCIe Gen4 x4 NVMe SSD
Storage Internal
Up to 4× E1.S PCIe Gen5 x4 NVMe SSD (9.5/15 mm)
Software
Ubuntu 22.04
Deployment & Operations
Form Factor
6U rackmount, air‑cooled chassis
System Dimensions
Height: 17.6 in (446.8 mm), Width: 10.4 in (263.4 mm), Length: 34.8 in (884.5 mm)
System Weight
262 lbs (119 kg)
System Power Usage
8 – 10 kW avg, 12 kW max (Max system power configurable up to 14.5 kW)
Operating Temperature
50 – 95 °F (10 – 35 °C)
Pricing
$110,000 list

Tenstorrent Galaxy Wormhole Specs

Single Galaxy Module

Full Galaxy 4U Server
AI Processor(s)
Tenstorrent Wormhole
32 × Tenstorrent Wormhole
Galaxy Modules
1
32
Tensix Cores
80
2,560
AI Clock
1 GHz
1 GHz
TeraFLOPS (FP8)
292
9,322 (9.3 PetaFLOPS)
SRAM
120 MB (1.5 MB per Tensix Core)
3.8 GB (120 MB per Module)
Memory
12 GB GDDR6 (192-bit memory bus, 12 GT/sec)
384 GB GDDR6, globally addressable
Power
200W
7.5 kW
System Interface
3.2 Tbps Ethernet (16 x 200Gbps)
41.6 Tbps Ethernet Internal Connectivity
Cooling
Passive
6x 120mm Fan
Board Management Controller (BMC)
-
IMX8

Looking for additional information?