NVIDIADatacenterHopper

NVIDIA H100 80GB für lokale KI

NVIDIA H100 80GB bietet 80 GB VRAM für lokale KI. Im LocalIA-Katalog laufen 224 von 242 Modellen komfortabel auf einer Karte.

Kompatible Modelle ansehen →Rig mit 80GB ↗

VRAM

80GB

Kategorie

Datacenter

Serie

Hopper

Vendor

NVIDIA

Modelle, die komfortabel laufen

224 models

Diese Modelle passen in 80 GB mit Reserve für Kontext und stabile Inferenz.

01★Command R+ 104Bcommand65.4 GBkomfortabelQ4 · / 80 GB

02★Qwen3 Next 80B A3B Instructqwen61.5 GBkomfortabelQ5 · / 80 GB

03★Qwen 2.5 72Bqwen55.3 GBkomfortabelQ5 · / 80 GB

04Qwen 2.5 VL 72Bqwen55.3 GBkomfortabelQ5 · / 80 GB

05★Qwen2.5 72B Instructqwen55.3 GBkomfortabelQ5 · / 80 GB

06Llama 2 70Bllama53.8 GBkomfortabelQ5 · / 80 GB

07Llama 3 70Bllama53.8 GBkomfortabelQ5 · / 80 GB

08Llama 3.1 70Bllama53.8 GBkomfortabelQ5 · / 80 GB

09★Llama 3.3 70Bllama53.8 GBkomfortabelQ5 · / 80 GB

10CodeLlama 70Bcodellama53.8 GBkomfortabelQ5 · / 80 GB

11★DeepSeek R1 Distill 70Bdeepseek53.8 GBkomfortabelQ5 · / 80 GB

12Hermes 3 70Bhermes53.8 GBkomfortabelQ5 · / 80 GB

13★Llama 3.1 Nemotron 70Bnemotron53.8 GBkomfortabelQ5 · / 80 GB

14Athene 70Bathene53.8 GBkomfortabelQ5 · / 80 GB

15★Llama 3.3 70B Instructllama53.8 GBkomfortabelQ5 · / 80 GB

16★Llama 3.1 70B Instructllama53.8 GBkomfortabelQ5 · / 80 GB

17★Mixtral 8x7Bmistral52.5 GBkomfortabelQ8 · / 80 GB

18Falcon 40Bfalcon44.7 GBkomfortabelQ8 · / 80 GB

19Command R 35Bcommand39.1 GBkomfortabelQ8 · / 80 GB

20Aya 23 35Baya39.1 GBkomfortabelQ8 · / 80 GB

21CodeLlama 34Bcodellama38.0 GBkomfortabelQ8 · / 80 GB

22Yi 1.5 34Byi38.0 GBkomfortabelQ8 · / 80 GB

23★dolphin 2.9.1 yi 1.5 34byi38.0 GBkomfortabelQ8 · / 80 GB

24★Qwen 2.5 32Bqwen35.8 GBkomfortabelQ8 · / 80 GB

25★Qwen 2.5 Coder 32Bqwen35.8 GBkomfortabelQ8 · / 80 GB

26★Qwen 3 32Bqwen35.8 GBkomfortabelQ8 · / 80 GB

27★QwQ 32Bqwq35.8 GBkomfortabelQ8 · / 80 GB

28★DeepSeek R1 Distill 32Bdeepseek35.8 GBkomfortabelQ8 · / 80 GB

29Qwen 2.5 VL 32Bqwen35.8 GBkomfortabelQ8 · / 80 GB

30★Granite 4 H-Small 32B-A9Bgranite35.8 GBkomfortabelQ8 · / 80 GB

Knappe Modelle

3 models

Diese Modelle passen gerade so. Sie laufen, aber Kontext und Geschwindigkeit sind begrenzt.

01★Mistral Large 123Bmistral77.3 GBknappQ4 · / 80 GB

02★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron75.4 GBknappQ4 · / 80 GB

03★Llama 4 Scout 17Bx16llama68.5 GBknappQ4 · / 80 GB

Freigeschaltet im 2x-Rig

160 GB

Mit zwei Karten parallel (160 GB gesamt) werden größere Modelle erreichbar.

01DeepSeek V2deepseek148.4 GBknappQ4 · / 160 GB

02DeepSeek Coder V2deepseek148.4 GBknappQ4 · / 160 GB

03★Qwen 3 235B A22Bqwen147.7 GBknappQ4 · / 160 GB

04★Qwen3 235B A22Bqwen147.7 GBknappQ4 · / 160 GB

05Falcon 180Bfalcon113.2 GBkomfortabelQ4 · / 160 GB

06Mixtral 8x22Bmistral108.3 GBkomfortabelQ5 · / 160 GB

Freigeschaltet im 4x-Rig

320 GB

Server-Konfiguration (320 GB gesamt) für sehr große Open-Weight-Modelle.

01★Llama 3.1 405Bllama254.6 GBkomfortabelQ4 · / 320 GB

02Hermes 3 405Bhermes254.6 GBkomfortabelQ4 · / 320 GB

03★Llama 4 Maverick 17Bx128llama251.5 GBkomfortabelQ4 · / 320 GB

04Nemotron 340Bnemotron261.2 GBkomfortabelQ5 · / 320 GB