NVIDIAWorkstationRTX A (Ampere)

RTX A5500 für lokale KI

RTX A5500 bietet 24 GB VRAM für lokale KI. Im LocalIA-Katalog laufen 201 von 242 Modellen komfortabel auf einer Karte.

Kompatible Modelle ansehen →Rig mit A5500 ↗

VRAM

24GB

Kategorie

Workstation

Serie

RTX A (Ampere)

Vendor

NVIDIA

Modelle, die komfortabel laufen

201 models

Diese Modelle passen in 24 GB mit Reserve für Kontext und stabile Inferenz.

01★Qwen 2.5 32Bqwen20.1 GBkomfortabelQ4 · / 24 GB

02★Qwen 2.5 Coder 32Bqwen20.1 GBkomfortabelQ4 · / 24 GB

03★Qwen 3 32Bqwen20.1 GBkomfortabelQ4 · / 24 GB

04★QwQ 32Bqwq20.1 GBkomfortabelQ4 · / 24 GB

05★DeepSeek R1 Distill 32Bdeepseek20.1 GBkomfortabelQ4 · / 24 GB

06Qwen 2.5 VL 32Bqwen20.1 GBkomfortabelQ4 · / 24 GB

07★Granite 4 H-Small 32B-A9Bgranite20.1 GBkomfortabelQ4 · / 24 GB

08GLM-4.6glm20.1 GBkomfortabelQ4 · / 24 GB

09★GLM-4.7glm20.1 GBkomfortabelQ4 · / 24 GB

10★GLM-5glm20.1 GBkomfortabelQ4 · / 24 GB

11★GLM-5.1glm20.1 GBkomfortabelQ4 · / 24 GB

12★Qwen3 32Bqwen20.1 GBkomfortabelQ4 · / 24 GB

13★Qwen2.5 Coder 32B Instructqwen20.1 GBkomfortabelQ4 · / 24 GB

14★DeepSeek R1 Distill Qwen 32Bqwen20.1 GBkomfortabelQ4 · / 24 GB

15★Qwen2.5 32B Instructqwen20.1 GBkomfortabelQ4 · / 24 GB

16★Gemma 4 31Bgemma19.5 GBkomfortabelQ4 · / 24 GB

17★Qwen 3 30B A3Bqwen18.9 GBkomfortabelQ4 · / 24 GB

18MPT 30Bmpt18.9 GBkomfortabelQ4 · / 24 GB

19★Qwen3 Coder 30B A3B Instructqwen18.9 GBkomfortabelQ4 · / 24 GB

20★Qwen3 30B A3Bqwen18.9 GBkomfortabelQ4 · / 24 GB

21★Qwen3 30B A3B Instruct 2507qwen18.9 GBkomfortabelQ4 · / 24 GB

22★NVIDIA Nemotron 3 Nano 30B A3B BF16nemotron18.9 GBkomfortabelQ4 · / 24 GB

23★Gemma 2 27Bgemma17.0 GBkomfortabelQ4 · / 24 GB

24★Gemma 3 27Bgemma17.0 GBkomfortabelQ4 · / 24 GB

25★Gemma 4 26B A4Bgemma20.0 GBkomfortabelQ5 · / 24 GB

26★Mistral Small 3 24Bmistral18.4 GBkomfortabelQ5 · / 24 GB

27★Mistral Small 3.1 24Bmistral18.4 GBkomfortabelQ5 · / 24 GB

28★Mistral Small 3.2 24Bmistral18.4 GBkomfortabelQ5 · / 24 GB

29★Devstral Small 2 24Bdevstral18.4 GBkomfortabelQ5 · / 24 GB

30Mistral Small 22Bmistral16.9 GBkomfortabelQ5 · / 24 GB

Knappe Modelle

5 models

Diese Modelle passen gerade so. Sie laufen, aber Kontext und Geschwindigkeit sind begrenzt.

01Command R 35Bcommand22.0 GBknappQ4 · / 24 GB

02Aya 23 35Baya22.0 GBknappQ4 · / 24 GB

03CodeLlama 34Bcodellama21.4 GBknappQ4 · / 24 GB

04Yi 1.5 34Byi21.4 GBknappQ4 · / 24 GB

05★dolphin 2.9.1 yi 1.5 34byi21.4 GBknappQ4 · / 24 GB

Freigeschaltet im 2x-Rig

48 GB

Mit zwei Karten parallel (48 GB gesamt) werden größere Modelle erreichbar.

01★Qwen 2.5 72Bqwen45.3 GBknappQ4 · / 48 GB

02Qwen 2.5 VL 72Bqwen45.3 GBknappQ4 · / 48 GB

03★Qwen2.5 72B Instructqwen45.3 GBknappQ4 · / 48 GB

04Llama 2 70Bllama44.0 GBknappQ4 · / 48 GB

05Llama 3 70Bllama44.0 GBknappQ4 · / 48 GB

06Llama 3.1 70Bllama44.0 GBknappQ4 · / 48 GB

07★Llama 3.3 70Bllama44.0 GBknappQ4 · / 48 GB

08CodeLlama 70Bcodellama44.0 GBknappQ4 · / 48 GB

09★DeepSeek R1 Distill 70Bdeepseek44.0 GBknappQ4 · / 48 GB

10Hermes 3 70Bhermes44.0 GBknappQ4 · / 48 GB

11★Llama 3.1 Nemotron 70Bnemotron44.0 GBknappQ4 · / 48 GB

12Athene 70Bathene44.0 GBknappQ4 · / 48 GB

13★Llama 3.3 70B Instructllama44.0 GBknappQ4 · / 48 GB

14★Llama 3.1 70B Instructllama44.0 GBknappQ4 · / 48 GB

15★Mixtral 8x7Bmistral36.1 GBkomfortabelQ5 · / 48 GB

Freigeschaltet im 4x-Rig

96 GB

Server-Konfiguration (96 GB gesamt) für sehr große Open-Weight-Modelle.

01Mixtral 8x22Bmistral88.6 GBknappQ4 · / 96 GB

02★Mistral Large 123Bmistral77.3 GBkomfortabelQ4 · / 96 GB

03★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron75.4 GBkomfortabelQ4 · / 96 GB

04★Llama 4 Scout 17Bx16llama68.5 GBkomfortabelQ4 · / 96 GB

05★Command R+ 104Bcommand79.9 GBkomfortabelQ5 · / 96 GB

06★Qwen3 Next 80B A3B Instructqwen61.5 GBkomfortabelQ5 · / 96 GB