NVIDIAWorkstationRTX A (Ampere)

RTX A6000 for local AI

RTX A6000 provides 48 GB of VRAM for local AI. In the LocalIA catalog, 208 out of 242 models run comfortably on a single card.

View all compatible models →Rig around the A6000 ↗

VRAM

48GB

Models that run comfortably

208 models

These models fit in 48 GB with room for context and stable inference.

01★Mixtral 8x7Bmistral36.1 GBcomfortableQ5 · / 48 GB

02Falcon 40Bfalcon30.7 GBcomfortableQ5 · / 48 GB

03Command R 35Bcommand39.1 GBcomfortableQ8 · / 48 GB

04Aya 23 35Baya39.1 GBcomfortableQ8 · / 48 GB

05CodeLlama 34Bcodellama38.0 GBcomfortableQ8 · / 48 GB

06Yi 1.5 34Byi38.0 GBcomfortableQ8 · / 48 GB

07★dolphin 2.9.1 yi 1.5 34byi38.0 GBcomfortableQ8 · / 48 GB

08★Qwen 2.5 32Bqwen35.8 GBcomfortableQ8 · / 48 GB

09★Qwen 2.5 Coder 32Bqwen35.8 GBcomfortableQ8 · / 48 GB

10★Qwen 3 32Bqwen35.8 GBcomfortableQ8 · / 48 GB

11★QwQ 32Bqwq35.8 GBcomfortableQ8 · / 48 GB

12★DeepSeek R1 Distill 32Bdeepseek35.8 GBcomfortableQ8 · / 48 GB

13Qwen 2.5 VL 32Bqwen35.8 GBcomfortableQ8 · / 48 GB

14★Granite 4 H-Small 32B-A9Bgranite35.8 GBcomfortableQ8 · / 48 GB

15GLM-4.6glm35.8 GBcomfortableQ8 · / 48 GB

16★GLM-4.7glm35.8 GBcomfortableQ8 · / 48 GB

17★GLM-5glm35.8 GBcomfortableQ8 · / 48 GB

18★GLM-5.1glm35.8 GBcomfortableQ8 · / 48 GB

19★Qwen3 32Bqwen35.8 GBcomfortableQ8 · / 48 GB

20★Qwen2.5 Coder 32B Instructqwen35.8 GBcomfortableQ8 · / 48 GB

21★DeepSeek R1 Distill Qwen 32Bqwen35.8 GBcomfortableQ8 · / 48 GB

22★Qwen2.5 32B Instructqwen35.8 GBcomfortableQ8 · / 48 GB

23★Gemma 4 31Bgemma34.6 GBcomfortableQ8 · / 48 GB

24★Qwen 3 30B A3Bqwen33.5 GBcomfortableQ8 · / 48 GB

25MPT 30Bmpt33.5 GBcomfortableQ8 · / 48 GB

26★Qwen3 Coder 30B A3B Instructqwen33.5 GBcomfortableQ8 · / 48 GB

27★Qwen3 30B A3Bqwen33.5 GBcomfortableQ8 · / 48 GB

28★Qwen3 30B A3B Instruct 2507qwen33.5 GBcomfortableQ8 · / 48 GB

29★NVIDIA Nemotron 3 Nano 30B A3B BF16nemotron33.5 GBcomfortableQ8 · / 48 GB

30★Gemma 2 27Bgemma30.2 GBcomfortableQ8 · / 48 GB

Tight models

14 models

These models barely fit. They can run, but context and speed will be limited.

01★Qwen 2.5 72Bqwen45.3 GBtightQ4 · / 48 GB

02Qwen 2.5 VL 72Bqwen45.3 GBtightQ4 · / 48 GB

03★Qwen2.5 72B Instructqwen45.3 GBtightQ4 · / 48 GB

04Llama 2 70Bllama44.0 GBtightQ4 · / 48 GB

05Llama 3 70Bllama44.0 GBtightQ4 · / 48 GB

06Llama 3.1 70Bllama44.0 GBtightQ4 · / 48 GB

07★Llama 3.3 70Bllama44.0 GBtightQ4 · / 48 GB

08CodeLlama 70Bcodellama44.0 GBtightQ4 · / 48 GB

09★DeepSeek R1 Distill 70Bdeepseek44.0 GBtightQ4 · / 48 GB

10Hermes 3 70Bhermes44.0 GBtightQ4 · / 48 GB

11★Llama 3.1 Nemotron 70Bnemotron44.0 GBtightQ4 · / 48 GB

12Athene 70Bathene44.0 GBtightQ4 · / 48 GB

13★Llama 3.3 70B Instructllama44.0 GBtightQ4 · / 48 GB

14★Llama 3.1 70B Instructllama44.0 GBtightQ4 · / 48 GB

Unlocked in a 2x rig

96 GB

With two cards in parallel (96 GB total), larger models become reachable.

01Mixtral 8x22Bmistral88.6 GBtightQ4 · / 96 GB

02★Mistral Large 123Bmistral77.3 GBcomfortableQ4 · / 96 GB

03★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron75.4 GBcomfortableQ4 · / 96 GB

04★Llama 4 Scout 17Bx16llama68.5 GBcomfortableQ4 · / 96 GB

05★Command R+ 104Bcommand79.9 GBcomfortableQ5 · / 96 GB

06★Qwen3 Next 80B A3B Instructqwen61.5 GBcomfortableQ5 · / 96 GB

Unlocked in a 4x rig

192 GB

Server-style configuration (192 GB total) for the largest open-weight models.

01DeepSeek V2deepseek148.4 GBcomfortableQ4 · / 192 GB

02DeepSeek Coder V2deepseek148.4 GBcomfortableQ4 · / 192 GB

03★Qwen 3 235B A22Bqwen147.7 GBcomfortableQ4 · / 192 GB

04★Qwen3 235B A22Bqwen147.7 GBcomfortableQ4 · / 192 GB

05Falcon 180Bfalcon138.3 GBcomfortableQ5 · / 192 GB

Similar GPUs

VRAM estimates updated 2026-05-12.