AMDDatacenterInstinct CDNA 3+

Instinct MI325X for local AI

Instinct MI325X provides 256 GB of VRAM for local AI. In the LocalIA catalog, 233 out of 242 models run comfortably on a single card.

View all compatible models →Rig around the MI325X ↗

VRAM

256GB

Models that run comfortably

233 models

These models fit in 256 GB with room for context and stable inference.

01Nemotron 340Bnemotron213.7 GBcomfortableQ4 · / 256 GB

02DeepSeek V2deepseek181.3 GBcomfortableQ5 · / 256 GB

03DeepSeek Coder V2deepseek181.3 GBcomfortableQ5 · / 256 GB

04★Qwen 3 235B A22Bqwen180.6 GBcomfortableQ5 · / 256 GB

05★Qwen3 235B A22Bqwen180.6 GBcomfortableQ5 · / 256 GB

06Falcon 180Bfalcon201.2 GBcomfortableQ8 · / 256 GB

07Mixtral 8x22Bmistral157.6 GBcomfortableQ8 · / 256 GB

08★Mistral Large 123Bmistral137.5 GBcomfortableQ8 · / 256 GB

09★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron134.1 GBcomfortableQ8 · / 256 GB

10★Llama 4 Scout 17Bx16llama121.8 GBcomfortableQ8 · / 256 GB

11★Command R+ 104Bcommand116.2 GBcomfortableQ8 · / 256 GB

12★Qwen 2.5 72Bqwen160.9 GBcomfortableFP16 · / 256 GB

13Qwen 2.5 VL 72Bqwen160.9 GBcomfortableFP16 · / 256 GB

14★Qwen2.5 72B Instructqwen160.9 GBcomfortableFP16 · / 256 GB

15Llama 2 70Bllama156.5 GBcomfortableFP16 · / 256 GB

16Llama 3 70Bllama156.5 GBcomfortableFP16 · / 256 GB

17Llama 3.1 70Bllama156.5 GBcomfortableFP16 · / 256 GB

18★Llama 3.3 70Bllama156.5 GBcomfortableFP16 · / 256 GB

19CodeLlama 70Bcodellama156.5 GBcomfortableFP16 · / 256 GB

20★DeepSeek R1 Distill 70Bdeepseek156.5 GBcomfortableFP16 · / 256 GB

21Hermes 3 70Bhermes156.5 GBcomfortableFP16 · / 256 GB

22★Llama 3.1 Nemotron 70Bnemotron156.5 GBcomfortableFP16 · / 256 GB

23Athene 70Bathene156.5 GBcomfortableFP16 · / 256 GB

24★Llama 3.3 70B Instructllama156.5 GBcomfortableFP16 · / 256 GB

25★Llama 3.1 70B Instructllama156.5 GBcomfortableFP16 · / 256 GB

26★DeepSeek R1 Distill Llama 70Bllama156.5 GBcomfortableFP16 · / 256 GB

27★Llama 3_3 Nemotron Super 49B v1_5llama109.5 GBcomfortableFP16 · / 256 GB

28★Mixtral 8x7Bmistral105.1 GBcomfortableFP16 · / 256 GB

29Falcon 40Bfalcon89.4 GBcomfortableFP16 · / 256 GB

30Command R 35Bcommand78.2 GBcomfortableFP16 · / 256 GB

Tight models

4 models

These models barely fit. They can run, but context and speed will be limited.

01★Llama 3.1 405Bllama254.6 GBtightQ4 · / 256 GB

02Hermes 3 405Bhermes254.6 GBtightQ4 · / 256 GB

03★Llama 3.1 405Bllama254.6 GBtightQ4 · / 256 GB

04★Llama 4 Maverick 17Bx128llama251.5 GBtightQ4 · / 256 GB

Unlocked in a 2x rig

512 GB

With two cards in parallel (512 GB total), larger models become reachable.

01★DeepSeek V3.2deepseek430.6 GBcomfortableQ4 · / 512 GB

02★DeepSeek V4 Prodeepseek430.6 GBcomfortableQ4 · / 512 GB

03★DeepSeek R1deepseek421.8 GBcomfortableQ4 · / 512 GB

04★DeepSeek V3deepseek421.8 GBcomfortableQ4 · / 512 GB

05★DeepSeek R1 (0528 snapshot)deepseek421.8 GBcomfortableQ4 · / 512 GB

Similar GPUs

Instinct MI300X192 GB · Instinct CDNA 3+

VRAM estimates updated 2026-06-27.