AMDDatacenterInstinct CDNA 1-2

Instinct MI250 for local AI

Instinct MI250 provides 128 GB of VRAM for local AI. In the LocalIA catalog, 228 out of 242 models run comfortably on a single card.

View all compatible models →Rig around the MI250 ↗

VRAM

128GB

Models that run comfortably

228 models

These models fit in 128 GB with room for context and stable inference.

01Mixtral 8x22Bmistral108.3 GBcomfortableQ5 · / 128 GB

02★Mistral Large 123Bmistral94.5 GBcomfortableQ5 · / 128 GB

03★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron92.2 GBcomfortableQ5 · / 128 GB

04★Llama 4 Scout 17Bx16llama83.7 GBcomfortableQ5 · / 128 GB

05★Command R+ 104Bcommand79.9 GBcomfortableQ5 · / 128 GB

06★Qwen3 Next 80B A3B Instructqwen89.4 GBcomfortableQ8 · / 128 GB

07★Qwen 2.5 72Bqwen80.5 GBcomfortableQ8 · / 128 GB

08Qwen 2.5 VL 72Bqwen80.5 GBcomfortableQ8 · / 128 GB

09★Qwen2.5 72B Instructqwen80.5 GBcomfortableQ8 · / 128 GB

10Llama 2 70Bllama78.2 GBcomfortableQ8 · / 128 GB

11Llama 3 70Bllama78.2 GBcomfortableQ8 · / 128 GB

12Llama 3.1 70Bllama78.2 GBcomfortableQ8 · / 128 GB

13★Llama 3.3 70Bllama78.2 GBcomfortableQ8 · / 128 GB

14CodeLlama 70Bcodellama78.2 GBcomfortableQ8 · / 128 GB

15★DeepSeek R1 Distill 70Bdeepseek78.2 GBcomfortableQ8 · / 128 GB

16Hermes 3 70Bhermes78.2 GBcomfortableQ8 · / 128 GB

17★Llama 3.1 Nemotron 70Bnemotron78.2 GBcomfortableQ8 · / 128 GB

18Athene 70Bathene78.2 GBcomfortableQ8 · / 128 GB

19★Llama 3.3 70B Instructllama78.2 GBcomfortableQ8 · / 128 GB

20★Llama 3.1 70B Instructllama78.2 GBcomfortableQ8 · / 128 GB

21★Mixtral 8x7Bmistral105.1 GBcomfortableFP16 · / 128 GB

22Falcon 40Bfalcon89.4 GBcomfortableFP16 · / 128 GB

23Command R 35Bcommand78.2 GBcomfortableFP16 · / 128 GB

24Aya 23 35Baya78.2 GBcomfortableFP16 · / 128 GB

25CodeLlama 34Bcodellama76.0 GBcomfortableFP16 · / 128 GB

26Yi 1.5 34Byi76.0 GBcomfortableFP16 · / 128 GB

27★dolphin 2.9.1 yi 1.5 34byi76.0 GBcomfortableFP16 · / 128 GB

28★Qwen 2.5 32Bqwen71.5 GBcomfortableFP16 · / 128 GB

29★Qwen 2.5 Coder 32Bqwen71.5 GBcomfortableFP16 · / 128 GB

30★Qwen 3 32Bqwen71.5 GBcomfortableFP16 · / 128 GB

Tight models

1 models

These models barely fit. They can run, but context and speed will be limited.

01Falcon 180Bfalcon113.2 GBtightQ4 · / 128 GB

Unlocked in a 2x rig

256 GB

With two cards in parallel (256 GB total), larger models become reachable.

01★Llama 3.1 405Bllama254.6 GBtightQ4 · / 256 GB

02Hermes 3 405Bhermes254.6 GBtightQ4 · / 256 GB

03★Llama 4 Maverick 17Bx128llama251.5 GBtightQ4 · / 256 GB

04Nemotron 340Bnemotron213.7 GBcomfortableQ4 · / 256 GB

05DeepSeek V2deepseek181.3 GBcomfortableQ5 · / 256 GB

06DeepSeek Coder V2deepseek181.3 GBcomfortableQ5 · / 256 GB

07★Qwen 3 235B A22Bqwen180.6 GBcomfortableQ5 · / 256 GB

08★Qwen3 235B A22Bqwen180.6 GBcomfortableQ5 · / 256 GB

Unlocked in a 4x rig

512 GB

Server-style configuration (512 GB total) for the largest open-weight models.

01★DeepSeek V3.2deepseek430.6 GBcomfortableQ4 · / 512 GB

02★DeepSeek V4 Prodeepseek430.6 GBcomfortableQ4 · / 512 GB

03★DeepSeek R1deepseek421.8 GBcomfortableQ4 · / 512 GB

04★DeepSeek V3deepseek421.8 GBcomfortableQ4 · / 512 GB

05★DeepSeek R1 (0528 snapshot)deepseek421.8 GBcomfortableQ4 · / 512 GB

Similar GPUs

Instinct MI250X128 GB · Instinct CDNA 1-2

VRAM estimates updated 2026-05-12.