AMDDatacenterInstinct CDNA 1-2

Instinct MI100 for local AI

Instinct MI100 provides 32 GB of VRAM for local AI. In the LocalIA catalog, 207 out of 242 models run comfortably on a single card.

View all compatible models →Rig around the MI100 ↗

VRAM

32GB

Models that run comfortably

207 models

These models fit in 32 GB with room for context and stable inference.

01Falcon 40Bfalcon25.1 GBcomfortableQ4 · / 32 GB

02Command R 35Bcommand26.9 GBcomfortableQ5 · / 32 GB

03Aya 23 35Baya26.9 GBcomfortableQ5 · / 32 GB

04CodeLlama 34Bcodellama26.1 GBcomfortableQ5 · / 32 GB

05Yi 1.5 34Byi26.1 GBcomfortableQ5 · / 32 GB

06★dolphin 2.9.1 yi 1.5 34byi26.1 GBcomfortableQ5 · / 32 GB

07★Qwen 2.5 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

08★Qwen 2.5 Coder 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

09★Qwen 3 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

10★QwQ 32Bqwq24.6 GBcomfortableQ5 · / 32 GB

11★DeepSeek R1 Distill 32Bdeepseek24.6 GBcomfortableQ5 · / 32 GB

12Qwen 2.5 VL 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

13★Granite 4 H-Small 32B-A9Bgranite24.6 GBcomfortableQ5 · / 32 GB

14GLM-4.6glm24.6 GBcomfortableQ5 · / 32 GB

15★GLM-4.7glm24.6 GBcomfortableQ5 · / 32 GB

16★GLM-5glm24.6 GBcomfortableQ5 · / 32 GB

17★GLM-5.1glm24.6 GBcomfortableQ5 · / 32 GB

18★Qwen3 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

19★Qwen2.5 Coder 32B Instructqwen24.6 GBcomfortableQ5 · / 32 GB

20★DeepSeek R1 Distill Qwen 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

21★Qwen2.5 32B Instructqwen24.6 GBcomfortableQ5 · / 32 GB

22★Gemma 4 31Bgemma23.8 GBcomfortableQ5 · / 32 GB

23★Qwen 3 30B A3Bqwen23.1 GBcomfortableQ5 · / 32 GB

24MPT 30Bmpt23.1 GBcomfortableQ5 · / 32 GB

25★Qwen3 Coder 30B A3B Instructqwen23.1 GBcomfortableQ5 · / 32 GB

26★Qwen3 30B A3Bqwen23.1 GBcomfortableQ5 · / 32 GB

27★Qwen3 30B A3B Instruct 2507qwen23.1 GBcomfortableQ5 · / 32 GB

28★NVIDIA Nemotron 3 Nano 30B A3B BF16nemotron23.1 GBcomfortableQ5 · / 32 GB

29★Gemma 2 27Bgemma20.7 GBcomfortableQ5 · / 32 GB

30★Gemma 3 27Bgemma20.7 GBcomfortableQ5 · / 32 GB

Tight models

1 models

These models barely fit. They can run, but context and speed will be limited.

01★Mixtral 8x7Bmistral29.5 GBtightQ4 · / 32 GB

Unlocked in a 2x rig

64 GB

With two cards in parallel (64 GB total), larger models become reachable.

01★Qwen3 Next 80B A3B Instructqwen50.3 GBcomfortableQ4 · / 64 GB

02★Qwen 2.5 72Bqwen45.3 GBcomfortableQ4 · / 64 GB

03Qwen 2.5 VL 72Bqwen45.3 GBcomfortableQ4 · / 64 GB

04★Qwen2.5 72B Instructqwen45.3 GBcomfortableQ4 · / 64 GB

05Llama 2 70Bllama53.8 GBcomfortableQ5 · / 64 GB

06Llama 3 70Bllama53.8 GBcomfortableQ5 · / 64 GB

07Llama 3.1 70Bllama53.8 GBcomfortableQ5 · / 64 GB

08★Llama 3.3 70Bllama53.8 GBcomfortableQ5 · / 64 GB

09CodeLlama 70Bcodellama53.8 GBcomfortableQ5 · / 64 GB

10★DeepSeek R1 Distill 70Bdeepseek53.8 GBcomfortableQ5 · / 64 GB

11Hermes 3 70Bhermes53.8 GBcomfortableQ5 · / 64 GB

12★Llama 3.1 Nemotron 70Bnemotron53.8 GBcomfortableQ5 · / 64 GB

13Athene 70Bathene53.8 GBcomfortableQ5 · / 64 GB

14★Llama 3.3 70B Instructllama53.8 GBcomfortableQ5 · / 64 GB

15★Llama 3.1 70B Instructllama53.8 GBcomfortableQ5 · / 64 GB

Unlocked in a 4x rig

128 GB

Server-style configuration (128 GB total) for the largest open-weight models.

01Falcon 180Bfalcon113.2 GBtightQ4 · / 128 GB

02Mixtral 8x22Bmistral108.3 GBcomfortableQ5 · / 128 GB

03★Mistral Large 123Bmistral94.5 GBcomfortableQ5 · / 128 GB

04★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron92.2 GBcomfortableQ5 · / 128 GB

05★Llama 4 Scout 17Bx16llama83.7 GBcomfortableQ5 · / 128 GB

06★Command R+ 104Bcommand79.9 GBcomfortableQ5 · / 128 GB

Similar GPUs

VRAM estimates updated 2026-05-12.