AppleAppleMac Studiounified memory

Mac Studio M3 Ultra (256GB) for local AI

Mac Studio M3 Ultra (256GB) provides 256 GB of VRAM for local AI. In the LocalIA catalog, 233 out of 242 models run comfortably on a single card.

View all compatible models →Rig around the (256GB) ↗

VRAM

256GB

Models that run comfortably

233 models

These models fit in 256 GB with room for context and stable inference.

01Nemotron 340Bnemotron213.7 GBcomfortableQ4 · / 256 GB

02DeepSeek V2deepseek181.3 GBcomfortableQ5 · / 256 GB

03DeepSeek Coder V2deepseek181.3 GBcomfortableQ5 · / 256 GB

04★Qwen 3 235B A22Bqwen180.6 GBcomfortableQ5 · / 256 GB

05★Qwen3 235B A22Bqwen180.6 GBcomfortableQ5 · / 256 GB

06Falcon 180Bfalcon201.2 GBcomfortableQ8 · / 256 GB

07Mixtral 8x22Bmistral157.6 GBcomfortableQ8 · / 256 GB

08★Mistral Large 123Bmistral137.5 GBcomfortableQ8 · / 256 GB

09★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron134.1 GBcomfortableQ8 · / 256 GB

10★Llama 4 Scout 17Bx16llama121.8 GBcomfortableQ8 · / 256 GB

11★Command R+ 104Bcommand116.2 GBcomfortableQ8 · / 256 GB

12★Qwen 2.5 72Bqwen160.9 GBcomfortableFP16 · / 256 GB

13Qwen 2.5 VL 72Bqwen160.9 GBcomfortableFP16 · / 256 GB

14★Qwen2.5 72B Instructqwen160.9 GBcomfortableFP16 · / 256 GB

15Llama 2 70Bllama156.5 GBcomfortableFP16 · / 256 GB

16Llama 3 70Bllama156.5 GBcomfortableFP16 · / 256 GB

17Llama 3.1 70Bllama156.5 GBcomfortableFP16 · / 256 GB

18★Llama 3.3 70Bllama156.5 GBcomfortableFP16 · / 256 GB

19CodeLlama 70Bcodellama156.5 GBcomfortableFP16 · / 256 GB

20★DeepSeek R1 Distill 70Bdeepseek156.5 GBcomfortableFP16 · / 256 GB

21Hermes 3 70Bhermes156.5 GBcomfortableFP16 · / 256 GB

22★Llama 3.1 Nemotron 70Bnemotron156.5 GBcomfortableFP16 · / 256 GB

23Athene 70Bathene156.5 GBcomfortableFP16 · / 256 GB

24★Llama 3.3 70B Instructllama156.5 GBcomfortableFP16 · / 256 GB

25★Llama 3.1 70B Instructllama156.5 GBcomfortableFP16 · / 256 GB

26★DeepSeek R1 Distill Llama 70Bllama156.5 GBcomfortableFP16 · / 256 GB

27★Llama 3_3 Nemotron Super 49B v1_5llama109.5 GBcomfortableFP16 · / 256 GB

28★Mixtral 8x7Bmistral105.1 GBcomfortableFP16 · / 256 GB

29Falcon 40Bfalcon89.4 GBcomfortableFP16 · / 256 GB

30Command R 35Bcommand78.2 GBcomfortableFP16 · / 256 GB

Tight models

4 models

These models barely fit. They can run, but context and speed will be limited.

01★Llama 3.1 405Bllama254.6 GBtightQ4 · / 256 GB

02Hermes 3 405Bhermes254.6 GBtightQ4 · / 256 GB

03★Llama 3.1 405Bllama254.6 GBtightQ4 · / 256 GB

04★Llama 4 Maverick 17Bx128llama251.5 GBtightQ4 · / 256 GB

Unlocked in a 2x rig

512 GB

With two cards in parallel (512 GB total), larger models become reachable.

01★DeepSeek V3.2deepseek430.6 GBcomfortableQ4 · / 512 GB

02★DeepSeek V4 Prodeepseek430.6 GBcomfortableQ4 · / 512 GB

03★DeepSeek R1deepseek421.8 GBcomfortableQ4 · / 512 GB

04★DeepSeek V3deepseek421.8 GBcomfortableQ4 · / 512 GB

05★DeepSeek R1 (0528 snapshot)deepseek421.8 GBcomfortableQ4 · / 512 GB

Similar GPUs

VRAM estimates updated 2026-06-27. Apple Silicon: part of unified memory remains reserved for the system.