AppleAppleMac Studiounified memory

Mac Studio M2 Ultra (192GB) for local AI

Mac Studio M2 Ultra (192GB) provides 192 GB of VRAM for local AI. In the LocalIA catalog, 232 out of 242 models run comfortably on a single card.

View all compatible models →Rig around the (192GB) ↗

VRAM

192GB

Models that run comfortably

232 models

These models fit in 192 GB with room for context and stable inference.

01DeepSeek V2deepseek148.4 GBcomfortableQ4 · / 192 GB

02DeepSeek Coder V2deepseek148.4 GBcomfortableQ4 · / 192 GB

03★Qwen 3 235B A22Bqwen147.7 GBcomfortableQ4 · / 192 GB

04★Qwen3 235B A22Bqwen147.7 GBcomfortableQ4 · / 192 GB

05Falcon 180Bfalcon138.3 GBcomfortableQ5 · / 192 GB

06Mixtral 8x22Bmistral157.6 GBcomfortableQ8 · / 192 GB

07★Mistral Large 123Bmistral137.5 GBcomfortableQ8 · / 192 GB

08★NVIDIA Nemotron 3 Super 120B A12B BF16nemotron134.1 GBcomfortableQ8 · / 192 GB

09★Llama 4 Scout 17Bx16llama121.8 GBcomfortableQ8 · / 192 GB

10★Command R+ 104Bcommand116.2 GBcomfortableQ8 · / 192 GB

11★Qwen 2.5 72Bqwen160.9 GBcomfortableFP16 · / 192 GB

12Qwen 2.5 VL 72Bqwen160.9 GBcomfortableFP16 · / 192 GB

13★Qwen2.5 72B Instructqwen160.9 GBcomfortableFP16 · / 192 GB

14Llama 2 70Bllama156.5 GBcomfortableFP16 · / 192 GB

15Llama 3 70Bllama156.5 GBcomfortableFP16 · / 192 GB

16Llama 3.1 70Bllama156.5 GBcomfortableFP16 · / 192 GB

17★Llama 3.3 70Bllama156.5 GBcomfortableFP16 · / 192 GB

18CodeLlama 70Bcodellama156.5 GBcomfortableFP16 · / 192 GB

19★DeepSeek R1 Distill 70Bdeepseek156.5 GBcomfortableFP16 · / 192 GB

20Hermes 3 70Bhermes156.5 GBcomfortableFP16 · / 192 GB

21★Llama 3.1 Nemotron 70Bnemotron156.5 GBcomfortableFP16 · / 192 GB

22Athene 70Bathene156.5 GBcomfortableFP16 · / 192 GB

23★Llama 3.3 70B Instructllama156.5 GBcomfortableFP16 · / 192 GB

24★Llama 3.1 70B Instructllama156.5 GBcomfortableFP16 · / 192 GB

25★DeepSeek R1 Distill Llama 70Bllama156.5 GBcomfortableFP16 · / 192 GB

26★Llama 3_3 Nemotron Super 49B v1_5llama109.5 GBcomfortableFP16 · / 192 GB

27★Mixtral 8x7Bmistral105.1 GBcomfortableFP16 · / 192 GB

28Falcon 40Bfalcon89.4 GBcomfortableFP16 · / 192 GB

29Command R 35Bcommand78.2 GBcomfortableFP16 · / 192 GB

30Aya 23 35Baya78.2 GBcomfortableFP16 · / 192 GB

Unlocked in a 2x rig

384 GB

With two cards in parallel (384 GB total), larger models become reachable.

01★Llama 3.1 405Bllama311.2 GBcomfortableQ5 · / 384 GB

02Hermes 3 405Bhermes311.2 GBcomfortableQ5 · / 384 GB

03★Llama 3.1 405Bllama311.2 GBcomfortableQ5 · / 384 GB

04★Llama 4 Maverick 17Bx128llama307.3 GBcomfortableQ5 · / 384 GB

05Nemotron 340Bnemotron261.2 GBcomfortableQ5 · / 384 GB

Unlocked in a 4x rig

768 GB

Server-style configuration (768 GB total) for the largest open-weight models.

01★DeepSeek V3.2deepseek526.3 GBcomfortableQ5 · / 768 GB

02★DeepSeek V4 Prodeepseek526.3 GBcomfortableQ5 · / 768 GB

03★DeepSeek R1deepseek515.6 GBcomfortableQ5 · / 768 GB

04★DeepSeek V3deepseek515.6 GBcomfortableQ5 · / 768 GB

05★DeepSeek R1 (0528 snapshot)deepseek515.6 GBcomfortableQ5 · / 768 GB

Similar GPUs

VRAM estimates updated 2026-06-27. Apple Silicon: part of unified memory remains reserved for the system.