NVIDIAConsumerGTX 10

GTX 1070 for local AI

GTX 1070 provides 8 GB of VRAM for local AI. In the LocalIA catalog, 145 out of 242 models run comfortably on a single card.

View all compatible models →Rig around the 1070 ↗

VRAM

8GB

Models that run comfortably

145 models

These models fit in 8 GB with room for context and stable inference.

01Solar 10.7Bsolar6.7 GBcomfortableQ4 · / 8 GB

02Falcon 3 10Bfalcon6.3 GBcomfortableQ4 · / 8 GB

03★Gemma 2 9Bgemma5.7 GBcomfortableQ4 · / 8 GB

04Yi 1.5 9Byi5.7 GBcomfortableQ4 · / 8 GB

05★Qwen 3.5 9Bqwen5.7 GBcomfortableQ4 · / 8 GB

06★GLM-4 9Bglm5.7 GBcomfortableQ4 · / 8 GB

07★GLM-4.7 Flashglm5.7 GBcomfortableQ4 · / 8 GB

08GLM-4.1V 9B Thinkingglm5.7 GBcomfortableQ4 · / 8 GB

09★NVIDIA Nemotron Nano 9Bnemotron5.7 GBcomfortableQ4 · / 8 GB

10★gemma 2 9b itgemma5.7 GBcomfortableQ4 · / 8 GB

11Llama 3 8Bllama6.1 GBcomfortableQ5 · / 8 GB

12★Llama 3.1 8Bllama6.1 GBcomfortableQ5 · / 8 GB

13Ministral 8Bmistral6.1 GBcomfortableQ5 · / 8 GB

14★Qwen 3 8Bqwen6.1 GBcomfortableQ5 · / 8 GB

15DeepSeek R1 Distill 8Bdeepseek6.1 GBcomfortableQ5 · / 8 GB

16Aya 23 8Baya6.1 GBcomfortableQ5 · / 8 GB

17Granite 3 8Bgranite6.1 GBcomfortableQ5 · / 8 GB

18★Hermes 3 8Bhermes6.1 GBcomfortableQ5 · / 8 GB

19★DeepSeek R1 Distill Llama 8Bdeepseek6.1 GBcomfortableQ5 · / 8 GB

20★MiniCPM 4.1 8Bminicpm6.1 GBcomfortableQ5 · / 8 GB

21★Qwen3 8Bqwen6.1 GBcomfortableQ5 · / 8 GB

22★Llama 3.1 8B Instructllama6.1 GBcomfortableQ5 · / 8 GB

23★Llama 3.1 8Bllama6.1 GBcomfortableQ5 · / 8 GB

24★Meta Llama 3 8B Instructllama6.1 GBcomfortableQ5 · / 8 GB

25★Meta Llama 3 8Bllama6.1 GBcomfortableQ5 · / 8 GB

26★DeepSeek R1 0528 Qwen3 8Bqwen6.1 GBcomfortableQ5 · / 8 GB

27★Nemotron Labs Diffusion 8B Basenemotron6.1 GBcomfortableQ5 · / 8 GB

28★Qwen3 8B Baseqwen6.1 GBcomfortableQ5 · / 8 GB

29★Meta Llama 3.1 8B Instructllama6.1 GBcomfortableQ5 · / 8 GB

30★saiga_llama3_8bllama6.1 GBcomfortableQ5 · / 8 GB

Tight models

3 models

These models barely fit. They can run, but context and speed will be limited.

01★Mistral Nemo 12Bmistral7.5 GBtightQ4 · / 8 GB

02★Gemma 3 12Bgemma7.5 GBtightQ4 · / 8 GB

03StableLM 2 12Bstable7.5 GBtightQ4 · / 8 GB

Unlocked in a 2x rig

16 GB

With two cards in parallel (16 GB total), larger models become reachable.

01★Mistral Small 3 24Bmistral15.1 GBtightQ4 · / 16 GB

02★Mistral Small 3.1 24Bmistral15.1 GBtightQ4 · / 16 GB

03★Mistral Small 3.2 24Bmistral15.1 GBtightQ4 · / 16 GB

04★Devstral Small 2 24Bdevstral15.1 GBtightQ4 · / 16 GB

05Mistral Small 22Bmistral13.8 GBtightQ4 · / 16 GB

06★Codestral 22Bcodestral13.8 GBtightQ4 · / 16 GB

07Reka Flash 3 21Breka13.2 GBcomfortableQ4 · / 16 GB

08InternLM 2.5 20Binternlm12.6 GBcomfortableQ4 · / 16 GB

09DeepSeek V2 Litedeepseek12.3 GBcomfortableQ5 · / 16 GB

10DeepSeek Coder V2 Litedeepseek12.3 GBcomfortableQ5 · / 16 GB

11StarCoder 2 15Bstarcoder11.5 GBcomfortableQ5 · / 16 GB

12★Phi-4 Reasoning Vision 15Bphi11.5 GBcomfortableQ5 · / 16 GB

13★Qwen 2.5 14Bqwen10.8 GBcomfortableQ5 · / 16 GB

14Qwen 2.5 Coder 14Bqwen10.8 GBcomfortableQ5 · / 16 GB

15★Qwen 3 14Bqwen10.8 GBcomfortableQ5 · / 16 GB

Unlocked in a 4x rig

32 GB

Server-style configuration (32 GB total) for the largest open-weight models.

01★Llama 3_3 Nemotron Super 49B v1_5llama30.8 GBtightQ4 · / 32 GB

02★Mixtral 8x7Bmistral29.5 GBtightQ4 · / 32 GB

03Falcon 40Bfalcon25.1 GBcomfortableQ4 · / 32 GB

04Command R 35Bcommand26.9 GBcomfortableQ5 · / 32 GB

05Aya 23 35Baya26.9 GBcomfortableQ5 · / 32 GB

06CodeLlama 34Bcodellama26.1 GBcomfortableQ5 · / 32 GB

07Yi 1.5 34Byi26.1 GBcomfortableQ5 · / 32 GB

08★dolphin 2.9.1 yi 1.5 34byi26.1 GBcomfortableQ5 · / 32 GB

09★Qwen 2.5 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

10★Qwen 2.5 Coder 32Bqwen24.6 GBcomfortableQ5 · / 32 GB

Similar GPUs

VRAM estimates updated 2026-06-27.