All LLM Models

Browse 36 LLM models with VRAM requirements, quantization options, and hardware compatibility.

Featured only

Understanding LLM VRAM Requirements

How much VRAM you need depends on the model size and quantization level. Quantization reduces the precision of model weights, trading small quality losses for significantly lower VRAM usage. For example, a 7B parameter model needs ~14 GB at FP16 but only ~4 GB at Q4_K_M quantization.

Llama 2 13B HF

Meta · 13.0B · runs from 6.1 GB

27.9K 628

Llama 2 13B HF is a 13.0B-parameter open language model from Meta in the Llama 2 family. See its VRAM requirements by quantization and which GPUs and Macs can run it locally below.

Chat

Llama 2 70B HF

Meta · 69.0B · runs from 151.8 GB

15.4K 855

Llama 2 70B HF is a 69.0B-parameter open language model from Meta in the Llama 2 family. See its VRAM requirements by quantization and which GPUs and Macs can run it locally below.

Chat

Meta Llama Guard 2 8B

Meta · 8.0B · runs from 17.7 GB

8.3K 307

Meta Llama Guard 2 8B is a 8.0B-parameter open language model from Meta in the Llama family. See its VRAM requirements by quantization and which GPUs and Macs can run it locally below.

Chat

Llama 3.2 90B Vision Instruct

Meta · 88.6B · runs from 194.9 GB

849 358

Llama 3.2 90B Vision Instruct is a 88.6B-parameter open language model from Meta in the Llama 3 family. See its VRAM requirements by quantization and which GPUs and Macs can run it locally below.

Vision

KernelLLM

Meta · 8.0B · runs from 4.0 GB

137 202

KernelLLM is a 8.0B-parameter open language model from Meta. It supports a context window of up to 131,072 tokens. See its VRAM requirements by quantization and which GPUs and Macs can run it locally below.

Chat

MobileLLM R1.5 950M

Meta · 950M · runs from 2.1 GB

56 19

MobileLLM R1.5 950M is a 950M-parameter open language model from Meta. See its VRAM requirements by quantization and which GPUs and Macs can run it locally below.

ChatReasoning

All LLM Models

Understanding LLM VRAM Requirements

Model List

Llama 2 13B HF

Llama 2 70B HF

Meta Llama Guard 2 8B

Llama 3.2 90B Vision Instruct

KernelLLM

MobileLLM R1.5 950M