Trinity Large Preview — Hardware Requirements & GPU Compatibility
ChatTrinity Large Preview is a 398.6B-parameter open language model from Arcee AI. It supports a context window of up to 262,144 tokens. At BF16 it needs about 797.82 GB of VRAM — see which GPUs and Macs can run it below.
Specifications
- Publisher
- Arcee AI
- Parameters
- 398.6B
- Architecture
- AfmoeForCausalLM
- Context Length
- 262,144 tokens
- Vocabulary Size
- 200,192
- Release Date
- 2026-02-20
- License
- Apache 2.0
Get Started
HuggingFace
How Much VRAM Does Trinity Large Preview Need?
Select a quantization to see compatible GPUs below.
| Quantization | Bits | VRAM | + Context | File Size | Quality |
|---|---|---|---|---|---|
| BF16 | 16.00 | 797.8 GB | 829.8 GB | 797.27 GB | Brain floating point 16 — preferred for training |
Which GPUs Can Run Trinity Large Preview?
BF16 · 797.8 GBTrinity Large Preview (BF16) requires 797.8 GB of VRAM to load the model weights. For comfortable inference with headroom for KV cache and system overhead, 1038+ GB is recommended. Using the full 262K context window can add up to 32.0 GB, bringing total usage to 829.8 GB. No single GPU has enough memory — multi-GPU or cluster setups are needed.
Related Models
Frequently Asked Questions
- How much VRAM does Trinity Large Preview need?
Trinity Large Preview requires 797.8 GB of VRAM at BF16. Full 262K context adds up to 32.0 GB (829.8 GB total).
VRAM = Weights + KV Cache + Overhead
Weights = 398.6B × 16 bits ÷ 8 = 797.3 GB
KV Cache + Overhead ≈ 0.5 GB (at 2K context + ~0.3 GB framework)
KV Cache + Overhead ≈ 32.5 GB (at full 262K context)
VRAM usage by quantization
BF16797.8 GBBF16 + full context829.8 GB- Can NVIDIA GeForce RTX 5090 run Trinity Large Preview?
No — Trinity Large Preview requires at least 797.8 GB at BF16, which exceeds the NVIDIA GeForce RTX 5090's 32 GB of VRAM.
- Can I run Trinity Large Preview on a Mac?
Trinity Large Preview requires at least 797.8 GB at BF16, which exceeds the unified memory of most consumer Macs. You would need a Mac Studio or Mac Pro with a high-memory configuration.
- Can I run Trinity Large Preview locally?
Yes — Trinity Large Preview can run locally on consumer hardware. At BF16 quantization it needs 797.8 GB of VRAM. Popular tools include Ollama, LM Studio, and llama.cpp.
- What's the download size of Trinity Large Preview?
At BF16, the download is about 797.27 GB.
- Which GPUs can run Trinity Large Preview?
No single consumer GPU has enough VRAM to run Trinity Large Preview at BF16 (797.8 GB). Multi-GPU or professional hardware is required.
- Which devices can run Trinity Large Preview?
Trinity Large Preview requires at least 797.8 GB at BF16, which exceeds the unified memory of most consumer devices. A high-memory Mac Studio, Mac Pro, or multi-GPU desktop setup is recommended.