MistralDenseApache 2.0

Mistral 7B v0.3

Mistral 7B v0.3 is the refined version of the model that launched the European open-source LLM movement. At 7.25B parameters, it punches above its weight on reasoning and multilingual tasks, particularly in French, German, Spanish, and Ital

7.3B

Parameters

32K

Max Context

Dense

Architecture

May 20, 2024

Released

Text

Modality

About Mistral 7B v0.3

Mistral 7B v0.3 is the refined version of the model that launched the European open-source LLM movement. At 7.25B parameters, it punches above its weight on reasoning and multilingual tasks, particularly in French, German, Spanish, and Italian. Apache 2.0 licensing makes it a safe choice for commercial deployment. With mature GGUF support and ~4 GB VRAM at Q4_K_M, it remains a solid alternative to Llama 3.1 8B — especially for multilingual use cases.

MultilingualChatCodeCommercial

Technical Specifications

Total Parameters7.3B
ArchitectureDense
Attention TypeGQA (Grouped Query Attention)
Hidden Dimensiond = 4,096
Transformer Layers32
Attention Heads32
KV Headsn_kv = 8
Head Dimensiond_head = 128
Activation FunctionSwiGLU
NormalizationRMSNorm
Position EmbeddingRoPE

System Requirements

Estimated VRAM at 10% overhead for different quantization methods and context sizes.

Quantization1K ctx32K ctx
Q4_K_M0.50 B/W
~97% of FP16
3.87Consumer GPU
7.75Consumer GPU
Q8_01.00 B/W
~100% of FP16
7.62Consumer GPU
11.49Consumer GPU
F162.00 B/W
Reference
15.11Consumer GPU
18.99Consumer GPU
Fits 24 GB consumer GPU
Fits 80 GB datacenter GPU
Requires cluster / multi-GPU

Other Mistral Models

View All

Find the right GPU for Mistral 7B v0.3

Use the interactive VRAM Calculator to see exactly how much memory you need at any quantization level, context length, and overhead setting.