Loading search index…
No recent searches
No results for "Query here"
← Back to Glossary
The file format used by llama.cpp to store quantized model weights, tokenizer data, and metadata in a single file. The standard format for local LLM inference. Replaced the older GGML format.