Custom Builds
Collection
Text Generation Cook with it's GGUF
•
31 items
•
Updated
•
8
The QwQ-4B-Instruct is a lightweight and efficient fine-tuned language model for instruction-following tasks and reasoning. It is based on a quantized version of the Qwen2.5-7B model, optimized for inference speed and reduced memory consumption, while retaining robust capabilities for complex tasks.
File Name | Size | Description | Upload Status |
---|---|---|---|
.gitattributes |
1.57 kB | Tracks files stored with Git LFS. | Uploaded |
README.md |
271 Bytes | Basic project documentation. | Updated |
added_tokens.json |
657 Bytes | Specifies additional tokens for the tokenizer. | Uploaded |
config.json |
1.26 kB | Detailed model configuration file. | Uploaded |
generation_config.json |
281 Bytes | Configuration for text generation settings. | Uploaded |
merges.txt |
1.82 MB | Byte pair encoding (BPE) merge rules for tokenizer. | Uploaded |
model-00001-of-00002.safetensors |
4.46 GB | Part 1 of the model weights in safetensors format. | Uploaded (LFS) |
model-00002-of-00002.safetensors |
1.09 GB | Part 2 of the model weights in safetensors format. | Uploaded (LFS) |
model.safetensors.index.json |
124 kB | Index file for safetensors model sharding. | Uploaded |
special_tokens_map.json |
644 Bytes | Mapping of special tokens (e.g., , ). | Uploaded |
tokenizer.json |
11.4 MB | Complete tokenizer configuration. | Uploaded (LFS) |
tokenizer_config.json |
7.73 kB | Settings for the tokenizer integration. | Uploaded |
vocab.json |
2.78 MB | Vocabulary file containing token-to-id mappings. | Uploaded |
Model Size:
Precision Support:
Model Sharding:
model-00001-of-00002.safetensors
(4.46 GB)model-00002-of-00002.safetensors
(1.09 GB)model.safetensors.index.json
.Tokenizer:
vocab.json
(2.78 MB)merges.txt
(1.82 MB)tokenizer.json
(11.4 MB, pre-trained configuration).special_tokens_map.json
(e.g., <pad>
, <eos>
).Configuration Files:
config.json
: Defines the architecture, hyperparameters, and settings.generation_config.json
: Specifies text generation behavior (e.g., max length, temperature).Instruction-Following:
Reasoning:
Text Generation:
Resource-Constrained Applications:
Base model
Qwen/Qwen2.5-7B