Qwen3-4B-Instruct-2507

Model Description

Qwen3-4B-Instruct-2507 is an updated non-thinking variant in the Qwen3 family, designed for instruction-following tasks without generating <think></think> reasoning blocks.
Trained for enhanced general capabilities—including logic, coding, math, science, and long-tail multilingual knowledge—while natively supporting sprawling 256K-token contexts.

Features

Instruction-tuned performance: Strong at prompts, logic, comprehension, coding.
Multilingual strength: Expanded long-tail coverage across many languages.
Massive context window: Handles up to 262,144 tokens natively.
Clean output: No thinking-mode parsing needed—just straight responses.

Use Cases

High-quality conversational agents and instruction following
Processing long documents, books, legal texts, and source code
Multilingual tasks or low-resource language scenarios

Inputs and Outputs

Input: Text prompts—questions, commands, code tasks—without any special thinking mode flags.
Output: Direct, context-aware responses—answers, explanations, code—with no internal thought annotations.

License

This model is released under the Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0) license.
Non-commercial use, modification, and redistribution are permitted with attribution.
For commercial licensing, please contact dev@nexa.ai.

References

Model card: https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507

Downloads last month: -; Downloads are not tracked for this model. How to track

Collection including NexaAI/Qwen3-4B-Instruct-2507-npu-mobile

Qualcomm NPU Mobile

Collection

Multimodal models running on Qualcomm NPU for Snapdragon8 Gen4 • 14 items • Updated 5 days ago