Qwen3-4B-Instruct-2507

Model Description

Qwen3-4B-Instruct-2507 is an updated non-thinking variant in the Qwen3 family, designed for instruction-following tasks without generating <think></think> reasoning blocks.
Trained for enhanced general capabilities—including logic, coding, math, science, and long-tail multilingual knowledge—while natively supporting sprawling 256K-token contexts.

Features

  • Instruction-tuned performance: Strong at prompts, logic, comprehension, coding.
  • Multilingual strength: Expanded long-tail coverage across many languages.
  • Massive context window: Handles up to 262,144 tokens natively.
  • Clean output: No thinking-mode parsing needed—just straight responses.

Use Cases

  • High-quality conversational agents and instruction following
  • Processing long documents, books, legal texts, and source code
  • Multilingual tasks or low-resource language scenarios

Inputs and Outputs

Input: Text prompts—questions, commands, code tasks—without any special thinking mode flags.
Output: Direct, context-aware responses—answers, explanations, code—with no internal thought annotations.


License

This model is released under the Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0) license.
Non-commercial use, modification, and redistribution are permitted with attribution.
For commercial licensing, please contact dev@nexa.ai.

References

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including NexaAI/Qwen3-4B-Instruct-2507-npu-mobile