deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
•
Updated
•
411k
•
370
Additionally, we feed generated with structured prediction JSON data and feed them and text into DeepSeek-R1 Llama 70B to generate a chain of thought that can explain the extraction process.
Why don't you use R1 original (>600B) to get the best results?