This open source version performs differently from the official trial version. Are these two different versions?
#12
by
ZHUYONGJUN
- opened
The performance of the same prompt is inconsistent between the open source version and the trial version of https://chat.deepseek.com/. The official website trial version performs much better, while the open source version performs very poorly. Are these two different versions?
ZHUYONGJUN
changed discussion title from
这个开源版本与官网试用版本表现效果不一样,这是两个不同的版本吗?
to This open source version performs differently from the official trial version. Are these two different versions?
reduce the Repetition penalty to 1, the code will be much better, and closely resemble what is generated on the website. (tested multiple times with pong and snake)
The official website https://chat.deepseek.com/ has recently been updated in January with an SFT-optimized model, so it will be somewhat better than the open-source version. This model might be open-sourced in the future.
luofuli
changed discussion status to
closed