license: apache-2.0 | |
base_model: HuggingFaceTB/SmolLM-135M-Instruct | |
tags: | |
- trl | |
- dpo | |
- generated_from_trainer | |
model-index: | |
- name: output | |
results: [] | |
smollm-135m-instruct but more conversational |
license: apache-2.0 | |
base_model: HuggingFaceTB/SmolLM-135M-Instruct | |
tags: | |
- trl | |
- dpo | |
- generated_from_trainer | |
model-index: | |
- name: output | |
results: [] | |
smollm-135m-instruct but more conversational |