output / README.md
cappuch's picture
Update README.md
d49b36f verified
metadata
license: apache-2.0
base_model: HuggingFaceTB/SmolLM-135M-Instruct
tags:
  - trl
  - dpo
  - generated_from_trainer
model-index:
  - name: output
    results: []

smollm-135m-instruct but more conversational