macadeliccc
/

MarcoroCapy-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MarcoroCapy-7B / README.md

macadeliccc's picture

Update README.md

e22e1fb verified 9 months ago

|

history blame contribute delete

862 Bytes

metadata

library_name: transformers
tags: []

MarcoroCapy-7B

This model is a DPO fine tune of mlabonne/Marcoro14-7B-slerp on argilla/distilabel-capybara-dpo-7k-binarized

Process

Realigned the chat template to ChatML
Completed 1 Epoch
5e-5 learning rate
Training time was about 4.5 hours on 1 H100
Cost was ~$20

GGUF

TODO

Evaluations

TODO