Model Card for free-solar-dpo-v0.1
Developed by : Freewheelin AI Technical Team
Hardware and Software
- Training Factors: We fine-tuned this model using the HuggingFace TRL Trainer
Method
- This model was trained using the learning method introduced in the SOLAR paper.
Base Model
- Downloads last month
- 8
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.