Model Card for Model ID
Summary
This is a preference tuned model for text completion based on Phi 1.5. It has been tuned on a filtered version of the The Complete Works of William Shakespeare, which can be found and downloaded from here: https://www.gutenberg.org/ebooks/100.
Model Description
- Developed by: Course Organizers
- Finetuned from model: microsoft/phi-1_5
Training Details
This model has been trained using the TRL library and OPROTrainer class from Huggingface.
Training Data
The Complete Works of William Shakespeare, which can be found and downloaded from here: https://www.gutenberg.org/ebooks/100
Training Hyperparameters
The following hyperparameters were used during training:
- learning_rate: 1e-06
- per_device_train_batch_size: 1
- lr_scheduler_type: cosine
- weight_decay: 0.01
- num_epochs: 1
Framework Versions
- accelerate==0.26.1
- datasets==2.16.1
- transformers==4.45.2
- trl==0.11.2
Compute Infrastructure and Hardware
Slurm cluster with 8 x H100 Nvidia GPUs.
- Downloads last month
- 86
Model tree for course-genai-w24/week5-phi-1.5-pref-shakespeare
Base model
microsoft/phi-1_5