pt-sk commited on
Commit
b451002
1 Parent(s): c9af816

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -3,7 +3,7 @@ license: mit
3
  datasets: pt-sk/imdb
4
  tags: ["PPO", "RLHF"]
5
  ---
6
- Fine-tuning a GPT-2 model on the IMDB dataset using Proximal Policy Optimization (PPO). The goal is to train the model to generate positive sentiment reviews. The training process utilizes the `trl` library for reinforcement learning, the `transformers` library for model handling, and `datasets` for dataset management.
7
  Implementation code is available here: [GitHub](https://github.com/sathishkumar67/GPT-2-IMDB-Sentiment-Fine-Tuning-with-PPO)
8
  ```python
9
  # Load model and tokenizer directly
 
3
  datasets: pt-sk/imdb
4
  tags: ["PPO", "RLHF"]
5
  ---
6
+ GPT2-IMDB is pretrained on IMDB dataset. Aligning the model using Proximal Policy Optimization (PPO). The goal is to train the model to generate positive sentiment reviews. The training process utilizes the `trl` library for reinforcement learning, the `transformers` library for model handling, and `datasets` for dataset management.
7
  Implementation code is available here: [GitHub](https://github.com/sathishkumar67/GPT-2-IMDB-Sentiment-Fine-Tuning-with-PPO)
8
  ```python
9
  # Load model and tokenizer directly