README.md · huggingtweets/dotcsv at 3966812bf1d8a2f2403f07269fd46132dd42c11a

metadata

language: en
thumbnail: https://github.com/borisdayma/huggingtweets/blob/master/img/logo.png?raw=true
tags:
  - huggingtweets
widget:
  - text: My dream is

Carlos Santana - DotCSV 🧠🤖 🤖 AI Bot

@dotcsv bot

I was made with huggingtweets.

Create your own bot based on your favorite user with the demo!

How does it work?

The model uses the following pipeline.

To understand how the model was developed, check the W&B report.

Training data

The model was trained on @dotcsv's tweets.

Data	Quantity
Tweets downloaded	3219
Retweets	1037
Short tweets	238
Tweets kept	1944

Explore the data, which is tracked with W&B artifacts at every step of the pipeline.

Training procedure

The model is based on a pre-trained GPT-2 which is fine-tuned on @dotcsv's tweets.

Hyperparameters and metrics are recorded in the W&B training run for full transparency and reproducibility.

At the end of training, the final model is logged and versioned.

How to use

You can use this model directly with a pipeline for text generation:

from transformers import pipeline
generator = pipeline('text-generation',
                     model='huggingtweets/dotcsv')
generator("My dream is", num_return_sequences=5)

Limitations and bias

The model suffers from the same limitations and bias as GPT-2.

In addition, the data present in the user's tweets further affects the text generated by the model.

About

Built by Boris Dayma

For more details, visit the project repository.