dotcsv / README.md
boris's picture
New model from https://wandb.ai/wandb/huggingtweets/runs/3g04fco4
3966812
|
raw
history blame
2.83 kB
metadata
language: en
thumbnail: https://github.com/borisdayma/huggingtweets/blob/master/img/logo.png?raw=true
tags:
  - huggingtweets
widget:
  - text: My dream is
Carlos Santana - DotCSV 🧠🤖 🤖 AI Bot
@dotcsv bot

I was made with huggingtweets.

Create your own bot based on your favorite user with the demo!

How does it work?

The model uses the following pipeline.

pipeline

To understand how the model was developed, check the W&B report.

Training data

The model was trained on @dotcsv's tweets.

Data Quantity
Tweets downloaded 3219
Retweets 1037
Short tweets 238
Tweets kept 1944

Explore the data, which is tracked with W&B artifacts at every step of the pipeline.

Training procedure

The model is based on a pre-trained GPT-2 which is fine-tuned on @dotcsv's tweets.

Hyperparameters and metrics are recorded in the W&B training run for full transparency and reproducibility.

At the end of training, the final model is logged and versioned.

How to use

You can use this model directly with a pipeline for text generation:

from transformers import pipeline
generator = pipeline('text-generation',
                     model='huggingtweets/dotcsv')
generator("My dream is", num_return_sequences=5)

Limitations and bias

The model suffers from the same limitations and bias as GPT-2.

In addition, the data present in the user's tweets further affects the text generated by the model.

About

Built by Boris Dayma

Follow

For more details, visit the project repository.

GitHub stars