star / README.md
jahuerta92's picture
Update README.md
e2a08d0 verified
---
pipeline_tag: feature-extraction
---
# Style Transformer for Authorship Representations - STAR
This is the repository for the [Style Transformer for Authorship Representations (STAR)](https://arxiv.org/abs/2310.11081) model. We present the weights of our model here.
Also check out our [github repo for STAR](https://github.com/jahuerta92/star) for replication.
## Feature extraction
```python
tokenizer = AutoTokenizer.from_pretrained('roberta-large')
model = AutoModel.from_pretrained('AIDA-UPM/star')
examples = ['My text 1', 'This is another text']
def extract_embeddings(texts):
encoded_texts = tokenizer(texts)
with torch.no_grad():
style_embeddings = model(encoded_texts.input_ids,
attention_mask=encoded_texts.attention_mask).pooler_output
return style_embeddings
print(extract_embeddings(examples))
```
## Citation
```
@article{Huertas-Tato2023Oct,
author = {Huertas-Tato, Javier and Martin, Alejandro and Camacho, David},
title = {{Understanding writing style in social media with a supervised contrastively pre-trained transformer}},
journal = {arXiv},
year = {2023},
month = oct,
eprint = {2310.11081},
doi = {10.48550/arXiv.2310.11081}
}
```