Spaces:
Runtime error
Runtime error
title: Perplexity Lenses | |
emoji: 🌸 | |
colorFrom: pink | |
colorTo: blue | |
sdk: streamlit | |
app_file: app.py | |
pinned: false | |
# Installation: | |
Requires Python >= 3.7 and < 3.10 | |
``` | |
pip install . | |
``` | |
Or with [poetry](https://python-poetry.org/) | |
``` | |
poetry install | |
``` | |
# Web App: | |
The app is hosted [here](https://huggingface.co/spaces/edugp/perplexity-lenses). To run it locally: | |
``` | |
python -m streamlit run app.py | |
``` | |
# CLI: | |
The CLI with no arguments defaults to running mc4 in Spanish. | |
For full usage: | |
``` | |
python cli.py --help | |
``` | |
Example: Running on 1000 sentences extracted from Spanish OSCAR docs specifying all arguments: | |
``` | |
python cli.py \ | |
--dataset oscar \ | |
--dataset-config unshuffled_deduplicated_es \ | |
--dataset-split train \ | |
--text-column text \ | |
--language es \ | |
--doc-type sentence \ | |
--sample 1000 \ | |
--dimensionality-reduction umap \ | |
--model-name distiluse-base-multilingual-cased-v1 \ | |
--output-file perplexity.html | |
``` | |
# Tests: | |
``` | |
python -m unittest discover -s ./tests/ -p "test_*.py" | |
``` | |