Update README.md
Browse files
README.md
CHANGED
@@ -33,6 +33,7 @@ pipeline_tag: token-classification
|
|
33 |
<a target="_blank" href="https://colab.research.google.com/github/williamgao1729/piiranha-quickstart/blob/main/piiranha_quickstart.ipynb">
|
34 |
<img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
|
35 |
</a>
|
|
|
36 |
Piiranha is trained to **detect 17 types** of Personally Identifiable Information (PII) across six languages. It successfully **catches 98.27% of PII** tokens, with an overall classification **accuracy of 99.44%**.
|
37 |
Piiranha is especially accurate at detecting passwords, emails (100%), phone numbers, and usernames.
|
38 |
|
@@ -42,7 +43,9 @@ Performance on PII vs. Non PII classification task:
|
|
42 |
- **Specificity: 99.84%** (correctly identifies 99.84% of Non PII tokens)
|
43 |
|
44 |
<img src="https://cloud-3i4ld6u5y-hack-club-bot.vercel.app/0home.png" alt="Akash Network logo" width="250"/>
|
|
|
45 |
Piiranha was trained on an H100 GPU rented through the Akash Network (https://akash.network)
|
|
|
46 |
## Model Description
|
47 |
Piiranha is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base).
|
48 |
The context length is 256 Deberta tokens. If your text is longer than that, just split it up.
|
|
|
33 |
<a target="_blank" href="https://colab.research.google.com/github/williamgao1729/piiranha-quickstart/blob/main/piiranha_quickstart.ipynb">
|
34 |
<img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
|
35 |
</a>
|
36 |
+
|
37 |
Piiranha is trained to **detect 17 types** of Personally Identifiable Information (PII) across six languages. It successfully **catches 98.27% of PII** tokens, with an overall classification **accuracy of 99.44%**.
|
38 |
Piiranha is especially accurate at detecting passwords, emails (100%), phone numbers, and usernames.
|
39 |
|
|
|
43 |
- **Specificity: 99.84%** (correctly identifies 99.84% of Non PII tokens)
|
44 |
|
45 |
<img src="https://cloud-3i4ld6u5y-hack-club-bot.vercel.app/0home.png" alt="Akash Network logo" width="250"/>
|
46 |
+
|
47 |
Piiranha was trained on an H100 GPU rented through the Akash Network (https://akash.network)
|
48 |
+
|
49 |
## Model Description
|
50 |
Piiranha is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base).
|
51 |
The context length is 256 Deberta tokens. If your text is longer than that, just split it up.
|