gaodrew commited on
Commit
506726d
·
verified ·
1 Parent(s): 398498e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -33,6 +33,7 @@ pipeline_tag: token-classification
33
  <a target="_blank" href="https://colab.research.google.com/github/williamgao1729/piiranha-quickstart/blob/main/piiranha_quickstart.ipynb">
34
  <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
35
  </a>
 
36
  Piiranha is trained to **detect 17 types** of Personally Identifiable Information (PII) across six languages. It successfully **catches 98.27% of PII** tokens, with an overall classification **accuracy of 99.44%**.
37
  Piiranha is especially accurate at detecting passwords, emails (100%), phone numbers, and usernames.
38
 
@@ -42,7 +43,9 @@ Performance on PII vs. Non PII classification task:
42
  - **Specificity: 99.84%** (correctly identifies 99.84% of Non PII tokens)
43
 
44
  <img src="https://cloud-3i4ld6u5y-hack-club-bot.vercel.app/0home.png" alt="Akash Network logo" width="250"/>
 
45
  Piiranha was trained on an H100 GPU rented through the Akash Network (https://akash.network)
 
46
  ## Model Description
47
  Piiranha is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base).
48
  The context length is 256 Deberta tokens. If your text is longer than that, just split it up.
 
33
  <a target="_blank" href="https://colab.research.google.com/github/williamgao1729/piiranha-quickstart/blob/main/piiranha_quickstart.ipynb">
34
  <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
35
  </a>
36
+
37
  Piiranha is trained to **detect 17 types** of Personally Identifiable Information (PII) across six languages. It successfully **catches 98.27% of PII** tokens, with an overall classification **accuracy of 99.44%**.
38
  Piiranha is especially accurate at detecting passwords, emails (100%), phone numbers, and usernames.
39
 
 
43
  - **Specificity: 99.84%** (correctly identifies 99.84% of Non PII tokens)
44
 
45
  <img src="https://cloud-3i4ld6u5y-hack-club-bot.vercel.app/0home.png" alt="Akash Network logo" width="250"/>
46
+
47
  Piiranha was trained on an H100 GPU rented through the Akash Network (https://akash.network)
48
+
49
  ## Model Description
50
  Piiranha is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base).
51
  The context length is 256 Deberta tokens. If your text is longer than that, just split it up.