andriadze
/

ai-chat-censor6

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

andriadze commited on Oct 7, 2024

Commit

35137d0

·

verified ·

1 Parent(s): 46010ee

Update README.md

Files changed (1) hide show

README.md +11 -10

README.md CHANGED Viewed

@@ -16,24 +16,25 @@ should probably proofread and complete it, then remove this comment. -->
 # ai-chat-censor6
-This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.0637
-- Accuracy: 0.9903
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters

 # ai-chat-censor6
+The primary focus of the model is detecting sexual/minors category of messages.
+Possible flags are: regular, racist, underage, sexual
+# BEWARE
+The model might categorize any talk about race as racism, for example: "Black people suffer so much in America" will be flagged as "racist".
 ## Training and evaluation data
+Model was trained on a fully synthetic dataset generated by uncensored 72b models based on qwen2.
+This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0637
+- Accuracy: 0.9903
 ### Training hyperparameters