sfarrukh
/

modernbert-setfit-nli

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

sfarrukh commited on 26 days ago

Commit

46ae0d9

·

verified ·

1 Parent(s): f61f746

Update README.md

Files changed (1) hide show

README.md +15 -8

README.md CHANGED Viewed

@@ -45,19 +45,23 @@ should probably proofread and complete it, then remove this comment. -->
 # modernbert-setfit-nli
-This model was trained from scratch on an unknown dataset.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -77,4 +81,7 @@ The following hyperparameters were used during training:
 - Transformers 4.48.0
 - Pytorch 2.5.1+cu121
 - Datasets 3.2.0
-- Tokenizers 0.21.0

 # modernbert-setfit-nli
+## Model Description
+This model is a fine-tuned version of `answerdotai/ModernBERT-base` trained on a subset of the SetFit/mnli dataset. It is designed for natural language inference (NLI) tasks, where the goal is to determine the relationship between two text inputs (e.g., entailment, contradiction, or neutrality).
+## Intended Uses & Limitations
+### Intended Uses
+- **Natural Language Inference (NLI):** Suitable for classifying relationships between pairs of sentences.
+- **Text Understanding Tasks:** Can be applied to other similar tasks requiring sentence pair classification.
+### Limitations
+- **Dataset-Specific Biases:** The model was fine-tuned on 30,000 samples from the SetFit/mnli dataset and may not generalize well to domains significantly different from the training data.
+- **Context Length:** The tokenizer’s maximum sequence length is 512 tokens. Inputs longer than this will be truncated.
+- **Resource Intensive:** May require a modern GPU for efficient inference on large datasets.
+This model is a starting point for NLI tasks and may need further fine-tuning for domain-specific applications.
 ## Training procedure
 - Transformers 4.48.0
 - Pytorch 2.5.1+cu121
 - Datasets 3.2.0
+- Tokenizers 0.21.0
+## References
+- **GitHub Repository:** The training code is available at [GitHub Repository Link](https://github.com/sfarrukhm/model_finetune.git).