rv2307
/

electra-small-ner

Token Classification

Inference Endpoints

Model card Files Files and versions Community

electra-small-ner / README.md

rv2307's picture

Update README.md

69635eb verified 8 months ago

|

history blame contribute delete

2.08 kB

	---
	license: apache-2.0
	datasets:
	- conll2003
	- ai4privacy/pii-masking-200k
	language:
	- en
	metrics:
	- accuracy
	- f1
	library_name: transformers
	pipeline_tag: token-classification
	---

	## Model Details

	### Model Description

	This model is electra-small finetuned for NER prediction task. The model currently predicts three entities which are given below.
	1. Location
	2. Person
	3. Organization


	- Developed by:
	விபின் (Vipin)
	- Model type: Google's electra small discriminator
	- Language(s) (NLP): English
	- License: Apache 2.0
	- Finetuned from model [optional]: Google's electra small discriminator

	### Model Sources [optional]

	<!-- Provide the basic links for the model. -->

	- Repository: https://huggingface.co/google/electra-small-discriminator

	## Uses

	This model uses tokenizer that is from distilbert family. So the model may predict wrong entities for same word (different sub word). Use 'aggregation_strategy' to "max" when using transformer's pipeline.
	for example 'ashwin ::"
	ash" => Person
	win => Location


	### Out-of-Scope Use

	May not work well for some long sentences.


	## How to Get Started with the Model

	Use the code below to get started with the model.

	```
	from transformers import AutoModelForTokenClassification, AutoTokenizer
	from transformers import pipeline

	model = AutoModelForTokenClassification.from_pretrained("rv2307/electra-small-ner")
	tokenizer = AutoTokenizer.from_pretrained("rv2307/electra-small-ner")

	nlp = pipeline("ner",
	model=model,
	tokenizer=tokenizer,device="cpu",
	aggregation_strategy = "max")

	```

	## Training Details

	### Training Procedure

	This model is trained for 6 epoch in 3e-4 lr.

	```
	[39168/39168 41:18, Epoch 6/6]
	Step Training Loss Validation Loss Precision Recall F1 Accuracy
	10000 0.086300 0.088625 0.863476 0.876271 0.869827 0.972581
	20000 0.059800 0.079611 0.894612 0.884521 0.889538 0.976563
	30000 0.050400 0.074552 0.895812 0.902591 0.899188 0.978380
	```

	## Evaluation

	Validation loss is 0.07 for this model