metadata

tags:
  - text-classification
  - language-identification
inference: false
license: cc-by-sa-3.0
language: multilingual
library_name: staticvectors
base_model:
  - NeuML/language-id

Language Detection with StaticVectors

This model is an export of this FastText Language Identification model for staticvectors. staticvectors enables running inference in Python with NumPy. This helps it maintain solid runtime performance.

Language detection is an important task and identification with n-gram models is an efficient and highly accurate way to do it.

This model is a quantized version of the base language id model. It's using 2x256 Product Quantization like the original quantized model from FastText. This shrinks this model down to 4MB with only a minor hit on accuracy.

Usage with StaticVectors

from staticvectors import StaticVectors

model = StaticVectors("neuml/language-id-quantized")
model.predict(["What language is this text?"])