Occupational CANINE: HISCO Classification Model
Overview
OccCANINE is a version of CANINE which has been finetuned to automatically convert occupational descriptions into standardized HISCO codes using a CANINE model. This tool facilitates historical occupational data analysis with over 90% accuracy across 13 languages.
See more on: GitHub.com/christianvedels/OccCANINE
Read the paper on arXiv: https://arxiv.org/abs/2402.13604
Key Features
- High Accuracy: Over 90% accuracy, recall, and precision.
- Multilingual Support: Trained on 14 million description-HISCO code pairs across 13 languages.
- Efficiency: Rapidly processes descriptions into HISCO codes.
Contribution and Support
Developed at the University of Southern Denmark by Christian Møller Dahl, Torben Johansen and Christian Vedel with contributions from various sources.
- Downloads last month
- 126
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.