Model Card for Model ID
A fine-tune of Google's ViT-384 model for multi-label image classification on tongue images.
Model Details
Model Description
The model will predict the presence/absence of three features; Cracks, Red Dots and Toothmarks.
- Model type: Vision Transformer
- Finetuned from model [optional]: https://huggingface.co/google/vit-base-patch16-384
- Downloads last month
- 29
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.