Edit model card

Model Card for Model ID

A fine-tune of Google's ViT-384 model for multi-label image classification on tongue images.

Model Details

Model Description

The model will predict the presence/absence of three features; Cracks, Red Dots and Toothmarks.

Model type: Vision Transformer
Finetuned from model [optional]: https://huggingface.co/google/vit-base-patch16-384

Downloads last month: 29

Safetensors

Model size

86.1M params

Tensor type

F32

Inference Examples

Image Classification

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

e1010101
/

vit-384-tongue-image

Model Card for Model ID

Model Details

Model Description

Space using e1010101/vit-384-tongue-image 1