gaunernst commited on
Commit
59db684
1 Parent(s): 787dde8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -5
README.md CHANGED
@@ -8,11 +8,9 @@ pipeline_tag: audio-classification
8
 
9
  A Vision Transformer (ViT) for audio. Pretrained on AudioSet-2M with Self-Supervised Masked Autoencoder (MAE) method, and fine-tuned on AudioSet-20k.
10
 
11
- This is a port of AudioMAE ViT-B/32 weights for usage with `timm`. The naming convention is adopted from other `timm`'s ViT models.
12
-
13
- See the original repo here: https://github.com/facebookresearch/AudioMAE
14
-
15
- For the AudioSet-2M pre-trained checkpoint (without Audioset-20k fine-tuning), see https://huggingface.co/gaunernst/vit_base_patch16_1024_128.audiomae_as2m
16
 
17
 
18
  ## Model Details
 
8
 
9
  A Vision Transformer (ViT) for audio. Pretrained on AudioSet-2M with Self-Supervised Masked Autoencoder (MAE) method, and fine-tuned on AudioSet-20k.
10
 
11
+ - This is a port of AudioMAE ViT-B/32 weights for usage with `timm`. The naming convention is adopted from other `timm`'s ViT models.
12
+ - See the original repo here: https://github.com/facebookresearch/AudioMAE
13
+ - For the AudioSet-2M pre-trained checkpoint (without Audioset-20k fine-tuning), see https://huggingface.co/gaunernst/vit_base_patch16_1024_128.audiomae_as2m
 
 
14
 
15
 
16
  ## Model Details