Spaces:
Runtime error
Runtime error
File size: 1,437 Bytes
dd9d56a e752318 dd9d56a e752318 dd9d56a 667ae00 dd9d56a 667ae00 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
---
title: MultiModal Phi2
emoji: π
colorFrom: blue
colorTo: red
sdk: gradio
sdk_version: 3.35.2
app_file: app.py
pinned: false
license: mit
---
## Phi2 : Multimodal Finetuning
### Details
1. LLM Backbone: Phi2
2. Vision Tower: clip-vit-large-patch14-336
3. Audio Model: Whisper
4. Pretraining Dataset: LAION-CC-SBU dataset with BLIP captions(200k samples)
5. Finetuning Dataset: Instruct 150k dataset based on COCO
### Design

### Pretraining
#### Training Loss Curve

#### Learing Rate

#### Training Logs

### Finetuning
#### Training Loss Curve

#### Learing Rate

#### Training Logs

### Results

|