HaoyeZhang
commited on
Commit
β’
e0c7f4a
1
Parent(s):
fe64c14
Update README.md
Browse files
README.md
CHANGED
@@ -10,17 +10,17 @@ language:
|
|
10 |
|
11 |
[GitHub ](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
-
**RLAIF-V-12B** is a model exhibits super GPT-4V trustworthiness
|
14 |
|
15 |
-
We utilize a novel framework,
|
16 |
|
17 |
|
18 |
## Model Details
|
19 |
|
20 |
### Key Features
|
21 |
|
22 |
-
* π
**Super GPT-4V Trustworthiness
|
23 |
-
* πͺ **Maintaining Well Performance on General Abilities**: On benchmarks tested with the general abilities (e.g.
|
24 |
|
25 |
<p align="center">
|
26 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/ypXZxb4HE-jDPJU9115bi.png" alt="fig1" width="90%"/>
|
|
|
10 |
|
11 |
[GitHub ](https://github.com/RLHF-V/RLAIF-V)
|
12 |
|
13 |
+
**RLAIF-V-12B** is a multimodal large language model (MLLM) that exhibits **super GPT-4V trustworthiness**. The model is built up on OmniLMM from the [MiniCPM-V](https://github.com/OpenBMB/MiniCPM-V) series.
|
14 |
|
15 |
+
We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), which **aligns MLLMs in a fully open-source paradigm**. This framework maximally exploits the [open-source feedback](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset) from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
|
16 |
|
17 |
|
18 |
## Model Details
|
19 |
|
20 |
### Key Features
|
21 |
|
22 |
+
* π
**Super GPT-4V Trustworthiness**: By learning from open-source AI feedback, RLAIF-V-12B achieves super GPT-4V trustworthiness in both generative and discriminative tasks.
|
23 |
+
* πͺ **Maintaining Well Performance on General Abilities**: On benchmarks tested with the general abilities (e.g. LLaVA Bench, MMStar), RLAIF-V-12B also exhibits good performance.
|
24 |
|
25 |
<p align="center">
|
26 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/ypXZxb4HE-jDPJU9115bi.png" alt="fig1" width="90%"/>
|