Text Generation
Transformers
Safetensors
English
omnilmm
conversational
Inference Endpoints
HaoyeZhang commited on
Commit
e0c7f4a
β€’
1 Parent(s): fe64c14

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -10,17 +10,17 @@ language:
10
 
11
  [GitHub ](https://github.com/RLHF-V/RLAIF-V)
12
 
13
- **RLAIF-V-12B** is a model exhibits super GPT-4V trustworthiness. The model is built on the SFT version of OmniLMM-12B, which is one of the first version of MiniCPM-V series.
14
 
15
- We utilize a novel framework, **RLAIF-V**, which **aligns MLLMs in a fully open-source paradigm**. This alignment framework maximally exploits the open-source feedback from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
16
 
17
 
18
  ## Model Details
19
 
20
  ### Key Features
21
 
22
- * πŸ… **Super GPT-4V Trustworthiness via Open-source Feedback**. By learning from open-source AI feedback, RLAIF-V 12B achieves super GPT-4V trustworthiness in both generative and discriminative tasks.
23
- * πŸ’ͺ **Maintaining Well Performance on General Abilities**: On benchmarks tested with the general abilities (e.g. LLaVABench, MMStar), RLAIF-V-12B also exhibits good performance.
24
 
25
  <p align="center">
26
  <img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/ypXZxb4HE-jDPJU9115bi.png" alt="fig1" width="90%"/>
 
10
 
11
  [GitHub ](https://github.com/RLHF-V/RLAIF-V)
12
 
13
+ **RLAIF-V-12B** is a multimodal large language model (MLLM) that exhibits **super GPT-4V trustworthiness**. The model is built up on OmniLMM from the [MiniCPM-V](https://github.com/OpenBMB/MiniCPM-V) series.
14
 
15
+ We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), which **aligns MLLMs in a fully open-source paradigm**. This framework maximally exploits the [open-source feedback](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset) from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
16
 
17
 
18
  ## Model Details
19
 
20
  ### Key Features
21
 
22
+ * πŸ… **Super GPT-4V Trustworthiness**: By learning from open-source AI feedback, RLAIF-V-12B achieves super GPT-4V trustworthiness in both generative and discriminative tasks.
23
+ * πŸ’ͺ **Maintaining Well Performance on General Abilities**: On benchmarks tested with the general abilities (e.g. LLaVA Bench, MMStar), RLAIF-V-12B also exhibits good performance.
24
 
25
  <p align="center">
26
  <img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/ypXZxb4HE-jDPJU9115bi.png" alt="fig1" width="90%"/>