Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ inference:
|
|
14 |
license: apache-2.0
|
15 |
---
|
16 |
# Wenzhong2.0-GPT2-3.5B model (chinese),one model of [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM).
|
17 |
-
As we all know, the single direction language model based on decoder structure has strong generation ability, such as GPT model. The 3.5 billion parameter Wenzhong-GPT2-3.5B large model, using 100G chinese common data, 32 A100 training for 28 hours, is the largest open source **GPT2 large model of chinese**. **Our model performs well in Chinese continuation generation.** **
|
18 |
|
19 |
## Usage
|
20 |
|
|
|
14 |
license: apache-2.0
|
15 |
---
|
16 |
# Wenzhong2.0-GPT2-3.5B model (chinese),one model of [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM).
|
17 |
+
As we all know, the single direction language model based on decoder structure has strong generation ability, such as GPT model. The 3.5 billion parameter Wenzhong-GPT2-3.5B large model, using 100G chinese common data, 32 A100 training for 28 hours, is the largest open source **GPT2 large model of chinese**. **Our model performs well in Chinese continuation generation.** **Wenzhong2.0-GPT2-3.5B is a Chinese gpt2 model trained with cleaner data on the basis of Wenzhong-GPT2-3.5B.**
|
18 |
|
19 |
## Usage
|
20 |
|