wubingheng
/

Doge_medical_chat-197M

Question Answering

text-generation

Model card Files Files and versions Community

wubingheng commited on 14 days ago

Commit

bca8d24

•

1 Parent(s): c18a51f

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ pipeline_tag: text-generation
 library_name: transformers
 ---
-# **Doge 197M**
 Doge is an ongoing research project where we aim to train a series of small language models to further explore whether the Transformer framework allows for more complex feedforward network structures, enabling the model to have fewer cache states and larger knowledge capacity.
@@ -68,10 +68,11 @@ In addition, Doge uses Inner Function Attention with Dynamic Mask as sequence tr
 ...     tokenizer=tokenizer,
 ...     generation_config=generation_config,
 ...     streamer=steamer
-... )```
 **Fine-tue Task**:
-We selected an open-source Chinese medical question answering dataset for fine-tuning.
 **Fine-tue Environment**:

 library_name: transformers
 ---
+## **basic model : Doge 197M**
 Doge is an ongoing research project where we aim to train a series of small language models to further explore whether the Transformer framework allows for more complex feedforward network structures, enabling the model to have fewer cache states and larger knowledge capacity.
 ...     tokenizer=tokenizer,
 ...     generation_config=generation_config,
 ...     streamer=steamer
+... )
+```
 **Fine-tue Task**:
+- We selected an open-source Chinese medical question answering dataset for fine-tuning.
 **Fine-tue Environment**: