收集繁體中文在語言模型上存在多國語言翻譯的資料集,例如:中轉英、中轉越南等。繁體中文與東亞、東南亞關係密切,需考量未來延展性
Heng-Shiou Sheu | 許恆修
Heng666
AI & ML interests
Graph Neural Learning
Recent Activity
liked
a model
5 days ago
1-800-BAD-CODE/xlm-roberta_punctuation_fullstop_truecase
upvoted
an
article
about 2 months ago
Hugging Face welcomes the Aya Expanse family of multilingual models
upvoted
a
collection
about 2 months ago
Traditional Chinese LLM Corpus
Organizations
Collections
6
spaces
16
models
31
Heng666/gemma-2b-GGUF
Updated
•
2
Heng666/paligemma_construction_safety
Updated
Heng666/my_awesome_billsum_model
Updated
Heng666/madlad400-10b-mt-ct2-int8
Updated
•
3
Heng666/madlad400-7b-bt-mt-ct2-int8
Updated
•
4
Heng666/madlad400-7b-mt-ct2-int8
Translation
•
Updated
•
35
•
3
Heng666/madlad400-3b-mt-ct2
Translation
•
Updated
•
8
Heng666/madlad400-3b-mt-ct2-int8
Translation
•
Updated
•
18
Heng666/NeuralPipe-7B-slerp
Text Generation
•
Updated
•
15
Heng666/phi-2-GGUF
Updated
•
2
datasets
11
Heng666/dot_embedding
Viewer
•
Updated
•
152
•
36
Heng666/Taiwan-patent-corpus
Viewer
•
Updated
•
28
•
39
•
1
Heng666/Taiwan-patent-qa
Viewer
•
Updated
•
1.22k
•
129
•
3
Heng666/Taiwan-patent-qa-eval
Viewer
•
Updated
•
192
•
61
•
2
Heng666/OpenSubtitles-TW-Corpus
Viewer
•
Updated
•
7.22M
•
55
•
2
Heng666/Traditional_Chinese-aya_evaluation_suite
Viewer
•
Updated
•
650
•
68
•
3
Heng666/Traditional_Chinese-aya_dataset
Viewer
•
Updated
•
4.91k
•
152
•
3
Heng666/Traditional_Chinese-aya_collection
Viewer
•
Updated
•
2.02M
•
2.2k
•
6
Heng666/MultiCCAligned-TW-Corpus
Viewer
•
Updated
•
3.13M
•
61
•
3
Heng666/Taoyuan-Airport-MRT-MT-Challenge
Viewer
•
Updated
•
1.14k
•
60