-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 13 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 7 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 12 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 91
daje kang
daje
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
daje/Qwen2-VL-7B-instruct-ScienceQA
published
a model
12 days ago
daje/Qwen2-VL-7B-instruct-ScienceQA
updated
a model
13 days ago
daje/qwen2-7b-instruct-harmful-detector-8500
Organizations
None yet
Collections
1
models
29
daje/Qwen2-VL-7B-instruct-ScienceQA
Updated
•
3
daje/qwen2-7b-instruct-harmful-detector-8500
Image-Text-to-Text
•
Updated
•
1
daje/qwen2-7b-instruct-hamful-detector
Image-Text-to-Text
•
Updated
•
5
daje/Qwen2.5-coder-7B-en-all-merged
Text Generation
•
Updated
•
15
daje/Qwen2.5-coder-7B-ko-all
Updated
daje/llama3-8B-ko-all
Updated
daje/Qwen2.5-coder-7B-en-all
Updated
daje/ko-sql-Qwen-2.5-coder-7B-instruct-15000
Updated
daje/ko-sql-Qwen-2.5-coder-7B-instruct-all
Updated
daje/ko-sql-Qwen-2.5-coder-7B-instruct
Updated
datasets
12
daje/ko-hatefulmemes_train_8500
Viewer
•
Updated
•
8.2k
•
33
daje/ko-hatefulmemes_train_8500_kmhas
Viewer
•
Updated
•
95.3k
•
45
daje/ko-hatefulmemes_train_2000
Viewer
•
Updated
•
1.91k
•
36
daje/Ko-SciecneQA
Viewer
•
Updated
•
12.7k
•
73
daje/keyword_summary
Viewer
•
Updated
•
1k
•
188
daje/kotext-to-sql-v1
Viewer
•
Updated
•
262k
•
119
•
1
daje/mistral_tokenized_en_wiki
Viewer
•
Updated
•
16.1M
•
68
daje/mistral_tokenized_ko_wiki
Viewer
•
Updated
•
1.7M
•
34
daje/tokenized_enwiki
Viewer
•
Updated
•
16.4M
•
194
daje/tokenized_kowiki
Viewer
•
Updated
•
1.71M
•
36