-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 12 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 7 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 12 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 86
daje kang
daje
·
AI & ML interests
None yet
Recent Activity
updated
a model
8 days ago
daje/Qwen2-VL-72B-instruct-ScienceQA
liked
a model
10 days ago
boltz-community/boltz-1
updated
a dataset
27 days ago
daje/Ko-SciecneQA
Organizations
None yet
Collections
1
models
19
daje/Qwen2-VL-72B-instruct-ScienceQA
Updated
•
10
daje/Qwen2-VL-72B-instruct-ScienceQA-LoRA
Updated
daje/llama3.1-8B-naver_news-summary-llamafactory
Updated
•
7
daje/code-llama-7b-text-to-sql
Updated
daje/chapter5_code-llama3-8B-text-to-sql-ver0.1
Updated
daje/chapter5_psychological_chatbots
Updated
daje/20240830_model
Updated
daje/meta-llama3.1-8B-qna-koalpaca-v1.1
Text Generation
•
Updated
•
7
daje/model_output
Updated
daje/chinese_results_20240729_021938
Updated
datasets
9
daje/Ko-SciecneQA
Viewer
•
Updated
•
12.7k
•
54
daje/keyword_summary
Viewer
•
Updated
•
1k
•
33
daje/kotext-to-sql-v1
Viewer
•
Updated
•
262k
•
43
daje/mistral_tokenized_en_wiki
Viewer
•
Updated
•
16.1M
•
187
daje/mistral_tokenized_ko_wiki
Viewer
•
Updated
•
1.7M
•
37
daje/tokenized_enwiki
Viewer
•
Updated
•
16.4M
•
187
daje/tokenized_kowiki
Viewer
•
Updated
•
1.71M
•
41
daje/en_wiki
Viewer
•
Updated
•
5.09M
•
285
daje/ko_wiki
Viewer
•
Updated
•
311k
•
74
•
6