tyzhu (Tongyao)

Collections 1

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 41
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Paper • 2409.11242 • Published Sep 17, 2024 • 6
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17, 2024 • 22
On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 13

models 223

datasets 815

tyzhu/tpo

Viewer • Updated 5 days ago • 269 • 44

tyzhu/quality

Viewer • Updated 5 days ago • 173 • 65

tyzhu/the-stack-py

Viewer • Updated 16 days ago • 16.3M • 64 • 1

tyzhu/pystack_clean

Viewer • Updated 17 days ago • 9.44M • 46

tyzhu/id_cc_pool

Viewer • Updated Dec 23, 2024 • 72.5M • 189

tyzhu/proweb

Viewer • Updated Dec 23, 2024 • 46.3M • 179

tyzhu/anchorcontext_5M_v3_models

Updated Dec 22, 2024 • 1

tyzhu/cmmlu_filtered

Updated Oct 7, 2024 • 42

tyzhu/lmind_nq_train6000_eval6489_v1_docidx_v3

Viewer • Updated Jun 4, 2024 • 76.7k • 46

tyzhu/flan_max_300_added

Viewer • Updated Apr 3, 2024 • 1.46M • 40

Tongyao PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

On the Diagram of Thought

models 223

tyzhu/tiny_LLaMA_120M_2k_cc_repeat_2k_iter-060000-ckpt-step-15000_hf

tyzhu/tiny_LLaMA_120M_8k_cc_8k_iter-400000-ckpt-step-100000_hf

tyzhu/tiny_LLaMA_1b_8k_intramask_cc_8k_iter-480000-ckpt-step-60000_hf

tyzhu/tiny_LLaMA_1b_8k_intramask_cc_8k_iter-320000-ckpt-step-40000_hf

tyzhu/tiny_LLaMA_1b_8k_cc_8k_iter-400000-ckpt-step-50000_hf

tyzhu/tiny_LLaMA_1b_2k_cc_2k_iter-400000-ckpt-step-50000_hf

tyzhu/llama3.2_3b_8k_intramask_cc_8k_iter-400000-ckpt-step-100000_hf

tyzhu/tiny_LLaMA_1b_32k_cc_32k_iter-100000-ckpt-step-100000_hf

tyzhu/tiny_LLaMA_1b_32k_dm2_cc_32k_iter-100000-ckpt-step-100000_hf

tyzhu/temp_models

datasets 815

tyzhu/tpo

tyzhu/quality

tyzhu/the-stack-py

tyzhu/pystack_clean

tyzhu/id_cc_pool

tyzhu/proweb

tyzhu/anchorcontext_5M_v3_models

tyzhu/cmmlu_filtered

tyzhu/lmind_nq_train6000_eval6489_v1_docidx_v3

tyzhu/flan_max_300_added

Tongyao PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

models 223 Sort: Recently updated

datasets 815 Sort: Recently updated

models 223

datasets 815