jiakai's picture

48 417

jiakai

real-jiakai

·

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a model about 1 hour ago

google/paligemma2-10b-ft-docci-448

liked a model about 1 hour ago

google/paligemma2-28b-pt-224

liked a model about 1 hour ago

google/paligemma2-3b-pt-896

View all activity

Organizations

real-jiakai's activity

upvoted a paper 2 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 6 days ago • 82

upvoted a paper 5 days ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published 6 days ago • 17

upvoted a paper 11 days ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 31

upvoted a collection 14 days ago

🔱 Sailor2 Language Models

Sailing in South-East Asia with Inclusive Multilingual LLMs • 9 items • Updated 14 days ago • 21

upvoted 2 papers 14 days ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 20 days ago • 32

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published 19 days ago • 39

upvoted a collection 15 days ago

Nov 29 Releases 🌲🌲

25 items • Updated 16 days ago • 9

upvoted a collection 21 days ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 20 days ago • 424

upvoted a paper 21 days ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published 22 days ago • 45

upvoted a paper 28 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15 • 109

upvoted an article about 1 month ago

Article

Releasing the largest multilingual open pretraining dataset

By

•

Nov 13

• 98

upvoted a paper about 1 month ago

TableGPT2: A Large Multimodal Model with Tabular Data Integration

Paper • 2411.02059 • Published Nov 4 • 5

upvoted 2 collections about 1 month ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 20 days ago • 253

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated 25 days ago • 76

upvoted a paper about 1 month ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5 • 60

upvoted 3 papers about 2 months ago

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24 • 31

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published Oct 25 • 23

Why Does the Effective Context Length of LLMs Fall Short?

Paper • 2410.18745 • Published Oct 24 • 16

upvoted a paper 2 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 62

upvoted an article 2 months ago

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20

• 26