Makise Kurisu's picture

3 16 5

Makise Kurisu

kurisu0306

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

LongCat-Flash-Thinking-2601 Technical Report

upvoted a paper 8 days ago

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

liked a model 8 days ago

Emperorizzis/ASTRA-14B-Thinking-v1

View all activity

Organizations

None yet

upvoted a paper 3 days ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 7 days ago • 163

upvoted a paper 8 days ago

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Paper • 2505.14464 • Published May 20, 2025 • 10

liked 2 models 8 days ago

Emperorizzis/ASTRA-14B-Thinking-v1

15B • Updated about 4 hours ago • 13 • 6

Emperorizzis/ASTRA-32B-Thinking-v1

33B • Updated about 4 hours ago • 19 • 5

upvoted 2 collections 9 days ago

ASTRA Dataset

2 items • Updated 9 days ago • 3

ASTRA Models

2 items • Updated 8 days ago • 1

upvoted 2 papers 9 days ago

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published 10 days ago • 51

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published 15 days ago • 60

upvoted a paper 11 days ago

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Paper • 2601.10355 • Published 15 days ago • 39

upvoted a paper 12 days ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 16 days ago • 189

upvoted a paper 17 days ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published 21 days ago • 36

upvoted a paper 26 days ago

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published 30 days ago • 104

upvoted a paper about 2 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 107

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 55.3k • • 946

upvoted a paper 8 months ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published Jun 4, 2025 • 48

liked a Space 9 months ago

Qwen3 Demo

Chat with AI assistant powered by Qwen3 model

upvoted a collection 9 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 8 days ago • 258

upvoted a collection 11 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated about 1 month ago • 554

New activity in google/siglip2-base-patch16-224 11 months ago

Missing Vocab file

#4 opened 11 months ago by

Error while loading processor: TypeError: expected str, bytes or os.PathLike object, not NoneType

#2 opened 11 months ago by