kje2952

AI & ML interests

None yet

Recent Activity

updated a collection about 19 hours ago

reasoning

upvoted a paper about 19 hours ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

updated a collection about 19 hours ago

reasoning

View all activity

Organizations

None yet

updated a collection about 19 hours ago

reasoning

Collection

4 items • Updated about 19 hours ago

upvoted a paper about 19 hours ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published 10 days ago • 5

updated a collection about 19 hours ago

reasoning

Collection

4 items • Updated about 19 hours ago

upvoted a paper 1 day ago

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 3 days ago • 25

updated a collection 3 days ago

diffusion

Collection

2 items • Updated 3 days ago

upvoted 2 papers 3 days ago

Residual Context Diffusion Language Models

Paper • 2601.22954 • Published 14 days ago • 31

Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling

Paper • 2602.02453 • Published 11 days ago • 35

updated 2 collections 3 days ago

reasoning

Collection

4 items • Updated about 19 hours ago

moe

Collection

2 items • Updated 3 days ago

upvoted 3 papers 3 days ago

updated a model 4 days ago

kje2952/moe-prefill-lora-v2

Updated 3 days ago

upvoted 7 papers 4 days ago

SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

Paper • 2602.06854 • Published 7 days ago • 6

LatentMem: Customizing Latent Memory for Multi-Agent Systems

Paper • 2602.03036 • Published 11 days ago • 14

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published Feb 3, 2025 • 24

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 16 days ago • 99

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 108

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 127

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published 8 days ago • 41

kje2952

AI & ML interests

Recent Activity

Organizations

kje2952's activity