1 21 10

ℏεsam

hesamation

AI & ML interests

post-training / reasonign models / RAG

Recent Activity

upvoted a paper 1 day ago

Agent Learning via Early Experience

new activity 6 days ago

hesamation/primer-llm-embedding:Fixed typo

reacted to SelmaNajih001's post with 🔥 6 days ago

I found it excellent and very well done. One of the best explanations of embedding I've ever read. Well done, @hesamation! Had to share this: https://huggingface.co/spaces/hesamation/primer-llm-embedding

View all activity

Organizations

upvoted a paper 1 day ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published 4 days ago • 177

upvoted a paper about 1 month ago

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3 • 24

upvoted a paper about 2 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 150

upvoted a paper 3 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 256

upvoted a paper 4 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

upvoted 8 papers 6 months ago

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges

Paper • 2504.19093 • Published Apr 27 • 17

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Paper • 2504.16656 • Published Apr 23 • 57

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 135

WORLDMEM: Long-term Consistent World Simulation with Memory

Paper • 2504.12369 • Published Apr 16 • 34

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published Apr 11 • 55

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 295

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 62

upvoted a paper 7 months ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27 • 41

upvoted a collection 7 months ago

🤔 Reasoning about Reasoning

Collection

papers and articles about reasoning LLMs • 12 items • Updated Jun 22 • 6

upvoted 2 papers 7 months ago

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Paper • 2503.19693 • Published Mar 25 • 76

Inside-Out: Hidden Factual Knowledge in LLMs

Paper • 2503.15299 • Published Mar 19 • 55

upvoted 2 papers 9 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 418

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107

upvoted a paper 10 months ago

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 29

ℏεsam

AI & ML interests

Recent Activity

Organizations

hesamation's activity