Reasoning Core: A Scalable RL Environment for LLM Symbolic Reasoning Paper β’ 2509.18083 β’ Published Sep 22 β’ 5 β’ 2
Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem Paper β’ 2509.06809 β’ Published Sep 8 β’ 2 β’ 2
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods Paper β’ 2407.21630 β’ Published Jul 31, 2024 β’ 8 β’ 2
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation Paper β’ 2407.13481 β’ Published Jul 18, 2024 β’ 10 β’ 3