Mini Reasoning

university

https://joshuaongg21.github.io/

AI & ML interests

None defined yet.

submitted a paper to Daily Papers 3 months ago

Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

Paper • 2602.12586 • Published Feb 13 • 2

authored 2 papers 4 months ago

Beyond Data Filtering: Knowledge Localization for Capability Removal in LLMs

Paper • 2512.05648 • Published Dec 5, 2025

The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

Paper • 2601.23045 • Published Jan 30

authored a paper 8 months ago

Learning GUI Grounding with Spatial Reasoning from Visual Feedback

Paper • 2509.21552 • Published Sep 25, 2025 • 11

authored 2 papers 9 months ago

Theorem Prover as a Judge for Synthetic Data Generation

Paper • 2502.13137 • Published Feb 18, 2025 • 1

PiCSAR: Probabilistic Confidence Selection And Ranking

Paper • 2508.21787 • Published Aug 29, 2025 • 4

authored a paper 9 months ago

PiCSAR: Probabilistic Confidence Selection And Ranking

Paper • 2508.21787 • Published Aug 29, 2025 • 4

authored 4 papers 10 months ago

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published Feb 9, 2025

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

Paper • 2307.03042 • Published Jul 6, 2023

Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

Paper • 2507.10616 • Published Jul 13, 2025 • 1

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 28

authored a paper about 1 year ago

What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations

Paper • 2502.08279 • Published Feb 12, 2025 • 1

authored a paper about 1 year ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15, 2025 • 55

authored a paper about 1 year ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15, 2025 • 55

authored a paper about 1 year ago

An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

Paper • 2503.23415 • Published Mar 30, 2025 • 1

authored a paper about 1 year ago

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Paper • 2503.02812 • Published Mar 4, 2025 • 10

authored 2 papers over 1 year ago

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

Paper • 2502.05092 • Published Feb 7, 2025 • 8

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization

Paper • 2502.17540 • Published Feb 24, 2025 • 3

authored a paper over 1 year ago

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

Paper • 2502.05092 • Published Feb 7, 2025 • 8

authored a paper over 1 year ago

CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning

Paper • 2410.10336 • Published Oct 14, 2024 • 2