1 25

Le Van Duc

levanduc

AI & ML interests

None yet

Recent Activity

updated a collection 10 days ago

LLM-Papers

updated a collection 10 days ago

LLM-Papers

upvoted a paper 10 days ago

Organizations

None yet

levanduc's activity

upvoted a paper 10 days ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published 18 days ago • 36

upvoted a paper 24 days ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published 29 days ago • 19

upvoted 6 papers about 1 month ago

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples

Paper • 2410.14669 • Published Oct 18 • 35

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16 • 20

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Paper • 2410.12705 • Published Oct 16 • 29

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11 • 42

Vector-ICL: In-context Learning with Continuous Vector Representations

Paper • 2410.05629 • Published Oct 8 • 3

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10 • 44

upvoted 2 papers 3 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

Text2SQL is Not Enough: Unifying AI and Databases with TAG

Paper • 2408.14717 • Published Aug 27 • 24

upvoted 10 papers 4 months ago

Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines for Zero-Shot NER

Paper • 2407.01272 • Published Jul 1 • 8

SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Paper • 2406.19215 • Published Jun 27 • 29

Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning

Paper • 2406.09170 • Published Jun 13 • 24

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6 • 55

Towards a Personal Health Large Language Model

Paper • 2406.06474 • Published Jun 10 • 18

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30 • 21

TnT-LLM: Text Mining at Scale with Large Language Models

Paper • 2403.12173 • Published Mar 18 • 19