levanduc
's Collections
LLM-Papers
updated
PDFTriage: Question Answering over Long, Structured Documents
Paper
•
2309.08872
•
Published
•
53
Adapting Large Language Models via Reading Comprehension
Paper
•
2309.09530
•
Published
•
75
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper
•
2310.09263
•
Published
•
39
Context-Aware Meta-Learning
Paper
•
2310.10971
•
Published
•
16
Data-Centric Financial Large Language Models
Paper
•
2310.17784
•
Published
•
14
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
Modeling Likewise
Paper
•
2310.19019
•
Published
•
9
Contrastive Chain-of-Thought Prompting
Paper
•
2311.09277
•
Published
•
33
Orca 2: Teaching Small Language Models How to Reason
Paper
•
2311.11045
•
Published
•
70
Context Tuning for Retrieval Augmented Generation
Paper
•
2312.05708
•
Published
•
16
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
56
Improving Text Embeddings with Large Language Models
Paper
•
2401.00368
•
Published
•
79
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
•
2401.00908
•
Published
•
178
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
Paper
•
2401.04398
•
Published
•
20
ReFT: Reasoning with Reinforced Fine-Tuning
Paper
•
2401.08967
•
Published
•
27
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper
•
2401.15024
•
Published
•
67
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper
•
2402.05140
•
Published
•
20
AutoMathText: Autonomous Data Selection with Language Models for
Mathematical Texts
Paper
•
2402.07625
•
Published
•
11
How to Train Data-Efficient LLMs
Paper
•
2402.09668
•
Published
•
38
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
•
2402.10986
•
Published
•
76
Knowledge Fusion of Large Language Models
Paper
•
2401.10491
•
Published
•
3
SaulLM-7B: A pioneering Large Language Model for Law
Paper
•
2403.03883
•
Published
•
74
RAFT: Adapting Language Model to Domain Specific RAG
Paper
•
2403.10131
•
Published
•
66
TnT-LLM: Text Mining at Scale with Large Language Models
Paper
•
2403.12173
•
Published
•
19
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Paper
•
2403.13372
•
Published
•
58
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small
Reference Models
Paper
•
2405.20541
•
Published
•
20
Towards a Personal Health Large Language Model
Paper
•
2406.06474
•
Published
•
17
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Paper
•
2406.09170
•
Published
•
24
Instruction Pre-Training: Language Models are Supervised Multitask
Learners
Paper
•
2406.14491
•
Published
•
85
The FineWeb Datasets: Decanting the Web for the Finest Text Data at
Scale
Paper
•
2406.17557
•
Published
•
84
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
•
2406.19215
•
Published
•
29
Show Less, Instruct More: Enriching Prompts with Definitions and
Guidelines for Zero-Shot NER
Paper
•
2407.01272
•
Published
•
8
LETS-C: Leveraging Language Embedding for Time Series Classification
Paper
•
2407.06533
•
Published
•
2
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Paper
•
2407.09413
•
Published
•
9
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Paper
•
2407.12854
•
Published
•
29
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Paper
•
2407.18961
•
Published
•
38
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal
Domain
Paper
•
2407.19584
•
Published
•
60
Visual Riddles: a Commonsense and World Knowledge Challenge for Large
Vision and Language Models
Paper
•
2407.19474
•
Published
•
22
Self-Training with Direct Preference Optimization Improves
Chain-of-Thought Reasoning
Paper
•
2407.18248
•
Published
•
30
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Paper
•
2407.20183
•
Published
•
37
LAMBDA: A Large Model Based Data Agent
Paper
•
2407.17535
•
Published
•
34
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Paper
•
2407.15017
•
Published
•
33
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
•
2407.03502
•
Published
•
43
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Paper
•
2408.14717
•
Published
•
23
Foundation Models for Music: A Survey
Paper
•
2408.14340
•
Published
•
38
Efficient Detection of Toxic Prompts in Large Language Models
Paper
•
2408.11727
•
Published
•
11
Sapiens: Foundation for Human Vision Models
Paper
•
2408.12569
•
Published
•
84
Controllable Text Generation for Large Language Models: A Survey
Paper
•
2408.12599
•
Published
•
61
TableBench: A Comprehensive and Complex Benchmark for Table Question
Answering
Paper
•
2408.09174
•
Published
•
51
OLMoE: Open Mixture-of-Experts Language Models
Paper
•
2409.02060
•
Published
•
74
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Paper
•
2409.05840
•
Published
•
43
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
•
2409.02795
•
Published
•
70