Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2401.17268

interesting stuff

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 38
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 82
Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 82

Fundational - Deep Learning

Just How Flexible are Neural Networks in Practice?

Paper • 2406.11463 • Published Jun 17 • 7
Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39
KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30 • 108
An Interactive Agent Foundation Model

Paper • 2402.05929 • Published Feb 8 • 27

Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 43

Track-Over-Time

Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 43

Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 43

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Paper • 2401.11708 • Published Jan 22 • 30
Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 43
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models

Paper • 2402.01118 • Published Feb 2 • 29
Training-Free Consistent Text-to-Image Generation

Paper • 2402.03286 • Published Feb 5 • 65

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

Paper • 2402.00769 • Published Feb 1 • 20
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 81
LongAlign: A Recipe for Long Context Alignment of Large Language Models

Paper • 2401.18058 • Published Jan 31 • 21
Efficient Tool Use with Chain-of-Abstraction Reasoning

Paper • 2401.17464 • Published Jan 30 • 16

Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 43

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4 • 89
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24 • 45
SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26 • 69
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29 • 48

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 181
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Paper • 2401.04658 • Published Jan 9 • 25
Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 43
Efficient Tool Use with Chain-of-Abstraction Reasoning

Paper • 2401.17464 • Published Jan 30 • 16

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs