SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher Paper • 2408.14176 • Published Aug 26, 2024 • 62
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 610
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 115
view post Post Here is my selection of papers for today (12 Jan)https://huggingface.co/papersPALP: Prompt Aligned Personalization of Text-to-Image ModelsObject-Centric Diffusion for Efficient Video EditingTRIPS: Trilinear Point Splatting for Real-Time Radiance Field RenderingDiffusion Priors for Dynamic View Synthesis from Monocular VideosParrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image GenerationTOFU: A Task of Fictitious Unlearning for LLMsPatchscope: A Unifying Framework for Inspecting Hidden Representations of Language ModelsSecrets of RLHF in Large Language Models Part II: Reward ModelingLEGO:Language Enhanced Multi-modal Grounding ModelDeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsTuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource LanguagesA Shocking Amount of the Web is Machine Translated: Insights from Multi-Way ParallelismTowards Conversational Diagnostic AITransformers are Multi-State RNNsSleeper Agents: Training Deceptive LLMs that Persist Through Safety TrainingDistilling Vision-Language Models on Millions of VideosEfficient LLM inference solution on Intel GPUTrustLLM: Trustworthiness in Large Language Models ❤️ 14 14 🤗 2 2 + Reply
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 259
distil-whisper/distil-large-v2 Automatic Speech Recognition • Updated about 19 hours ago • 405k • • 505