Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 626
Efficient Stable Diffusion Collection Block-removed Knowledge-distilled SD models; https://github.com/Nota-NetsPresso/BK-SDM • 9 items • Updated Jul 1 • 2
Efficient Large Language Model Collection Shortened LLMs from Depth Pruning; https://github.com/Nota-NetsPresso/shortened-llm • 14 items • Updated Jul 23 • 4
LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights Paper • 2404.11936 • Published Apr 18 • 1
A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation Paper • 2304.00471 • Published Apr 2, 2023 • 1
On Architectural Compression of Text-to-Image Diffusion Models Paper • 2305.15798 • Published May 25, 2023 • 4
Optimizing diffusion models Collection Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 21 items • Updated Aug 22 • 19
Rethinking Optimization and Architecture for Tiny Language Models Paper • 2402.02791 • Published Feb 5 • 12
Shortened LLaMA: A Simple Depth Pruning for Large Language Models Paper • 2402.02834 • Published Feb 5 • 14
Diffusers at ICCV 2023 Collection This collection lists the demos to be presented at ICCV 2023 utilizing the Diffusers library (https://iccv2023.thecvf.com/demos-111.php). • 7 items • Updated Oct 4, 2023 • 3