Efficient - a zhidong-gao Collection

zhidong-gao 's Collections

Video

3D

SD

Audio

Attack

LLMs

dataset

align

Agent

Efficient

updated Aug 14

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182
Mixture-of-Subspaces in Low-Rank Adaptation

Paper • 2406.11909 • Published Jun 16 • 3
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

Paper • 2406.17660 • Published Jun 25 • 5
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Paper • 2407.11239 • Published Jul 15 • 7