OneBit: Towards Extremely Low-bit Large Language Models Paper • 2402.11295 • Published Feb 17 • 22 • 13
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21 • 111 • 20
OneBit: Towards Extremely Low-bit Large Language Models Paper • 2402.11295 • Published Feb 17 • 22 • 13
World Model on Million-Length Video And Language With RingAttention Paper • 2402.08268 • Published Feb 13 • 36 • 5
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling Paper • 2401.16380 • Published Jan 29 • 48 • 7
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision Paper • 2312.16256 • Published Dec 26, 2023 • 15 • 4
Shai: A large language model for asset management Paper • 2312.14203 • Published Dec 21, 2023 • 4 • 2
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models Paper • 2311.04589 • Published Nov 8, 2023 • 18 • 5