HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale Paper β’ 2409.16299 β’ Published Sep 9 β’ 10
Gemma 2: Improving Open Language Models at a Practical Size Paper β’ 2408.00118 β’ Published Jul 31 β’ 75
GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Paper β’ 2407.12077 β’ Published Jul 16 β’ 54
Searching for Best Practices in Retrieval-Augmented Generation Paper β’ 2407.01219 β’ Published Jul 1 β’ 11
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices Paper β’ 2406.08451 β’ Published Jun 12 β’ 23
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12 β’ 218
DBRX Collection DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. β’ 3 items β’ Updated Mar 27 β’ 91
π Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized β’ 88 items β’ Updated about 20 hours ago β’ 92
In-Context Language Learning: Architectures and Algorithms Paper β’ 2401.12973 β’ Published Jan 23 β’ 4
Fin-RWKV-V1 Collection Attention free financial expert modal - RWKV V4 β’ 6 items β’ Updated Feb 2 β’ 1
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models Paper β’ 2310.11954 β’ Published Oct 18, 2023 β’ 25
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Paper β’ 2310.11511 β’ Published Oct 17, 2023 β’ 74
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts Paper β’ 2310.11784 β’ Published Oct 18, 2023 β’ 10