Small LMs Text Embedding Collection Contrastive fine-tuned version of Language Models up to 2B parameters using LoRA β’ 3 items β’ Updated May 8 β’ 4
Papers I want to read Collection Papers in my to-read list β’ 247 items β’ Updated 4 days ago β’ 26
Matryoshka Embedding Models Collection https://huggingface.co/blog/matryoshka β’ 14 items β’ Updated Jun 4 β’ 13
π MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" β’ 13 items β’ Updated Jul 24 β’ 54
GTE models Collection General Text Embedding Models Released by Alibaba Group β’ 19 items β’ Updated Aug 6 β’ 13
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29 β’ 245
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models Paper β’ 2407.09025 β’ Published Jul 12 β’ 128
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs Paper β’ 2406.15319 β’ Published Jun 21 β’ 61