š®š¹šÆšµš§š· Generating multilingual instruction datasets with Magpie š¦āā¬ Oct 21 ā¢ 18
LoRA vs Full Fine-tuning: An Illusion of Equivalence Paper ā¢ 2410.21228 ā¢ Published 24 days ago ā¢ 2
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper ā¢ 2411.07133 ā¢ Published 10 days ago ā¢ 28
view article Article SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive By DavidGF ā¢ 13 days ago ā¢ 9
view article Article š®š¹šÆšµš§š· Generating multilingual instruction datasets with Magpie š¦āā¬ By anakin87 ā¢ Oct 21 ā¢ 18
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled ā¢ Oct 14 ā¢ 55
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses Paper ā¢ 2408.00584 ā¢ Published Aug 1 ā¢ 6
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 ā¢ Sep 3 ā¢ 29
š§© Verbalized Rebus @ CLiC-it 2024 Collection Materials for the paper "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses" ā¢ 13 items ā¢ Updated Aug 5 ā¢ 3
view article Article š„ Argilla 2.0: the data-centric tool for AI makers š¤ By dvilasuero ā¢ Jul 30 ā¢ 37
view article Article Mixedbread š¤ deepset: Announcing our New German/English Embedding Model By shadeMe ā¢ Jul 19 ā¢ 15
view article Article š¦āļø Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero ā¢ Jun 4 ā¢ 73
Refusal in Language Models Is Mediated by a Single Direction Paper ā¢ 2406.11717 ā¢ Published Jun 17 ā¢ 2
abliterated-v3 Collection Latest gen of the abliterated models I've produced ā¢ 17 items ā¢ Updated Jun 3 ā¢ 97
view article Article āļø š„ Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw ā¢ Jun 3 ā¢ 26
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 ā¢ 158