view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 10 days ago • 94
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Paper • 2402.14740 • Published Feb 22 • 11
🧠 Abliteration Collection Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 7 items • Updated 5 days ago • 22
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines Paper • 2408.01050 • Published Aug 2 • 8
view article Article The case for specialized pre-training: ultra-fast foundation models for dedicated tasks By Pclanglais • Aug 4 • 26
Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Paper • 2408.00690 • Published Aug 1 • 22
Probably function calling datasets Collection Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 36
Understanding Reference Policies in Direct Preference Optimization Paper • 2407.13709 • Published Jul 18 • 16
Bad Data Toolbox Collection PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18 • 11
OpenCulture Collection A multilingual dataset of public domain books and newspapers. • 27 items • Updated 16 days ago • 117
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 4
The Importance of Online Data: Understanding Preference Fine-tuning via Coverage Paper • 2406.01462 • Published Jun 3 • 6