afrideva's picture

afrideva

afrideva

·

afri_deva

AI & ML interests

None yet

Organizations

afrideva's activity

upvoted an article 26 days ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

27 days ago

• 54

upvoted a paper 4 months ago

Conciseness: An Overlooked Language Task

Paper • 2211.04126 • Published Nov 8, 2022 • 2

upvoted a collection 5 months ago

IrokoBench

a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM • 6 items • Updated May 31 • 18

upvoted a paper 6 months ago

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20 • 20

upvoted 2 collections 10 months ago

Medical Evaluation Datasets

41 items • Updated Aug 21 • 7

Pretrained Text-Generation Models Below 250M Parameters

Great candidates for fine-tuning targeting Transformers.js, ordered by number of parameters. • 8 items • Updated Aug 10 • 7

upvoted 2 papers 10 months ago

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16 • 32

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 143

upvoted 2 collections 11 months ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 57 items • Updated about 1 hour ago • 431

Trained Models 🏋️

They may be small, but they're training like giants! • 8 items • Updated May 13 • 16

upvoted 2 papers 11 months ago

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation

Paper • 2310.08185 • Published Oct 12, 2023 • 6

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 37

upvoted 3 collections 11 months ago

ChatGPT-Mini

A collection of fine-tuned GPT-2 models each designed to deploy a ChatGPT-like model at home. These models can also be deployed on an old computer. • 8 items • Updated Nov 16, 2023 • 4

Merged Models

Using mergekit • 10 items • Updated Mar 1 • 2

smol llama

🚧"raw" pretrained smol_llama checkpoints - WIP 🚧 • 4 items • Updated Apr 29 • 6

upvoted 5 collections 12 months ago

Coding datasets

3 items • Updated Nov 23, 2023 • 4

Indic language fine-tunes

Halted State: Attempting to create acceptable quality fine-tunes of different models • 1 item • Updated Nov 23, 2023 • 1

PIC (Partner-in-Crime) project

Empathetic, small, really useful personalised models. • 3 items • Updated Dec 10, 2023 • 2

Cramp(ed) Models

Smaller models trained locally on my 2xA6000 Lambda Vector • 3 items • Updated Oct 10, 2023 • 1

Shrink Llama - V1

Parts of Meta's LlamaV2 models, chopped up and trained. CoreX means the first X layers were kept. • 2 items • Updated Sep 12, 2023 • 2