David Adelani's picture

David Adelani

Davlan

·

https://dadelani.github.io/

AI & ML interests

Low resource NLP

Recent Activity

updated a Space about 13 hours ago

Davlan/msteb_leaderboard

published a Space about 13 hours ago

Davlan/msteb_leaderboard

updated a dataset 2 days ago

DDD-Kenya/Luhya-ASR-Data-subset-50h

View all activity

Organizations

upvoted an article about 1 month ago

Article

mmBERT: ModernBERT goes Multilingual

Sep 9

• 116

upvoted a collection about 2 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated Sep 1 • 129

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 701

upvoted a collection 9 months ago

GemmaX2

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated Feb 7 • 23

upvoted a collection 10 months ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 8 items • Updated Jul 31 • 27

upvoted a collection over 1 year ago

IrokoBench

a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM • 6 items • Updated May 31, 2024 • 21

upvoted a paper over 1 year ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257

upvoted an article over 1 year ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 239

upvoted 8 papers over 1 year ago

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 25

OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining

Paper • 2311.08849 • Published Nov 15, 2023 • 5

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 56

LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

Paper • 2402.14086 • Published Feb 21, 2024 • 12

SpiRit-LM: Interleaved Spoken and Written Language Model

Paper • 2402.05755 • Published Feb 8, 2024 • 15

SeaLLMs -- Large Language Models for Southeast Asia

Paper • 2312.00738 • Published Dec 1, 2023 • 25

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30, 2024 • 14

upvoted 3 papers almost 2 years ago

MaLA-500: Massive Language Adaptation of Large Language Models

Paper • 2401.13303 • Published Jan 24, 2024 • 12

Multilingual Instruction Tuning With Just a Pinch of Multilinguality

Paper • 2401.01854 • Published Jan 3, 2024 • 11

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Paper • 2311.16079 • Published Nov 27, 2023 • 19

upvoted a paper over 2 years ago

Prompting Large Language Models with Speech Recognition Abilities

Paper • 2307.11795 • Published Jul 21, 2023 • 17