Lukman Aliyu

lukmanaj

AI & ML interests

AI in healthcare | NLP | Computer Vision

Recent Activity

Organizations

HausaNLP's profile picture ZeroGPU Explorers's profile picture HausaNLP EA-MT Semeval's profile picture

lukmanaj's activity

upvoted an article 5 months ago
view article
Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By mlabonne
266
reacted to its5Q's post with 👍 5 months ago
view post
Post
1351
Continuing my streak by releasing the Wikireading dataset: a large collection of scraped non-fiction books predominantly in Russian language.
its5Q/wikireading

Here's the highlights:
- ~7B tokens, or ~28B characters, making it a great candidate for use in pretraining
- Contains non-fiction works from many knowledge domains
- Includes both the original HTML and extracted text of book chapters