14 9 95

TokenBender

https://buymeacoffee.com/tokenbender

AI & ML interests

Fine-tune small useful models, build datasets and anything related to local LLM hosting and serving.

Recent Activity

published a dataset 16 days ago

TokenBender/avataRL_openwebtext

updated a model 21 days ago

TokenBender/avataRL-critic

published a model 21 days ago

TokenBender/avataRL-critic

View all activity

Organizations

upvoted a collection 4 months ago

Llama Nemotron

Collection

Open, Production-ready Enterprise Models • 11 items • Updated 2 days ago • 67

upvoted a paper 4 months ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 39

upvoted an article 10 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 102

upvoted an article about 1 year ago

Article

Introduction to ggml

and 2 others •

Aug 13, 2024

• 235

upvoted a collection about 1 year ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 10 • 81

upvoted a paper about 1 year ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 95

upvoted an article over 1 year ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

and 8 others •

Apr 29, 2024

• 79

upvoted 2 papers about 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81

MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers

Paper • 2305.07185 • Published May 12, 2023 • 9

TokenBender

AI & ML interests

Recent Activity

Organizations

TokenBender's activity

Releasing the largest multilingual open pretraining dataset

Introduction to ggml

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation