14 31 188

Ken Tsui

kenhktsui

https://kenhktsui.github.io/

AI & ML interests

ML engineer, researcher VLM, LLM benchmark Opinions are my own

Recent Activity

upvoted a paper 1 day ago

Less is More: Recursive Reasoning with Tiny Networks

authored a paper 6 days ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

upvoted a paper 7 days ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

View all activity

Organizations

upvoted a paper 1 day ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 5 days ago • 289

authored a paper 6 days ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published 12 days ago • 6

upvoted 3 papers 7 days ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published about 1 month ago • 224

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published 11 days ago • 452

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published 12 days ago • 6

liked a dataset 27 days ago

HuggingFaceFW/finepdfs

Viewer • Updated Sep 8 • 475M • 73.9k • 604

authored a paper 3 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3 • 9

commented a paper 3 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3 • 9 •

upvoted a paper 3 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3 • 9

commented a paper 3 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3 • 9 •

upvoted a paper 3 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 71

published a dataset 3 months ago

kenhktsui/num_seq_bench

Viewer • Updated Aug 5, 2024 • 2.12k • 12

published an article 3 months ago

Article

NumSeqBench: Benchmarking Inductive Reasoning in Language Models via Number Sequences

•

Jul 3

updated 3 models 3 months ago

updated a model 4 months ago

kenhktsui/llm-data-textbook-quality-fasttext-classifier-v2

Text Classification • Updated Jun 26 • 859 • 28

upvoted a paper 4 months ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30 • 14

liked a dataset 4 months ago

ontocord/MixtureVitae-211BT

Viewer • Updated 2 days ago • 60M • 1.4k • 10

liked a model 4 months ago

lerobot/smolvla_base

Robotics • Updated 1 day ago • 28.7k • 277

Ken Tsui

AI & ML interests

Recent Activity

Organizations

kenhktsui's activity

NumSeqBench: Benchmarking Inductive Reasoning in Language Models via Number Sequences