Hamish Ivison's picture

Hamish Ivison

hamishivi

·

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a model 1 day ago

hamishivi/Qwen3.5-2B

updated a model 5 days ago

hamishivi/swerl_qwen35_9b_fp32lm_dppo_cli

updated a model 5 days ago

hamishivi/swerl_qwen35_9b_fp32lm_dppo_endless

View all activity

Organizations

upvoted a paper 4 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

upvoted a paper 6 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 128

upvoted 2 papers 7 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 63

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published Nov 10, 2025 • 18

upvoted a collection 10 months ago

massive-serve

One command to download and serve a datastore---that's it 😎. https://github.com/RulinShao/massive-serve • 8 items • Updated Nov 18, 2025 • 2

upvoted 2 papers over 1 year ago

TESS 2: A Large-Scale Generalist Diffusion Language Model

Paper • 2502.13917 • Published Feb 19, 2025 • 6

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 68

upvoted 3 collections over 1 year ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated Dec 23, 2025 • 103

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated Mar 2 • 98

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated Mar 3 • 157

upvoted a paper almost 2 years ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 81

upvoted a collection almost 2 years ago

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated Dec 23, 2025 • 16

upvoted a paper almost 2 years ago

Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

Paper • 2406.06469 • Published Jun 10, 2024 • 29

upvoted a collection about 2 years ago

SciRIFF

Data and models to enhance instruction-following for scientific literature understanding. • 4 items • Updated Mar 2 • 10

upvoted a collection over 2 years ago

OLMo Suite

Artifacts for the first set of OLMo models. • 18 items • Updated Dec 23, 2025 • 76

upvoted a paper over 2 years ago

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 86

upvoted a collection over 2 years ago

Paloma

Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated Dec 23, 2025 • 17

upvoted a paper over 2 years ago

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 19

upvoted a collection over 2 years ago

Tulu V2 Suite

The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated Dec 23, 2025 • 46