OptimalScale

university

https://github.com/OptimalScale

OptimalScale

optimalscale

Activity Feed Request to join this org

AI & ML interests

Large foundation models, large language models.

Recent Activity

hendrydong authored a paper about 1 month ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

shizhediao authored a paper 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

research4pan authored a paper 4 months ago

Personalized Visual Instruction Tuning

View all activity

OptimalScale's activity

hendrydong

authored a paper about 1 month ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

shizhediao

authored a paper 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 41

research4pan

authored a paper 4 months ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 70

shizhediao

authored a paper 4 months ago

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

Paper • 2410.03290 • Published Oct 4, 2024 • 7

hendrydong

authored a paper 4 months ago

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

hendrydong

authored a paper 6 months ago

ThinK: Thinner Key Cache by Query-Driven Pruning

Paper • 2407.21018 • Published Jul 30, 2024 • 31

shizhediao

authored 14 papers 7 months ago

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

Paper • 2306.12420 • Published Jun 21, 2023 • 2

Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models Memories

Paper • 2306.05406 • Published Jun 8, 2023

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data

Paper • 2302.12822 • Published Feb 24, 2023

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions

Paper • 2311.09677 • Published Nov 16, 2023 • 3

Can We Verify Step by Step for Incorrect Answer Detection?

Paper • 2402.10528 • Published Feb 16, 2024

Plum: Prompt Learning using Metaheuristic

Paper • 2311.08364 • Published Nov 14, 2023

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Paper • 2403.17919 • Published Mar 26, 2024 • 16

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Paper • 2402.18571 • Published Feb 28, 2024

Mitigating the Alignment Tax of RLHF

Paper • 2309.06256 • Published Sep 12, 2023

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

Paper • 2405.20974 • Published May 31, 2024

AI & ML interests

Recent Activity

Team members 5

OptimalScale's activity