Bo Liu's picture

Building on HF

Bo Liu

Benjamin-eecs

·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space

authored a paper 2 months ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

upvoted a paper 2 months ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

View all activity

Organizations

Collections 1

models 2

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

Feature Extraction • 8B • Updated Nov 24, 2024 • 2

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value

Feature Extraction • 8B • Updated Nov 24, 2024 • 3

datasets 0

None public yet