What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
liked
a model
about 10 hours ago
Qwen/Qwen2.5-VL-7B-Instruct
authored
a paper
17 days ago
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via
Reinforcement Learning
updated
a collection
23 days ago
Digi-Q
Organizations
Collections
2
models
18

JackBAI/aitw-general-digiq-agent
Updated

JackBAI/aitw-webshop-digiq-agent
Updated

JackBAI/llava-v1.5-7b-sfted-pad-inputtext
Updated

JackBAI/CRATE-GPT-12L-Pile-600000steps
Updated

JackBAI/webshop-off2on-filteredbc
Updated
•
2

JackBAI/general-off2on-filteredbc
Updated
•
1

JackBAI/general-off2on-digirl
Updated
•
2

JackBAI/webshop-off2on-digirl
Updated
•
2

JackBAI/crate-3l-l0-sae-1x
Updated

JackBAI/crate-1l-l0-sae-1x
Updated
datasets
6
JackBAI/autoui-zeroshot-trajectories
Preview
•
Updated
•
100
JackBAI/pile_uncopyrighted_bin
Updated
•
7
JackBAI/bert_pretrain_datasets
Viewer
•
Updated
•
80.5M
•
1.44k
•
1
JackBAI/redbajama-sampled
Viewer
•
Updated
•
24.3M
•
4.61k
JackBAI/merged_roberta_dataset
Updated
•
36
JackBAI/chatgpt-woi-finetune
Preview
•
Updated
•
89
•
3