charlesxu0124
/

RLDG

Model card Files Files and versions Community

charlesxu0124 commited on 14 days ago

Commit

d85bcee

•

1 Parent(s): c834223

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,4 +7,4 @@ base_model:
 This repository contains [OpenVLA](https://huggingface.co/openvla/openvla-7b) and [Octo](https://huggingface.co/rail-berkeley/octo-base) checkpoints fine-tuned on tele-op and RL generated datasets for the Connctor Insertion task in the paper [RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
 ](https://generalist-distillation.github.io/).
-Each model is trained on 45 episodes either collected by human tele-operation or rolling out converged RL policies. Each dataset contains 15 episodes for a VGA connector, 15 episodes for a USB-A connector, and 15 episodes for a ethernet connector. The models take 1 wrist camera and 1 language command as input, and outputs a 6D end-effector twist expressed in the wrist frame.

 This repository contains [OpenVLA](https://huggingface.co/openvla/openvla-7b) and [Octo](https://huggingface.co/rail-berkeley/octo-base) checkpoints fine-tuned on tele-op and RL generated datasets for the Connctor Insertion task in the paper [RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
 ](https://generalist-distillation.github.io/).
+Each model is trained on 45 episodes either collected by human tele-operation or rolling out converged RL policies. Each dataset contains 15 episodes for a VGA connector, 15 episodes for a USB-A connector, and 15 episodes for an ethernet connector. The models take 1 wrist camera and 1 language command as input, and outputs a 6D end-effector twist expressed in the wrist frame.