rail-berkeley
/

octo-base

Inference Endpoints

Model card Files Files and versions Community

rail-berkeley commited on Dec 14, 2023

Commit

8f87d58

•

1 Parent(s): c00abd5

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -1,3 +1,7 @@
 # Octo Base
 This model is trained with a window size of 2, predicting 7-dimensional actions 4 steps into the future using a diffusion policy. The model is a Transformer with 93M parameters (equivalent to a ViT-B). Images are tokenized by preprocessing with a lightweight convolutional encoder, then grouped into 16x16 patches. Language is tokenized by applying the T5 tokenizer, and then applying the T5-Base language encoder.
@@ -27,7 +31,7 @@ Tasks:
 At inference, you may pass in any subset of these observation and task keys, with a history window up to 2 timesteps.
-This model was trained on a mix of datasets from the Open X-Embodiment dataset
 | Dataset                                                    | Proportion of batch |
 |------------------------------------------------------------|---------------------|

+---
+license: mit
+pipeline_tag: robotics
+---
 # Octo Base
 This model is trained with a window size of 2, predicting 7-dimensional actions 4 steps into the future using a diffusion policy. The model is a Transformer with 93M parameters (equivalent to a ViT-B). Images are tokenized by preprocessing with a lightweight convolutional encoder, then grouped into 16x16 patches. Language is tokenized by applying the T5 tokenizer, and then applying the T5-Base language encoder.
 At inference, you may pass in any subset of these observation and task keys, with a history window up to 2 timesteps.
+This model was trained on a mix of datasets from the Open X-Embodiment dataset.
 | Dataset                                                    | Proportion of batch |
 |------------------------------------------------------------|---------------------|