p3nGu1nZz commited on
Commit
f048d67
1 Parent(s): ffd41ae

initial model

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. README.md +138 -0
  2. results/tau_agent_A1_2M/Tau-A1-2M.onnx +3 -0
  3. results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.onnx +3 -0
  4. results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.pt +3 -0
  5. results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.onnx +3 -0
  6. results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.pt +3 -0
  7. results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.onnx +3 -0
  8. results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.pt +3 -0
  9. results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.onnx +3 -0
  10. results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.pt +3 -0
  11. results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.onnx +3 -0
  12. results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.pt +3 -0
  13. results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.onnx +3 -0
  14. results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.pt +3 -0
  15. results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.onnx +3 -0
  16. results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.pt +3 -0
  17. results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.onnx +3 -0
  18. results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.pt +3 -0
  19. results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.onnx +3 -0
  20. results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.pt +3 -0
  21. results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.onnx +3 -0
  22. results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.pt +3 -0
  23. results/tau_agent_A1_2M/checkpoints/checkpoint.pt +3 -0
  24. results/tau_agent_A1_2M/configuration.yaml +93 -0
  25. results/tau_agent_A3_1M/Tau-A3-1M.onnx +3 -0
  26. results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.onnx +3 -0
  27. results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.pt +3 -0
  28. results/tau_agent_A3_1M/checkpoints/TauAgent-12324.onnx +3 -0
  29. results/tau_agent_A3_1M/checkpoints/TauAgent-12324.pt +3 -0
  30. results/tau_agent_A3_1M/checkpoints/TauAgent-199903.onnx +3 -0
  31. results/tau_agent_A3_1M/checkpoints/TauAgent-199903.pt +3 -0
  32. results/tau_agent_A3_1M/checkpoints/TauAgent-28282.onnx +3 -0
  33. results/tau_agent_A3_1M/checkpoints/TauAgent-28282.pt +3 -0
  34. results/tau_agent_A3_1M/checkpoints/TauAgent-299879.onnx +3 -0
  35. results/tau_agent_A3_1M/checkpoints/TauAgent-299879.pt +3 -0
  36. results/tau_agent_A3_1M/checkpoints/TauAgent-399831.onnx +3 -0
  37. results/tau_agent_A3_1M/checkpoints/TauAgent-399831.pt +3 -0
  38. results/tau_agent_A3_1M/checkpoints/TauAgent-499989.onnx +3 -0
  39. results/tau_agent_A3_1M/checkpoints/TauAgent-499989.pt +3 -0
  40. results/tau_agent_A3_1M/checkpoints/TauAgent-599755.onnx +3 -0
  41. results/tau_agent_A3_1M/checkpoints/TauAgent-599755.pt +3 -0
  42. results/tau_agent_A3_1M/checkpoints/TauAgent-699907.onnx +3 -0
  43. results/tau_agent_A3_1M/checkpoints/TauAgent-699907.pt +3 -0
  44. results/tau_agent_A3_1M/checkpoints/TauAgent-799975.onnx +3 -0
  45. results/tau_agent_A3_1M/checkpoints/TauAgent-799975.pt +3 -0
  46. results/tau_agent_A3_1M/checkpoints/TauAgent-899787.onnx +3 -0
  47. results/tau_agent_A3_1M/checkpoints/TauAgent-899787.pt +3 -0
  48. results/tau_agent_A3_1M/checkpoints/TauAgent-999987.onnx +3 -0
  49. results/tau_agent_A3_1M/checkpoints/TauAgent-999987.pt +3 -0
  50. results/tau_agent_A3_1M/checkpoints/checkpoint.pt +3 -0
README.md CHANGED
@@ -1,3 +1,141 @@
1
  ---
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
  ---
4
+
5
+ # Tau LLM Unity ML Agents Project
6
+
7
+ Welcome to the Tau LLM Unity ML Agents Project repository! This project focuses on training reinforcement learning agents using Unity ML-Agents and the PPO algorithm. Our goal is to optimize the performance of the agents through various configurations and training runs.
8
+
9
+ ## Project Overview
10
+
11
+ This repository contains the code and configurations for training agents in a Unity environment using the Proximal Policy Optimization (PPO) algorithm. The agents are designed to learn and adapt to their environment, improving their performance over time.
12
+
13
+ ### Key Features
14
+
15
+ - **Reinforcement Learning**: Utilizes the PPO algorithm for training agents.
16
+ - **Unity ML-Agents**: Integrates with Unity ML-Agents for a seamless training experience.
17
+ - **Custom Reward Functions**: Implements gradient-based reward functions for nuanced feedback.
18
+ - **Memory Networks**: Incorporates memory networks to handle temporal dependencies.
19
+ - **TensorBoard Integration**: Monitors training progress and performance using TensorBoard.
20
+
21
+ ## Configuration
22
+
23
+ Below is the configuration used for training the agents:
24
+
25
+ ```yaml
26
+ behaviors:
27
+ TauAgent:
28
+ trainer_type: ppo
29
+ hyperparameters:
30
+ batch_size: 256
31
+ buffer_size: 4096
32
+ learning_rate: 0.00003
33
+ beta: 0.005
34
+ epsilon: 0.2
35
+ lambd: 0.95
36
+ num_epoch: 10
37
+ learning_rate_schedule: linear
38
+ network_settings:
39
+ normalize: true
40
+ hidden_units: 256
41
+ num_layers: 4
42
+ vis_encode_type: simple
43
+ memory:
44
+ memory_size: 256
45
+ sequence_length: 256
46
+ num_layers: 4
47
+ reward_signals:
48
+ extrinsic:
49
+ gamma: 0.99
50
+ strength: 1.0
51
+ curiosity:
52
+ gamma: 0.995
53
+ strength: 0.1
54
+ network_settings:
55
+ normalize: true
56
+ hidden_units: 256
57
+ num_layers: 4
58
+ learning_rate: 0.00003
59
+ keep_checkpoints: 10
60
+ checkpoint_interval: 100000
61
+ threaded: true
62
+ max_steps: 3000000
63
+ time_horizon: 256
64
+ summary_freq: 10000
65
+ ```
66
+
67
+ ## Model Naming Convention
68
+
69
+ The models in this repository follow the naming convention `Tau_<series>_<max_steps>`. This helps in easily identifying the series and the number of training steps for each model.
70
+
71
+ ## Getting Started
72
+
73
+ ### Prerequisites
74
+
75
+ - Unity 6
76
+ - Unity ML-Agents Toolkit
77
+ - Python 3.10.11
78
+ - PyTorch
79
+ - Transformers
80
+
81
+ ### Installation
82
+
83
+ 1. Clone the repository:
84
+ ```bash
85
+ git clone https://github.com/yourusername/tau-llm-unity-ml-agents.git
86
+ cd tau-llm-unity-ml-agents
87
+ ```
88
+
89
+ 2. Install the required Python packages:
90
+ ```bash
91
+ pip install -r requirements.txt
92
+ ```
93
+
94
+ 3. Open the Unity project:
95
+ - Launch Unity Hub and open the project folder.
96
+
97
+ ### Training the Agent
98
+
99
+ To start training the agent, run the following command:
100
+ ```bash
101
+ mlagents-learn config/trainer_config.yaml --run-id=run1
102
+ ```
103
+
104
+ ### Monitoring Training
105
+
106
+ You can monitor the training progress using TensorBoard:
107
+ ```bash
108
+ tensorboard --logdir=results --port=6006
109
+ ```
110
+
111
+ ## Results
112
+
113
+ The training results, including the average reward and cumulative reward, can be visualized using TensorBoard. The graphs below show the performance of the agent over time:
114
+
115
+ ![Average Reward](path/to/average_reward.png)
116
+ ![Cumulative Reward](path/to/cumulative_reward.png)
117
+
118
+ ## Citation
119
+
120
+ If you use this project in your research, please cite it as follows:
121
+
122
+ ```bibtex
123
+ @misc{Tau,
124
+ author = {K. Rawson},
125
+ title = {Tau LLM Unity ML Agents Project},
126
+ year = {2024},
127
+ publisher = {GitHub},
128
+ journal = {GitHub repository},
129
+ howpublished = {\url{https://github.com/p3nGu1nZz/Tau}},
130
+ }
131
+ ```
132
+
133
+ ## License
134
+
135
+ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
136
+
137
+ ## Acknowledgments
138
+
139
+ - Unity ML-Agents Toolkit
140
+ - TensorFlow and PyTorch communities
141
+ - Hugging Face for hosting the model repository
results/tau_agent_A1_2M/Tau-A1-2M.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:08931e19cffa93c14fed86e9bb88278424715303928d4761bf3dcc257fdde73d
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b24d7a70f3f708362ccd3b35ccbf309d81696c379a5c2111810676ffda6c9c3d
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f556a11f0fea58e2cc679cf2f9ad6e86403425ced6496f25188080f8f29bc8e
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bb6f5d1ee696b00963d7cb00a10b924fdf123f5eb46b618ba006117c7d843919
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c0244046f46126c14c82f60ef46b799325fb0f35205cec04ccec9141784a93c
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d0b419212303e89f05b6d735d04bb392166df4dd491fd0036ea2fce40a3abd6
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c120a1fa8108b6b9558d9667fe949808b3e16394d499693e20392d2ea1f6c28e
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f424ea2d9e633119050d04d95e1079bee5e8c3a1a9fee31282ca95855bd7d885
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:912696d11a6837fc71783e311bed38195e1dda57fe4123a64141db5e96083ba3
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:56f777fafa9cc0919950a0231834f75954a17c4e07b9bcf7c6b2b3dbc5426c41
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f311e89bfe1d1fa8a578efe91d9ceafece8fa50a349a0721013634ff0e664ef9
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90533537979d9abeb815e499a777474c4c0c66e3068c3e9de39c17512f6cd35c
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:09f5ead3bb039e49e211a2ca8d7afa788223c1ed2b9883342efc46ef66799982
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:de1ea3ba5c8d90ce7be467ee2871441c2dbb220e8761d14b8c3d70439bc9ad7b
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:32def0f45b71e638ddd1ad302b620df2526e4099b0bc65c9f4b1ec7a2737b092
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9677e46c8e368b0e5f5a3aa982ca3949bf1f4489fa58ae55cea8801e56563aba
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a9cacd7cdf76e019c35c7618a719b7c58d5b468f7a47d136dc4d1dcea7ede6b7
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5eb6959372271405f646cec449cddcc6d19f604a7d02b5422b02aa7035aa9906
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cfc7b6a20afe2601acf9523584f01e56a6a62f274dff5066b5b53ae4621953aa
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:08931e19cffa93c14fed86e9bb88278424715303928d4761bf3dcc257fdde73d
3
+ size 2186395
results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:260196c3491aca9156c57f3d29bba9cb40b9655acd53407f09075f191df035ae
3
+ size 15534256
results/tau_agent_A1_2M/checkpoints/checkpoint.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37080da1574efbcf39b5938261f36738c44437f2ceb72501058d86a7ffe8d386
3
+ size 15533332
results/tau_agent_A1_2M/configuration.yaml ADDED
@@ -0,0 +1,93 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ default_settings: null
2
+ behaviors:
3
+ TauAgent:
4
+ trainer_type: ppo
5
+ hyperparameters:
6
+ batch_size: 256
7
+ buffer_size: 4096
8
+ learning_rate: 3.0e-05
9
+ beta: 0.005
10
+ epsilon: 0.2
11
+ lambd: 0.95
12
+ num_epoch: 7
13
+ shared_critic: false
14
+ learning_rate_schedule: linear
15
+ beta_schedule: linear
16
+ epsilon_schedule: linear
17
+ checkpoint_interval: 100000
18
+ network_settings:
19
+ normalize: true
20
+ hidden_units: 256
21
+ num_layers: 4
22
+ vis_encode_type: simple
23
+ memory:
24
+ sequence_length: 256
25
+ memory_size: 256
26
+ goal_conditioning_type: hyper
27
+ deterministic: false
28
+ reward_signals:
29
+ extrinsic:
30
+ gamma: 0.99
31
+ strength: 1.0
32
+ network_settings:
33
+ normalize: false
34
+ hidden_units: 128
35
+ num_layers: 2
36
+ vis_encode_type: simple
37
+ memory: null
38
+ goal_conditioning_type: hyper
39
+ deterministic: false
40
+ curiosity:
41
+ gamma: 0.995
42
+ strength: 0.1
43
+ network_settings:
44
+ normalize: true
45
+ hidden_units: 256
46
+ num_layers: 4
47
+ vis_encode_type: simple
48
+ memory: null
49
+ goal_conditioning_type: hyper
50
+ deterministic: false
51
+ learning_rate: 0.0003
52
+ encoding_size: null
53
+ init_path: null
54
+ keep_checkpoints: 10
55
+ even_checkpoints: false
56
+ max_steps: 2000000
57
+ time_horizon: 256
58
+ summary_freq: 10000
59
+ threaded: true
60
+ self_play: null
61
+ behavioral_cloning: null
62
+ env_settings:
63
+ env_path: .\Build
64
+ env_args: null
65
+ base_port: 5005
66
+ num_envs: 1
67
+ num_areas: 1
68
+ timeout_wait: 300
69
+ seed: -1
70
+ max_lifetime_restarts: 10
71
+ restarts_rate_limit_n: 1
72
+ restarts_rate_limit_period_s: 60
73
+ engine_settings:
74
+ width: 84
75
+ height: 84
76
+ quality_level: 5
77
+ time_scale: 20
78
+ target_frame_rate: -1
79
+ capture_frame_rate: 60
80
+ no_graphics: false
81
+ environment_parameters: null
82
+ checkpoint_settings:
83
+ run_id: tau_agent_ppo_A1
84
+ initialize_from: null
85
+ load_model: false
86
+ resume: false
87
+ force: true
88
+ train_model: false
89
+ inference: false
90
+ results_dir: results
91
+ torch_settings:
92
+ device: cuda
93
+ debug: false
results/tau_agent_A3_1M/Tau-A3-1M.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c5d20133e25c7b1f17c9fe045373f12448adfba0a341d2ce0ab683dc0a505e9
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c5d20133e25c7b1f17c9fe045373f12448adfba0a341d2ce0ab683dc0a505e9
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2c1e3f84038d1be138512d0c2b168f8fed07c9e123765c992dbb88ce19ad9729
3
+ size 23269214
results/tau_agent_A3_1M/checkpoints/TauAgent-12324.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eaf958d6b87edcf7ebcfbfe28296a04bbdb8f8a5fad6aa1b8f23bcf747cd89d1
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-12324.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c98eb2737900135b3e52175a3b2c94a6d3bd3c88d1b85667f32af788be5b6075
3
+ size 23268710
results/tau_agent_A3_1M/checkpoints/TauAgent-199903.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f024ccd7748fde0b0b7e8fa59d06951bf15fcefb555457746ee03d2f7a90bbc
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-199903.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b77bbf92946f07c4ea7dba982ce78ad57c00ee5a9bca7edd7271d7918bbbda7
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-28282.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd48d742cff7e006aa71e8dcb2c31c31c74bc9ae03d54b76559c3b3dd8745c61
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-28282.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8513da1f1353113fd99c51dd85e61f57e4dfb4b8d31b644d0f7af73ac4c3b47e
3
+ size 23268710
results/tau_agent_A3_1M/checkpoints/TauAgent-299879.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ff77dadbb9a1fafc12a4946bdd5c63f4e6f43fa1763cff4dd0929fa3d499a2b
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-299879.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b03039f2bdfbaa4625a6f13b8580c72c43bef5bfd9025e9f915e2bf3a51b8dfa
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-399831.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f0f215196437e7d6a5e1757fb59a21b4ea1cda1a3b7af937a427659a375b9a0d
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-399831.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0bd14a4a2ab99d335f034d318f7c372770c62ea615a4e2804bd8994de6b8050
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-499989.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:187b76bb76c34cba0dac85f49f58a7ea5bde3e4a23f9bfc899e2023cdbba3e70
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-499989.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:782a3ce94817b5f93f82f245128eabd0e5944a6f1ce158921c7cb500cc53c1d2
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-599755.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:545ee682b7953e6ed53cb351c7e17d16fcd665bf5d9a9f3cb1e32452bceaa760
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-599755.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:53a8c5e1897e7aa448878fc936a0b0968addbffe7e0b73cfa17999fd75de4aa6
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-699907.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5f5612c9190f8d25f0b2e1ae27d72aca029f7962cd1d1c9c2605296389979b4
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-699907.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01a48a1e97204e86b79e57dde69e65a2e156df2159595e7583f69355a2726e41
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-799975.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3946bf8a6a03014256bedd19b4965ec552903ba24a711ac5b3ae5bfd96e18f82
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-799975.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:50854837ea697627700f4b23582acd5083cbc41be015f5177bec6fa417a75141
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-899787.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3834f38a6da9d23a037c64da280ad1b87c77d9228ffc461ccc7c312a0f147c38
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-899787.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c4867e2a04b63b7412b26745df14b83b2a4465dfcd36b37018158fa2b685661
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/TauAgent-999987.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3f599e796aa05601ba2a7a5c54f4c702594601fa5c5d3861f60964129a4d4109
3
+ size 1983173
results/tau_agent_A3_1M/checkpoints/TauAgent-999987.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d7607c7b79859970d0162c13f77a89730ac548341ae0438001612c2ab0745ab
3
+ size 23268962
results/tau_agent_A3_1M/checkpoints/checkpoint.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94b764243f51b496724081db5f3d7a11d270dc706ec1c5635abfc81dc20cfb0b
3
+ size 23267702