2024-07-10 18:02:50 | INFO | model_worker | args: Namespace(host='0.0.0.0', port=40006, worker_address='http://10.140.60.25:40006', controller_address='http://10.140.60.209:10075', model_path='share_internvl/InternVL2-40B/', model_name=None, device='auto', limit_model_concurrency=5, stream_interval=1, load_8bit=False)
2024-07-10 18:02:50 | INFO | model_worker | Loading the model InternVL2-40B on worker 30a4c1 ...
2024-07-10 18:02:50 | WARNING | transformers.tokenization_utils_base | Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
2024-07-10 18:02:50 | WARNING | transformers.tokenization_utils_base | Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
2024-07-10 18:02:52 | ERROR | stderr | /mnt/petrelfs/wangweiyun/miniconda3/envs/internvl/lib/python3.10/site-packages/transformers/generation/configuration_utils.py:397: UserWarning: `do_sample` is set to `False`. However, `top_p` is set to `None` -- this flag is only used in sample-based generation modes. You should set `do_sample=True` or unset `top_p`. This was detected when initializing the generation config instance, which means the corresponding file may hold incorrect parameterization and should be fixed.
2024-07-10 18:02:52 | ERROR | stderr |   warnings.warn(
2024-07-10 18:02:56 | ERROR | stderr | Loading checkpoint shards:   0%|          | 0/17 [00:00<?, ?it/s]
2024-07-10 18:02:58 | ERROR | stderr | Loading checkpoint shards:   6%|▌         | 1/17 [00:02<00:32,  2.06s/it]
2024-07-10 18:03:00 | ERROR | stderr | Loading checkpoint shards:  12%|█▏        | 2/17 [00:04<00:30,  2.05s/it]
2024-07-10 18:03:02 | ERROR | stderr | Loading checkpoint shards:  18%|█▊        | 3/17 [00:06<00:28,  2.03s/it]
2024-07-10 18:03:04 | ERROR | stderr | Loading checkpoint shards:  24%|██▎       | 4/17 [00:08<00:26,  2.01s/it]
2024-07-10 18:03:06 | ERROR | stderr | Loading checkpoint shards:  29%|██▉       | 5/17 [00:10<00:24,  2.04s/it]
2024-07-10 18:03:08 | ERROR | stderr | Loading checkpoint shards:  35%|███▌      | 6/17 [00:12<00:22,  2.02s/it]
2024-07-10 18:03:10 | ERROR | stderr | Loading checkpoint shards:  41%|████      | 7/17 [00:14<00:20,  2.00s/it]
2024-07-10 18:03:12 | ERROR | stderr | Loading checkpoint shards:  47%|████▋     | 8/17 [00:16<00:18,  2.01s/it]
2024-07-10 18:03:14 | ERROR | stderr | Loading checkpoint shards:  53%|█████▎    | 9/17 [00:18<00:15,  1.99s/it]
2024-07-10 18:03:16 | ERROR | stderr | Loading checkpoint shards:  59%|█████▉    | 10/17 [00:20<00:13,  1.97s/it]
2024-07-10 18:03:18 | ERROR | stderr | Loading checkpoint shards:  65%|██████▍   | 11/17 [00:22<00:11,  1.99s/it]
2024-07-10 18:03:20 | ERROR | stderr | Loading checkpoint shards:  71%|███████   | 12/17 [00:24<00:09,  1.98s/it]
2024-07-10 18:03:22 | ERROR | stderr | Loading checkpoint shards:  76%|███████▋  | 13/17 [00:25<00:07,  1.97s/it]
2024-07-10 18:03:24 | ERROR | stderr | Loading checkpoint shards:  82%|████████▏ | 14/17 [00:28<00:05,  1.99s/it]
2024-07-10 18:03:26 | ERROR | stderr | Loading checkpoint shards:  88%|████████▊ | 15/17 [00:29<00:03,  1.97s/it]
2024-07-10 18:03:28 | ERROR | stderr | Loading checkpoint shards:  94%|█████████▍| 16/17 [00:31<00:01,  1.96s/it]
2024-07-10 18:03:29 | ERROR | stderr | Loading checkpoint shards: 100%|██████████| 17/17 [00:32<00:00,  1.71s/it]
2024-07-10 18:03:29 | ERROR | stderr | Loading checkpoint shards: 100%|██████████| 17/17 [00:32<00:00,  1.94s/it]
2024-07-10 18:03:29 | ERROR | stderr | 
2024-07-10 18:03:29 | INFO | model_worker | Register to controller
2024-07-10 18:03:29 | ERROR | stderr | INFO:     Started server process [67980]
2024-07-10 18:03:29 | ERROR | stderr | INFO:     Waiting for application startup.
2024-07-10 18:03:29 | ERROR | stderr | INFO:     Application startup complete.
2024-07-10 18:03:29 | ERROR | stderr | INFO:     Uvicorn running on http://0.0.0.0:40006 (Press CTRL+C to quit)
2024-07-10 18:03:44 | INFO | model_worker | Send heart beat. Models: ['InternVL2-40B']. Semaphore: None. global_counter: 0
2024-07-10 18:03:59 | INFO | model_worker | Send heart beat. Models: ['InternVL2-40B']. Semaphore: None. global_counter: 0
2024-07-10 18:04:14 | INFO | model_worker | Send heart beat. Models: ['InternVL2-40B']. Semaphore: None. global_counter: 0
2024-07-10 18:04:29 | INFO | model_worker | Send heart beat. Models: ['InternVL2-40B']. Semaphore: None. global_counter: 0
2024-07-10 18:04:44 | INFO | model_worker | Send heart beat. Models: ['InternVL2-40B']. Semaphore: None. global_counter: 0