runtime error
g (…)chat.ggmlv3.q2_K.bin: 94%|█████████▍| 2.69G/2.87G [01:00<00:03, 43.6MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 94%|█████████▍| 2.71G/2.87G [01:00<00:03, 43.1MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 95%|█████████▌| 2.73G/2.87G [01:00<00:02, 50.4MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 95%|█████████▌| 2.74G/2.87G [01:01<00:02, 44.9MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 96%|█████████▌| 2.76G/2.87G [01:01<00:02, 45.2MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 97%|█████████▋| 2.77G/2.87G [01:01<00:02, 43.4MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 97%|█████████▋| 2.79G/2.87G [01:02<00:01, 50.0MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 98%|█████████▊| 2.80G/2.87G [01:02<00:01, 52.9MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 98%|█████████▊| 2.81G/2.87G [01:02<00:01, 52.1MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 98%|█████████▊| 2.82G/2.87G [01:02<00:00, 49.8MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 99%|█████████▉| 2.84G/2.87G [01:03<00:00, 58.1MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 99%|█████████▉| 2.85G/2.87G [01:03<00:00, 49.6MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 100%|██████████| 2.87G/2.87G [01:04<00:00, 35.6MB/s][A Downloading (…)chat.ggmlv3.q2_K.bin: 100%|██████████| 2.87G/2.87G [01:04<00:00, 44.8MB/s] Fetching 1 files: 100%|██████████| 1/1 [01:04<00:00, 64.31s/it] Fetching 1 files: 100%|██████████| 1/1 [01:04<00:00, 64.31s/it] WARNING: failed to allocate 0.09 MB of pinned memory: CUDA driver version is insufficient for CUDA runtime version CUDA error 35 at /home/runner/work/ctransformers/ctransformers/models/ggml/ggml-cuda.cu:5067: CUDA driver version is insufficient for CUDA runtime version
Container logs:
Fetching error logs...