qaihm-bot commited on
Commit
11f8623
1 Parent(s): cb6dc66

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +5 -12
README.md CHANGED
@@ -36,8 +36,8 @@ More details on model performance across various devices, can be found
36
 
37
  | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
38
  | ---|---|---|---|---|---|---|---|
39
- | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 0.297 ms | 0 - 1 MB | INT8 | NPU | [GoogLeNetQuantized.tflite](https://huggingface.co/qualcomm/GoogLeNetQuantized/blob/main/GoogLeNetQuantized.tflite)
40
- | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 0.346 ms | 0 - 133 MB | INT8 | NPU | [GoogLeNetQuantized.so](https://huggingface.co/qualcomm/GoogLeNetQuantized/blob/main/GoogLeNetQuantized.so)
41
 
42
 
43
  ## Installation
@@ -97,18 +97,11 @@ python -m qai_hub_models.models.googlenet_quantized.export
97
  ```
98
  Profile Job summary of GoogLeNetQuantized
99
  --------------------------------------------------
100
- Device: QCS8550 (Proxy) (12)
101
- Estimated Inference Time: 0.32 ms
102
- Estimated Peak Memory Range: 0.01-1.95 MB
103
  Compute Units: NPU (86) | Total (86)
104
 
105
- Profile Job summary of GoogLeNetQuantized
106
- --------------------------------------------------
107
- Device: QCS8550 (Proxy) (12)
108
- Estimated Inference Time: 0.36 ms
109
- Estimated Peak Memory Range: 0.61-5.14 MB
110
- Compute Units: NPU (88) | Total (88)
111
-
112
 
113
  ```
114
  ## How does this work?
 
36
 
37
  | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
38
  | ---|---|---|---|---|---|---|---|
39
+ | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 0.297 ms | 0 - 2 MB | INT8 | NPU | [GoogLeNetQuantized.tflite](https://huggingface.co/qualcomm/GoogLeNetQuantized/blob/main/GoogLeNetQuantized.tflite)
40
+ | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 0.345 ms | 0 - 4 MB | INT8 | NPU | [GoogLeNetQuantized.so](https://huggingface.co/qualcomm/GoogLeNetQuantized/blob/main/GoogLeNetQuantized.so)
41
 
42
 
43
  ## Installation
 
97
  ```
98
  Profile Job summary of GoogLeNetQuantized
99
  --------------------------------------------------
100
+ Device: Snapdragon X Elite CRD (11)
101
+ Estimated Inference Time: 0.47 ms
102
+ Estimated Peak Memory Range: 0.52-0.52 MB
103
  Compute Units: NPU (86) | Total (86)
104
 
 
 
 
 
 
 
 
105
 
106
  ```
107
  ## How does this work?