์ฝ˜ํ…์ธ ๋กœ ๊ฑด๋„ˆ๋›ฐ๊ธฐ

Neural Magic์˜ ๋”ฅ์ŠคํŽ˜์ด์Šค

์†Œํ”„ํŠธ์›จ์–ด ์ œ๊ณต AI์— ์˜ค์‹  ๊ฒƒ์„ ํ™˜์˜ํ•ฉ๋‹ˆ๋‹ค.

์ด ๊ฐ€์ด๋“œ์—์„œ๋Š” Neural Magic ์˜ DeepSparse๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ YOLOv5 ๋ฅผ ๋ฐฐํฌํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์„ค๋ช…ํ•ฉ๋‹ˆ๋‹ค.

DeepSparse๋Š” CPU์—์„œ ํƒ์›”ํ•œ ์„ฑ๋Šฅ์„ ๋ฐœํœ˜ํ•˜๋Š” ์ถ”๋ก  ๋Ÿฐํƒ€์ž„์ž…๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, ONNX ๋Ÿฐํƒ€์ž„ ๊ธฐ์ค€๊ณผ ๋น„๊ตํ–ˆ์„ ๋•Œ, ๋™์ผํ•œ ์ปดํ“จํ„ฐ์—์„œ ์‹คํ–‰๋˜๋Š” YOLOv5์˜ ๊ฒฝ์šฐ DeepSparse๋Š” 5.8๋ฐฐ์˜ ์†๋„ ํ–ฅ์ƒ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค!

YOLOv5 ์†๋„ ํ–ฅ์ƒ

For the first time, your deep learning workloads can meet the performance demands of production without the complexity and costs of hardware accelerators. Put simply, DeepSparse gives you the performance of GPUs and the simplicity of software:

  • ์œ ์—ฐํ•œ ๋ฐฐํฌ: ํด๋ผ์šฐ๋“œ, ๋ฐ์ดํ„ฐ์„ผํ„ฐ, ์—ฃ์ง€ ์ „๋ฐ˜์—์„œ ์ผ๊ด€๋˜๊ฒŒ ์‹คํ–‰: Intel , AMD, ARM ๋“ฑ ๋ชจ๋“  ํ•˜๋“œ์›จ์–ด ๊ณต๊ธ‰์—…์ฒด์™€ ํ•จ๊ป˜ ์‹คํ–‰ํ•˜์„ธ์š”.
  • ๋ฌดํ•œํ•œ ํ™•์žฅ์„ฑ: 100๊ฐœ์˜ ์ฝ”์–ด๋กœ ์ˆ˜์ง ํ™•์žฅํ•˜๊ฑฐ๋‚˜, ํ‘œ์ค€ Kubernetes๋กœ ํ™•์žฅํ•˜๊ฑฐ๋‚˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ์™„์ „ํžˆ ์ถ”์ƒํ™”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  • ๊ฐ„ํŽธํ•œ ํ†ตํ•ฉ: ๋ชจ๋ธ์„ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์— ํ†ตํ•ฉํ•˜๊ณ  ํ”„๋กœ๋•์…˜ ํ™˜๊ฒฝ์—์„œ ๋ชจ๋‹ˆํ„ฐ๋งํ•˜๊ธฐ ์œ„ํ•œ ๊น”๋”ํ•œ API

DeepSparse๋Š” ์–ด๋–ป๊ฒŒ GPU-ํด๋ž˜์Šค ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•˜๋‚˜์š”?

DeepSparse๋Š” ๋ชจ๋ธ ํฌ์†Œ์„ฑ์„ ํ™œ์šฉํ•˜์—ฌ ์„ฑ๋Šฅ ์†๋„๋ฅผ ๋†’์ž…๋‹ˆ๋‹ค.

Sparsification through pruning and quantization is a broadly studied technique, allowing order-of-magnitude reductions in the size and compute needed to execute a network, while maintaining high accuracy. DeepSparse is sparsity-aware, meaning it skips the zeroed out parameters, shrinking amount of compute in a forward pass. Since the sparse computation is now memory bound, DeepSparse executes the network depth-wise, breaking the problem into Tensor Columns, vertical stripes of computation that fit in cache.

YOLO ๋ชจ๋ธ ๊ฐ€์ง€์น˜๊ธฐ

์บ์‹œ์—์„œ ๊นŠ์ด ๋‹จ์œ„๋กœ ์‹คํ–‰๋˜๋Š” ์••์ถ• ์—ฐ์‚ฐ์ด ํฌํ•จ๋œ ์ŠคํŒŒ์Šค ๋„คํŠธ์›Œํฌ๋ฅผ ํ†ตํ•ด DeepSparse๋Š” CPU์—์„œ GPU-๊ธ‰ ์„ฑ๋Šฅ์„ ์ œ๊ณตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค!

๋‚ด ๋ฐ์ดํ„ฐ๋กœ ํ›ˆ๋ จ๋œ YOLOv5 ์˜ ์ŠคํŒŒ์Šค ๋ฒ„์ „์„ ๋งŒ๋“ค๋ ค๋ฉด ์–ด๋–ป๊ฒŒ ํ•˜๋‚˜์š”?

Neural Magic์˜ ์˜คํ”ˆ ์†Œ์Šค ๋ชจ๋ธ ์ €์žฅ์†Œ์ธ SparseZoo์—๋Š” ๊ฐ YOLOv5 ๋ชจ๋ธ์— ๋Œ€ํ•ด ๋ฏธ๋ฆฌ ์ง€์ •๋œ ์ฒดํฌํฌ์ธํŠธ๊ฐ€ ํฌํ•จ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. Ultralytics ๊ณผ ํ†ตํ•ฉ๋œ SparseML์„ ์‚ฌ์šฉํ•˜๋ฉด CLI ๋ช…๋ น ํ•œ ๋ฒˆ์œผ๋กœ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์ŠคํŒŒ์Šค ์ฒดํฌํฌ์ธํŠธ๋ฅผ ๋ฏธ์„ธ ์กฐ์ •ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

์ž์„ธํ•œ ๋‚ด์šฉ์€ Neural Magic ์˜ YOLOv5 ๋ฌธ์„œ๋ฅผ ์ฐธ์กฐํ•˜์„ธ์š”.

๋”ฅ์ŠคํŽ˜์ด์Šค ์‚ฌ์šฉ

DeepSparse๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ YOLOv5์˜ ์ŠคํŒŒ์Šค ๋ฒ„์ „์„ ๋ฒค์น˜๋งˆํ‚นํ•˜๊ณ  ๋ฐฐํฌํ•˜๋Š” ์˜ˆ์ œ๋ฅผ ์‚ดํŽด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค.

DeepSparse ์„ค์น˜

๋‹ค์Œ์„ ์‹คํ–‰ํ•˜์—ฌ DeepSparse๋ฅผ ์„ค์น˜ํ•ฉ๋‹ˆ๋‹ค. Python ์„ ์‚ฌ์šฉํ•˜์—ฌ ๊ฐ€์ƒ ํ™˜๊ฒฝ์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด ์ข‹์Šต๋‹ˆ๋‹ค.

pip install "deepsparse[server,yolo,onnxruntime]"

ONNX ํŒŒ์ผ ์ˆ˜์ง‘

DeepSparse๋Š” ONNX ํ˜•์‹์œผ๋กœ ์ „๋‹ฌ๋œ ๋ชจ๋ธ์„ ํ—ˆ์šฉํ•ฉ๋‹ˆ๋‹ค:

  • SparseZoo์˜ ONNX ํŒŒ์ผ์„ ์‹๋ณ„ํ•˜๋Š” SparseZoo ์Šคํ…์ž…๋‹ˆ๋‹ค.
  • ํŒŒ์ผ ์‹œ์Šคํ…œ์—์„œ ONNX ๋ชจ๋ธ์— ๋Œ€ํ•œ ๋กœ์ปฌ ๊ฒฝ๋กœ

์•„๋ž˜ ์˜ˆ์‹œ์—์„œ๋Š” ๋‹ค์Œ SparseZoo ์Šคํ…์œผ๋กœ ์‹๋ณ„๋˜๋Š” ํ‘œ์ค€ ์กฐ๋ฐ€ํ•˜๊ณ  ์ •๋Ÿ‰ํ™”๋œ YOLOv5s ์ฒดํฌํฌ์ธํŠธ๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค:

zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none
zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none

๋ชจ๋ธ ๋ฐฐํฌ

DeepSparse๋Š” ๋ชจ๋ธ์„ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์— ํ†ตํ•ฉํ•˜๊ธฐ ์œ„ํ•œ ํŽธ๋ฆฌํ•œ API๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

์•„๋ž˜ ๋ฐฐํฌ ์˜ˆ์ œ๋ฅผ ์‚ฌ์šฉํ•ด ๋ณด๋ ค๋ฉด ์ƒ˜ํ”Œ ์ด๋ฏธ์ง€๋ฅผ ๋Œ์–ด์™€์„œ ๋‹ค์Œ ์ด๋ฆ„์œผ๋กœ ์ €์žฅํ•˜์„ธ์š”. basilica.jpg ๋ฅผ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค:

wget -O basilica.jpg https://raw.githubusercontent.com/neuralmagic/deepsparse/main/src/deepsparse/yolo/sample_images/basilica.jpg

Python API

Pipelines ๋Š” ์ „์ฒ˜๋ฆฌ์™€ ์ถœ๋ ฅ ํ›„์ฒ˜๋ฆฌ๋ฅผ ๋Ÿฐํƒ€์ž„์— ๋ž˜ํ•‘ํ•˜์—ฌ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์— DeepSparse๋ฅผ ์ถ”๊ฐ€ํ•˜๊ธฐ ์œ„ํ•œ ๊น”๋”ํ•œ ์ธํ„ฐํŽ˜์ด์Šค๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. DeepSparse-Ultralytics ํ†ตํ•ฉ์—๋Š” ์ฆ‰์‹œ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ Pipeline ์›์‹œ ์ด๋ฏธ์ง€๋ฅผ ๋ฐ›์•„๋“ค์ด๊ณ  ๊ฒฝ๊ณ„ ์ƒ์ž๋ฅผ ์ถœ๋ ฅํ•˜๋Š” ํ•จ์ˆ˜์ž…๋‹ˆ๋‹ค.

๋งŒ๋“ค๊ธฐ Pipeline ๋ฅผ ํด๋ฆญํ•˜๊ณ  ์ถ”๋ก ์„ ์‹คํ–‰ํ•ฉ๋‹ˆ๋‹ค:

from deepsparse import Pipeline

# list of images in local filesystem
images = ["basilica.jpg"]

# create Pipeline
model_stub = "zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none"
yolo_pipeline = Pipeline.create(
    task="yolo",
    model_path=model_stub,
)

# run inference on images, receive bounding boxes + classes
pipeline_outputs = yolo_pipeline(images=images, iou_thres=0.6, conf_thres=0.001)
print(pipeline_outputs)

ํด๋ผ์šฐ๋“œ์—์„œ ์‹คํ–‰ ์ค‘์ธ ๊ฒฝ์šฐ, open-cv๋ฅผ ์ฐพ์„ ์ˆ˜ ์—†๋‹ค๋Š” ์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. libGL.so.1. Ubuntu์—์„œ ๋‹ค์Œ์„ ์‹คํ–‰ํ•˜๋ฉด ์„ค์น˜๋ฉ๋‹ˆ๋‹ค:

apt-get install libgl1

HTTP ์„œ๋ฒ„

DeepSparse Server runs on top of the popular FastAPI web framework and Uvicorn web server. With just a single CLI command, you can easily setup a model service endpoint with DeepSparse. The Server supports any Pipeline from DeepSparse, including object detection with YOLOv5, enabling you to send raw images to the endpoint and receive the bounding boxes.

์ •๋ฆฌ๋œ ์ •๋Ÿ‰ํ™”๋œ YOLOv5๋กœ ์„œ๋ฒ„๋ฅผ ์Šคํ•€์—…ํ•ฉ๋‹ˆ๋‹ค:

deepsparse.server \
    --task yolo \
    --model_path zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none

Python ๋ฅผ ์‚ฌ์šฉํ•œ ์š”์ฒญ ์˜ˆ์‹œ requests ํŒจํ‚ค์ง€์ž…๋‹ˆ๋‹ค:

import json

import requests

# list of images for inference (local files on client side)
path = ["basilica.jpg"]
files = [("request", open(img, "rb")) for img in path]

# send request over HTTP to /predict/from_files endpoint
url = "http://0.0.0.0:5543/predict/from_files"
resp = requests.post(url=url, files=files)

# response is returned in JSON
annotations = json.loads(resp.text)  # dictionary of annotation results
bounding_boxes = annotations["boxes"]
labels = annotations["labels"]

์ฃผ์„ ๋‹ฌ๊ธฐ CLI

์ฃผ์„ ๋‹ฌ๊ธฐ ๋ช…๋ น์„ ์‚ฌ์šฉํ•˜์—ฌ ์—”์ง„์ด ์ฃผ์„์ด ๋‹ฌ๋ฆฐ ์‚ฌ์ง„์„ ๋””์Šคํฌ์— ์ €์žฅํ•˜๋„๋ก ํ•  ์ˆ˜๋„ ์žˆ์Šต๋‹ˆ๋‹ค. ์†Œ์Šค 0์„ ์‚ฌ์šฉํ•ด ๋ผ์ด๋ธŒ ์›น์บ  ํ”ผ๋“œ์— ์ฃผ์„์„ ๋‹ฌ์•„ ๋ณด์„ธ์š”!

deepsparse.object_detection.annotate --model_filepath zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none --source basilica.jpg

์œ„ ๋ช…๋ น์„ ์‹คํ–‰ํ•˜๋ฉด annotation-results ํด๋”์— ์ฃผ์„์ด ๋‹ฌ๋ฆฐ ์ด๋ฏธ์ง€๋ฅผ ์ €์žฅํ•ฉ๋‹ˆ๋‹ค.

์ฃผ์„์ด ๋‹ฌ๋ฆฐ

๋ฒค์น˜๋งˆํ‚น ์„ฑ๋Šฅ

DeepSparse์˜ ๋ฒค์น˜๋งˆํ‚น ์Šคํฌ๋ฆฝํŠธ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ YOLOv5์—์„œ ONNX ๋Ÿฐํƒ€์ž„์˜ ์ฒ˜๋ฆฌ๋Ÿ‰๊ณผ DeepSparse์˜ ์ฒ˜๋ฆฌ๋Ÿ‰์„ ๋น„๊ตํ•ด ๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค.

๋ฒค์น˜๋งˆํฌ๋Š” AWS์—์„œ ์‹คํ–‰๋˜์—ˆ์Šต๋‹ˆ๋‹ค. c6i.8xlarge ์ธ์Šคํ„ด์Šค(16์ฝ”์–ด).

๋ฐฐ์น˜ 32 ์„ฑ๋Šฅ ๋น„๊ต

ONNX ๋Ÿฐํƒ€์ž„ ๊ธฐ์ค€์„ 

๋ฐฐ์น˜ 32์—์„œ ONNX ๋Ÿฐํƒ€์ž„์€ ํ‘œ์ค€ ๊ณ ๋ฐ€๋„ YOLOv5๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ดˆ๋‹น 42๊ฐœ์˜ ์ด๋ฏธ์ง€๋ฅผ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค:

deepsparse.benchmark zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none -s sync -b 32 -nstreams 1 -e onnxruntime

> Original Model Path: zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none
> Batch Size: 32
> Scenario: sync
> Throughput (items/sec): 41.9025

๋”ฅ์ŠคํŽ˜์ด์Šค ๊ณ ๋ฐ€๋„ ์„ฑ๋Šฅ

๋”ฅ์ŠคํŒŒ์Šค๋Š” ์ตœ์ ํ™”๋œ ์ŠคํŒŒ์Šค ๋ชจ๋ธ์—์„œ ์ตœ๊ณ ์˜ ์„ฑ๋Šฅ์„ ์ œ๊ณตํ•˜์ง€๋งŒ, ํ‘œ์ค€ ๊ณ ๋ฐ€๋„ YOLOv5์—์„œ๋„ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ฐœํœ˜ํ•ฉ๋‹ˆ๋‹ค.

๋ฐฐ์น˜ 32์—์„œ ๋”ฅ์ŠคํŽ˜์ด์Šค๋Š” ํ‘œ์ค€ ๊ณ ๋ฐ€๋„ YOLOv5s๋กœ ์ดˆ๋‹น 70๊ฐœ์˜ ์ด๋ฏธ์ง€๋ฅผ ๋‹ฌ์„ฑํ•˜์—ฌ ORT๋ณด๋‹ค 1.7๋ฐฐ ์„ฑ๋Šฅ์ด ํ–ฅ์ƒ๋˜์—ˆ์Šต๋‹ˆ๋‹ค!

deepsparse.benchmark zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none -s sync -b 32 -nstreams 1

> Original Model Path: zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none
> Batch Size: 32
> Scenario: sync
> Throughput (items/sec): 69.5546

๋”ฅ์ŠคํŽ˜์–ด์Šค ์ŠคํŒŒ์Šค ์„ฑ๋Šฅ

๋ชจ๋ธ์— ํฌ์†Œ์„ฑ์„ ์ ์šฉํ•˜๋ฉด ONNX ๋Ÿฐํƒ€์ž„์— ๋น„ํ•ด DeepSparse์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์€ ๋”์šฑ ๊ฐ•๋ ฅํ•ด์ง‘๋‹ˆ๋‹ค.

๋ฐฐ์น˜ 32์—์„œ ๋”ฅ์ŠคํŽ˜์ด์Šค๋Š” ํ”„๋ฃจ๋‹๋œ ์ •๋Ÿ‰ํ™”๋œ YOLOv5s๋กœ ์ดˆ๋‹น 241 ์ด๋ฏธ์ง€๋ฅผ ๋‹ฌ์„ฑํ•˜๋ฉฐ, ์ด๋Š” ORT๋ณด๋‹ค 5.8๋ฐฐ ํ–ฅ์ƒ๋œ ์„ฑ๋Šฅ์ž…๋‹ˆ๋‹ค!

deepsparse.benchmark zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none -s sync -b 32 -nstreams 1

> Original Model Path: zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none
> Batch Size: 32
> Scenario: sync
> Throughput (items/sec): 241.2452

๋ฐฐ์น˜ 1 ์„ฑ๋Šฅ ๋น„๊ต

๋˜ํ•œ ๋”ฅ์ŠคํŽ˜์ด์Šค๋Š” ์ง€์—ฐ ์‹œ๊ฐ„์— ๋ฏผ๊ฐํ•œ ๋ฐฐ์น˜ 1 ์‹œ๋‚˜๋ฆฌ์˜ค์˜ ๊ฒฝ์šฐ ONNX ๋Ÿฐํƒ€์ž„๋ณด๋‹ค ์†๋„๋ฅผ ๋†’์ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

ONNX ๋Ÿฐํƒ€์ž„ ๊ธฐ์ค€์„ 

๋ฐฐ์น˜ 1์—์„œ ONNX ๋Ÿฐํƒ€์ž„์€ ํ‘œ์ค€ ๊ณ ๋ฐ€๋„ YOLOv5๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ดˆ๋‹น 48๊ฐœ์˜ ์ด๋ฏธ์ง€๋ฅผ ๋‹ฌ์„ฑํ•ฉ๋‹ˆ๋‹ค.

deepsparse.benchmark zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none -s sync -b 1 -nstreams 1 -e onnxruntime

> Original Model Path: zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none
> Batch Size: 1
> Scenario: sync
> Throughput (items/sec): 48.0921

๋”ฅ์ŠคํŽ˜์–ด์Šค ์ŠคํŒŒ์Šค ์„ฑ๋Šฅ

๋ฐฐ์น˜ 1์—์„œ DeepSparse๋Š” ์ •๋ฆฌ๋œ ์ •๋Ÿ‰ํ™”๋œ YOLOv5s๋กœ ์ดˆ๋‹น 135๊ฐœ์˜ ํ•ญ๋ชฉ์„ ๋‹ฌ์„ฑํ•˜์—ฌ ONNX ๋Ÿฐํƒ€์ž„์— ๋น„ํ•ด 2.8๋ฐฐ์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค !

deepsparse.benchmark zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none -s sync -b 1 -nstreams 1

> Original Model Path: zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned65_quant-none
> Batch Size: 1
> Scenario: sync
> Throughput (items/sec): 134.9468

์ดํ›„ c6i.8xlarge ์ธ์Šคํ„ด์Šค์— VNNI ๋ช…๋ น์–ด๊ฐ€ ์žˆ๋Š” ๊ฒฝ์šฐ, ๊ฐ€์ค‘์น˜๋ฅผ 4๋ธ”๋ก ๋‹จ์œ„๋กœ ์ž˜๋ผ๋‚ด๋ฉด DeepSparse์˜ ์ฒ˜๋ฆฌ๋Ÿ‰์„ ๋” ๋Š˜๋ฆด ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

๋ฐฐ์น˜ 1์—์„œ DeepSparse๋Š” 4๋ธ”๋ก ํ”„๋ฃจ๋‹ ์ •๋Ÿ‰ํ™”๋œ YOLOv5s๋กœ ์ดˆ๋‹น 180๊ฐœ ํ•ญ๋ชฉ์„ ๋‹ฌ์„ฑํ•˜์—ฌ ONNX ๋Ÿฐํƒ€์ž„ ๋Œ€๋น„ 3.7๋ฐฐ์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋‹ฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค !

deepsparse.benchmark zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned35_quant-none-vnni -s sync -b 1 -nstreams 1

> Original Model Path: zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/pruned35_quant-none-vnni
> Batch Size: 1
> Scenario: sync
> Throughput (items/sec): 179.7375

DeepSparse ์‹œ์ž‘ํ•˜๊ธฐ

์—ฐ๊ตฌ ๋˜๋Š” ํ…Œ์ŠคํŠธ? ๋”ฅ์ŠคํŽ˜์ด์Šค ์ปค๋ฎค๋‹ˆํ‹ฐ๋Š” ์—ฐ๊ตฌ ๋ฐ ํ…Œ์ŠคํŠธ๋ฅผ ์œ„ํ•ด ๋ฌด๋ฃŒ๋กœ ์ œ๊ณต๋ฉ๋‹ˆ๋‹ค. ๋ฌธ์„œ์—์„œ ์‹œ์ž‘ํ•˜์„ธ์š”.


๐Ÿ“… Created 11 months ago โœ๏ธ Updated 17 days ago

๋Œ“๊ธ€