Skip to content

Latest commit

 

History

History
32 lines (22 loc) · 1.81 KB

File metadata and controls

32 lines (22 loc) · 1.81 KB

YOLOv2-COCO

Description

This model aims to detect objects in real time. It detects 80 different classes from the COCO Datasets. For information on network architecture, see the author's page and white paper.

Model

The model was converted to ONNX from PyTorch version of YOLOv2 using PyTorch-Yolo2. The output is fully verified by generating bounding boxes under PyTorch and onnxruntime.

Model Download ONNX version Opset version
YOLOv2 203.9 MB 1.5 9

Inference

Input to model

shape (1x3x416x416)

Output of model

shape (1x425x13x13)

Postprocessing steps

The output is a (1x425x13x13) tensor where 13x13 is the number of grid cells that the image gets divided into. Each grid cell corresponds to 5 anchors, made up of the 5 bounding boxes predicted by the grid cell and the 80 classes that describe each bounding box (5 x (80 classes + 5) = 425). For more information on how to derive the final bounding boxes and their corresponding confidence scores, refer to this post and PyTorch source code.

Dataset (Train and validation)

The YOLOv2 model was trained on the COCO datasets and was sourced from the original yolov2-voc .cfg and .weights files from link.

References

"YOLO9000: Better, Faster, Stronger" arXiv:1612.08242

License

MIT License