YOLOv5s Models

Overview

YOLOv5s is a variant of the YOLO (You Only Look Once) family of object detection models, specifically designed to be a smaller and faster version suitable for real-time object detection tasks. Developed by Ultralytics, YOLOv5 is the latest iteration in the YOLO series, offering improved speed and accuracy compared to previous versions.

Getting Started & How it work ?

To see detailed steps on how to use the model converter you can go to MTK Converter ,it will direct you to the appropriate page

Model Details

General Information

Property	Value
Category	Detection
Input Size	640x640
FLOPs@640 (B)	16.5
#Params (M)	7.2
Training Framework	PyTorch
Inference Framework	TFLite
Quant8 Model package	Download
Float32 Model package	Download

Model Properties

Quant8

Format: TensorFlow Lite v3
Description: Exported by NeuroPilot converter v7.14.1+release

Inputs

Property	Value
Name	x.1
Tensor	int8[1,3,640,640]
Identifier	67
Quantization	Linear
Quantization Range	0.0039 * (q + 128) ≤ 0.9993

Outputs

Property	Value
Name	77
Tensor	int8[1,255,80,80]
Identifier	315
Quantization	Linear
Quantization Range	-19.3298 ≤ 0.0966 * (q - 72) ≤ 5.3157
Name	78
Tensor	int8[1,255,40,40]
Identifier	279
Quantization	Linear
Quantization Range	-15.8150 ≤ 0.0841 * (q - 60) ≤ 5.6362
Name	79
Tensor	int8[1,255,20,20]
Identifier	15
Quantization	Linear
Quantization Range	-15.7213 ≤ 0.0845 * (q - 58) ≤ 5.8321

Fp32

Format: TensorFlow Lite v3
Description: Exported by NeuroPilot converter v7.14.1+release

Inputs

Property	Value
Name	x.1
Tensor	float32[1,3,640,640]
Identifier	315

Outputs

Property	Value
Name	77
Tensor	float32[1,255,80,80]
Identifier	304
Name	78
Tensor	float32[1,255,40,40]
Identifier	272
Name	79
Tensor	float32[1,255,20,20]
Identifier	230

Performance Benchmarks

YOLOv5s-quant8

Run model (.tflite) 10 times	CPU (Thread:8)	GPU	ARMNN(GpuAcc)	ARMNN(CpuAcc)	Neuron Stable Delegate(APU)	APU(MDLA)	APU(VPU)
G350	669.998 ms (Thread:4)	984.989 ms	492.372 ms	456.609 ms	N/A	N/A	70487.4 ms
G510	336.39 ms	358.188 ms	161.230 ms	116.290 ms	17.894 ms	17.47 ms	N/A
G700	115.887 ms	225.351 ms	113.794 ms	104.801 ms	10.899 ms	10.04 ms	N/A
G1200	116.143 ms	150.983 ms	72.639 ms	58.181 ms	19.238 ms	19.05 ms	N/A

YOLOv5s-fp32

Run model (.tflite) 10 times	CPU (Thread:8)	GPU	ARMNN(GpuAcc)	ARMNN(CpuAcc)	Neuron Stable Delegate(APU)	APU(MDLA)	APU(VPU)
G350	1379.79 ms (Thread:4)	935.716 ms	957.083 ms	N/A	N/A	N/A	4775.23 ms
G510	548.035 ms	304.006 ms	302.887 ms	326.755 ms	43.684 ms	46.41 ms	N/A
G700	299.257 ms	209.685 ms	207.253 ms	278.701 ms	31.853 ms	32.04 ms	N/A
G1200	272.845 ms	136.244 ms	133.026 ms	158.299 ms	36.771 ms	36.66 ms	N/A

Widespread: CPU only, light workload.
Performance: CPU and GPU, medium workload.
Ultimate: CPU, GPU, and APUs, heavy workload.

Resources

To preview related documentation about YOLOv5, please visit the GitHub repository.