449 Star 3.5K Fork 851

PaddlePaddle / PaddleOCR

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
benchmark_en.md 1.90 KB
一键复制 编辑 原始数据 按行查看 历史
Leif 提交于 2021-09-23 20:58 . Merge pull request #4044 from Evezerest/2.3

Benchmark

This document gives the performance of the series models for Chinese and English recognition.

Test Data

We collected 300 images for different real application scenarios to evaluate the overall OCR system, including contract samples, license plates, nameplates, train tickets, test sheets, forms, certificates, street view images, business cards, digital meter, etc. The following figure shows some images of the test set.

Measurement

Explanation:

  • The long size of the input for the text detector is 960.

  • The evaluation time-consuming stage is the complete stage from image input to result output, including image pre-processing and post-processing.

  • Intel Xeon 6148 is the server-side CPU model. Intel MKL-DNN is used in the test to accelerate the CPU prediction speed.

  • Snapdragon 855 is a mobile processing platform model.

Compares the model size and F-score:

Model Name Model Size
of the
Whole System(M)
Model Size
of the Text
Detector(M)
Model Size
of the Direction
Classifier(M)
Model Size
of the Text
Recognizer (M)
F-score
PP-OCRv2 11.6 3.0 0.9 8.6 0.5224
PP-OCR mobile 8.1 2.6 0.9 4.6 0.503
PP-OCR server 155.1 47.2 0.9 107 0.570

Compares the time-consuming on CPU and T4 GPU (ms):

Model Name CPU T4 GPU
PP-OCRv2 330 111
PP-OCR mobile 356 116
PP-OCR server 1056 200

More indicators of PP-OCR series models can be referred to PP-OCR Benchmark

Python
1
https://gitee.com/paddlepaddle/PaddleOCR.git
git@gitee.com:paddlepaddle/PaddleOCR.git
paddlepaddle
PaddleOCR
PaddleOCR
release/2.6

搜索帮助