update pipeline.jpg

This commit is contained in:
WenmuZhou 2021-07-29 12:32:00 +08:00
parent 8aafd69e12
commit 34291229bd
3 changed files with 1 additions and 1 deletions

Binary file not shown.

Before

Width:  |  Height:  |  Size: 611 KiB

After

Width:  |  Height:  |  Size: 1.5 MiB

BIN
doc/table/pipeline_en.jpg Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 1.4 MiB

View File

@ -74,7 +74,7 @@ After running, each image will have a directory with the same name under the dir
## 2. PaddleStructure Pipeline
the process is as follows
![pipeline](../doc/table/pipeline.jpg)
![pipeline](../doc/table/pipeline_en.jpg)
In PaddleStructure, the image will be analyzed by layoutparser first. In the layout analysis, the area in the image will be classified, including **text, title, image, list and table** 5 categories. For the first 4 types of areas, directly use the PP-OCR to complete the text detection and recognition. The table area will be converted to an excel file of the same table style via Table OCR.