PaddleOCR/ppstructure/README_ch.md

# PaddleStructure

安装layoutparser
```sh
wget  https://paddleocr.bj.bcebos.com/whl/layoutparser-0.0.0-py3-none-any.whl
pip3 install layoutparser-0.0.0-py3-none-any.whl
```

## 1. pipeline介绍

PaddleStructure 是一个用于复杂板式文字OCR的工具包，流程如下
![pipeline](../doc/table/pipeline.jpg)

在PaddleStructure中，图片会先经由layoutparser进行版面分析，在版面分析中，会对图片里的区域进行分类，根据根据类别进行对于的ocr流程。

目前layoutparser会输出五个类别:
1. Text
2. Title
3. Figure
4. List
5. Table
   
1-4类走传统的OCR流程，5走表格的OCR流程。

## 2. LayoutParser

[文档](layout/README.md)

## 3. Table OCR

[文档](table/README_ch.md)

## 4. 预测引擎推理

使用如下命令即可完成预测引擎的推理
```python
python3 table/predict_system.py --det_model_dir=path/to/det_model_dir --rec_model_dir=path/to/rec_model_dir --table_model_dir=path/to/table_model_dir --image_dir=../doc/table/1.png --rec_char_dict_path=../ppocr/utils/dict/table_dict.txt --table_char_dict_path=../ppocr/utils/dict/table_structure_dict.txt --rec_char_type=EN --det_limit_side_len=736 --det_limit_type=min --output ../output/table
```
运行完成后，每张图片会output字段指定的目录下有一个同名目录，图片里的每个表格会存储为一个excel，excel文件名为表格在图片里的坐标。

## 5. PaddleStructure whl包介绍

### 5.1 使用

5.1.1 代码使用
```python
import os
import cv2
from paddlestructure import PaddleStructure,draw_result,save_res

table_engine = PaddleStructure(show_log=True)

save_folder = './output/table'
img_path = '../doc/table/1.png'
img = cv2.imread(img_path)
result = table_engine(img)
save_res(result, save_folder,os.path.basename(img_path).split('.')[0])

for line in result:
    print(line)

from PIL import Image

font_path = 'path/tp/PaddleOCR/doc/fonts/simfang.ttf'
image = Image.open(img_path).convert('RGB')
im_show = draw_result(image, result,font_path=font_path)
im_show = Image.fromarray(im_show)
im_show.save('result.jpg')
```

5.1.2 命令行使用
```bash
paddlestructure --image_dir=../doc/table/1.png
```

### 参数说明
大部分参数和paddleocr whl包保持一致，见 [whl包文档](../doc/doc_ch/whl.md)

| 字段                    | 说明                                            | 默认值           |
|------------------------|------------------------------------------------------|------------------|
| output                 | excel和识别结果保存的地址                    | ./output/table            |
| table_max_len      |  表格结构模型预测时，图像的长边resize尺度             |  488            |
| table_model_dir      |  表格结构模型 inference 模型地址             |  None            |
| table_char_type      |  表格结构模型所用字典地址             |  ../ppocr/utils/dict/table_structure_dict.tx            |
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
+								# PaddleStructure
-												rename test to test1

											
										
										
											2021-06-18 12:55:44 +08:00
+								安装layoutparser
 								```sh
 								wget  https://paddleocr.bj.bcebos.com/whl/layoutparser-0.0.0-py3-none-any.whl
 								pip3 install layoutparser-0.0.0-py3-none-any.whl
 								```
-												add en doc

											
										
										
											2021-06-11 14:17:59 +08:00
+								## 1. pipeline介绍
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
 								PaddleStructure 是一个用于复杂板式文字OCR的工具包，流程如下
-												fix link error

											
										
										
											2021-07-27 19:24:40 +08:00
+								![pipeline](../doc/table/pipeline.jpg)
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
 								在PaddleStructure中，图片会先经由layoutparser进行版面分析，在版面分析中，会对图片里的区域进行分类，根据根据类别进行对于的ocr流程。
 								目前layoutparser会输出五个类别:
 . Text
 . Title
 . Figure
 . List
 . Table
 -4类走传统的OCR流程，5走表格的OCR流程。
-												add en doc

											
										
										
											2021-06-11 14:17:59 +08:00
+								## 2. LayoutParser
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
-												add layout model

											
										
										
											2021-06-16 16:05:37 +08:00
+								[文档](layout/README.md)
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
-												add en doc

											
										
										
											2021-06-11 14:17:59 +08:00
+								## 3. Table OCR
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
 								[文档](table/README_ch.md)
-												opt doc

											
										
										
											2021-06-23 12:45:05 +08:00
+								## 4. 预测引擎推理
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
-												opt doc

											
										
										
											2021-06-23 12:45:05 +08:00
+								使用如下命令即可完成预测引擎的推理
 								```python
 								python3 table/predict_system.py --det_model_dir=path/to/det_model_dir --rec_model_dir=path/to/rec_model_dir --table_model_dir=path/to/table_model_dir --image_dir=../doc/table/1.png --rec_char_dict_path=../ppocr/utils/dict/table_dict.txt --table_char_dict_path=../ppocr/utils/dict/table_structure_dict.txt --rec_char_type=EN --det_limit_side_len=736 --det_limit_type=min --output ../output/table
 								```
 								运行完成后，每张图片会output字段指定的目录下有一个同名目录，图片里的每个表格会存储为一个excel，excel文件名为表格在图片里的坐标。
 								## 5. PaddleStructure whl包介绍
 								### 5.1 使用
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
-												opt doc

											
										
										
											2021-06-23 12:45:05 +08:00
+.1.1 代码使用
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
+								```python
-												rename structure 2 table

											
										
										
											2021-06-23 12:28:32 +08:00
+								import os
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
+								import cv2
-												rename structure 2 table

											
										
										
											2021-06-23 12:28:32 +08:00
+								from paddlestructure import PaddleStructure,draw_result,save_res
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
-												rename structure 2 table

											
										
										
											2021-06-23 12:28:32 +08:00
+								table_engine = PaddleStructure(show_log=True)
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
-												rename structure 2 table

											
										
										
											2021-06-23 12:28:32 +08:00
+								save_folder = './output/table'
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
+								img_path = '../doc/table/1.png'
 								img = cv2.imread(img_path)
 								result = table_engine(img)
-												rename structure 2 table

											
										
										
											2021-06-23 12:28:32 +08:00
+								save_res(result, save_folder,os.path.basename(img_path).split('.')[0])
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
+								for line in result:
 								    print(line)
 								from PIL import Image
 								font_path = 'path/tp/PaddleOCR/doc/fonts/simfang.ttf'
 								image = Image.open(img_path).convert('RGB')
 								im_show = draw_result(image, result,font_path=font_path)
 								im_show = Image.fromarray(im_show)
 								im_show.save('result.jpg')
 								```
-												opt doc

											
										
										
											2021-06-23 12:45:05 +08:00
+.1.2 命令行使用
-												merge dygraph

											
										
										
											2021-06-10 14:24:59 +08:00
+								```bash
 								paddlestructure --image_dir=../doc/table/1.png
 								```
-												add table ch readme

											
										
										
											2021-06-10 17:17:46 +08:00
+								### 参数说明
 								大部分参数和paddleocr whl包保持一致，见 [whl包文档](../doc/doc_ch/whl.md)
 								| 字段                    | 说明                                            | 默认值           |
 								|------------------------|------------------------------------------------------|------------------|
 								| output                 | excel和识别结果保存的地址                    | ./output/table            |
-												opt doc

											
										
										
											2021-06-24 10:02:52 +08:00
+								| table_max_len      |  表格结构模型预测时，图像的长边resize尺度             |  488            |
 								| table_model_dir      |  表格结构模型 inference 模型地址             |  None            |
 								| table_char_type      |  表格结构模型所用字典地址             |  ../ppocr/utils/dict/table_structure_dict.tx            |
-												add table ch readme

											
										
										
											2021-06-10 17:17:46 +08:00