PaddleOCR/doc/doc_en/quickstart_en.md


# Quick start of Chinese OCR model

## 1. Prepare for the environment

Please refer to [quick installation](./installation_en.md) to configure the PaddleOCR operating environment.

* Note: Support the use of PaddleOCR through whl package installation，pelease refer  [PaddleOCR Package](https://github.com/PaddlePaddle/PaddleOCR/blob/develop/doc/doc_en/whl_en.md).

## 2.inference models

The detection and recognition models on the mobile and server sides are as follows. For more models  (including multiple languages), please refer to [PP-OCR v1.1 series model list](../doc_ch/models_list.md)


| Model introduction    | Model name     |  Recommended scene      | Detection model | Direction Classifier | Recognition model |
| ------------ | --------------- | ----------------|---- | ---------- | -------- |
| Ultra-lightweight Chinese OCR model（8.1M） | ch_ppocr_mobile_v1.1_xx |Mobile-side/Server-side|[inference model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/det/ch_ppocr_mobile_v1.1_det_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/det/ch_ppocr_mobile_v1.1_det_train.tar)|[inference model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_train.tar) |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/rec/ch_ppocr_mobile_v1.1_rec_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/rec/ch_ppocr_mobile_v1.1_rec_pre.tar)      |
| Universal Chinese OCR model（155.1M）   |ch_ppocr_server_v1.1_xx|Server-side |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/server/det/ch_ppocr_server_v1.1_det_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/server/det/ch_ppocr_server_v1.1_det_train.tar)          |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_train.tar)    |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/server/rec/ch_ppocr_server_v1.1_rec_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/server/rec/ch_ppocr_server_v1.1_rec_pre.tar)  |

* If `wget` is not installed in the windows environment, you can copy the link to the browser to download when downloading the model, then uncompress it and place it in the corresponding directory.

Copy the download address of the `inference model` for detection and recognition in the table above, and uncompress them.

```
mkdir inference && cd inference
# Download the detection model and unzip
wget {url/of/detection/inference_model} && tar xf {name/of/detection/inference_model/package}
# Download the recognition model and unzip
wget {url/of/recognition/inference_model} && tar xf {name/of/recognition/inference_model/package}
# Download the direction classifier model and unzip
wget {url/of/classification/inference_model} && tar xf {name/of/classification/inference_model/package}
cd ..
```

Take the ultra-lightweight model as an example:

```
mkdir inference && cd inference
# Download the detection model of the ultra-lightweight Chinese OCR model and uncompress it
wget https://paddleocr.bj.bcebos.com/20-09-22/mobile/det/ch_ppocr_mobile_v1.1_det_infer.tar && tar xf ch_ppocr_mobile_v1.1_det_infer.tar
# Download the recognition model of the ultra-lightweight Chinese OCR model and uncompress it
wget https://paddleocr.bj.bcebos.com/20-09-22/mobile/rec/ch_ppocr_mobile_v1.1_rec_infer.tar && tar xf ch_ppocr_mobile_v1.1_rec_infer.tar
# Download the direction classifier model of the ultra-lightweight Chinese OCR model and uncompress it
wget https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_infer.tar && tar xf ch_ppocr_mobile_v1.1_cls_infer.tar
cd ..
```

After decompression, the file structure should be as follows:

```
|-inference
    |-ch_ppocr_mobile_v1.1_det_infer
        |- model
        |- params
    |-ch_ppocr_mobile_v1.1_rec_infer
        |- model
        |- params
    |-ch_ppocr_mobile_v1.1_cls_infer
        |- model
        |- params
    ...
```

## 3. Single image or image set prediction

* The following code implements text detection and recognition process. When performing prediction, you need to specify the path of a single image or image set through the parameter `image_dir`, the parameter `det_model_dir` specifies the path to detect the inference model, the parameter `rec_model_dir` specifies the path to identify the inference model, the parameter `use_angle_cls` specifies whether to use the direction classifier, the parameter `cls_model_dir` specifies the path to identify the direction classifier model, the parameter `use_space_char` specifies whether to predict the space char. The visual results are saved to the `./inference_results` folder by default.


```bash

# Predict a single image specified by image_dir
python3 tools/infer/predict_system.py --image_dir="./doc/imgs/11.jpg" --det_model_dir="./inference/ch_ppocr_mobile_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_mobile_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True

# Predict imageset specified by image_dir
python3 tools/infer/predict_system.py --image_dir="./doc/imgs/" --det_model_dir="./inference/ch_ppocr_mobile_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_mobile_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True

# If you want to use the CPU for prediction, you need to set the use_gpu parameter to False
python3 tools/infer/predict_system.py --image_dir="./doc/imgs/11.jpg" --det_model_dir="./inference/ch_ppocr_mobile_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_mobile_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True --use_gpu=False
```

- Universal Chinese OCR model

Please follow the above steps to download the corresponding models and update the relevant parameters, The example is as follows.

```
# Predict a single image specified by image_dir
python3 tools/infer/predict_system.py --image_dir="./doc/imgs/11.jpg" --det_model_dir="./inference/ch_ppocr_server_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_server_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True
```

* Note
    - If you want to use the recognition model which does not support space char recognition, please update the source code to the latest version and add parameters `--use_space_char=False`.
    - If you do not want to use direction classifier, please update the source code to the latest version and add parameters `--use_angle_cls=False`.


For more text detection and recognition tandem reasoning, please refer to the document tutorial
: [Inference with Python inference engine](./inference_en.md)。

In addition, the tutorial also provides other deployment methods for the Chinese OCR model:
- [Server-side C++ inference](../../deploy/cpp_infer/readme_en.md)
- [Service deployment](../../deploy/hubserving/readme_en.md)
- [End-to-end deployment](../../deploy/lite/readme_en.md)
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
 								# Quick start of Chinese OCR model
 								## 1. Prepare for the environment
 								Please refer to [quick installation](./installation_en.md) to configure the PaddleOCR operating environment.
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								* Note: Support the use of PaddleOCR through whl package installation，pelease refer  [PaddleOCR Package](https://github.com/PaddlePaddle/PaddleOCR/blob/develop/doc/doc_en/whl_en.md).
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
 								## 2.inference models
-												fix link in quickstart doc, test=document_fix

											
										
										
											2020-10-12 17:21:48 +08:00
+								The detection and recognition models on the mobile and server sides are as follows. For more models  (including multiple languages), please refer to [PP-OCR v1.1 series model list](../doc_ch/models_list.md)
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
 								| Model introduction    | Model name     |  Recommended scene      | Detection model | Direction Classifier | Recognition model |
 								| ------------ | --------------- | ----------------|---- | ---------- | -------- |
-												update clas_train link

											
										
										
											2020-09-22 17:05:47 +08:00
+								| Ultra-lightweight Chinese OCR model（8.1M） | ch_ppocr_mobile_v1.1_xx |Mobile-side/Server-side|[inference model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/det/ch_ppocr_mobile_v1.1_det_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/det/ch_ppocr_mobile_v1.1_det_train.tar)|[inference model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_train.tar) |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/rec/ch_ppocr_mobile_v1.1_rec_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/mobile/rec/ch_ppocr_mobile_v1.1_rec_pre.tar)      |
 								| Universal Chinese OCR model（155.1M）   |ch_ppocr_server_v1.1_xx|Server-side |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/server/det/ch_ppocr_server_v1.1_det_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/server/det/ch_ppocr_server_v1.1_det_train.tar)          |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_train.tar)    |[inference model](https://paddleocr.bj.bcebos.com/20-09-22/server/rec/ch_ppocr_server_v1.1_rec_infer.tar) / [pretrained model](https://paddleocr.bj.bcebos.com/20-09-22/server/rec/ch_ppocr_server_v1.1_rec_pre.tar)  |
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
-												revert config

											
										
										
											2020-09-21 14:04:53 +08:00
+								* If `wget` is not installed in the windows environment, you can copy the link to the browser to download when downloading the model, then uncompress it and place it in the corresponding directory.
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
 								Copy the download address of the `inference model` for detection and recognition in the table above, and uncompress them.
 								```
 								mkdir inference && cd inference
 								# Download the detection model and unzip
 								wget {url/of/detection/inference_model} && tar xf {name/of/detection/inference_model/package}
 								# Download the recognition model and unzip
 								wget {url/of/recognition/inference_model} && tar xf {name/of/recognition/inference_model/package}
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								# Download the direction classifier model and unzip
 								wget {url/of/classification/inference_model} && tar xf {name/of/classification/inference_model/package}
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								cd ..
 								```
 								Take the ultra-lightweight model as an example:
 								```
 								mkdir inference && cd inference
 								# Download the detection model of the ultra-lightweight Chinese OCR model and uncompress it
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								wget https://paddleocr.bj.bcebos.com/20-09-22/mobile/det/ch_ppocr_mobile_v1.1_det_infer.tar && tar xf ch_ppocr_mobile_v1.1_det_infer.tar
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								# Download the recognition model of the ultra-lightweight Chinese OCR model and uncompress it
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								wget https://paddleocr.bj.bcebos.com/20-09-22/mobile/rec/ch_ppocr_mobile_v1.1_rec_infer.tar && tar xf ch_ppocr_mobile_v1.1_rec_infer.tar
 								# Download the direction classifier model of the ultra-lightweight Chinese OCR model and uncompress it
-												fix cls link

											
										
										
											2020-09-21 14:45:29 +08:00
+								wget https://paddleocr.bj.bcebos.com/20-09-22/cls/ch_ppocr_mobile_v1.1_cls_infer.tar && tar xf ch_ppocr_mobile_v1.1_cls_infer.tar
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								cd ..
 								```
 								After decompression, the file structure should be as follows:
 								```
 								|-inference
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								    |-ch_ppocr_mobile_v1.1_det_infer
 								        |- model
 								        |- params
 								    |-ch_ppocr_mobile_v1.1_rec_infer
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								        |- model
 								        |- params
-												fix cls link

											
										
										
											2020-09-21 14:45:29 +08:00
+								    |-ch_ppocr_mobile_v1.1_cls_infer
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								        |- model
 								        |- params
 								    ...
 								```
 								## 3. Single image or image set prediction
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								* The following code implements text detection and recognition process. When performing prediction, you need to specify the path of a single image or image set through the parameter `image_dir`, the parameter `det_model_dir` specifies the path to detect the inference model, the parameter `rec_model_dir` specifies the path to identify the inference model, the parameter `use_angle_cls` specifies whether to use the direction classifier, the parameter `cls_model_dir` specifies the path to identify the direction classifier model, the parameter `use_space_char` specifies whether to predict the space char. The visual results are saved to the `./inference_results` folder by default.
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
 								```bash
 								# Predict a single image specified by image_dir
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								python3 tools/infer/predict_system.py --image_dir="./doc/imgs/11.jpg" --det_model_dir="./inference/ch_ppocr_mobile_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_mobile_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
 								# Predict imageset specified by image_dir
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								python3 tools/infer/predict_system.py --image_dir="./doc/imgs/" --det_model_dir="./inference/ch_ppocr_mobile_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_mobile_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
 								# If you want to use the CPU for prediction, you need to set the use_gpu parameter to False
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								python3 tools/infer/predict_system.py --image_dir="./doc/imgs/11.jpg" --det_model_dir="./inference/ch_ppocr_mobile_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_mobile_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True --use_gpu=False
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								```
 								- Universal Chinese OCR model
 								Please follow the above steps to download the corresponding models and update the relevant parameters, The example is as follows.
 								```
 								# Predict a single image specified by image_dir
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								python3 tools/infer/predict_system.py --image_dir="./doc/imgs/11.jpg" --det_model_dir="./inference/ch_ppocr_server_v1.1_det_infer/"  --rec_model_dir="./inference/ch_ppocr_server_v1.1_rec_infer/" --cls_model_dir="./inference/ch_ppocr_mobile_v1.1_cls_infer/" --use_angle_cls=True --use_space_char=True
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								```
-												add quick start en model

											
										
										
											2020-09-21 13:24:35 +08:00
+								* Note
 								    - If you want to use the recognition model which does not support space char recognition, please update the source code to the latest version and add parameters `--use_space_char=False`.
 								    - If you do not want to use direction classifier, please update the source code to the latest version and add parameters `--use_angle_cls=False`.
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
 								For more text detection and recognition tandem reasoning, please refer to the document tutorial
 								: [Inference with Python inference engine](./inference_en.md)。
 								In addition, the tutorial also provides other deployment methods for the Chinese OCR model:
 								- [Server-side C++ inference](../../deploy/cpp_infer/readme_en.md)
-												fix serving link (#994)

* fix serving link, test=document_fix

* fix link
											
										
										
											2020-10-24 15:33:57 +08:00
+								- [Service deployment](../../deploy/hubserving/readme_en.md)
-												add en docs

											
										
										
											2020-07-17 11:51:19 +08:00
+								- [End-to-end deployment](../../deploy/lite/readme_en.md)