image_shape parameter #7820

ciberola · 2022-10-04T20:34:35Z

ciberola
Oct 4, 2022

Hi Paddle Enthusiasts!
I would like to aks about parameter image_shape.
By default in yml file it is set as:
"image_shape: [3, 48, 320]",
but in documentation about training (https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/doc/doc_en/recognition_en.md) during inference tests we see command:
python3 tools/infer/predict_rec.py --image_dir="./doc/imgs_words_en/word_336.png" --rec_model_dir="./your inference model" --rec_image_shape="3, 32, 100" --rec_char_dict_path="your text dict path"
Can someone explain me in detail, what is this parameter used for?

andyjiang1116 · 2022-10-09T06:23:37Z

andyjiang1116
Oct 9, 2022
Collaborator

it depends on which model you used, for PP-OCRv3 rec model, the input image shape is 3,48,320, for CRNN model, the input shape is 3,32,100. you can set this param to fit your model, usually it's same with your training config.

0 replies

sheltonsuen · 2022-12-01T08:12:55Z

sheltonsuen
Dec 1, 2022

I'm also wondering what is this parameter for, and I can see if the rec_image_shape not consitent with your traning config, the predict result of the modle sometime will miss some words

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image_shape parameter #7820

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

image_shape parameter #7820

ciberola Oct 4, 2022

Replies: 2 comments

andyjiang1116 Oct 9, 2022 Collaborator

sheltonsuen Dec 1, 2022

ciberola
Oct 4, 2022

andyjiang1116
Oct 9, 2022
Collaborator

sheltonsuen
Dec 1, 2022