PaddleOCR 支持80+语言识别的OCR工具

PaddleOCR简介

PaddleOCR，Python第三方库，基于飞桨的OCR工具包，实用的超轻量级OCR系统，支持80+语言识别，提供数据注释和合成工具，支持服务器、移动、嵌入式和物联网设备之间的培训和部署。

PaddleOCR应用实例

from paddleocr import PaddleOCR,draw_ocr
# Paddleocr supports Chinese, English, French, German, Korean and Japanese.
# You can set the parameter `lang` as `ch`, `en`, `french`, `german`, `korean`, `japan`
# to switch the language model in order.
ocr = PaddleOCR(use_angle_cls=True, lang='en') # need to run only once to download and load model into memory
img_path = 'PaddleOCR/doc/imgs_en/img_12.jpg'
result = ocr.ocr(img_path, cls=True)
for idx in range(len(result)):
    res = result[idx]
    for line in res:
        print(line)

# draw result
from PIL import Image
result = result[0]
image = Image.open(img_path).convert('RGB')
boxes = [line[0] for line in result]
txts = [line[1][0] for line in result]
scores = [line[1][1] for line in result]
im_show = draw_ocr(image, boxes, txts, scores, font_path='/path/to/PaddleOCR/doc/fonts/simfang.ttf')
im_show = Image.fromarray(im_show)
im_show.save('result.jpg')

PaddleOCR Github统计数据

Apache-2.0 license

Github 34.8k stars

PaddleOCR安装命令

pip install paddleocr

PaddleOCR Python版本要求

Python 3.7+

免责声明：内容编辑自网络，仅供参考，不保证正确性，不作任何决策依据！！以上数据皆截止于博文的写稿日期。

hot：热门