PaddleOCR 支持80+语言识别的OCR工具
PaddleOCR简介
PaddleOCR,Python第三方库,基于飞桨的OCR工具包,实用的超轻量级OCR系统,支持80+语言识别,提供数据注释和合成工具,支持服务器、移动、嵌入式和物联网设备之间的培训和部署。
PaddleOCR应用实例
from paddleocr import PaddleOCR,draw_ocr
# Paddleocr supports Chinese, English, French, German, Korean and Japanese.
# You can set the parameter `lang` as `ch`, `en`, `french`, `german`, `korean`, `japan`
# to switch the language model in order.
ocr = PaddleOCR(use_angle_cls=True, lang='en') # need to run only once to download and load model into memory
img_path = 'PaddleOCR/doc/imgs_en/img_12.jpg'
result = ocr.ocr(img_path, cls=True)
for idx in range(len(result)):
res = result[idx]
for line in res:
print(line)
# draw result
from PIL import Image
result = result[0]
image = Image.open(img_path).convert('RGB')
boxes = [line[0] for line in result]
txts = [line[1][0] for line in result]
scores = [line[1][1] for line in result]
im_show = draw_ocr(image, boxes, txts, scores, font_path='/path/to/PaddleOCR/doc/fonts/simfang.ttf')
im_show = Image.fromarray(im_show)
im_show.save('result.jpg')
PaddleOCR Github统计数据
Apache-2.0 license
Github 34.8k stars
PaddleOCR安装命令
pip install paddleocr
PaddleOCR Python版本要求
Python 3.7+
免责声明:内容编辑自网络,仅供参考,不保证正确性,不作任何决策依据!!以上数据皆截止于博文的写稿日期。