WenmuZhou / PytorchOCR

基于Pytorch的OCR工具库,支持常用的文字检测和识别算法

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

文本识别: RecLmdbDataset数据集存在严重bug

wkailiu opened this issue · comments

Bug1: 没有读取字母表,而是读取的文件名


self.str2idx = {c: i for i, c in enumerate(config.alphabet)}

替换为:

        with open(config.alphabet, 'r', encoding='utf-8') as file:
            alphabet = ''.join([s.strip('\n') for s in file.readlines()])
        alphabet += ' '
        self.str2idx = {c: i for i, c in enumerate(alphabet)}

Bug2: self.dict 缺少 空格符号


self.dict = {}
for i, char in enumerate(dict_character):
# NOTE: 0 is reserved for 'blank' token required by CTCLoss
self.dict[char] = i + 1
#TODO replace ‘ ’ with special symbol
self.character = ['[blank]'] + dict_character+[' '] # dummy '[blank]' token for CTCLoss (index 0)

替换为:

        dict_character.append(" ")
        self.dict = {}
        for i, char in enumerate(dict_character):
            # NOTE: 0 is reserved for 'blank' token required by CTCLoss
            self.dict[char] = i + 1
        #TODO replace ‘ ’ with special symbol
        self.character = ['[blank]'] + dict_character  # dummy '[blank]' token for CTCLoss (index 0)