-
摘要: 提出了一种基于自适应特征与多级反馈模型的新颖的字符分割方法,对文字图像质量与中英文混排格式有较好的自适应能力.该方法的主要思想就是将一个分割过程分成很多层,每层都会由一个主要特征来指导字符分割与中英文预分类,然后将分割层的结果反馈至当前分割层或前面的分割层,并指导下一层的分割.该方法将字符分割、中英文预分类和字符识别这三者进行了很好的融合,大大提高了字符分割与识别的正确率.
-
关键词:
- 中英文混排文档分割 /
- 中英文预分类 /
- 自适应特征与多级反馈模型 /
- 对文档图像的自适应特性 /
- OCR
Abstract: This paper proposes a novel method to segment document image based on adaptive feature and multiple phase feedback (AFMPF) model, which is adaptive to the blurring of document image and various patterns of mixed Chinese/English document. First, the whole process of segmentation is divided into several phases and each phase is allocated a primary feature to segment document image. Second, the segmentation result of each phase is fed back to the current or previous phase, and directs the next phase segmentation. This method causes good and effective combination among character segmentation, pre-classification of Chinese/English and character recognition, which improves greatly the accuracy of character segmentation and recognition.
计量
- 文章访问数: 3361
- HTML全文浏览量: 123
- PDF下载量: 1574
- 被引次数: 0