Paper
4 May 2022 Text region extraction method for historical Tibetan document based on border detection
Yiqun Wang, Weilan Wang, Zhengqi Cai
Author Affiliations +
Proceedings Volume 12172, International Conference on Electronic Information Engineering and Computer Communication (EIECC 2021); 121720C (2022) https://doi.org/10.1117/12.2634657
Event: International Conference on Electronic Information Engineering and Computer Communication (EIECC 2021), 2021, Nanchang, China
Abstract
The text region extraction is the key step for optical character recognition (OCR) operation. According to the layout characteristics of historical Tibetan document, this paper proposes a text region extraction method based on border detection to extract text region from documents. Firstly, the character height and stroke width are estimated and the border region position is detected. Then, the position of decorative lines surrounding the body text region is detected by heuristic search. Finally, the mask image of the document is generated according to the position relationship between the text regions and border, then different text regions are extracted. Experiments on dataset of historical Tibetan document show that this method can effectively overcome the problems of page tilt, border and decorative lines fracture, and demonstrate the effectiveness of the proposed method.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yiqun Wang, Weilan Wang, and Zhengqi Cai "Text region extraction method for historical Tibetan document based on border detection", Proc. SPIE 12172, International Conference on Electronic Information Engineering and Computer Communication (EIECC 2021), 121720C (4 May 2022); https://doi.org/10.1117/12.2634657
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Binary data

Optical character recognition

Image quality

Image processing

Analytical research

Edge detection

Image segmentation

RELATED CONTENT

Automatic text extraction from color image
Proceedings of SPIE (May 30 2000)
Segmentation of white rat sperm image
Proceedings of SPIE (December 05 2011)
Blurriness estimation in video frames a study on smooth...
Proceedings of SPIE (February 08 2010)

Back to Top