Segmentation and Alignment of Multi-oriented and Curved Text Lines from Document Images
Too, Boaz K.
Prabhakar, C. J.
MetadataShow full item record
In this paper, we present a novel approach to segment and align multi-oriented and curved text-lines from document images. We assumed that the input document image contains text-lines with arbitrary orientation and identified the arbitrary text string based on projection profile. We employed anisotropic Gaussian filter bank on the identified arbitrary text region in order to smooth the text region, which helps to detect the ridges which is a representative of a text-line path. The ridges are then labeled and a cubic B-spline is fitted to the text-line path points. The orientation and curvature features of the text-line path is estimated using orientated gradients for each point and corresponding curvature to these text-line path are computed. Text is aligned along the horizontally transformed line by rotating individual characters based on the computed curvature information. Finally, the aligned text-lines are extracted, which can be fed into OCR for recognition. The evaluation metrics was evaluated at text-line segmentation level and the results posted show a significant improvement. The resulting system is proven to provide better results than most state of the art algorithms