Segmentation and Alignment of Multi-oriented and Curved Text Lines from Document Images
Abstract
In this paper, we present a novel approach to segment and align multi-oriented and curved text-lines from document images. We
assumed that the input document image contains text-lines with arbitrary orientation and identified the arbitrary text string based on projection
profile. We employed anisotropic Gaussian filter bank on the identified arbitrary text region in order to smooth the text region, which helps to
detect the ridges which is a representative of a text-line path. The ridges are then labeled and a cubic B-spline is fitted to the text-line path
points. The orientation and curvature features of the text-line path is estimated using orientated gradients for each point and corresponding
curvature to these text-line path are computed. Text is aligned along the horizontally transformed line by rotating individual characters based
on the computed curvature information. Finally, the aligned text-lines are extracted, which can be fed into OCR for recognition. The evaluation
metrics was evaluated at text-line segmentation level and the results posted show a significant improvement. The resulting system is proven to
provide better results than most state of the art algorithms