Segmentation and Alignment of Multi-oriented and Curved Text Lines from Document Images

dc.contributor.authorToo, Boaz K.
dc.contributor.authorPrabhakar, C. J.
dc.date.accessioned2018-07-10T07:32:45Z
dc.date.available2018-07-10T07:32:45Z
dc.date.issued2015-06
dc.description.abstractIn this paper, we present a novel approach to segment and align multi-oriented and curved text-lines from document images. We assumed that the input document image contains text-lines with arbitrary orientation and identified the arbitrary text string based on projection profile. We employed anisotropic Gaussian filter bank on the identified arbitrary text region in order to smooth the text region, which helps to detect the ridges which is a representative of a text-line path. The ridges are then labeled and a cubic B-spline is fitted to the text-line path points. The orientation and curvature features of the text-line path is estimated using orientated gradients for each point and corresponding curvature to these text-line path are computed. Text is aligned along the horizontally transformed line by rotating individual characters based on the computed curvature information. Finally, the aligned text-lines are extracted, which can be fed into OCR for recognition. The evaluation metrics was evaluated at text-line segmentation level and the results posted show a significant improvement. The resulting system is proven to provide better results than most state of the art algorithmsen_US
dc.identifier.citationInternational journal of Machine Intelligence, (Bioinfo Publications), vol. 6, issue 1, pages 426-434en_US
dc.identifier.issn0975-9166
dc.identifier.urihttp://hdl.handle.net/123456789/1754
dc.language.isoenen_US
dc.subjectText-Lines detectionen_US
dc.subjectText-line segmentationen_US
dc.subjectoriented gradientsen_US
dc.titleSegmentation and Alignment of Multi-oriented and Curved Text Lines from Document Imagesen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Segmentation and Alignment of Multi-oriented and Curved Text Lines from Document Images.pdf
Size:
816.94 KB
Format:
Adobe Portable Document Format
Description:
Full text
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: