Text information extraction in images and video: a survey

Keechul Jung, Kwang In Kim, Anil K. Jain

Research output: Contribution to journalArticlepeer-review

692 Citations (SciVal)


Text data present in images and video contain useful information for automatic annotation, indexing, and structuring of images. Extraction of this information involves detection, localization, tracking, extraction, enhancement, and recognition of the text from a given image. However, variations of text due to differences in size, style, orientation, and alignment, as well as low image contrast and complex background make the problem of automatic text extraction extremely challenging. While comprehensive surveys of related problems such as face detection, document analysis, and image & video indexing can be found, the problem of text information extraction is not well surveyed. A large number of techniques have been proposed to address this problem, and the purpose of this paper is to classify and review these algorithms, discuss benchmark data and performance evaluation, and to point out promising directions for future research.
Original languageEnglish
Pages (from-to)977-997
Number of pages21
JournalPattern Recognition
Issue number5
Publication statusPublished - 24 Jan 2004


Dive into the research topics of 'Text information extraction in images and video: a survey'. Together they form a unique fingerprint.

Cite this