Lister Hill Center Logo  
Search Tips
About the Lister Hill Center
Blue Arrow
Blue Arrow
Blue Arrow
Blue Arrow
Innovative Research
Blue Arrow
Blue Arrow
Blue Arrow
Blue Arrow
Blue Arrow
Publications and Lectures
Blue Arrow
Blue Arrow
Blue Arrow
Training and Employment
Blue Arrow
Blue Arrow
LHNCBC: Document Abstract
Year: 2002Adobe Acrobat Reader
Download Free Adobe Acrobat Reader
LHNCBC-2002-004
Text Verification in an Automated System for the Extraction of Bibliographic Data
Thoma GR, Ford G, Le DX, Li Z
Proc. 5th International Workshop on Document Analysis Systems, Springer-Verlag: Berlin. 2002 Aug;: 423-32.
An essential stage in any text extraction system is the manual verification of the printed material converted by OCR. This proves to be the most labor-intensive step in the process. In a system built and deployed at the National Library of Medicine to automatically extract bibliographic data from scanned biomedical journals, alternative means were considered to validate the text. This paper describes two approaches and gives preliminary performance data.
PDF