AAS 203rd Meeting, January 2004
Session 20 Accessing Data Bases
Poster, Monday, January 5, 2004, 9:20am-6:30pm, Hanover Hall

[Previous] | [Session 20] | [Next]


[20.06] Searching the full text in the ADS article service

E.A. Henneken, G. Eichhorn, M.J. Kurtz, A. Accomazzi, C.S. Grant, S.S. Murray (Harvard-Smithsonian CfA.)

The NASA Astrophysics Data System (ADS) provides world-wide free access to a searchable bibliographic database containing over 3.5 million records and a full-text archive with 2.5 million scanned pages. We have recently extracted the textual information from these scanned pages by using Optical Character Recognition (OCR) software. While this process does not yield a completely accurate reproduction of the text in the original papers, it has allowed us to create a searchable interface for this body of full-text. This allows ADS users to locate any page in the archive which contains a particular word or combination of words. While this interface is still under development, we encourage users to test it and give us feedback. An added benefit of the OCR process is that we are often able to automatically extract and process reference lists from the scanned papers, thus adding to the ADS citation database.

The ADS is funded by NASA Grant NCC5-189


If you would like more information about this abstract, please follow the link to http://ads.harvard.edu. This link was provided by the author. When you follow it, you will leave the Web site for this meeting; to return, you should use the Back comand on your browser.

The author(s) of this abstract have provided an email address for comments about the abstract: ehenneken@cfa.harvard.edu

[Previous] | [Session 20] | [Next]

Bulletin of the American Astronomical Society, 35#5
© 2003. The American Astronomical Soceity.