DocSearcher is a free and open source program that uses the Open Source Lucene and POI Apache APIs as well as the Open Source PDF Box API to provide searching capabilities for HTML, MS Word, MS Excel, RTF, PDF, Open Office (and Star Office), and text document.
Requirements:
· Java 1.5 or later
What`s New in This Release: [ read full changelog ]
· refactored PDF converter
· removed old multivalent PDF extractor
· updated PDF Box to 0.7.3
· changed the Lucene date to new format (DateTools)
· refactored internal filetype handling