Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and Zip files....protected files and containers like ZIP archives, OST/PST mail data files...files PDF Portfolio Files within ZIP archives Text and Markup documents...