Java document parser API To extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....archives, OST/PST mail data files, eBooks, markups, and PDF portfolios...images Extract metadata Emails eBooks PDF files PDF Portfolio Files...