Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....Text and Markup documents Microsoft & OpenDocument Document parser...presentations Parse Microsoft Word, Excel, PowerPoint and OpenDocument...