Java document parser API To extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, Email, EPUB and ZIP files....Extract images Extract metadata Emails eBooks PDF files PDF Portfolio... XHTML, MHTML, MD, XML Portable Formats : PDF Email : PST, OST...