Java document parser API To extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....POTX, POTM, ODP, OTP OneNote : ONE Email : MSG, EML, EMLX, PST,...Document : PDF, POT, POTM, POTX Ebook : CHM, EPUB, FB2 Markup : HTML...