Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....PPTM, PPS, PPSX, PPSM, POT, POTX, POTM OneNote : ONE OpenDocument...EML XLSX OST XHTML MHTML XLTX POTX XLA POTM OTP RAR XML DOTX TAR...