Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....XLSM, XLSB, XLT, XLTX, XLTM, XLA, XLAM Presentations : PPT, PPTX...XLSX OST XHTML MHTML XLTX POTX XLA POTM OTP RAR XML DOTX TAR PPTM...