Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....XLS, XLSX, XLSM, XLSB, XLT, XLTX, XLTM, XLA, XLAM Presentations...BZ2 EML XLSX OST XHTML MHTML XLTX POTX XLA POTM OTP RAR XML DOTX...