Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....DOTX, DOTM Spreadsheets : XLS, XLSX, XLSM, XLSB, XLT, XLTX, XLTM...HTML CHM PPSM PPTX PDF BZ2 EML XLSX OST XHTML MHTML XLTX POTX XLA...