Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....Processing : DOC, DOCX, DOCM, DOT, DOTX, DOTM Spreadsheets : XLS...XLT TIF EPUB PNG ODT JPEG GZ DOT GIF ONE JPG XLTM HTML CHM PPSM...