Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....Compression & Packaging : ZIP, CHM Database : ADO.NET BOM : UTF32...PDF, POT, POTM, POTX Ebook : CHM, EPUB, FB2 Markup : HTML GroupDocs...