This article explains that how to extract metadata from EPUB e-books getMetadata method is used....Comparison Product Solution GroupDocs...extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
To extract table of contents from Microsoft Office Word document getToc method is used. Table of contents is generated by paragraphs with H1-H9 build-in styles.
Warning getToc method returns null value if table of contents extraction isn’t supported for the document. For example, table of contents extraction isn’t supported for TXT files. Therefore, for TXT file getToc method returns null. If Microsoft Office Word document has no table of contents, getToc method returns an empty collection....Comparison Product Solution GroupDocs...extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
This article explains that how to extract containers items and iterate through container items in Java....Comparison Product Solution GroupDocs...extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
This article gives the knowledge about the regular expression (RegEx) search queries which are universal and very flexible, but at the same time, in large indexes, their performance becomes extremely low using Java search API....Comparison Product Solution GroupDocs...to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
It supports Docx, DOCM, DOC, DOT, DOTM, XLS, XLSX, PDF, PPT, JPG, PNG, HTML, EML and many more...Comparison Product Solution GroupDocs...files online? Try the to convert DOCX, XLSX, PPTX, and more. This...
This article explains that how to extract hyperlinks from document page....Comparison Product Solution GroupDocs...extract images from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
GroupDocs.Metadata for Java provides functionality that allows working with different kinds of WordProcessing documents such as DOC, Docx, ODT, etc. For the full list of supported document formats please refer to Supported document formats.
Detecting the exact type of a document The following sample of code will help you to detect the exact type of a loaded document and extract some additional file format information.
Load a WordProcessing document Extract the root metadata package Use the getWordProcessingType method to obtain file format information advanced_usage....Comparison Product Solution GroupDocs...WordProcessing documents such as DOC, DOCX, ODT, etc. For the full list...
This article demonstrates how to convert files stored on local disk using GroupDocs.Conversion for Java API....Comparison Product Solution GroupDocs...to convert a Word document ( .docx ) to a PDF file and save it...
This article explains how to locate signature in the footer and header of Word document....Comparison Product Solution GroupDocs...signature will be placed in the docx document body Header - signature...