Learn how to get edited document HTML markup - body without head tag, content in raw and base64 form, and others using Groupdocs.Editor for Node.js via Java API....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains that how to extract text in Raw mode....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
To extract table of contents from Microsoft Office Word document getToc method is used. Table of contents is generated by paragraphs with H1-H9 build-in styles.
Warning getToc method returns null value if table of contents extraction isn’t supported for the document. For example, table of contents extraction isn’t supported for TXT files. Therefore, for TXT file getToc method returns null. If Microsoft Office Word document has no table of contents, getToc method returns an empty collection....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article demonstrates how you can convert to and from email formats with Groupdocs.Conversion for Java....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
Using the Groupdocs.Metadata for Python via .NET you can easily search metadata and extract desired metadata properties from PDF, DOCX, PPTX, XLSX, images, audio, video and many other files of different types in your Java solution....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
To extract metadata from PDF documents getMetadata method is used. This method allows to extract the following metadata:
Name Description title The title of the presentation. subject The subject of the presentation. keywords The keyword of the presentation. author The name of the presentation’s author. application The name of the application. application-version The version number of the application that created the presentation. created-time The time of the presentation creation. last-saved-time The time of the the presentation when it was last saved....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains that how to extract metadata from EPUB e-books getMetadata method is used....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article gives the knowledge about the regular expression (RegEx) search queries which are universal and very flexible, but at the same time, in large indexes, their performance becomes extremely low using Java search API....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
Convert web documents (CHM, HTML) to PDF, PNG, or JPEG using the Groupdocs.Viewer Python API....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...