To extract table of contents from Microsoft Office Word document getToc method is used. Table of contents is generated by paragraphs with H1-H9 build-in styles.
Warning getToc method returns null value if table of contents extraction isn’t supported for the document. For example, table of contents extraction isn’t supported for TXT files. Therefore, for TXT file getToc method returns null. If Microsoft Office Word document has no table of contents, getToc method returns an empty collection....Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
To extract metadata from PDF documents getMetadata method is used. This method allows to extract the following metadata:
Name Description title The title of the presentation. subject The subject of the presentation. keywords The keyword of the presentation. author The name of the presentation’s author. application The name of the application. application-version The version number of the application that created the presentation. created-time The time of the presentation creation. last-saved-time The time of the the presentation when it was last saved....Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article explains that how to extract metadata from EPUB e-books getMetadata method is used....Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Render documents as HTML, PDF, JPEG, or PNG with GroupDocs.Viewer for Python. No third-party software needed!...Comparison Product Solution GroupDocs...documents in the (DOCX, XLSX, PPTX, PDF, DWG, PSD, HTML, ZIP, and...
Learn how to extract metadata from PDF, Word, Excel, PowerPoint and 50+ document formats using GroupDocs.Parser for .NET. Get document properties like author, title, creation date in C#....Comparison Product Solution GroupDocs...(DOC, DOCX), PowerPoint (PPT, PPTX), Excel (XLS, XLSX), LibreOffice...
This article explains that how to extract images from Microsoft Office Word (.doc, .docx) documents...Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Sometimes when indexing, it is necessary to associate each document with certain additional metadata, for example, a set of tags, a number in the library catalog, the subject of a document, etc. To accomplish this task, additional fields can be added to each indexed document in addition to those already in the document itself.
Additional fields are associated with the document through the arguments of the FileIndexing event that occurs before indexing each added document....Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
This article explains that how to extract hyperlinks from document page....Comparison Product Solution GroupDocs...images from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
It supports DOCX, DOCM, DOC, DOT, DOTM, XLS, XLSX, PDF, PPT, JPG, PNG, HTML, EML and many more...Comparison Product Solution GroupDocs...Try the to convert DOCX, XLSX, PPTX, and more. This topic lists...