To extract table of contents from Microsoft Office Word document getToc method is used. Table of contents is generated by paragraphs with H1-H9 build-in styles.
Warning getToc method returns null value if table of contents extraction isn’t supported for the document. For example, table of contents extraction isn’t supported for TXT files. Therefore, for TXT file getToc method returns null. If Microsoft Office Word document has no table of contents, getToc method returns an empty collection....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article explains that how To extract metadata from EPUB e-books getMetadata method is used....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Learn how To extract images from documents using GroupDocs.Parser for .NET. Extract images with position data, rotation, and format information from Pdf, Word, Excel in C#....images from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article shows how To access IPTC metadata in a file of any supported format....pdf. Reading basic IPTC IIM properties...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Extract information about known properties available in a particular package using GroupDocs.Metadata for Python via .NET....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains how the The GroupDocs.Metadata API allows reading some common MPEG audio properties from MP3 files....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...