Learn how to extract metadata from PDF, Word, Excel, PowerPoint and 50+ document formats using GroupDocs.Parser for .NET. Get document properties like author, title, creation date in C#....Assembly Product Solution GroupDocs...
An interface is used to receive the information about errors, warnings and events which occur while data extraction....Assembly Product Solution GroupDocs...
To extract table of contents from Microsoft Office Word document getToc method is used. Table of contents is generated by paragraphs with H1-H9 build-in styles.
Warning getToc method returns null value if table of contents extraction isn’t supported for the document. For example, table of contents extraction isn’t supported for TXT files. Therefore, for TXT file getToc method returns null. If Microsoft Office Word document has no table of contents, getToc method returns an empty collection....Assembly Product Solution GroupDocs...
To extract metadata from PDF documents getMetadata method is used. This method allows to extract the following metadata:
Name Description title The title of the presentation. subject The subject of the presentation. keywords The keyword of the presentation. author The name of the presentation’s author. application The name of the application. application-version The version number of the application that created the presentation. created-time The time of the presentation creation. last-saved-time The time of the the presentation when it was last saved....Assembly Product Solution GroupDocs...