Learn how to extract metadata from PDF, Word, Excel, PowerPoint and 50+ document formats using GroupDocs.Parser for .NET. Get document properties like author, title, creation date in C#....Assembly Product Solution GroupDocs...
This article gives the knowledge about the regular expression (RegEx) search queries which are universal and very flexible, but at the same time, in large indexes, their performance becomes extremely low using Java search API....Assembly Product Solution GroupDocs...
An interface is used to receive the information about errors, warnings and events which occur while data extraction....Assembly Product Solution GroupDocs...
This article gives the knowledge of the API methods which can be used to perform operations about document passwords or password dictionary using Java....Assembly Product Solution GroupDocs...
To extract table of contents from Microsoft Office Word document getToc method is used. Table of contents is generated by paragraphs with H1-H9 build-in styles.
Warning getToc method returns null value if table of contents extraction isn’t supported for the document. For example, table of contents extraction isn’t supported for TXT files. Therefore, for TXT file getToc method returns null. If Microsoft Office Word document has no table of contents, getToc method returns an empty collection....Assembly Product Solution GroupDocs...