To extract hyperlinks from Microsoft Office Word document getStructure method is used. This method returns XML representation of the document. Hyperlinks are represented by “hyperlink” tag; “link” attribute contains hyperlink’s URL. For more details, see Extract text structure. Hyperlink can contain a text:
google.com Warning getStructure method returns null value if text structure extraction isn’t supported for the document. For example, text structure extraction isn’t supported for TXT files. Therefore, for TXT file getStructure method returns null....Editor Product Solution GroupDocs...
Learn how to extract metadata from PDF files in C# using GroupDocs.Parser for .NET. Get document properties such as title, author, subject, creation date, and more....Editor Product Solution GroupDocs...
This article shows how to extract table of contents from Microsoft Word (DOC, DOCX etc), PDF documents and Ebooks (CHM, EPUB)....Editor Product Solution GroupDocs...
To extract files from ZIP archives getContainer method is used. This method returns the collection of ContainerItem objects.
Zip Entry can contain the following metadata:
Name Description date The time and date at which the file indicated by the Zip Entry was last modified. crc The 32-bit CRC (Cyclic Redundancy Check) on the contents of the Zip Entry. These metadata refer to a container element itself, not a document.
Here are the steps to extract an email text from Zip archives:...Editor Product Solution GroupDocs...
Learn how to extract table of contents (TOC) from Word documents, PDF files, and eBooks using GroupDocs.Parser for .NET. Extract TOC items with page numbers and depth levels in C#....Editor Product Solution GroupDocs...
Ontdek wat er nieuw is in GroupDocs.Total voor .NET 25.9. Nu beschikbaar op NuGet en de GroupDocs‑website....Editor for .NET 25.7 (no change) Release...