Learn how to extract table of contents (TOC) from Word documents, PDF files, and eBooks using GroupDocs.Parser for .NET. Extract TOC items with page numbers and depth levels in C#....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
This article explains that how to extract formatted text from document page....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
To extract hyperlinks from Microsoft Office Word document getStructure method is used. This method returns XML representation of the document. Hyperlinks are represented by “hyperlink” tag; “link” attribute contains hyperlink’s URL. For more details, see Extract text structure. Hyperlink can contain a text:
google.com Warning getStructure method returns null value if text structure extraction isn’t supported for the document. For example, text structure extraction isn’t supported for TXT files. Therefore, for TXT file getStructure method returns null....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
To extract metadata from Microsoft Office Excel spreadsheets getMetadata method is used. This method allows to extract the following metadata:
Name Description title The title of the spreadsheet. subject The subject of the spreadsheet. keywords The keyword of the spreadsheet. comments The comments of the spreadsheet. content-status The content status of the spreadsheet. category The category of the spreadsheet. company The company of the spreadsheet. manager The manager of the spreadsheet....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
To extract files from ZIP archives getContainer method is used. This method returns the collection of ContainerItem objects.
Zip Entry can contain the following metadata:
Name Description date The time and date at which the file indicated by the Zip Entry was last modified. crc The 32-bit CRC (Cyclic Redundancy Check) on the contents of the Zip Entry. These metadata refer to a container element itself, not a document.
Here are the steps to extract an email text from Zip archives:...Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
This article shows how to extract table of contents from Microsoft Word (DOC, DOCX etc), PDF documents and Ebooks (CHM, EPUB)....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
Learn how to extract metadata from Microsoft Excel spreadsheets (.xls, .xlsx) in C# using GroupDocs.Parser for .NET. Step-by-step guide with code example....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
Learn how to extract metadata from PDF files in C# using GroupDocs.Parser for .NET. Get document properties such as title, author, subject, creation date, and more....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...
This article describes the search options that can be specified in an instance of the SearchOptions class....Consulting Blog Knowledge Base New Releases Status Websites aspose.com...