This article shows how to extract table of contents from Microsoft Word (DOC, DOCX etc), PDF documents and Ebooks (CHM, EPUB)....Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Learn how to extract metadata from Microsoft Excel spreadsheets (.xls, .xlsx) in C# using GroupDocs.Parser for .NET. Step-by-step guide with code example....Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
To extract files from ZIP archives getContainer method is used. This method returns the collection of ContainerItem objects.
Zip Entry can contain the following metadata:
Name Description date The time and date at which the file indicated by the Zip Entry was last modified. crc The 32-bit CRC (Cyclic Redundancy Check) on the contents of the Zip Entry. These metadata refer to a container element itself, not a document.
Here are the steps to extract an email text from Zip archives:...Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article explains that how to extract formatted text from document page....Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
The easiest way to remove metadata properties from a file is to use the corresponding tags that let you locate the desired properties across all metadata packages with GroupDocs.Metadata for Python via .NET....Comparison Product Solution GroupDocs...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images and...
This article gives the knowledge about two ways to create a search query: in text or object form using Java search API....Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
This article explains how to separately extract data from documents and add the extracted data to the index....Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
This article shows how to redact the pages of a document as images, redacting entire areas of the page instead or in addition to a specific text....Comparison Product Solution GroupDocs...formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Reading AVI header properties The GroupDocs.Metadata API supports extracting format-specific information from AVI file headers.
The following are the steps to read the header of an AVI file.
Load an AVI video Get the root metadata package Extract the native metadata package using AviRootPackage.Header Read the AVI header properties AdvancedUsage.ManagingMetadataForSpecificFormats.Video.Avi.AviReadHeaderProperties
using (Metadata metadata = new Metadata(Constants.InputAvi)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.Header.AviHeaderFlags); Console.WriteLine(root.Header.Height); Console.WriteLine(root.Header.Width); Console.WriteLine(root.Header.TotalFrames); Console.WriteLine(root.Header.InitialFrames); Console.WriteLine(root.Header.MaxBytesPerSec); Console.WriteLine(root.Header.PaddingGranularity); Console.WriteLine(root.Header.Streams); // ....Comparison Product Solution GroupDocs...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images and...
Get ZIP format metadata The API allows detecting ZIP archives and reading format metadata. The following steps are needed to be followed:
Load a ZIP archive Get the root metadata package Extract the native metadata package using ZipRootPackage.ZipPackage Read the ZIP archive properties Loop through ZipPackage.Files to extract information about archived files The following code snippet shows how to get metadata from a ZIP archive.
AdvancedUsage.ManagingMetadataForSpecificFormats.Archive.ZipReadNativeMetadataProperties
Encoding encoding = Encoding....Comparison Product Solution GroupDocs...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images and...