There might be cases when the document is presented only as a stream (without a copy on the local disk). To avoid the overhead of saving documents to the disk, GroupDocs.Parser enables to extract data from streams directly.
The following example shows how to load the document from the stream:
// Create the stream try (InputStream stream = new FileInputStream(Constants.SamplePdf)) { // Create an instance of Parser class with the stream try (Parser parser = new Parser(stream)) { // Extract a text into the reader try (TextReader reader = parser....Annotation Product Solution GroupDocs...welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
Extract information about known properties available in a particular package using GroupDocs.Metadata for Python via .NET....Annotation Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This artcle explains how to remove all or clean metadata properties without applying any filters. The best way to do this is to use the Sanitize method....Annotation Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
Reading BMP header properties The GroupDocs.Metadata API supports extracting format-specific information from BMP file headers.
The following are the steps to read the header of a BMP file.
Load a BMP image Get the root metadata package Extract the native metadata package using BmpRootPackage.BmpHeader Read the BMP header properties AdvancedUsage.ManagingMetadataForSpecificFormats.Image.Bmp.BmpReadHeaderProperties
using (Metadata metadata = new Metadata(Constants.InputBmp)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.BmpHeader.BitsPerPixel); Console.WriteLine(root.BmpHeader.ColorsImportant); Console.WriteLine(root.BmpHeader.HeaderSize); Console.WriteLine(root.BmpHeader.ImageSize); Console.WriteLine(root.BmpHeader.Planes); } More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Annotation Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article explains that how to extract Markdown formatted text from document page....Annotation Product Solution GroupDocs...documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
API allows creating of full-text and / or metadata index on documents. To index only metadata without main content of documents, you only need to set IndexType.MetadataIndex when creating an index....Annotation Product Solution GroupDocs...welcome to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
GroupDocs.Metadata allows users to get meta information of a document....Annotation Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article gives the knowledge about numeric range search which allows you to search in documents any integer numbers in the range from 0 to 9223372036854775807 (Int64.MaxValue) using Java search API....Annotation Product Solution GroupDocs...welcome to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
For all supported image formats the GroupDocs.Metadata API allows extracting common image properties such as width and height, MIME type, byte order, etc. Please see the code snippet below for more information on the feature.
Load an image Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage.ManagingMetadataForSpecificFormats.Image.ImageReadFileFormatProperties
using (Metadata metadata = new Metadata(Constants.InputPng)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.FileType.FileFormat); Console.WriteLine(root.FileType.ByteOrder); Console.WriteLine(root.FileType.MimeType); Console.WriteLine(root.FileType.Extension); Console.WriteLine(root.FileType.Width); Console.WriteLine(root.FileType.Height); } More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Annotation Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...