Sometimes you may need to just remove all or clean metadata properties without applying any filters. The best way to do this is to use the Sanitize method....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Learn how to get basic document information including file type, page count, and file size using GroupDocs.Parser for .NET. Get document properties in C#....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
This article demonstrates that how to save a document to any custom file at any location on the local disc or a even a Stream...formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
This article gives the knowledge which allows you to find not only the words specified in the search query, but also the homophones, words that are pronounced the same but differ in meaning using Java search API....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
This page contains information about highlighting search results in the search network....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
Detecting the GIF version The following sample of code will help you to detect the version of a loaded GIF image and extract some additional file format information.
Load a GIF image Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage.ManagingMetadataForSpecificFormats.Image.Gif.GifReadFileFormatProperties
using (Metadata metadata = new Metadata(Constants.InputGif)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.FileType.FileFormat); Console.WriteLine(root.FileType.Version); Console.WriteLine(root.FileType.ByteOrder); Console.WriteLine(root.FileType.MimeType); Console.WriteLine(root.FileType.Extension); Console.WriteLine(root.FileType.Width); Console.WriteLine(root.FileType.Height); } Working with XMP Metadata GroupDocs....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains that how to extract HTML formatted text from document page in Java....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
try (Metadata metadata = new Metadata(Constants.InputDoc)) { WordProcessingRootPackage root = metadata.getRootPackageGeneric(); for (PropertyDescriptor descriptor : root.getDocumentProperties().getKnowPropertyDescriptors()) { System.out.println(descriptor.getName()); System.out.println(descriptor.getType()); System.out.println(descriptor.getAccessLevel()); for (PropertyTag tag : descriptor.getTags()) { System.out.println(tag); } System.out.println(); } } Note Not all possible properties are presented in the getKnowPropertyDescriptors collection....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...