This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
try (Metadata metadata = new Metadata(Constants.InputDoc)) { WordProcessingRootPackage root = metadata.getRootPackageGeneric(); for (PropertyDescriptor descriptor : root.getDocumentProperties().getKnowPropertyDescriptors()) { System.out.println(descriptor.getName()); System.out.println(descriptor.getType()); System.out.println(descriptor.getAccessLevel()); for (PropertyTag tag : descriptor.getTags()) { System.out.println(tag); } System.out.println(); } } Note Not all possible properties are presented in the getKnowPropertyDescriptors collection....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Detecting the GIF version The following sample of code will help you to detect the version of a loaded GIF image and extract some additional file format information.
Load a GIF image Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage.ManagingMetadataForSpecificFormats.Image.Gif.GifReadFileFormatProperties
using (Metadata metadata = new Metadata(Constants.InputGif)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.FileType.FileFormat); Console.WriteLine(root.FileType.Version); Console.WriteLine(root.FileType.ByteOrder); Console.WriteLine(root.FileType.MimeType); Console.WriteLine(root.FileType.Extension); Console.WriteLine(root.FileType.Width); Console.WriteLine(root.FileType.Height); } Working with XMP Metadata GroupDocs....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains that how to extract HTML formatted text from document page in Java....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
Locate and remove the metadata properties you don't want — by tag, category, name, type or value — with GroupDocs.Metadata for Python via .NET....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
JavaScript const metadata = new groupdocs.metadata.Metadata("input.doc"); var root = metadata.getRootPackageGeneric(); var descriptors = root.getDocumentProperties().getKnowPropertyDescriptors(); for(var i=0;iPPT, PPTX, XLS, XLSX, emails, images...
In some cases it’s required to specify the document format manually to guarantee correct output produced by GroupDocs.Parser. The following are the cases when the document format must be specified manually:
Markdown documents MHTML documents OTP documents (OpenDocument Presentation Template) Databases Emails from remote servers Here are the steps to specify the document format for Markup document.
Instantiate the LoadOptions object and pass the document format in LoadOptions(FileFormat) constructor; Create Parser object and call any method....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
This article demonstrates that how you can specify starting page index (zero based) and the number of pages from this index to save a rasterized PDF...formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
Outlook Metadata Cleaner is an Outlook add-in powered by GroupDocs.Metadata for .NET that removes/cleans metadata properties in the files selected to be attached with email. It is an attach file button that appears in New Message window in Outlook. When some file is selected to be attached with email, the add-in first cleans all the metadata information saved with that file and then attaches metadata free file with email message....Cleaner Presentation Document (PPT/PPTX) Word Document (DOC/DOCX)...