This article demonstrates that how To save the redacted document, replacing an original file...formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
GroupDocs.Parser provides the functionality To extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
GroupDocs.Parser provides the functionality To handle loading of HTML external resources.
Here are the steps To handle loading of HTML external resources.
Instantiate the ParserSettings object and pass External Resource Handler; Create Parser object and call GetImages method. The following code sample shows how To handle loading of HTML external resources.
// Create an instance of ParserSettings To pass External Resource Handler ParserSettings settings = new ParserSettings(new Handler()); // Create an instance of Parser class To generate spreadsheet page previews try (Parser parser = new Parser(Constants....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Java API To remove all or selective metadata properties of DOCX, XLSX, PPTX, Pdf documents, JPEG, PNG, WebP images, Email, eBooks, Visio Drawings, Zip, etc....spreadsheets, presentations, PDF files, images, emails, eBooks, drawings...
The easiest way To remove metadata properties from a file is To use corresponding tags that allow you To locate the desired properties across all metadata packages....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
For all supported image formats the GroupDocs.Metadata API allows extracting common image properties such as width and height, MIME type, byte order, etc. Please see the code snippet below for more information on the feature.
Load an image Extract the root metadata package Use the FileType property To obtain file format information AdvancedUsage.ManagingMetadataForSpecificFormats.Image.ImageReadFileFormatProperties
using (Metadata metadata = new Metadata(Constants.InputPng)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.FileType.FileFormat); Console.WriteLine(root.FileType.ByteOrder); Console.WriteLine(root.FileType.MimeType); Console.WriteLine(root.FileType.Extension); Console.WriteLine(root.FileType.Width); Console.WriteLine(root.FileType.Height); } More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Detecting the GIF version The following sample of code will help you To detect the version of a loaded GIF image and extract some additional file format information.
Load a GIF image Extract the root metadata package Use the getGifImageType method To obtain file format information advanced_usage.managing_metadata_for_specific_formats.image.gif.GifReadFileFormatProperties
try (Metadata metadata = new Metadata(Constants.InputGif)) { GifRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getGifImageType().getFileFormat()); System.out.println(root.getGifImageType().getVersion()); System.out.println(root.getGifImageType().getByteOrder()); System.out.println(root.getGifImageType().getMimeType()); System.out.println(root.getGifImageType().getExtension()); System.out.println(root.getGifImageType().getWidth()); System.out.println(root.getGifImageType().getHeight()); } Working with XMP Metadata GroupDocs....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article demonstrates that how To save the redacted document, replacing an original file... rasterize_to_pdf = False result_path = redactor...formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Reading FLV header properties The GroupDocs.Metadata API supports extracting format-specific information from the FLV file header.
The following are the steps To read the header of an FLV file.
Load an FLV video Get the root metadata package Extract the native metadata package using FlvRootPackage.Header Read the FLV header properties AdvancedUsage.ManagingMetadataForSpecificFormats.Video.Flv.FlvReadHeaderProperties
using (Metadata metadata = new Metadata(Constants.InputFlv)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.Header.Version); Console.WriteLine(root.Header.HasAudioTags); Console.WriteLine(root.Header.HasVideoTags); Console.WriteLine(root.Header.TypeFlags); } Working with XMP metadata GroupDocs....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...