GroupDocs.Metadata for Python via .NET lets you read basic file information — format, extension, MIME type, page count, size, and encryption state....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Reading BMP header properties The GroupDocs.Metadata API supports extracting format-specific information from BMP file headers.
The following are the steps To read the header of a BMP file.
Load a BMP image Get the root metadata package Extract the native metadata package using the BmpRootPackage.getBmpHeader method Read the BMP header properties advanced_usage.managing_metadata_for_specific_formats.image.bmp.BmpReadHeaderProperties
try (Metadata metadata = new Metadata(Constants.InputBmp)) { BmpRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getBmpHeader().getBitsPerPixel()); System.out.println(root.getBmpHeader().getColorsImportant()); System.out.println(root.getBmpHeader().getHeaderSize()); System.out.println(root.getBmpHeader().getImageSize()); System.out.println(root.getBmpHeader().getPlanes()); } More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Reading DICOM metadata properties The GroupDocs.Metadata API supports extracting format-specific information from DICOM images.
The following are the steps To read the native DICOM metadata.
Load a DICOM image Get the root metadata package Extract the native metadata package using the DicomRootPackage.getDicomPackage method Read the DICOM metadata properties examples.advanced_usage.managing_metadata_for_specific_formats.image.dicom.DicomReadNativeMetadataProperties
try (Metadata metadata = new Metadata(Constants.InputDicom)) { DicomRootPackage root = metadata.getRootPackageGeneric(); if (root.getDicomPackage() != null) { System.out.println(root.getDicomPackage().getBitsAllocated()); System.out.println(root.getDicomPackage().getReds()); System.out.println(root.getDicomPackage().getGreens()); System.out.println(root.getDicomPackage().getBlues()); System....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
To extract images from Microsoft Office PowerPoint presentations getImages methods are used. By default images are extracted with its original format. With using ImageOptions class it is possible To extract images from Microsoft Office PowerPoint presentations as bmp, gif, jpeg, png and webp formats.
Warning getImages method returns null value if image extraction isn’t supported for the document. For example, image extraction isn’t supported for TXT files. Therefore, for TXT file getImages method returns null....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Detecting the GIF version The following sample of code will help you To detect the version of a loaded GIF image and extract some additional file format information.
Load a GIF image Extract the root metadata package Use the getGifImageType method To obtain file format information advanced_usage.managing_metadata_for_specific_formats.image.gif.GifReadFileFormatProperties
try (Metadata metadata = new Metadata(Constants.InputGif)) { GifRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getGifImageType().getFileFormat()); System.out.println(root.getGifImageType().getVersion()); System.out.println(root.getGifImageType().getByteOrder()); System.out.println(root.getGifImageType().getMimeType()); System.out.println(root.getGifImageType().getExtension()); System.out.println(root.getGifImageType().getWidth()); System.out.println(root.getGifImageType().getHeight()); } Working with XMP Metadata GroupDocs....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
GroupDocs.Parser provides the functionality To extract data from Microsoft Office PowerPoint presentations. Both classic (ppt, pot) and Open XML (pptx, potx) formats are supported. Also LibreOffice Impress (OpenOffice.org Impress) formats are supported.
The following table provides the list of supported formats:
Format Description PPT Microsoft Office PowerPoint Presentation PPS Microsoft Office PowerPoint Slideshow POT Microsoft Office PowerPoint Template PPTX Microsoft Office Open XML Presentation PPTM Microsoft Office Open XML Macro-Enabled Presentation POTX Microsoft Office Open XML Presentation Template POTM Microsoft Office Open XML Macro-Enabled Presentation Template PPSX Microsoft Office Open XML Presentation Slideshow PPSM Microsoft Office Open XML Macro-Enabled Presentation Slideshow ODP Open Document Presentation OTP Open Document Presentation Template More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
The easiest way To remove metadata properties from a file is To use corresponding tags that allow you To locate the desired properties across all metadata packages....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
For all supported image formats the GroupDocs.Metadata API allows extracting common image properties such as width and height, MIME type, byte order, etc. Please see the code snippet below for more information on the feature.
Load an image Extract the root metadata package Use the FileType property To obtain file format information AdvancedUsage.ManagingMetadataForSpecificFormats.Image.ImageReadFileFormatProperties
using (Metadata metadata = new Metadata(Constants.InputPng)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.FileType.FileFormat); Console.WriteLine(root.FileType.ByteOrder); Console.WriteLine(root.FileType.MimeType); Console.WriteLine(root.FileType.Extension); Console.WriteLine(root.FileType.Width); Console.WriteLine(root.FileType.Height); } More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains that how To extract plain formatted text from document page in Java....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...