An interface is used To receive the information about errors, warnings and events which occur while data extraction....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
The easiest way To remove metadata properties from a file is To use corresponding tags that allow you To locate the desired properties across all metadata packages....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Render Email attachments with GroupDocs.Viewer for .NET in the same way as you would render any other documents...attachments: emails, Outlook data files, archives, and PDF documents...
This code snippet demonstrates how To extract information about known properties that can be encountered in a particular package.
Load a file To examine Get a collection of PropertyDescripTor instances for any desired metadata package Iterate through the extracted descripTors advanced_usage.GettingKnownPropertyDescripTors
try (Metadata metadata = new Metadata(Constants.InputDoc)) { WordProcessingRootPackage root = metadata.getRootPackageGeneric(); for (PropertyDescripTor descripTor : root.getDocumentProperties().getKnowPropertyDescripTors()) { System.out.println(descripTor.getName()); System.out.println(descripTor.getType()); System.out.println(descripTor.getAccessLevel()); for (PropertyTag tag : descripTor.getTags()) { System.out.println(tag); } System.out.println(); } } Note Not all possible properties are presented in the getKnowPropertyDescripTors collection....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
GroupDocs.Parser provides the functionality To extract data from Microsoft Office PowerPoint presentations. Both classic (ppt, pot) and Open XML (pptx, potx) formats are supported. Also LibreOffice Impress (OpenOffice.org Impress) formats are supported.
The following table provides the list of supported formats:
Format Description PPT Microsoft Office PowerPoint Presentation PPS Microsoft Office PowerPoint Slideshow POT Microsoft Office PowerPoint Template PPTX Microsoft Office Open XML Presentation PPTM Microsoft Office Open XML Macro-Enabled Presentation POTX Microsoft Office Open XML Presentation Template POTM Microsoft Office Open XML Macro-Enabled Presentation Template PPSX Microsoft Office Open XML Presentation Slideshow PPSM Microsoft Office Open XML Macro-Enabled Presentation Slideshow ODP Open Document Presentation OTP Open Document Presentation Template More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article shows how To extract metadata with GroupDocs.Parser from documents of various formats: Pdf, Emails, Ebooks, Microsoft Office: Word (DOC, DOCX), PowerPoint (PPT, PPTX), Excel (XLS, XLSX), LibreOffice formats and many others....documents of various formats: PDF, Emails, Ebooks, Microsoft Office:...
Using the GroupDocs.Metadata for Python via .NET you can easily search metadata and extract desired metadata properties from Pdf, DOCX, PPTX, XLSX, images, audio, video and many other files of different types in your Java solution....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Java API To remove all or selective metadata properties of DOCX, XLSX, PPTX, Pdf documents, JPEG, PNG, WebP images, Email, eBooks, Visio Drawings, Zip, etc....spreadsheets, presentations, PDF files, images, emails, eBooks, drawings...