To extract metadata from PDF documents getMetadata method is used. This method allows to extract the following metadata:
Name Description title The title of the Presentation. subject The subject of the Presentation. keywords The keyword of the Presentation. author The name of the Presentation’s author. application The name of the application. application-version The version number of the application that created the Presentation. created-time The time of the Presentation creation. last-saved-time The time of the the Presentation when it was last saved....Conversion Product Solution GroupDocs...the presentation. subject The subject of the presentation. keywords...
The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Conversion Product Solution GroupDocs...NUMBERS Apple iWork Numbers Presentation Document Type Parse Document...
GroupDocs Blog - GroupDocs Blog | Document Automation Solutions for .NET & Java Developers...Conversion for .NET 18.3 is on-board...for PDF conversion is improved, PSD to PDF conversion is improved...
Explore what’s new in GroupDocs.Redaction for Java 25.12. Available now on NuGet and GroupDocs website....redaction Fix Fix details When a presentation was loaded from a stream...(disable rasterization for presentations) RasterizationOptions rasterizationOptions...
This article explains that how to extract images from Microsoft Office PowerPoint(.ppt, .pptx) Presentations...Conversion Product Solution GroupDocs...Microsoft Office PowerPoint presentations / Extract images from Microsoft...
This section describes GroupDocs.Merger for Python via .NET supported document types...Conversion Product Solution GroupDocs...97-2003 Presentation Microsoft PowerPoint Presentation Microsoft...
You can remove text watermark from PPTX using Python with simple steps. This guide also explains how to delete watermark in PowerPoint using Python easily....Conversion Product Family GroupDocs...Using Python PowerPoint presentations often contain text watermarks...