Learn how to extract images from documents using GroupDocs.Parser for Python via .NET. Extract images with position data, rotation, and format information from PDF, Word, Excel....ppt' ] for file_path in Path ( input_dir...
Detecting the version of a PDF document The following sample of code will help you to detect the PDF version a loaded document and extract some additional file format information.
Load a PDF document Extract the root metadata package Use the getPdfType method to obtain file format information advanced_usage.managing_metadata_for_specific_formats.document.pdf.PdfReadFileFormatProperties
try (Metadata metadata = new Metadata(Constants.InputPdf)) { PdfRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getPdfType().getFileFormat()); System.out.println(root.getPdfType().getVersion()); System.out.println(root.getPdfType().getMimeType()); System.out.println(root.getPdfType().getExtension()); } Reading built-in metadata properties To access built-in metadata of a PDF document, please use the getDocumentProperties method defined in the DocumentRootPackage class....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
The OperationFinished event occurs when an index operation completes – indexing, updating, merging, or optimizing (segment merging)...search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
This page contains information about building text search queries of various types. More examples on building search queries are provided on the page...search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
Access, read, update, and remove IPTC IIM metadata using GroupDocs.Metadata for Python via .NET....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This page contains a description of the use of document filters for indexing, as well as descriptions of all types of filters with examples of their creation....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
The OperationFinished event occurs when an index operation completes – indexing, updating, merging, or optimizing (segment merging)...search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
This article explains how to access IPTC metadata in a file of any supported format, GroupDocs.Metadata for Java provides the IIptc.getIptcPackage method....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains main principles and stages of editing documents programmatically with GroupDocs.Editor for .NET API....separator N/A N/A Presentation PPT, PPTX, PPTM, PPS, PPSX, PPSM...
This article gives knowledge on how to highlight search results in the text of a document....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...