This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
try (Metadata metadata = new Metadata(Constants.InputDoc)) { WordProcessingRootPackage root = metadata.getRootPackageGeneric(); for (PropertyDescriptor descriptor : root.getDocumentProperties().getKnowPropertyDescriptors()) { System.out.println(descriptor.getName()); System.out.println(descriptor.getType()); System.out.println(descriptor.getAccessLevel()); for (PropertyTag tag : descriptor.getTags()) { System.out.println(tag); } System.out.println(); } } Note Not all possible properties are presented in the getKnowPropertyDescriptors collection....Signature Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
It supports DOCX, DOCM, DOC, DOT, DOTM, XLS, XLSX, Pdf, PPT, JPG, PNG, HTML, EML and many more....Signature Product Solution GroupDocs...Filters Saves the document as PDF (Adobe Portable Document) format...
This article shows how to check supported file formats....Signature Product Solution GroupDocs...welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
To extract a text from HTML documents GetText method is used. This method allows to extract a text from the entire document. Pagination and raw mode is not supported for emails....Signature Product Solution GroupDocs...documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article explains how to detect document file type, size and calculate pages count when annotate documents or images with GroupDocs.Comparison....Signature Product Solution GroupDocs...represents the document file type (PDF, Word document, Excel spreadsheet...
This article shows how to get the basic document info....Signature Product Solution GroupDocs...welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
In some cases it’s required to specify the document format manually to guarantee correct output produced by GroupDocs.Parser. The following are the cases when the document format must be specified manually:
Markdown documents MHTML documents OTP documents (OpenDocument Presentation Template) Databases Emails from remote servers Here are the steps to specify the document format for Markup document.
Instantiate the LoadOptions object and pass the document format in LoadOptions(FileFormat) constructor; Create Parser object and call any method....Signature Product Solution GroupDocs...welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
JavaScript const metadata = new groupdocs.metadata.Metadata("input.doc"); var root = metadata.getRootPackageGeneric(); var descriptors = root.getDocumentProperties().getKnowPropertyDescriptors(); for(var i=0;iSignature Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
GroupDocs.Metadata allows users to get meta information of a document....Signature Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...