This article shows how to check supported file formats....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
try (Metadata metadata = new Metadata(Constants.InputDoc)) { WordProcessingRootPackage root = metadata.getRootPackageGeneric(); for (PropertyDescriptor descriptor : root.getDocumentProperties().getKnowPropertyDescriptors()) { System.out.println(descriptor.getName()); System.out.println(descriptor.getType()); System.out.println(descriptor.getAccessLevel()); for (PropertyTag tag : descriptor.getTags()) { System.out.println(tag); } System.out.println(); } } Note Not all possible properties are presented in the getKnowPropertyDescriptors collection....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
To extract a text from HTML documents GetText method is used. This method allows to extract a text from the entire document. Pagination and raw mode is not supported for emails....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains that how to extract HTML formatted text from document page in Java....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article shows how to get the basic document info....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
In some cases it’s required to specify the document format manually to guarantee correct output produced by Groupdocs.Parser. The following are the cases when the document format must be specified manually:
Markdown documents MHTML documents OTP documents (OpenDocument Presentation Template) Databases Emails from remote servers Here are the steps to specify the document format for Markup document.
Instantiate the LoadOptions object and pass the document format in LoadOptions(FileFormat) constructor; Create Parser object and call any method....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article demonstrates that how you can specify starting page index (zero based) and the number of pages from this index to save a rasterized PDF...Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
Groupdocs.Metadata allows users to get meta information of a document....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains that how to extract barcodes from documents....Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...