This code snippet demonstrates how to extract information about known properties that can be encountered in a particular package.
Load a file to examine Get a collection of PropertyDescriptor instances for any desired metadata package Iterate through the extracted descriptors advanced_usage.GettingKnownPropertyDescriptors
try (Metadata metadata = new Metadata(Constants.InputDoc)) { WordProcessingRootPackage root = metadata.getRootPackageGeneric(); for (PropertyDescriptor descriptor : root.getDocumentProperties().getKnowPropertyDescriptors()) { System.out.println(descriptor.getName()); System.out.println(descriptor.getType()); System.out.println(descriptor.getAccessLevel()); for (PropertyTag tag : descriptor.getTags()) { System.out.println(tag); } System.out.println(); } } Note Not all possible properties are presented in the getKnowPropertyDescriptors collection....Redaction Product Solution GroupDocs...
To extract a text from HTML documents GetText method is used. This method allows to extract a text from the entire document. Pagination and raw mode is not supported for emails....Redaction Product Solution GroupDocs...
GroupDocs.Conversion for Python via .NET is a document conversion API that enables easy conversion of popular file formats, including DOCX, XLSX, PPTX, PDF, CAD drawings, and many more....Redaction Product Solution GroupDocs...
This article gives the knowledge about numeric range search which allows you to search in documents any integer numbers in the range from 0 to 9223372036854775807 (Int64.MaxValue) using Java search API....Redaction Product Solution GroupDocs...