Not all metadata properties extracted from a file are marked with tags. Some file formats and metadata standards allow adding fully custom properties that can’t be properly tagged by the library since their purpose is not clearly defined in the appropriate format/standard specification. In such cases, you can use the name of the property to locate and remove it. The following example demonstrates some advanced usage scenarios of the GroupDocs.Metadata search engine allowing to remove metadata properties....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This article shows how to handle search queries if they contain special characters and separator characters....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition GroupDocs...
Follow this guide and learn how to convert email attachments to different format based on attachment type using GroupDocs.Conversion for .Java....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This article explains how to separately extract data from documents and add the extracted data to the index....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition GroupDocs...
GroupDocs.Metadata for Java provides functionality that allows working with MPP files created by different versions of Microsoft Project. Please see the code samples below for more information.
Reading built-In metadata properties To access built-in metadata of a ProjectManagement document, please use the getDocumentProperties method defined in the DocumentRootPackage class.
The following code snippet extracts built-in metadata properties and displays them on the screen.
advanced_usage.managing_metadata_for_specific_formats.document.project_management.ProjectManagementReadBuiltInProperties
try (Metadata metadata = new Metadata(Constants.InputMpp)) { ProjectManagementRootPackage root = metadata....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
Reading Photoshop Metadata properties The GroupDocs Metadata API allows the user to read Adobe Photoshop metadata associated with a PSD image. For more information on the Photoshop file format and metadata blocks please refer to the specification: https://www.adobe.com/devnet-apps/photoshop/fileformatashtml/.
The code sample below demonstrates how to extract image resource blocks (building blocks of the Photoshop file format) from a PSD image.
Load a PSD file Get the root metadata package Extract the ImageResourcePackage instance and obtain a list of ImageResourceBlock objects Iterate trough the collection of resource blocks advanced_usage....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
System requirements, supported platforms, and environment setup for GroupDocs.Conversion for Node.js via Java. No Microsoft Office or other third‑party software is required....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
Sometimes when indexing, it is necessary to associate each document with certain additional metadata, for example, a set of tags, a number in the library catalog, the subject of a document, etc. To accomplish this task, additional fields can be added to each indexed document in addition to those already in the document itself.
Additional fields are associated with the document through the arguments of the FileIndexing event that occurs before indexing each added document....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition GroupDocs...
GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...