GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
This article shows how to add metadata properties which is the most sophisticated feature of the GroupDocs.Metadata search engine...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article gives the knowledge of the API methods which can be used to perform operations about Alphabets....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
This article explains how the The GroupDocs.Metadata API allows reading some common MPEG audio properties from MP3 files....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Reading PNG metadata properties The GroupDocs.Metadata API supports extracting format-specific information from PNG images.
The following are the steps to read the native PNG metadata.
Load a PNG image Get the root metadata package Extract the native metadata package using PngRootPackage.getPngPackage Read the PNG metadata properties advanced_usage.managing_metadata_for_specific_formats.image.png.PngReadTextChunks
try (Metadata metadata = new Metadata(Constants.InputPng)) { PngRootPackage root = metadata.getRootPackageGeneric(); for (PngTextChunk chunk : root.getPngPackage().getTextChunks()) { System.out.println(chunk.getKeyword()); System.out.println(chunk.getText()); if (chunk instanceof PngCompressedTextChunk) { PngCompressedTextChunk compressedChunk = (PngCompressedTextChunk) chunk; System....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article gives the knowledge of the API methods which can be used to perform operations about spelling corrector using Java....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
This page contains information about getting indexed documents in the search network....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...