GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Metadata Product Solution GroupDocs...Office Word documents Extract metadata from Microsoft Office Word...
GroupDocs.Parser provides the functionality to extract data from Microsoft Office PowerPoint presentations. Both classic (ppt, pot) and Open XML (pptx, potx) formats are supported. Also LibreOffice Impress (OpenOffice.org Impress) formats are supported.
The following table provides the list of supported formats:
Format Description PPT Microsoft Office PowerPoint Presentation PPS Microsoft Office PowerPoint Slideshow POT Microsoft Office PowerPoint Template PPTX Microsoft Office Open XML Presentation PPTM Microsoft Office Open XML Macro-Enabled Presentation POTX Microsoft Office Open XML Presentation Template POTM Microsoft Office Open XML Macro-Enabled Presentation Template PPSX Microsoft Office Open XML Presentation Slideshow PPSM Microsoft Office Open XML Macro-Enabled Presentation Slideshow ODP Open Document Presentation OTP Open Document Presentation Template More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Metadata Product Solution GroupDocs...PowerPoint presentations Extract metadata from Microsoft Office PowerPoint...
Discover how to protect documents with AI-powered watermarks using GroupDocs.Watermark. Learn to add, create, and customize watermarks automatically for enhanced document security and content protection....Document Analysis : Extract metadata including title, author, creation...based on document type and metadata. Building Your Watermarking...
GroupDocs.Conversion for Python via .NET is a document conversion API that enables easy conversion of popular file formats, including DOCX, XLSX, PPTX, PDF, CAD drawings, and many more....Metadata Product Solution GroupDocs...documents, retrieving document metadata, and performing conversions...
This topic describes how to use the GroupDocs.Viewer .NET API (C#) to display a list of archive content in HTML, PDF, PNG, and JPEG formats and to extract arbitrary file from archive....Metadata Product Solution GroupDocs...GroupDocs.Viewer displays a metadata for the files: a size in bytes...
This article details the process of how to extract text from DOCX using Java. It also includes code to assist you in implementing text extraction from DOCX in Java....Metadata Product Family GroupDocs.Search...to pull out text, images, metadata, and other elements from different...
This article shows how to to extract data from Microsoft Office Excel spreadsheets....Metadata Product Solution GroupDocs...Excel spreadsheets Extract metadata from Microsoft Office Excel...
The technology is growing at a huge pace and to stand strong through this storm you need to improve every day. Accordingly, to make you enhance your applications and take them to the next level, we keep trying to meet your emerging requirements by introducing new features and improving the existing ones. This is the reason we have introduced an optimized and simplified version of our Java document sanitization and text redaction API - GroupDocs....or find and replace text, metadata, and annotations in Word,...