GroupDocs.Parser provides the functionality To extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article explains that how To extract images from Microsoft Office Word (.doc, .docx) documents...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article shows how To access IPTC metadata in a file of any supported format....pdf. Reading basic IPTC IIM properties...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Understand and extract human-readable (interpreted) values for metadata properties using GroupDocs.Metadata for Python via .NET....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Java API To remove all or selective metadata properties of DOCX, XLSX, PPTX, Pdf documents, JPEG, PNG, WebP images, Email, eBooks, Visio Drawings, Zip, etc....spreadsheets, presentations, PDF files, images, emails, eBooks, drawings...
This article explains the usage of SetProperties method is used To update or add metadata. You can easily add metadata To phoTos, Pdfs or you can update or add data To mp3 files....easily add metadata to photos, pdfs or you can update or add data...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains that how To integrate any paid or free OCR solution in Java....formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Using the GroupDocs.Metadata for Node.js via Java you can easily search metadata and extract desired metadata properties from Pdf, DOCX, PPTX, XLSX, images, audio, video and many other files of different types in your Java solution....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article shows that how To remove metadata properties from a file is To use corresponding tags that allow you To locate the desired properties across all metadata packages....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
The easiest way To remove metadata properties from a file is To use the corresponding tags that let you locate the desired properties across all metadata packages with GroupDocs.Metadata for Python via .NET....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...