There might be cases when the document is presented only as a stream (without a copy on the local disk). To avoid the overhead of saving documents To the disk, GroupDocs.Parser enables To extract data from streams directly.
The following example shows how To load the document from the stream:
// Create the stream try (InputStream stream = new FileInputStream(Constants.SamplePdf)) { // Create an instance of Parser class with the stream try (Parser parser = new Parser(stream)) { // Extract a text inTo the reader try (TextReader reader = parser....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Reading JPEG2000 comments The GroupDocs.Metadata API supports extracting format-specific information from JPEG2000 images.
The following are the steps To read the JPEG2000 comments (pieces of metadata represented as strings with the length up To 64 kbytes).
Load a JPEG2000 image Get the root metadata package Extract the native metadata package using Jpeg2000RootPackage.Jpeg2000Package Read the JPEG2000 comments AdvancedUsage.ManagingMetadataForSpecificFormats.Image.Jpeg2000.Jpeg2000ReadComments
using (Metadata metadata = new Metadata(Constants.InputJpeg2000)) { var root = metadata.GetRootPackage(); if (root....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains that how To extract images from Microsoft Office Excel (.xls, .xlsx) spreadsheets....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Java API To remove all or selective metadata properties of DOCX, XLSX, PPTX, Pdf documents, JPEG, PNG, WebP images, Email, eBooks, Visio Drawings, Zip, etc....spreadsheets, presentations, PDF files, images, emails, eBooks, drawings...
Using the GroupDocs.Metadata for Node.js via Java you can easily search metadata and extract desired metadata properties from Pdf, DOCX, PPTX, XLSX, images, audio, video and many other files of different types in your Java solution....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
The easiest way To remove metadata properties from a file is To use corresponding tags that allow you To locate the desired properties across all metadata packages....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Programmatically render animated GIF and APNG images To HTML web page in C# with play and pause feature. Build your own animation viewer using .NET....documents, spreadsheets, PDF, presentations, emails, ZIP archives, Visio...
GroupDocs.Parser provides the functionality To extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...