To extract a text from PDF documents getText and getText(int) methods are used. These methods allow To extract a text from the entire document or a text from the selected page.
Here are the steps To extract a text from PDF document:
Instantiate Parser object for the initial document; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Conversion Product Solution GroupDocs...supported for Zip archive. Therefore, for Zip archive method returns...
Retrieving information about a document with GroupDocs.Viewer for Node.js...Conversion Product Solution GroupDocs...information. For example, archive files (.7z, .rar, .zip, etc...
Retrieving information about a document with GroupDocs.Viewer for Java...Conversion Product Solution GroupDocs...information. For example, archive files (.7z, .rar, .zip, etc...
Access document properties and supported formats with GroupDocs.Viewer for Python....Conversion Product Solution GroupDocs...information. For example, archive files (.7z, .rar, .zip, etc...
To extract a text from Microsoft Office Excel spreadsheets getText and getText(int) method is used. These methods allow To extract a text from the entire document or a text from the selected page.
Here are the steps To extract a text from Microsoft Office Excel spreadsheets:
Instantiate Parser object for the initial spreadsheet; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Conversion Product Solution GroupDocs...supported for Zip archive. Therefore, for Zip archive method returns...
GroupDocs Blog - GroupDocs Blog | Document AuTomation Solutions for .NET & Java Developers...Conversion for .NET 18.9 covers some...improvements and major bug fixes. Conversion from PLT and LGS formats...
Learn this article and check how To convert Microsoft Word DOCX, DOC, RTF documents To other formats with GroupDocs.Conversion for .NET....Conversion Product Solution GroupDocs...GroupDocs.Conversion Product Family / GroupDocs.Conversion for .NET...
To extract a text from Microsoft Office PowerPoint presentations getText and getText(int) method is used. These methods allow To extract a text from the entire presentation or a text from the selected slide.
Here are the steps To extract a text from Microsoft Office PowerPoint presentations:
Instantiate Parser object for the initial presentation; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Conversion Product Solution GroupDocs...supported for Zip archive. Therefore, for Zip archive method returns...
Retrieving information about a document with GroupDocs.Viewer for .NET...Conversion Product Solution GroupDocs...information. For example, archive files (.7z, .rar, .zip, etc...
Render documents To HTML, PNG, JPEG, PDF. Extract text, list attachments, and transform pages with GroupDocs.Viewer for Python....Conversion Product Solution GroupDocs...extracted: Archive – list of folders contained in archive; CAD -...