This article explains document formats and format families supported by GroupDocs.EdiTor for Java and how To operate them in Java code....which includes PPT, PPS, POT, PPTX, PPTM etc. Text-based formats...formats, which includes only PDF at this time (for version 19...
This article describes the search options that can be specified in an instance of the SearchOptions class....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
To extract text from EPUB e-books getText and getText(pageIndex) methods is used. These methods allow To extract text from the entire document or a text from the selected page. Raw mode is not supported for EPUB....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
Learn how To check which features are supported for a document using GroupDocs.Parser for .NET. Check text extraction, metadata, images, tables, and other feature support in C#....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
This article demonstrates the ability To connect an external module (library) for the recognition of printed text (optical character recognition, OCR) on images, either separate or embedded in documents...search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
GroupDocs.Search allows indexing documents from various sources....pdf' ; // Creating an index const...'c:/MyDocuments/ExampleDocument.pdf' ; // Creating an index const...
Using the GroupDocs.Metadata search engine you can extract desired metadata properties from files of different types. You don’t need To worry about the exact file format and metadata standards it can deal with. The same code will work for all supported formats in the same way. Most commonly used metadata properties are marked with tags that allow searching them across all supported files in various metadata packages. All tags defined in GroupDocs....edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails...