The merge operation is designed to combine two or more indexes into one index to accelerate the search and to simplify the work with indexes. When merging, only the index at which the merge method was called is changed. This index as a result of the operation contains all the documents that were contained in all indexes together. The second index or index repository after the merge can be deleted to free up disk space....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
This article demonstrate that how to associate each document with certain additional metadata....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
The SetProperties method is used to update or add metadata. You can easily add metadata to photos, pdfs or you can update or add data to mp3 files....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Learn how to extract images from documents using GroupDocs.Parser for .NET. Extract images with position data, rotation, and format information from PDF, Word, Excel in C#....images from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
Indexing reports are created for indexing and updating operations. Indexing reports can be retrieved from the index using the getIndexingReports method. Reports are stored in the index only while the index is loaded into RAM for use. If you reload the index, the reports will not be restored.
You can configure the maximum number of stored reports using the setMaxIndexingReportCount method of the IndexSettings class. The default value is 5. Learn more about index settings on the page Search index settings....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with...
To extract hyperlinks from Microsoft Office Word document getStructure method is used. This method returns XML representation of the document. Hyperlinks are represented by “hyperlink” tag; “link” attribute contains hyperlink’s URL. For more details, see Extract text structure. Hyperlink can contain a text:
google.com Warning getStructure method returns null value if text structure extraction isn’t supported for the document. For example, text structure extraction isn’t supported for TXT files. Therefore, for TXT file getStructure method returns null....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and...
Here at GroupDocs we always look for new ways to enhance our products. We constantly strive to improve our user’s experience. Hence, we are excited to announce the new release of GroupDocs.Viewer for Java 3.2.2. The latest version of our document viewer API provides 10+ new features, 25+ improvements and fixes. Let’s explore the exciting features in GroupDocs.Viewer for Java 3.2.2.
Document Viewer API for Java - FeaturesFollowing features are announced in this latest release: Ability to specify custom font paths New conversion mechanism for displaying multipage TIFF files Implement option that allows setting text document encoding Implement method that returns supported document formats Implement file description property that returns document type format Provide JPEG image quality setting Implement configuration option that allows set cells sheet conversion mode when converting to PDF Add support for Portuguese locale Add ability to show/hide gridlines for excel files Implement PdfFileOptions same as another Options classes Process files from the stream without specifying the fileName parameter GroupDocs....to HTML Incorrect converting PPT file to HTML The .pdf document...