The merge operation is designed to combine two or more indexes into one index to accelerate the search and to simplify the work with indexes. When merging, only the index at which the merge method was called is changed. This index as a result of the operation contains all the documents that were contained in all indexes together. The second index or index repository after the merge can be deleted to free up disk space....NET library we provide simple, but powerful...
Let's convert HTML to TXT using Python with clean extraction. Easily export HTML to TXT in Python to capture readable text for indexing, logs, and lightweight storage....setting up the conversion library to enable seamless HTML-to-TXT...
This article shows how to sign documents in batch mode within the archive files with one or multiple electronic signatures...NET library we provide simple, but powerful...
To extract hyperlinks from Microsoft Office Word document getStructure method is used. This method returns XML representation of the document. Hyperlinks are represented by “hyperlink” tag; “link” attribute contains hyperlink’s URL. For more details, see Extract text structure. Hyperlink can contain a text:
google.com Warning getStructure method returns null value if text structure extraction isn’t supported for the document. For example, text structure extraction isn’t supported for TXT files. Therefore, for TXT file getStructure method returns null....NET library we provide simple, but powerful...
Indexing reports are created for indexing and updating operations. Indexing reports can be retrieved from the index using the getIndexingReports method. Reports are stored in the index only while the index is loaded into RAM for use. If you reload the index, the reports will not be restored.
You can configure the maximum number of stored reports using the setMaxIndexingReportCount method of the IndexSettings class. The default value is 5. Learn more about index settings on the page Search index settings....NET library we provide simple, but powerful...
This article shows you how to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images and more with our free online....NET library we provide simple, but powerful...
To extract a text from HTML documents GetText method is used. This method allows to extract a text from the entire document. Pagination and raw mode is not supported for emails....NET library we provide simple, but powerful...
This article explains how to sign document with Image electronic signatures using extended options with GroupDocs.Signature API....NET library we provide simple, but powerful...