Character replacement during indexing can be used, for example, To convert all text To lowercase characters or To remove diacritics from text....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
This article explains that how To extract hyperlinks from Microsoft Office Word (.doc, .docx) documents...extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
Indexing reports are created for indexing and updating operations. Indexing reports can be retrieved from the index using the getIndexingReports method. Reports are sTored in the index only while the index is loaded inTo RAM for use. If you reload the index, the reports will not be resTored.
You can configure the maximum number of sTored reports using the setMaxIndexingReportCount method of the IndexSettings class. The default value is 5. Learn more about index settings on the page Search index settings....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
This article explains how To separately extract data from documents and add the extracted data To the index....pdf' ; // Extracting data from a...search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
GroupDocs.Search for Java supports the ability To remove indexed files and folders from an index. Only files or folders that were explicitly added To the index can be deleted....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
This article gives the knowledge of the API methods which can be used To perform operations about sTop word dictionary....search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
This article explains that the GroupDocs.Metadata represents any loaded file as a tree consisting of metadata properties and nested metadata packages. You can easily iterate through the metadata tree...edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails...