In some cases it’s required to specify the document format manually to guarantee correct output produced by GroupDocs.Parser. The following are the cases when the document format must be specified manually:
Markdown documents MHTML documents OTP documents (OpenDocument Presentation Template) Databases Emails from remote servers Here are the steps to specify the document format for Markup document.
Instantiate the LoadOptions object and pass the document format in LoadOptions(FileFormat) constructor; Create Parser object and call any method....Assembly Product Solution GroupDocs...
Complete guide to extracting data, text, and tables from Excel spreadsheets (.xls, .xlsx) using GroupDocs.Parser for .NET. Learn how to extract tables from Excel C# with code examples....Assembly Product Solution GroupDocs...
Stop words are frequently used words that do not carry a semantic meaning and can be removed from an index to reduce its size.
You can enable or disable the use of stop words by calling the setUseStopWords method of the IndexSettings class. The default value is true, meaning that stop words are filtered during indexing and not added to the index.
A list of stop words to use during indexing can be specified in the stop word dictionary....Assembly Product Solution GroupDocs...