
Here you can specify more precisely the files and folders, that you want to include to your index file. All defined files, folders and url addresses will be parsed for terms (words), that will be stored in your search index file. The folders will be parsed recursively, the content of specified url addressess will be automatically downloaded and parsed.

Files and folders definition
-
Files and folders
A list of files and folders that will be parsed for search terms. Folders are parsed recursively, depending on the definition of include and exclude masks. -
Root folder (for addressing the files)
This option defines the addressing root for processed files. Here is an example of usage:
Site folder:
"Samples\TestSite"
Files to search:
"Samples\TestSite\Files\home.htm"
"Samples\TestSite\Files\example.htm"
If your root folder will be defined as "Samples\TestSite", then your files will be addressed as "files/home.htm" and "files/example.htm".
If your root folder will be defined as "Samples\TestSite\Files", then your files will be addressed as "home.htm" and "example.htm".
As you can see, this "addressing root folder" may be different from the "site folder", as it is sometimes used.
-
Include masks
Only the files, that matches one of the specified include masks, will be parsed. Of course, you can use this field to define such a mask, that will allow to parse only the wanted subfolders of the folder defined in the "Files and folders" field. If you leave this field empty, generator will automatically use the "*.htm,*.html" mask. -
Exclude masks
This field works opposite from the previous "Include masks" definition. For example, if you define the include mask as "*.*", and then you define exclude mask as a "*.php", the *.php files will be excluded from your search index. -
Additional url addresses
All the web resources included in this field (only http and https protocols are supported), will be also downloaded and parsed for search terms. Using these addresses, keep in mind, that some portions of search engine scripts will not work as expected, because you will access the resources from other domains. So for example, the words from the desired page will be found by a search engine and the page will be listed in a search results, but the highlighting of search words on the displayed page will not be functional.