Realm

Name

Realm -- describes web-space to index using regex/wild patterns

indexer.conf

Synopsis

Realm [Subsection] [Method] [CmpType] [Match | NoMatch] {arg} [alias]

Description

It works almost like "Server" command but takes a regular expression or a string wildcards as it's argument and do not insert any URL into database for indexing. To insert URLs into database use URL command.

"Subsection" is an optional parameter which can take one of the following values: site, world, with world as a default value. If Subsection is set to "site", then a link between two pages is followed only if the pages belong to the same site. If Section is set to world then no site checking is done.

"Method" is an optional parameter which can take one of the following values: Allow, Disallow, CheckOnly, HrefOnly, CheckMP3, CheckMP3Only with Allow as a default value.

"CmpType" is an optional parameter to specify comparison type and can take either String or RegExp value with String as a default value.

Optional "alias" argument allows to provide very complicated URL rewrite more powerful than other aliasing mechanism. Take a look into alias.txt for "alias" argument usage explanation.

Examples


# For example, if you want to index all HTTP sites in ".ru" domain, use:
Realm http://*.ru/*
# The same using "Regex" match:
Realm Regex ^http://.*\.ru/
# Another example. Use this command to index everything without .com domain:
Realm NoMatch http://*.com/*

See also

FlushServerTable, Server, ServerTable, Subnet, URL.