Webmirror 2.0

You can define one or more include commands to define what pages to download. When a page is downloaded all links in it are checked against the domains defined in the commands include and exclude. If the url matches any on the patterns defined in the include commands and does NOT match any of the patterns in the exclude commands it is scheduled for retrieval.

The patters usually defined like:

include http://www.mycom.com/mypath/*
to retrieve all URLs that are on the machine www.mycom.com below the path mypath. You can also say
exclude *.rm
exclude *.ram
not to retrieve RealMedia files.

A * star character can be used as joker character matching zero or more characters.

Using this command you can define and include domain.

Later versions will allow usage of Perl regular expression for the domain definition for advanced users. TOC