CorpusCatcher 0.1 vrygestel

CorpusCatcher 0.1 was op 17 July 2008 vrygestel. Hier is die aankondiging wat aan die translate-announce poslys gestuur is:
Dear list member,

The first version of CorpusCatcher was released recently. CorpusCatcher is a toolset for creating language corpora by crawling the Web. It was based on BootCaT, but evolved into a stand-alone project. Thanks to Kevin Scannell for his advice in this regard.

Its main features are:
- Querying Yahoo! for pages containing specific seed words.
- Crawling the web for relevant pages.
- Extracting the text from found pages.
- Filtering results based on positive and/or negative word lists.

The release is available for download at https://sourceforge.net/project/showfiles.php?group_id=91920&package_id=...
The live documentation is available on the wiki at http://translate.sourceforge.net/wiki/corpuscatcher/index

Dependecies to use CorpusCatcher:
- Python >= 2.4
- mechanize 0.1.7b
- pYsearch 3.0

See http://translate.sourceforge.net/wiki/corpuscatcher/readme#installation for installation details.

Please report any bugs found at http://bugs.locamotion.org/