This utility is deprecated and no longer supported. It will not function properly with versions higher than 0.20.x.
The Check Corpus utility is a tool for use with POPFile to check the entries in the corpus word file to ensure that they do not include illegal imbedded spaces and that the word counts are integer values.
This version has been tested in a Windows environment with version 0.18.1 of POPFile. The author believes that the utility is platform independent and will work properly on non-Windows POPFile installs, but has not tested on those platforms.
POPFile is an automatic email classification tool authored by John Graham-Cumming available from SourceForge.
This program is deprecated, please use pfdiagnose instead, it includes all of the checks formerly performed by ck_corpus plus several new checks. ck_corpus is not compatible with POPFile versions greater than v 0.18.x.
Download the script to your POPFile install directory, normally c:\Program Files\Popfile by clicking here.
Open a DOS Command box (click the DOS icon on your desktop or Start/Run and type command in the open box and click ok).
Change to your POPFile installation directory, e.g.,
cd "\program files\popfile"
Run ck_corpus.pl using Perl.
perl ck_corpus.pl > report.txt
The resulting diagnostic report will be in the file named 'report.txt', open it with a text editor such as notepad.
start notepad.exe report.txt
Copyright (C) 2003 Scott W. Leighton
Licensed under the terms of the GNU General Public License.
Contributed to the POPFile project under the terms of the POPFile License Agreement.