cvs commit: src/games/fortune/datfiles limerick
Jens Schweikhardt
schweikh at FreeBSD.org
Wed Sep 28 11:18:30 PDT 2005
schweikh 2005-09-28 18:18:29 UTC
FreeBSD src repository
Modified files:
games/fortune/datfiles limerick
Log:
Nuke duplicates found via fuzzy logic:
- typos
- different spelling, punctuation, whitespace
- phonetically similar names
- words rearranged ("was once" vs "once was" etc)
If a limerick appeared as a single one and as part of a
double or triple, the singleton was removed.
With a little help from: sort limerick|uniq -d
This still turns up 20 lines being repeated, but the respective
limericks are sufficiently unique to leave them in (i.e. most differ
in at least two lines).
Nuke spaces in front of colons while I'm here.
Revision Changes Path
1.9 +15 -401 src/games/fortune/datfiles/limerick
More information about the cvs-src
mailing list