cvs commit: src/games/fortune/datfiles limerick

Jens Schweikhardt schweikh at FreeBSD.org
Wed Sep 28 11:18:30 PDT 2005


schweikh    2005-09-28 18:18:29 UTC

  FreeBSD src repository

  Modified files:
    games/fortune/datfiles limerick 
  Log:
  Nuke duplicates found via fuzzy logic:
  - typos
  - different spelling, punctuation, whitespace
  - phonetically similar names
  - words rearranged ("was once" vs "once was" etc)
  
  If a limerick appeared as a single one and as part of a
  double or triple, the singleton was removed.
  
  With a little help from: sort limerick|uniq -d
  This still turns up 20 lines being repeated, but the respective
  limericks are sufficiently unique to leave them in (i.e. most differ
  in at least two lines).
  
  Nuke spaces in front of colons while I'm here.
  
  Revision  Changes    Path
  1.9       +15 -401   src/games/fortune/datfiles/limerick


More information about the cvs-src mailing list