ports/132695: [NEW PORT]textproc/pystemmer:Snowball Stemming Algorithms for Information Retrieval
Wen Heping
wenheping at gmail.com
Mon Mar 16 13:10:02 UTC 2009
>Number: 132695
>Category: ports
>Synopsis: [NEW PORT]textproc/pystemmer:Snowball Stemming Algorithms for Information Retrieval
>Confidential: no
>Severity: non-critical
>Priority: low
>Responsible: freebsd-ports-bugs
>State: open
>Quarter:
>Keywords:
>Date-Required:
>Class: change-request
>Submitter-Id: current-users
>Arrival-Date: Mon Mar 16 13:10:01 UTC 2009
>Closed-Date:
>Last-Modified:
>Originator: Wen Heping
>Release: FreeBSD 8.0-CURRENT
>Organization:
ChangAn Middle School
>Environment:
FreeBSD fb8.wenjing.com 8.0-CURRENT-200811 FreeBSD 8.0-CURRENT-200811 #0: Thu Nov 6 00:32:12 UTC 2008 root at almeida.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC i386
>Description:
PyStemmer provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search engines
and information retrieval software; for example, a search with stemming
enabled should be able to find a document containing "cycling" given the
query "cycles".
PyStemmer provides algorithms for several (mainly european) languages,
by wrapping the libstemmer library from the Snowball project in a Python
module. It also provides access to the classic Porter stemming algorithm
for english: although this has been superceded by an improved algorithm,
the original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments.
WWW: http://pypi.python.org/pypi/PyStemmer/
>How-To-Repeat:
>Fix:
Patch attached with submission follows:
# This is a shell archive. Save it in a file, remove anything before
# this line, and then unpack it by entering "sh file". Note, it may
# create directories; files and directories will be owned by you and
# have default permissions.
#
# This archive contains:
#
# pystemmer
# pystemmer/pkg-plist
# pystemmer/pkg-descr
# pystemmer/distinfo
# pystemmer/Makefile
#
echo c - pystemmer
mkdir -p pystemmer > /dev/null 2>&1
echo x - pystemmer/pkg-plist
sed 's/^X//' >pystemmer/pkg-plist << 'f6776613564d7cdaeeefb4698a1823b1'
X%%PYTHON_SITELIBDIR%%/Stemmer.so
f6776613564d7cdaeeefb4698a1823b1
echo x - pystemmer/pkg-descr
sed 's/^X//' >pystemmer/pkg-descr << '564f59275c4afb38392523b797a7af32'
XPyStemmer provides access to efficient algorithms for calculating a
X"stemmed" form of a word. This is a form with most of the common
Xmorphological endings removed; hopefully representing a common
Xlinguistic base form. This is most useful in building search engines
Xand information retrieval software; for example, a search with stemming
Xenabled should be able to find a document containing "cycling" given the
Xquery "cycles".
X
XPyStemmer provides algorithms for several (mainly european) languages,
Xby wrapping the libstemmer library from the Snowball project in a Python
Xmodule. It also provides access to the classic Porter stemming algorithm
Xfor english: although this has been superceded by an improved algorithm,
Xthe original algorithm may be of interest to information retrieval
Xresearchers wishing to reproduce results of earlier experiments.
X
XWWW: http://pypi.python.org/pypi/PyStemmer/
564f59275c4afb38392523b797a7af32
echo x - pystemmer/distinfo
sed 's/^X//' >pystemmer/distinfo << '71cf3afae5a1d2b9aca66434d3497a4b'
XMD5 (PyStemmer-1.0.1.tar.gz) = 5fb09517161de1792b2fc1fd8844038b
XSHA256 (PyStemmer-1.0.1.tar.gz) = 08f0d64e7c49b77cb0f44937c25a04ed17ea1b9db3d830d47a02d2717ae89ebe
XSIZE (PyStemmer-1.0.1.tar.gz) = 331212
71cf3afae5a1d2b9aca66434d3497a4b
echo x - pystemmer/Makefile
sed 's/^X//' >pystemmer/Makefile << 'caff683fe7d0e1d270effb50522ae638'
X# New ports collection makefile for: pystemmer
X# Date created: 2009-03-17
X# Whom: Wen Heping <wenheping at gmail.com>
X#
X# $FreeBSD$
X#
X
XPORTNAME= pystemmer
XPORTVERSION= 1.0.1
XCATEGORIES= textproc python
XMASTER_SITES= http://snowball.tartarus.org/wrappers/
XDISTNAME= PyStemmer-${PORTVERSION}
X
XMAINTAINER= wenheping at gmail.com
XCOMMENT= Snowball Stemming Algorithms for Information Retrieval
X
XUSE_PYTHON= yes
XUSE_PYDISTUTILS= yes
XPYDISTUTILS_PKGNAME= PyStemmer
X
X.include <bsd.port.mk>
caff683fe7d0e1d270effb50522ae638
exit
>Release-Note:
>Audit-Trail:
>Unformatted:
More information about the freebsd-ports-bugs
mailing list