1Text::English(3)      User Contributed Perl Documentation     Text::English(3)
2
3
4

NAME

6       Text::English - Porter's stemming algorithm
7

SYNOPSIS

9           use Text::English;
10           @stems = Text::English::stem( @words );
11

DESCRIPTION

13       This routine applies the Porter Stemming Algorithm to its parameters,
14       returning the stemmed words.  It is derived from the C program
15       "stemmer.c" as found in freewais and elsewhere, which contains these
16       notes:
17
18          Purpose:    Implementation of the Porter stemming algorithm documented
19                      in: Porter, M.F., "An Algorithm For Suffix Stripping,"
20                      Program 14 (3), July 1980, pp. 130-137.
21          Provenance: Written by B. Frakes and C. Cox, 1986.
22
23       I have re-interpreted areas that use Frakes and Cox's "WordSize"
24       function. My version may misbehave on short words starting with "y",
25       but I can't think of any examples.
26
27       The step numbers correspond to Frakes and Cox, and are probably in
28       Porter's article (which I've not seen).  Porter's algorithm still has
29       rough spots (e.g current/currency, -ings words), which I've not
30       attempted to cure, although I have added support for the British -ise
31       suffix.
32

NOTES

34       This is version 0.1. I would welcome feedback, especially improvements
35       to the punctuation-stripping step.
36

AUTHOR

38       Ian Phillipps <ian@unipalm.pipex.com>
39
41       Copyright Public IP Exchange Ltd (PIPEX).  Available for use under the
42       same terms as perl.
43
44
45
46perl v5.30.0                      2019-07-26                  Text::English(3)
Impressum