1Text::English(3) User Contributed Perl Documentation Text::English(3)
2
3
4
6 Text::English - Porter's stemming algorithm
7
9 use Text::English;
10 @stems = Text::English::stem( @words );
11
13 This routine applies the Porter Stemming Algorithm to its parameters,
14 returning the stemmed words. It is derived from the C program
15 "stemmer.c" as found in freewais and elsewhere, which contains these
16 notes:
17
18 Purpose: Implementation of the Porter stemming algorithm documented
19 in: Porter, M.F., "An Algorithm For Suffix Stripping,"
20 Program 14 (3), July 1980, pp. 130-137.
21 Provenance: Written by B. Frakes and C. Cox, 1986.
22
23 I have re-interpreted areas that use Frakes and Cox's "WordSize"
24 function. My version may misbehave on short words starting with "y",
25 but I can't think of any examples.
26
27 The step numbers correspond to Frakes and Cox, and are probably in
28 Porter's article (which I've not seen). Porter's algorithm still has
29 rough spots (e.g current/currency, -ings words), which I've not
30 attempted to cure, although I have added support for the British -ise
31 suffix.
32
34 This is version 0.1. I would welcome feedback, especially improvements
35 to the punctuation-stripping step.
36
38 Ian Phillipps <ian@unipalm.pipex.com>
39
41 Copyright Public IP Exchange Ltd (PIPEX). Available for use under the
42 same terms as perl.
43
44
45
46perl v5.32.1 2021-01-27 Text::English(3)