1PREP(1) General Commands Manual PREP(1)
2
3
4
6 prep - prepare text for statistical processing
7
9 prep [ -dio ] file ...
10
12 Prep reads each file in sequence and writes it on the standard output,
13 one `word' to a line. A word is a string of alphabetic characters and
14 imbedded apostrophes, delimited by space or punctuation. Hyphented
15 words are broken apart; hyphens at the end of lines are removed and the
16 hyphenated parts are joined. Strings of digits are discarded.
17
18 The following option letters may appear in any order:
19
20 -d Print the word number (in the input stream) with each word.
21
22 -i Take the next file as an `ignore' file. These words will not
23 appear in the output. (They will be counted, for purposes of
24 the -d count.)
25
26 -o Take the next file as an `only' file. Only these words will
27 appear in the output. (All other words will also be counted for
28 the -d count.)
29
30 -p Include punctuation marks (single nonalphanumeric characters) as
31 separate output lines. The punctuation marks are not counted
32 for the -d count.
33
34 Ignore and only files contain words, one per line.
35
37 deroff(1)
38
39
40
41 PREP(1)