1CSVJOIN(1) User Commands CSVJOIN(1)
2
3
4
6 csvjoin - manual page for csvjoin 3.3.0
7
9 usage: csvjoin [-h] [-t] [-j {inner,outer}] [-m MINSCORE] [-c COUNT]
10 [-w WARP]
11
12 left leftcolumn right rightcolumn outfile
13
14 Left similarity join between two CSV files. For each row in the first
15 file, take the specified join column and find similar rows in the sec‐
16 ond file based on ngram similarity to a specified column in the second
17 file. For each resulting pair of rows, output a row consisting of the
18 fields from the first file, a column with the similarity value, and
19 then the fields from the second file.
20
21 positional arguments:
22 left First CSV file
23
24 leftcolumn
25 Column in first CSV file
26
27 right Second CSV file
28
29 rightcolumn
30 Column in second CSV file
31
32 outfile
33 Output CSV file
34
35 optional arguments:
36 -h, --help
37 show this help message and exit
38
39 -t, --titles
40 input files have column titles
41
42 -j {inner,outer}, --join {inner,outer}
43 The kind of left join to perform. Outer join outputs left-hand
44 rows which have no right hand match, while inner join discards
45 such rows. Default: outer
46
47 -m MINSCORE, --minscore MINSCORE
48 Minimum match score: 0.24
49
50 -c COUNT, --count COUNT
51 Max number of rows to match (0 for all): 0
52
53 -w WARP, --warp WARP
54 N-gram warp, higher helps short strings: 1.0
55
56
57
58csvjoin 3.3.0 June 2018 CSVJOIN(1)