1EXTRACT(1)                  General Commands Manual                 EXTRACT(1)
2
3
4

NAME

6       extract - determine meta-information about a file
7

SYNOPSIS

9       extract  [  -bghLnvV ] [ -H hash-algorithm ] [ -i ] [ -l library ] [ -p
10       type ] [ -x type ] file ...
11

DESCRIPTION

13       This manual page documents version 0.6.0 of the extract command.
14
15       extract tests each file specified in the argument list in an attempt to
16       infer  meta-information  from  it.   Each  file  is  subjected  to  the
17       meta-data extraction libraries from libextractor.
18
19       libextractor classifies meta-information (also referred to as keywords)
20       into types. A list of all types can be obtained with the -L option.
21
22

OPTIONS

24       -b      Display the output in BiBTeX format.
25
26       -g      Use  grep-friendly  output  (all  keywords on a single line for
27               each file).  Use the  verbose  option  to  print  the  filename
28               first,  followed by the keywords.  Use the verbose option twice
29               to also display the keyword types.  This option will not  print
30               keyword types or non-textual metadata.
31
32       -h      Print a brief summary of the options.
33
34       -i      Run plugins in-process (for debugging).  By default, each plug‐
35               in is run in its own process.
36
37       -l libraries
38               Use the specified libraries to extract  keywords.  The  general
39               format  of  libraries  is .I [[-]LIBRARYNAME[:[-]LIBRARYNAME]*]
40               where LIBRARYNAME is a libextractor compatible library and typ‐
41               ically  of  the  form  .Ijpeg. The minus before the libraryname
42               indicates that this library should be removed from the existing
43               list.   To  run only a few selected plugins, use -l in combina‐
44               tion with -n.
45
46       -L      Print a list of all known keyword types.
47
48       -n      Do not use the default set of extractors (typically  all  stan‐
49               dard extractors, currently mp3, ogg, jpg, gif, png, tiff, real,
50               html, pdf and mime-types), use only  the  extractors  specified
51               with the .B -l option.
52
53       -p  type
54               Print  only  the  keywords  matching  the  specified  type.  By
55               default, all keywords that are found and not removed as  dupli‐
56               cates are printed.
57
58       -v      Print the version number and exit.
59
60       -V      Be  verbose.   This  option  can be specified multiple times to
61               increase verbosity further.
62
63       -x  type
64               Exclude keywords of the specified  type  from  the  output.  By
65               default,  all keywords that are found and not removed as dupli‐
66               cates are printed.
67

SEE ALSO

69       libextractor(3) - description of the libextractor library
70

EXAMPLES

72       $ extract test/test.jpg
73       comment - (C) 2001 by Christian Grothoff, using gimp 1.2 1
74       mimetype - image/jpeg
75
76       $ extract -V -x comment test/test.jpg
77       Keywords for file test/test.jpg:
78       mimetype - image/jpeg
79
80       $ extract -p comment test/test.jpg
81       comment - (C) 2001 by Christian Grothoff, using gimp 1.2 1
82
83       $ extract -nV -l png.so -p comment test/test.jpg test/test.png
84       Keywords for file test/test.jpg:
85       Keywords for file test/test.png:
86       comment - Testing keyword extraction
87
88
90       libextractor and the extract tool are released under the  GPL.   libex‐
91       tractor is a GNU package.
92
93

BUGS

95       A couple of file-formats (on the order of 10^3) are not recognized...
96
97

AUTHORS

99       extract   was   originally   written   by  Christian  Grothoff  <chris‐
100       tian@grothoff.org> and Vidyut Samanta <vids@cs.ucla.edu>.  Use  <libex‐
101       tractor@gnu.org> to contact the current maintainer(s).
102
103

AVAILABILITY

105       You   can   obtain   the   original   author's   latest   version  from
106       http://www.gnu.org/software/libextractor/
107
108
109
110libextractor 0.6.0               Dec 20, 2009                       EXTRACT(1)
Impressum