zarafa-indexer.cfg(5)

1ZARAFA-INDEXER.CF(5)         Zarafa user reference        ZARAFA-INDEXER.CF(5)
2
3
4

NAME

6       zarafa-indexer.cfg - The Zarafa Indexer configuration file
7

SYNOPSIS

9       indexer.cfg
10

DESCRIPTION

12       The indexer.cfg is a configuration file for the Zarafa Indexer Service.
13       indexer.cfg contains instructions for the software to set up the
14       connection, indexing and logging systems.
15

FILE FORMAT

17       The file consists of one big section, but parameters can be grouped by
18       functionality.
19
20       The parameters are written in the form:
21
22       name = value
23
24       The file is line-based. Each newline-terminated line represents either
25       a comment, nothing, a parameter or a directive. A line beginning with
26       `#´ is considered a comment, and will be ignored by Zarafa. Parameter
27       names are case sensitive. Lines beginning with `!´ are directives.
28
29       Directives are written in the form:
30
31       !directive [argument(s)]
32
33       The following directives exist:
34
35       include
36           Include and process argument
37
38           Example: !include common.cfg
39

EXPLANATION OF THE SERVER SETTINGS PARAMETERS

41       server_socket
42           Socket to find the connection to the Zarafa server.
43
44           Default: file:///var/run/zarafa
45
46       run_as_user
47           After correctly starting, the server process will become this user,
48           dropping root privileges. Note that the log file needs to be
49           writeable by this user, and the directory too to create new
50           logfiles after logrotation. This can also be achieved by setting
51           the correct group and permissions.
52
53           Default value is empty, not changing the user after starting.
54
55       run_as_group
56           After correctly starting, the server process will become this
57           group, dropping root privileges.
58
59           Default value is empty, not changing the group after starting.
60
61       pid_file
62           Write the process ID number to this file. This is used by the
63           init.d script to correctly stop/restart the service.
64
65           Default: /var/run/zarafa-indexer.pid
66
67       running_path
68           Change directory to this path when running in daemonize mode. When
69           using the -F switch to run in the foreground the directory will not
70           be changed.
71
72           Default: /
73
74       cleanup_lockfiles
75           If for some unfortunate reason the zarafa-indexer is not cleanly
76           stopped, it may leave lockfiles behind on stores it was processing.
77           To continue the indexing on this possibly corrupt index, you need
78           to remove the lockfile. To restart the index of the store, you need
79           to remove the complete index directory. This directory will be
80           mentioned in the log. You may choose to automatically remove the
81           lockfiles by setting this option to yes. The index will be left on
82           disk.
83
84           Default: no
85
86       limit_results
87           Limit the number of results returned to the server. If set to a
88           nonzero value, only the N most relevant results will be returned to
89           the server. This in turn limits the number of results that the user
90           will see. If the limit is set to 0, all results will be returned.
91
92           Default:0
93

EXPLANATION OF THE SSL SETTINGS PARAMETERS

95       ssl_key_file
96           The file containing the private key and certificate. Please read
97           the SSL section in the zarafa-server(1) manual on how to create
98           this file.
99
100           Default: /etc/zarafa/ssl/indexer.pem
101
102       ssl_key_pass
103           Enter you password here when your key file contains a password to
104           be readable.
105
106           No default set.
107
108       ssl_private_key_file
109           The indexer will use this file as private key for SSL TLS. This
110           file can be created with: openssl genrsa -out
111           /etc/zarafa/indexer/privkey.pem 2048.
112
113           Default: /etc/zarafa/indexer/privkey.pem
114
115       ssl_certificate_file
116           The indexer will use this file as certificate for SSL TLS. A
117           self-signed certificate can be created with: openssl req -new -x509
118           -key /etc/zarafa/indexer/privkey.pem -out
119           /etc/zarafa/indexer/cert.pem -days 1095.
120
121           Default: /etc/zarafa/indexer/cert.pem
122

EXPLANATION OF THE LOGGING SETTINGS PARAMETERS

124       log_method
125           The method which should be used for logging. Valid values are:
126
127           syslog
128               Use the Linux system log. All messages will be written to the
129               mail facility. See also syslog.conf(5).
130
131           file
132               Log to a file. The filename will be specified in log_file.
133
134           Default: file
135
136       log_file
137           When logging to a file, specify the filename in this parameter. Use
138           - (minus sign) for stderr output.
139
140           Default: -
141
142       log_level
143           The level of output for logging in the range from 0 to 5. 0 means
144           no logging, 5 means full logging.
145
146           Default: 2
147
148       log_timestamp
149           Specify whether to prefix each log line with a timestamp in ´file´
150           logging mode.
151
152           Default: 1
153

EXPLANATION OF THE INDEXER SETTINGS PARAMETERS

155       server_bind_name
156           Connection path to which other processes can connect with the
157           zarafa-indexer for performing search queries.
158
159           Use http://0.0.0.0:port to listen as an HTTP service on all network
160           interfaces on the given port number.
161
162           Default: file:///var/run/zarafa-indexer
163
164       index_path
165           Base directory under which all index files will be placed, the
166           hierarchy below this folder will be:
167           ${index_path}/${server}/${user}/index
168
169           Default: /var/lib/zarafa/index/
170
171       index_sync_stream
172           Enable streaming synchronization. This can increase synchronization
173           speed depending on the setup. During streaming synchronization all
174           messages which will be synchronized will be downloaded in a single
175           call requiring more overall memory. When index_attachments is
176           disabled streaming will cause overhead because all attachments will
177           be downloaded to the client even though they will not be indexed.
178
179           This option requires enable_enhanced_ics to be enabled in the
180           zarafa-server.cfg(5), otherwise the indexing will still fallback to
181           the slower synchronization.
182
183           Default: yes
184
185       index_interval
186           Interval (in minutes) for indexing. During every interval the
187           zarafa-indexer will synchronize with the zarafa-server to collect
188           all changes for each store and update the index files for the
189           stores accordingly.
190
191           Default: 5 minutes
192
193       index_threads
194           Maximum number if indexing threads. In a multi-server environment
195           it is possible to distribute the work for different zarafa-servers
196           over different indexing threads. Note that the work for a single
197           server will be exclusively assigned to a single thread, thus the
198           number of started indexing threads will never exceed the number of
199           zarafa-servers.
200
201           Default: 1
202
203       index_max_field_length
204           Maximum number of words from a single message to index. Only the
205           first index_max_field_length from a single message will be indexed,
206           all words above this value will be discarded.
207
208           This value is used to control the amount of required memory during
209           the indexing process. More memory will be required during indexing
210           when index_max_field_length is set to a higher value.
211
212           Default: 10000
213
214       index_merge_factor
215           Number of index file segments per store before Lucene merges the
216           segments into a single file.
217
218           A low value will cause less memory to be used during indexing, but
219           the increased IO access to disk causes the indexing process to be
220           slower, while searching will be faster. A high value will speed up
221           the indexing process while searching will be slower
222
223           Batch indexing
224               If index_interval is set to a high value, set
225               index_merge_factor to a high value (> 10).
226
227           Interactive indexing
228               If index_interval is set to a low value, set index_merge_factor
229               to a low value (< 10).
230
231           Default: 10
232
233       index_max_buffered_docs
234           Max number of documents kept in memory before CLucene will write a
235           new index file segment to disk.
236
237           Larger values will increase memory usage but makes the indexing
238           process faster.
239
240           Default: 10
241
242       index_min_merge_docs
243           Minumum number of messages in a single store which are indexed in
244           memory before the index writer flushes the index to disk as new
245           index file segment.
246
247           Creating new index file segments often increases IO access to disk
248           but reduces the amount of memory required during the indexing
249           process.
250
251           Default: 10
252
253       index_max_merge_docs
254           Maximum number of documents in a index file segment, when an index
255           file segment contains index_max_merge_docs documents it will no
256           longer be merged with other index file segments.
257
258           This will limit the total size of an index file segment but will
259           trigger more index file segments to be created.
260
261           Batch indexing
262               If index_interval is set to a high value, set
263               index_max_merge_docs to a high value (> 10000).
264
265           Interactive indexing
266               If index_interval is set to a low value, set
267               index_max_merge_docs to a low value (< 10000).
268
269           Default: 2147483647 (INT32_MAX)
270
271       index_term_interval
272           The fraction of terms in the "dictionary" which should be stored in
273           memory. Smaller values use more memory, but make searching slightly
274           faster, while larger values use less memory and make searching
275           slightly slower. Searching is typically not dominated by dictionary
276           lookup, so tweaking this is rarely useful.
277
278           Default: 128
279
280       index_cache_timeout
281           Timeout value (in seconds) for purging all caches used during the
282           indexing process. If set to 0 caching will be disabled.
283
284           Default: 0
285

EXPLANATION OF THE ATTACHMENT INDEXER SETTINGS PARAMETERS

287       index_attachments
288           Enable indexing of attachments. When attachments are being indexed,
289           searching for keywords in the body of a message will automatically
290           cause the attachment to be searched as well.
291
292           This will slow down the indexing process, require more system
293           memory and increases index file size.
294
295           Default: yes
296
297       index_attachment_max_size
298           Maxiumum file size for attachments to be indexed. Any attachment
299           larger then this amount (in kilobytes) will not be indexed.
300
301           Default: 5120
302
303       index_attachment_parser
304           For indexing attachment each file must be converted into plain-text
305           by a parser. For this purpose the attachments_parser script has
306           been installed which can be configured to parse specific
307           mime-types.
308
309           It is highly advisable to check the attachments_parser.db file to
310           add or remove parser command for specific mime-types and/or
311           file-extensions.
312
313           Default: /etc/zarafa/indexerscripts/attachments_parser
314
315       index_attachment_parser_max_memory
316           Limit the maximum amount of memory (in bytes) the parser may use
317           for converting the attachment into plain-text. If this limit is
318           exceeded the parser will be killed and the attachment will not be
319           completely converted. If the value is set to 0 the limit is
320           considered as infinite.
321
322           The attachment parser depends on external tools to convert
323           attachments to plain-text, by limiting the amount of resources the
324           parser is allowed to use problems can be prevented when the
325           selected external tool misbehaves with certain attachments.
326
327           Default: 0 bytes
328
329       index_attachment_parser_max_cputime
330           Limit the maximum cpu time (in seconds) the parser may use for
331           converting the attachment into plain-text. If this limit is
332           exceeded the parser will be killed and the attachment will not be
333           completely converted. If the value is set to 0 the limit is
334           considered as infinite.
335
336           The attachment parser depends on external tools to convert
337           attachments to plain-text, by limiting the amount of resources the
338           parser is allowed to use problems can be prevented when the
339           selected external tool misbehaves with certain attachments.
340
341           Default: 0 seconds
342
343       index_attachment_mime_filter
344           Some attachments are not intresting to run through the parser. With
345           this option you can filter out attachment which have a specific
346           mimetype. Only the first part of the mimetype should be given here.
347           Good examples of a mimetype to filter is ´image´, ´audio´ and
348           ´video´. This field is space separated.
349
350           Default:
351
352       index_attachment_extension_filter
353           As the mime filter above, but works on the attachment extension.
354           This is only tested if the mimetype of an attachment was unknown.
355           Good examples of an extension to filter is ´gif´, ´jpeg´, ´jpg´ and
356           ´png´. This field is space separated.
357
358           Default:
359
360       index_max_clauses
361           This is the maximum number of clauses that can be present in a
362           Lucene search query, after expansion. You can raise this limit if
363           you are experiencing a Too Many Clauses error during searches.
364
365           Default: 50000 clauses
366

EXPLANATION OF THE INDEX FILTERS SETTINGS PARAMETERS

368       index_block_users
369           A list of usernames which should not be indexed by the indexing
370           service. Fields in the list are seperated by a SPACE.
371
372           Default: empty
373
374       index_block_companies
375           A list of tentants which should not be indexed by the indexing
376           service. All users and the public store belonging to these tentants
377           will not be indexed. Fields in the list are seperated by a SPACE.
378
379           Default: empty
380
381       index_allow_servers
382           A list of server names which should be indexed by the indexing
383           service. All stores located on these servers will be indexed, all
384           other servers will be ignored. If kept empty, no filtering is
385           applied and all detected servers are indexed. Fields in the list
386           are seperated by a SPACE.
387
388           Default: empty
389

RELOADING

391       The following options are reloadable by sending the zarafa-indexer
392       process a HUP signal:
393
394       log_level
395
396       index_threads
397
398       index_max_field_length
399
400       index_merge_factor
401
402       index_max_buffered_docs
403
404       index_min_merge_docs
405
406       index_max_merge_docs
407
408       index_term_interval
409
410       index_attachments
411
412       index_attachment_max_size
413
414       index_attachment_parser
415
416       index_attachment_parser_max_memory
417
418       index_attachment_parser_max_cputime
419
420       index_attachment_mime_filter
421
422       index_attachment_extension_filter
423

AUTHOR

425       Written by Zarafa.
426