1GE_CONF(5) Grid Engine File Formats GE_CONF(5)
2
3
4
6 ge_conf - Grid Engine configuration files
7
9 ge_conf defines the global and local Grid Engine configurations and can
10 be shown/modified by qconf(1) using the -sconf/-mconf options. Only
11 root or the cluster administrator may modify ge_conf.
12
13 At its initial start-up, ge_qmaster(8) checks to see if a valid Grid
14 Engine configuration is available at a well known location in the Grid
15 Engine internal directory hierarchy. If so, it loads that configura‐
16 tion information and proceeds. If not, ge_qmaster(8) writes a generic
17 configuration containing default values to that same location. The
18 Grid Engine execution daemons ge_execd(8) upon start-up retrieve their
19 configuration from ge_qmaster(8).
20
21 The actual configuration for both ge_qmaster(8) and ge_execd(8) is a
22 superposition of a global configuration and a local configuration per‐
23 tinent for the host on which a master or execution daemon resides. If
24 a local configuration is available, its entries overwrite the corre‐
25 sponding entries of the global configuration. Note: The local configu‐
26 ration does not have to contain all valid configuration entries, but
27 only those which need to be modified against the global entries.
28
29 Note: Grid Engine allows backslashes (\) be used to escape newline
30 (\newline) characters. The backslash and the newline are replaced with
31 a space (" ") character before any interpretation.
32
34 The paragraphs that follow provide brief descriptions of the individual
35 parameters that compose the global and local configurations for a Grid
36 Engine cluster:
37
38 execd_spool_dir
39 The execution daemon spool directory path. Again, a feasible spool
40 directory requires read/write access permission for root. The entry in
41 the global configuration for this parameter can be overwritten by exe‐
42 cution host local configurations, i.e. each ge_execd(8) may have a pri‐
43 vate spool directory with a different path, in which case it needs to
44 provide read/write permission for the root account of the corresponding
45 execution host only.
46
47 Under execd_spool_dir a directory named corresponding to the unquali‐
48 fied hostname of the execution host is opened and contains all informa‐
49 tion spooled to disk. Thus, it is possible for the execd_spool_dirs of
50 all execution hosts to physically reference the same directory path
51 (the root access restrictions mentioned above need to be met, however).
52
53 Changing the global execd_spool_dir parameter set at installation time
54 is not supported in a running system. If the change should still be
55 done it is required to restart all affected execution daemons. Please
56 make sure running jobs have finished before doing so, otherwise running
57 jobs will be lost.
58
59
60 The default location for the execution daemon spool directory is
61 $GE_ROOT/$GE_CELL/spool.
62
63 The global configuration entry for this value may be overwritten by the
64 execution host local configuration.
65
66 mailer
67 mailer is the absolute pathname to the electronic mail delivery agent
68 on your system. It must accept the following syntax:
69
70 mailer -s <subject-of-mail-message> <recipient>
71
72 Each ge_execd(8) may use a private mail agent. Changing mailer will
73 take immediate effect.
74
75 The default for mailer depends on the operating system of the host on
76 which the Grid Engine master installation was run. Common values are
77 /bin/mail or /usr/bin/Mail.
78
79 The global configuration entry for this value may be overwritten by the
80 execution host local configuration.
81
82 xterm
83 xterm is the absolute pathname to the X Window System terminal emula‐
84 tor, xterm(1).
85
86 Each ge_execd(8) may use a private mail agent. Changing xterm will take
87 immediate effect.
88
89 The default for xterm is /usr/bin/xterm.
90
91 The global configuration entry for this value may be overwritten by the
92 execution host local configuration.
93
94 load_sensor
95 A comma separated list of executable shell script paths or programs to
96 be started by ge_execd(8) and to be used in order to retrieve site con‐
97 figurable load information (e.g. free space on a certain disk parti‐
98 tion).
99
100 Each ge_execd(8) may use a set of private load_sensor programs or
101 scripts. Changing load_sensor will take effect after two load report
102 intervals (see load_report_time). A load sensor will be restarted auto‐
103 matically if the file modification time of the load sensor executable
104 changes.
105
106 The global configuration entry for this value may be overwritten by the
107 execution host local configuration.
108
109 In addition to the load sensors configured via load_sensor, ge_exec(8)
110 searches for an executable file named qloadsensor in the execution
111 host's Grid Engine binary directory path. If such a file is found, it
112 is treated like the configurable load sensors defined in load_sensor.
113 This facility is intended for pre-installing a default load sensor.
114
115 prolog
116 The executable path of a shell script that is started before execution
117 of Grid Engine jobs with the same environment setting as that for the
118 Grid Engine jobs to be started afterwards. An optional prefix "user@"
119 specifies the user under which this procedure is to be started. The
120 procedures standard output and the error output stream are written to
121 the same file used also for the standard output and error output of
122 each job. This procedure is intended as a means for the Grid Engine
123 administrator to automate the execution of general site specific tasks
124 like the preparation of temporary file systems with the need for the
125 same context information as the job. Each ge_execd(8) may use a pri‐
126 vate prolog script. Correspondingly, the execution host local configu‐
127 rations is can be overwritten by the queue configuration (see
128 queue_conf(5) ). Changing prolog will take immediate effect.
129
130 The default for prolog is the special value NONE, which prevents from
131 execution of a prolog script.
132
133 The following special variables expanded at runtime can be used
134 (besides any other strings which have to be interpreted by the proce‐
135 dure) to constitute a command line:
136
137 $host The name of the host on which the prolog or epilog procedures
138 are started.
139
140 $job_owner
141 The user name of the job owner.
142
143 $job_id
144 Grid Engine's unique job identification number.
145
146 $job_name
147 The name of the job.
148
149 $processors
150 The processors string as contained in the queue configuration
151 (see queue_conf(5)) of the master queue (the queue in which the
152 prolog and epilog procedures are started).
153
154 $queue The cluster queue name of the master queue instance, i.e. the
155 cluster queue in which the prolog and epilog procedures are
156 started.
157
158 $stdin_path
159 The pathname of the stdin file. This is always /dev/null for
160 prolog, pe_start, pe_stop and epilog. It is the pathname of the
161 stdin file for the job in the job script. When delegated file
162 staging is enabled, this path is set to $fs_stdin_tmp_path. When
163 delegated file staging is not enabled, it is the stdin pathname
164 given via DRMAA or qsub.
165
166 $stdout_path
167
168 $stderr_path
169 The pathname of the stdout/stderr file. This always points to
170 the output/error file. When delegated file staging is enabled,
171 this path is set to $fs_stdout_tmp_path/$fs_stderr_tmp_path.
172 When delegated file staging is not enabled, it is the std‐
173 out/stderr pathname given via DRMAA or qsub.
174
175 $merge_stderr
176 If merging of stderr and stdout is requested, this flag is "1",
177 otherwise it is "0". If this flag is 1, stdout and stderr are
178 merged in one file, the stdout file. Merging of stderr and std‐
179 out can be requested via the DRMAA job template attribute
180 'drmaa_join_files' (see drmaa_attributes(3) ) or the qsub param‐
181 eter '-j y' (see qsub(1) ).
182
183 $fs_stdin_host
184 When delegated file staging is requested for the stdin file,
185 this is the name of the host where the stdin file has to be
186 copied from before the job is started.
187
188 $fs_stdout_host
189
190 $fs_stderr_host
191 When delegated file staging is requested for the stdout/stderr
192 file, this is the name of the host where the stdout/stderr file
193 has to be copied to after the job has run.
194
195 $fs_stdin_path
196 When delegated file staging is requested for the stdin file,
197 this is the pathname of the stdin file on the host
198 $fs_stdin_host.
199
200 $fs_stdout_path
201
202 $fs_stderr_path
203 When delegated file staging is requested for the stdout/stderr
204 file, this is the pathname of the stdout/stderr file on the host
205 $fs_stdout_host/$fs_stderr_host.
206
207 $fs_stdin_tmp_path
208 When delegated file staging is requested for the stdin file,
209 this is the destination pathname of the stdin file on the execu‐
210 tion host. The prolog script must copy the stdin file from
211 $fs_stdin_host:$fs_stdin_path to localhost:$fs_stdin_tmp_path to
212 establish delegated file staging of the stdin file.
213
214 $fs_stdout_tmp_path
215
216 $fs_stderr_tmp_path
217 When delegated file staging is requested for the stdout/stderr
218 file, this is the source pathname of the stdout/stderr file on
219 the execution host. The epilog script must copy the stdout file
220 from localhost:$fs_stdout_tmp_path to $fs_stdout_host:$fs_std‐
221 out_path (the stderr file from localhost:$fs_stderr_tmp_path to
222 $fs_stderr_host:$fs_stderr_path) to establish delegated file
223 staging of the stdout/stderr file.
224
225 $fs_stdin_file_staging
226
227 $fs_stdout_file_staging
228
229 $fs_stderr_file_staging
230 When delegated file staging is requested for the stdin/std‐
231 out/stderr file, the flag is set to "1", otherwise it is set to
232 "0" (see in delegated_file_staging how to enable delegated file
233 staging).
234
235 These three flags correspond to the DRMAA job template attribute
236 'drmaa_transfer_files' (see drmaa_attributes(3) ).
237
238 The global configuration entry for this value may be overwritten by the
239 execution host local configuration.
240
241 Exit codes for the prolog attribute can be interpreted based on the
242 following exit values:
243 0: Success
244 99: Reschedule job
245 100: Put job in error state
246 Anything else: Put queue in error state
247
248 epilog
249 The executable path of a shell script that is started after execution
250 of Grid Engine jobs with the same environment setting as that for the
251 Grid Engine jobs that has just completed. An optional prefix "user@"
252 specifies the user under which this procedure is to be started. The
253 procedures standard output and the error output stream are written to
254 the same file used also for the standard output and error output of
255 each job. This procedure is intended as a means for the Grid Engine
256 administrator to automate the execution of general site specific tasks
257 like the cleaning up of temporary file systems with the need for the
258 same context information as the job. Each ge_execd(8) may use a pri‐
259 vate epilog script. Correspondingly, the execution host local configu‐
260 rations is can be overwritten by the queue configuration (see
261 queue_conf(5) ). Changing epilog will take immediate effect.
262
263 The default for epilog is the special value NONE, which prevents from
264 execution of a epilog script. The same special variables as for pro‐
265 log can be used to constitute a command line.
266
267 The global configuration entry for this value may be overwritten by the
268 execution host local configuration.
269
270 Exit codes for the epilog attribute can be interpreted based on the
271 following exit values:
272 0: Success
273 99: Reschedule job
274 100: Put job in error state
275 Anything else: Put queue in error state
276
277 shell_start_mode
278 Note: Deprecated, may be removed in future release.
279 This parameter defines the mechanisms which are used to actually invoke
280 the job scripts on the execution hosts. The following values are recog‐
281 nized:
282
283 unix_behavior
284 If a user starts a job shell script under UNIX interactively by
285 invoking it just with the script name the operating system's
286 executable loader uses the information provided in a comment
287 such as `#!/bin/csh' in the first line of the script to detect
288 which command interpreter to start to interpret the script. This
289 mechanism is used by Grid Engine when starting jobs if
290 unix_behavior is defined as shell_start_mode.
291
292 posix_compliant
293 POSIX does not consider first script line comments such a
294 `#!/bin/csh' as significant. The POSIX standard for batch queu‐
295 ing systems (P1003.2d) therefore requires a compliant queuing
296 system to ignore such lines but to use user specified or config‐
297 ured default command interpreters instead. Thus, if
298 shell_start_mode is set to posix_compliant Grid Engine will
299 either use the command interpreter indicated by the -S option of
300 the qsub(1) command or the shell parameter of the queue to be
301 used (see queue_conf(5) for details).
302
303 script_from_stdin
304 Setting the shell_start_mode parameter either to posix_compliant
305 or unix_behavior requires you to set the umask in use for
306 ge_execd(8) such that every user has read access to the
307 active_jobs directory in the spool directory of the correspond‐
308 ing execution daemon. In case you have prolog and epilog scripts
309 configured, they also need to be readable by any user who may
310 execute jobs.
311 If this violates your site's security policies you may want to
312 set shell_start_mode to script_from_stdin. This will force Grid
313 Engine to open the job script as well as the epilog and prolog
314 scripts for reading into STDIN as root (if ge_execd(8) was
315 started as root) before changing to the job owner's user
316 account. The script is then fed into the STDIN stream of the
317 command interpreter indicated by the -S option of the qsub(1)
318 command or the shell parameter of the queue to be used (see
319 queue_conf(5) for details).
320 Thus setting shell_start_mode to script_from_stdin also implies
321 posix_compliant behavior. Note, however, that feeding scripts
322 into the STDIN stream of a command interpreter may cause trouble
323 if commands like rsh(1) are invoked inside a job script as they
324 also process the STDIN stream of the command interpreter. These
325 problems can usually be resolved by redirecting the STDIN chan‐
326 nel of those commands to come from /dev/null (e.g. rsh host date
327 < /dev/null). Note also, that any command-line options associ‐
328 ated with the job are passed to the executing shell. The shell
329 will only forward them to the job if they are not recognized as
330 valid shell options.
331
332 Changes to shell_start_mode will take immediate effect. The default
333 for shell_start_mode is posix_compliant.
334
335 This value is a global configuration parameter only. It cannot be over‐
336 written by the execution host local configuration.
337
338 login_shells
339 UNIX command interpreters like the Bourne-Shell (see sh(1)) or the C-
340 Shell (see csh(1)) can be used by Grid Engine to start job scripts. The
341 command interpreters can either be started as login-shells (i.e. all
342 system and user default resource files like .login or .profile will be
343 executed when the command interpreter is started and the environment
344 for the job will be set up as if the user has just logged in) or just
345 for command execution (i.e. only shell specific resource files like
346 .cshrc will be executed and a minimal default environment is set up by
347 Grid Engine - see qsub(1)). The parameter login_shells contains a
348 comma separated list of the executable names of the command inter‐
349 preters to be started as login-shells. Shells in this list are only
350 started as login shells if the parameter shell_start_mode (see above)
351 is set to posix_compliant.
352
353 Changes to login_shells will take immediate effect. The default for
354 login_shells is sh,csh,tcsh,ksh.
355
356 This value is a global configuration parameter only. It cannot be over‐
357 written by the execution host local configuration.
358
359 min_uid
360 min_uid places a lower bound on user IDs that may use the cluster.
361 Users whose user ID (as returned by getpwnam(3)) is less than min_uid
362 will not be allowed to run jobs on the cluster.
363
364 Changes to min_uid will take immediate effect. The default for min_uid
365 is 0.
366
367 This value is a global configuration parameter only. It cannot be over‐
368 written by the execution host local configuration.
369
370 min_gid
371 This parameter sets the lower bound on group IDs that may use the clus‐
372 ter. Users whose default group ID (as returned by getpwnam(3)) is less
373 than min_gid will not be allowed to run jobs on the cluster.
374
375 Changes to min_gid will take immediate effect. The default for min_gid
376 is 0.
377
378 This value is a global configuration parameter only. It cannot be over‐
379 written by the execution host local configuration.
380
381 user_lists
382 The user_lists parameter contains a comma separated list of user access
383 lists as described in access_list(5). Each user contained in at least
384 one of the enlisted access lists has access to the cluster. If the
385 user_lists parameter is set to NONE (the default) any user has access
386 not explicitly excluded via the xuser_lists parameter described below.
387 If a user is contained both in an access list enlisted in xuser_lists
388 and user_lists the user is denied access to the cluster.
389
390 Changes to user_lists will take immediate effect
391
392 This value is a global configuration parameter only. It cannot be over‐
393 written by the execution host local configuration.
394
395 xuser_lists
396 The xuser_lists parameter contains a comma separated list of user
397 access lists as described in access_list(5). Each user contained in at
398 least one of the enlisted access lists is denied access to the cluster.
399 If the xuser_lists parameter is set to NONE (the default) any user has
400 access. If a user is contained both in an access list enlisted in
401 xuser_lists and user_lists (see above) the user is denied access to the
402 cluster.
403
404 Changes to xuser_lists will take immediate effect
405
406 This value is a global configuration parameter only. It cannot be over‐
407 written by the execution host local configuration.
408
409 administrator_mail
410 administrator_mail specifies a comma separated list of the electronic
411 mail address(es) of the cluster administrator(s) to whom internally-
412 generated problem reports are sent. The mail address format depends on
413 your electronic mail system and how it is configured; consult your sys‐
414 tem's configuration guide for more information.
415
416 Changing administrator_mail takes immediate effect. The default for
417 administrator_mail is an empty mail list.
418
419 This value is a global configuration parameter only. It cannot be over‐
420 written by the execution host local configuration.
421
422 projects
423 The projects list contains all projects which are granted access to
424 Grid Engine. User belonging to none of these projects cannot use Grid
425 Engine. If users belong to projects in the projects list and the xpro‐
426 jects list (see below), they also cannot use the system.
427
428 Changing projects takes immediate effect. The default for projects is
429 none.
430
431 This value is a global configuration parameter only. It cannot be over‐
432 written by the execution host local configuration.
433
434 xprojects
435 The xprojects list contains all projects that are denied access to Grid
436 Engine. User belonging to one of these projects cannot use Grid Engine.
437 If users belong to projects in the projects list (see above) and the
438 xprojects list, they also cannot use the system.
439
440 Changing xprojects takes immediate effect. The default for xprojects
441 is none.
442
443 This value is a global configuration parameter only. It cannot be over‐
444 written by the execution host local configuration.
445
446 load_report_time
447 System load is reported periodically by the execution daemons to
448 ge_qmaster(8). The parameter load_report_time defines the time inter‐
449 val between load reports.
450
451 Each ge_execd(8) may use a different load report time. Changing
452 load_report_time will take immediate effect.
453
454 Note: Be careful when modifying load_report_time. Reporting load too
455 frequently might block ge_qmaster(8) especially if the number of execu‐
456 tion hosts is large. Moreover, since the system load typically
457 increases and decreases smoothly, frequent load reports hardly offer
458 any benefit.
459
460 The default for load_report_time is 40 seconds.
461
462 The global configuration entry for this value may be overwritten by the
463 execution host local configuration.
464
465 reschedule_unknown
466 Determines whether jobs on hosts in unknown state are rescheduled and
467 thus sent to other hosts. Hosts are registered as unknown if ge_mas‐
468 ter(8) cannot establish contact to the ge_execd(8) on those hosts (see
469 max_unheard ). Likely reasons are a breakdown of the host or a break‐
470 down of the network connection in between, but also ge_execd(8) may not
471 be executing on such hosts.
472
473 In any case, Grid Engine can reschedule jobs running on such hosts to
474 another system. reschedule_unknown controls the time which Grid Engine
475 will wait before jobs are rescheduled after a host became unknown. The
476 time format specification is hh:mm:ss. If the special value 00:00:00 is
477 set, then jobs will not be rescheduled from this host.
478
479 Rescheduling is only initiated for jobs which have activated the rerun
480 flag (see the -r y option of qsub(1) and the rerun option of
481 queue_conf(5)). Parallel jobs are only rescheduled if the host on
482 which their master task executes is in unknown state. The behavior of
483 reschedule_unknown for parallel jobs and for jobs without the rerun
484 flag be set can be adjusted using the qmaster_params settings
485 ENABLE_RESCHEDULE_KILL and ENABLE_RESCHEDULE_SLAVE.
486
487 Checkpointing jobs will only be rescheduled when the when option of the
488 corresponding checkpointing environment contains an appropriate flag.
489 (see checkpoint(5)). Interactive jobs (see qsh(1), qrsh(1), qtcsh(1))
490 are not rescheduled.
491
492 The default for reschedule_unknown is 00:00:00
493
494 The global configuration entry for this value may be over written by
495 the execution host local configuration.
496
497 max_unheard
498 If ge_qmaster(8) could not contact or was not contacted by the execu‐
499 tion daemon of a host for max_unheard seconds, all queues residing on
500 that particular host are set to status unknown. ge_qmaster(8), at
501 least, should be contacted by the execution daemons in order to get the
502 load reports. Thus, max_unheard should by greater than the
503 load_report_time (see above).
504
505 Changing max_unheard takes immediate effect. The default for
506 max_unheard is 5 minutes.
507
508 This value is a global configuration parameter only. It cannot be over‐
509 written by the execution host local configuration.
510
511 loglevel
512 This parameter specifies the level of detail that Grid Engine compo‐
513 nents such as ge_qmaster(8) or ge_execd(8) use to produce informative,
514 warning or error messages which are logged to the messages files in the
515 master and execution daemon spool directories (see the description of
516 the execd_spool_dir parameter above). The following message levels are
517 available:
518
519 log_err
520 All error events being recognized are logged.
521
522 log_warning
523 All error events being recognized and all detected signs of
524 potentially erroneous behavior are logged.
525
526 log_info
527 All error events being recognized, all detected signs of poten‐
528 tially erroneous behavior and a variety of informative messages
529 are logged.
530
531 Changing loglevel will take immediate effect.
532
533 The default for loglevel is log_warning.
534
535 This value is a global configuration parameter only. It cannot be over‐
536 written by the execution host local configuration.
537
538 max_aj_instances
539 This parameter defines the maximum amount of array task to be scheduled
540 to run simultaneously per array job. An instance of an array task will
541 be created within the master daemon when it gets a start order from the
542 scheduler. The instance will be destroyed when the array task finishes.
543 Thus the parameter provides control mainly over the memory consumption
544 of array jobs in the master and scheduler daemon. It is most useful for
545 very large clusters and very large array jobs. The default for this
546 parameter is 2000. The value 0 will deactivate this limit and will
547 allow the scheduler to start as many array job tasks as suitable
548 resources are available in the cluster.
549
550 Changing max_aj_instances will take immediate effect.
551
552 This value is a global configuration parameter only. It cannot be over‐
553 written by the execution host local configuration.
554
555 max_aj_tasks
556 This parameter defines the maximum number of array job tasks within an
557 array job. ge_qmaster(8) will reject all array job submissions which
558 request more than max_aj_tasks array job tasks. The default for this
559 parameter is 75000. The value 0 will deactivate this limit.
560
561 Changing max_aj_tasks will take immediate effect.
562
563 This value is a global configuration parameter only. It cannot be over‐
564 written by the execution host local configuration.
565
566 max_u_jobs
567 The number of active (not finished) jobs which each Grid Engine user
568 can have in the system simultaneously is controlled by this parameter.
569 A value greater than 0 defines the limit. The default value 0 means
570 "unlimited". If the max_u_jobs limit is exceeded by a job submission
571 then the submission command exits with exit status 25 and an appropri‐
572 ate error message.
573
574 Changing max_u_jobs will take immediate effect.
575
576 This value is a global configuration parameter only. It cannot be over‐
577 written by the execution host local configuration.
578
579 max_jobs
580 The number of active (not finished) jobs simultaneously allowed in Grid
581 Engine is controlled by this parameter. A value greater than 0 defines
582 the limit. The default value 0 means "unlimited". If the max_jobs
583 limit is exceeded by a job submission then the submission command exits
584 with exit status 25 and an appropriate error message.
585
586 Changing max_jobs will take immediate effect.
587
588 This value is a global configuration parameter only. It cannot be over‐
589 written by the execution host local configuration.
590
591 max_advance_reservations
592 The number of active (not finished) Advance Reservations simultaneously
593 allowed in Grid Engine is controlled by this parameter. A value greater
594 than 0 defines the limit. The default value 0 means "unlimited". If the
595 max_advance_reservations limit is exceeded by an Advance Reservation
596 request then the submission command exits with exit status 25 and an
597 appropriate error message.
598
599 Changing max_advance_reservations will take immediate effect.
600
601 This value is a global configuration parameter only. It cannot be over‐
602 written by the execution host local configuration.
603
604 enforce_project
605 If set to true, users are required to request a project whenever sub‐
606 mitting a job. See the -P option to [22mqsub(1) for details.
607
608 Changing enforce_project will take immediate effect. The default for
609 enforce_project is false.
610
611 This value is a global configuration parameter only. It cannot be over‐
612 written by the execution host local configuration.
613
614 enforce_user
615 If set to true, a [22muser(5) must exist to allow for job submission. Jobs
616 are rejected if no corresponding user exists.
617
618 If set to auto, a [22muser(5) object for the submitting user will automati‐
619 cally be created during job submission, if one does not already exist.
620 The auto_user_oticket, auto_user_fshare, auto_user_default_project, and
621 auto_user_delete_time configuration parameters will be used as default
622 attributes of the new user(5) object.
623
624 Changing enforce_user will take immediate effect. The default for
625 enforce_user is auto.
626
627 This value is a global configuration parameter only. It cannot be over‐
628 written by the execution host local configuration.
629
630 auto_user_oticket
631 The number of override tickets to assign to automatically created
632 user(5) objects. User objects are created automatically if the
633 enforce_user attribute is set to auto.
634
635 Changing auto_user_oticket will affect any newly created user objects,
636 but will not change user objects created in the past.
637
638 This value is a global configuration parameter only. It cannot be over‐
639 written by the execution host local configuration.
640
641 auto_user_fshare
642 The number of functional shares to assign to automatically created
643 user(5) objects. User objects are created automatically if the
644 enforce_user attribute is set to auto.
645
646 Changing auto_user_fshare will affect any newly created user objects,
647 but will not change user objects created in the past.
648
649 This value is a global configuration parameter only. It cannot be over‐
650 written by the execution host local configuration.
651
652 auto_user_default_project
653 The default project to assign to automatically created user(5) objects.
654 User objects are created automatically if the enforce_user attribute is
655 set to auto.
656
657 Changing auto_user_default_project will affect any newly created user
658 objects, but will not change user objects created in the past.
659
660 This value is a global configuration parameter only. It cannot be over‐
661 written by the execution host local configuration.
662
663 auto_user_delete_time
664 The number of seconds of inactivity after which automatically created
665 user(5) objects will be deleted. User objects are created automatically
666 if the enforce_user attribute is set to auto. If the user has no active
667 or pending jobs for the specified amount of time, the object will auto‐
668 matically be deleted. A value of 0 can be used to indicate that the
669 automatically created user object is permanent and should not be auto‐
670 matically deleted.
671
672 Changing auto_user_delete_time will affect the deletion time for all
673 users with active jobs.
674
675 This value is a global configuration parameter only. It cannot be over‐
676 written by the execution host local configuration.
677
678 set_token_cmd
679 Note: Deprecated, may be removed in future release.
680 This parameter is only present if your Grid Engine system is licensed
681 to support AFS.
682
683 Set_token_cmd points to a command which sets and extends AFS tokens for
684 Grid Engine jobs. In the standard Grid Engine AFS distribution, it is
685 supplied as a script which expects two command line parameters. It
686 reads the token from STDIN, extends the token's expiration time and
687 sets the token:
688
689 <set_token_cmd> <user> <token_extend_after_seconds>
690
691 As a shell script this command will call the programs:
692
693 - SetToken
694 - forge
695
696 which are provided by your distributor as source code. The script looks
697 as follows:
698
699 --------------------------------
700 #!/bin/sh
701 # set_token_cmd
702 forge -u $1 -t $2 | SetToken
703 --------------------------------
704
705 Since it is necessary for forge to read the secret AFS server key, a
706 site might wish to replace the set_token_cmd script by a command, which
707 connects to a custom daemon at the AFS server. The token must be forged
708 at the AFS server and returned to the local machine, where SetToken is
709 executed.
710
711 Changing set_token_cmd will take immediate effect. The default for
712 set_token_cmd is none.
713
714 The global configuration entry for this value may be overwritten by the
715 execution host local configuration.
716
717 pag_cmd
718 Note: Deprecated, may be removed in future release.
719 This parameter is only present if your Grid Engine system is licensed
720 to support AFS.
721
722 The path to your pagsh is specified via this parameter. The ge_shep‐
723 herd(8) process and the job run in a pagsh. Please ask your AFS admin‐
724 istrator for details.
725
726 Changing pag_cmd will take immediate effect. The default for pag_cmd
727 is none.
728
729 The global configuration entry for this value may be overwritten by the
730 execution host local configuration.
731
732 token_extend_time
733 Note: Deprecated, may be removed in future release.
734 This parameter is only present if your Grid Engine system is licensed
735 to support AFS.
736
737 The token_extend_time is the time period for which AFS tokens are peri‐
738 odically extended. Grid Engine will call the token extension 30 minutes
739 before the tokens expire until jobs have finished and the corresponding
740 tokens are no longer required.
741
742 Changing token_extend_time will take immediate effect. The default for
743 token_extend_time is 24:0:0, i.e. 24 hours.
744
745 The global configuration entry for this value may be overwritten by the
746 execution host local configuration.
747
748 shepherd_cmd
749 Alternative path to the shepherd_cmd binary. Typically used to call the
750 shepherd binary by a wrapper script or command.
751
752 Changing shepherd_cmd will take immediate effect. The default for shep‐
753 herd_cmd is none.
754
755 The global configuration entry for this value may be overwritten by the
756 execution host local configuration.
757
758 gid_range
759 The gid_range is a comma separated list of range expressions of the
760 form n-m (n as well as m are integer numbers greater than 99), where m
761 is an abbreviation for m-m. These numbers are used in ge_execd(8) to
762 identify processes belonging to the same job.
763
764 Each ge_execd(8) may use a separate set up group ids for this purpose.
765 All number in the group id range have to be unused supplementary group
766 ids on the system, where the ge_execd(8) is started.
767
768 Changing gid_range will take immediate effect. There is no default for
769 gid_range. The administrator will have to assign a value for gid_range
770 during installation of Grid Engine.
771
772 The global configuration entry for this value may be overwritten by the
773 execution host local configuration.
774
775 qmaster_params
776 A list of additional parameters can be passed to the Grid Engine qmas‐
777 ter. The following values are recognized:
778
779 ENABLE_ENFORCE_MASTER_LIMIT
780 If this parameter is set then the s_rt, h_rt limit of a running
781 job are tested and executed by the ge_qmaster(8) when the
782 ge_execd(8) where the job is in unknown state.
783
784 After s_rt or h_rt limit of a job is expired then the master
785 daemon will wait additional time defined by DURATION_OFFSET (see
786 sched_conf(5)). If the execution daemon still cannot be con‐
787 tacted when this additional time is elapsed, then the master
788 daemon will force the deletion of the job (see -f of qdel(1)).
789
790 For jobs which will be deleted that way an accounting record
791 will be created. As usage the record will contain the last
792 reported online usage, when the execution daemon could contact
793 qmaster. The failed state in the record will be set to 37 to
794 indicate that the job was terminated by a limit enforcement of
795 master daemon.
796
797 After the restart of ge_qmaster(8) the limit enforcement will at
798 first be triggered after the double of the biggest
799 load_report_interval interval defined in sge_conf(5) has been
800 elapsed. This will give the execution daemons enough time to
801 reregister at master daemon.
802
803 ENABLE_FORCED_QDEL_IF_UNKNOWN
804 If this parameter is set then a deletion request for a job is
805 automatically interpreted as a forced deletion request (see -f
806 of qdel(1)) if the host, where the job is running is in unknown
807 state.
808
809 ENABLE_FORCED_QDEL
810 If this parameter is set, non-administrative users can force
811 deletion of their own jobs via the -f option of qdel(1). With‐
812 out this parameter, forced deletion of jobs is only allowed by
813 the Grid Engine manager or operator.
814
815 Note: Forced deletion for jobs is executed differently depending
816 on whether users are Grid Engine administrators or not. In case
817 of administrative users, the jobs are removed from the internal
818 database of Grid Engine immediately. For regular users, the
819 equivalent of a normal qdel(1) is executed first, and deletion
820 is forced only if the normal cancellation was unsuccessful.
821
822 FORBID_RESCHEDULE
823 If this parameter is set, re-queuing of jobs cannot be initiated
824 by the job script which is under control of the user. Without
825 this parameter jobs returning the value 99 are rescheduled. This
826 can be used to cause the job to be restarted at a different
827 machine, for instance if there are not enough resources on the
828 current one.
829
830 FORBID_APPERROR
831 If this parameter is set, the application cannot set itself to
832 error state. Without this parameter jobs returning the value
833 100 are set to error state (and therefore can be manually
834 rescheduled by clearing the error state). This can be used to
835 set the job to error state when a starting condition of the
836 application is not fulfilled before the application itself has
837 been started, or when a clean up procedure (e.g. in the epilog)
838 decides that it is necessary to run the job again, by returning
839 100 in the prolog, pe_start, job script, pe_stop or epilog
840 script.
841
842 DISABLE_AUTO_RESCHEDULING
843 Note: Deprecated, may be removed in future release.
844 If set to "true" or "1", the reschedule_unknown parameter is not
845 taken into account.
846
847 ENABLE_RESCHEDULE_KILL
848 If set to "true" or "1", the reschedule_unknown parameter
849 affects also jobs which have the rerun flag not activated (see
850 the -r y option of qsub(1) and the rerun option of
851 queue_conf(5)), but they are just finished as they can't be
852 rescheduled.
853
854 ENABLE_RESCHEDULE_SLAVE
855 If set to "true" or "1" Grid Engine triggers job rescheduling
856 also when the host where the slave tasks of a parallel job exe‐
857 cutes is in unknown state, if the reschedule_unknown parameter
858 is activated.
859
860 MAX_DYN_EC
861 Sets the max number of dynamic event clients (as used by qsub
862 -sync y and by Grid Engine DRMAA API library sessions). The
863 default is set to 99. The number of dynamic event clients
864 should not be bigger than half of the number of file descriptors
865 the system has. The number of file descriptors are shared among
866 the connections to all exec hosts, all event clients, and file
867 handles that the qmaster needs.
868
869 MONITOR_TIME
870 Specifies the time interval when the monitoring information
871 should be printed. The monitoring is disabled by default and can
872 be enabled by specifying an interval. The monitoring is per
873 thread and is written to the messages file or displayed by the
874 "qping -f" command line tool. Example: MONITOR_TIME=0:0:10 gen‐
875 erates and prints the monitoring information approximately every
876 10 seconds. The specified time is a guideline only and not a
877 fixed interval. The interval that is actually used is printed.
878 In this example, the interval could be anything between 9 sec‐
879 onds and 20 seconds.
880
881 LOG_MONITOR_MESSAGE
882 Monitoring information is logged into the messages files by
883 default. This information can be accessed via by qping(1). If
884 monitoring is always enabled, the messages files can become
885 quite large. This switch disables logging into the messages
886 files, making qping -f the only source of monitoring data.
887
888 PROF_SIGNAL
889 Profiling provides the user with the possibility to get system
890 measurements. This can be useful for debugging or optimization
891 of the system. The profiling output will be done within the mes‐
892 sages file.
893
894 Enables the profiling for qmaster signal thread. (e.g.
895 PROF_SIGNAL=true)
896
897 PROF_WORKER
898 Enables the profiling for qmaster worker threads. (e.g.
899 PROF_WORKER=true)
900
901 PROF_LISTENER
902 Enables the profiling for qmaster listener threads. (e.g.
903 PROF_LISTENER=true)
904
905 PROF_DELIVER
906 Enables the profiling for qmaster event deliver thread. (e.g.
907 PROF_DELIVER=true)
908
909 PROF_TEVENT
910 Enables the profiling for qmaster timed event thread. (e.g.
911 PROF_TEVENT=true)
912
913 Please note, that the cpu utime and stime values contained in the pro‐
914 filing output are not per thread cpu times. These cpu usage statistics
915 are per process statistics. So the printed profiling values for cpu
916 mean "cpu time consumed by sge_qmaster (all threads) while the reported
917 profiling level was active".
918
919 STREE_SPOOL_INTERVAL
920 Sets the time interval for spooling the sharetree usage. The
921 default is set to 00:04:00. The setting accepts colon-separated
922 string or seconds. There is no setting to turn the sharetree
923 spooling off. (e.g. STREE_SPOOL_INTERVAL=00:02:00)
924
925 MAX_JOB_DELETION_TIME
926 Sets the value of how long the qmaster will spend deleting jobs.
927 After this time, the qmaster will continue with other tasks and
928 schedule the deletion of remaining jobs at a later time. The
929 default value is 3 seconds, and will be used if no value is
930 entered. The range of valid values is > 0 and <= 5. (e.g.
931 MAX_JOB_DELETION_TIME=1)
932
933 gdi_timeout
934 Sets how long the communication will wait for gdi send/receive
935 operations. The default value is set to 60 seconds. After this
936 time, the communication library will retry, if "gdi_retries" is
937 configured, receiving the gdi request. In case of not configured
938 "gdi_retries" the communication will return with a "gdi receive
939 failure" (e.g. gdi_timeout=120 will set the timeout time to 120
940 sec) Configuring no gdi_timeout value, the value defaults to 60
941 sec.
942
943 gdi_retries
944 Sets how often the gdi receive call will be repeated until the
945 gdi receive error appears. The default is set to 0. In this case
946 the call will be done 1 time with no retry. Setting the value
947 to -1 the call will be done permanently. In combination with
948 gdi_timeout parameter it is possible to configure a system with
949 eg. slow NFS, to make sure that all jobs will be submitted.
950 (e.g. gdi_retries=4)
951
952 cl_ping
953 Turns on/off a communication library ping. This parameter will
954 create additional debug output. This output shows information
955 about the error messages which are returned by communication and
956 it will give information about the application status of the
957 qmaster. eg, if it's unclear what's the reason for gdi timeouts,
958 this may show you some useful messages. The default value is
959 false (off) (e.g cl_ping=false)
960
961 SCHEDULER_TIMEOUT
962 Setting this parameter allows the scheduler GDI event acknowl‐
963 edge timeout to be manually configured to a specific value. Cur‐
964 rently the default value is 10 minutes with the default sched‐
965 uler configuration and limited between 600 and 1200 seconds.
966 Value is limited only in case of default value. The default
967 value depends on the current scheduler configuration. The SCHED‐
968 ULER_TIMEOUT value is specified in seconds.
969
970 jsv_timeout
971 This parameter measures the response time of the server JSV. In
972 the event that the response time of the JSV is longer than the
973 timeout value specified, this will cause the JSV to be re-
974 started. The default value for the timeout is 10 seconds and if
975 modified, must be greater than 0. If the timeout has been reach,
976 the JSV will only try to re-start once, if the timeout is
977 reached again an error will occur.
978
979 jsv_threshold
980 The threshold of a JSV is measured as the time it takes to per‐
981 form a server job verification. If this value is greater than
982 the user defined value, it will cause logging to appear in the
983 qmaster messages file at the INFO level. By setting this value
984 to 0, all jobs will be logged in the qmaster messages file. This
985 value is specified in milliseconds and has a default value of
986 5000.
987
988 Changing qmaster_params will take immediate effect, except gdi_timeout,
989 gdi_retries, cl_ping, these will take effect only for new connections.
990 The default for qmaster_params is none.
991
992 This value is a global configuration parameter only. It cannot be over‐
993 written by the execution host local configuration.
994
995 execd_params
996 This is used for passing additional parameters to the Grid Engine exe‐
997 cution daemon. The following values are recognized:
998
999 ACCT_RESERVED_USAGE
1000 If this parameter is set to true, the usage of reserved
1001 resources is used for the accounting entries cpu, mem and io
1002 instead of the measured usage.
1003
1004 ENABLE_WINDOMACC
1005 If this parameter is set to true, Windows Domain accounts (Win‐
1006 DomAcc) are used on Windows hosts. These accounts require the
1007 use of sgepasswd(1) (see also sgepasswd(5)). If this parameter
1008 is set to false or is not set, local Windows accounts are used.
1009 On non-Windows hosts, this parameter is ignored.
1010
1011 KEEP_ACTIVE
1012 This value should only be set for debugging purposes. If set to
1013 true, the execution daemon will not remove the spool directory
1014 maintained by ge_shepherd(8) for a job.
1015
1016 PTF_MIN_PRIORITY, PTF_MAX_PRIORITY
1017 The maximum/minimum priority which Grid Engine will assign to a
1018 job. Typically this is a negative/positive value in the range
1019 of -20 (maximum) to 19 (minimum) for systems which allow setting
1020 of priorities with the nice(2) system call. Other systems may
1021 provide different ranges.
1022 The default priority range (varies from system to system) is
1023 installed either by removing the parameters or by setting a
1024 value of -999.
1025 See the "messages" file of the execution daemon for the prede‐
1026 fined default value on your hosts. The values are logged during
1027 the startup of the execution daemon.
1028
1029 PROF_EXECD
1030 Enables the profiling for the execution daemon. (e.g.
1031 PROF_EXECD=true)
1032
1033 NOTIFY_KILL
1034 The parameter allows you to change the notification signal for
1035 the signal SIGKILL (see -notify option of qsub(1)). The parame‐
1036 ter either accepts signal names (use the -l option of kill(1))
1037 or the special value none. If set to none, no notification sig‐
1038 nal will be sent. If it is set to TERM, for instance, or another
1039 signal name then this signal will be sent as notification sig‐
1040 nal.
1041
1042 NOTIFY_SUSP
1043 With this parameter it is possible to modify the notification
1044 signal for the signal SIGSTOP (see -notify parameter of
1045 qsub(1)). The parameter either accepts signal names (use the -l
1046 option of kill(1)) or the special value none. If set to none, no
1047 notification signal will be sent. If it is set to TSTP, for
1048 instance, or another signal name then this signal will be sent
1049 as notification signal.
1050
1051 SHARETREE_RESERVED_USAGE
1052 Note: Deprecated, may be removed in future release.
1053 If this parameter is set to true, the usage of reserved
1054 resources is taken for the Grid Engine share tree consumption
1055 instead of measured usage.
1056 Note: When running tightly integrated jobs with SHARE‐
1057 TREE_RESERVED_USAGE set, and with having accounting_summary
1058 enabled in the parallel environment, reserved usage will only be
1059 reported by the master task of the parallel job. No per paral‐
1060 lel task usage records will be sent from execd to qmaster, which
1061 can significantly reduce load on qmaster when running large
1062 tightly integrated parallel jobs.
1063
1064 USE_QSUB_GID
1065 If this parameter is set to true, the primary group id active
1066 when a job was submitted will be set to become the primary group
1067 id for job execution. If the parameter is not set, the primary
1068 group id as defined for the job owner in the execution host
1069 passwd(5) file is used.
1070 The feature is only available for jobs submitted via qsub(1),
1071 qrsh(1), qmake(1) and qtcsh(1). Also, it only works for qrsh(1)
1072 jobs (and thus also for qtcsh(1) and qmake(1)) if rsh and rshd
1073 components are used which are provided with Grid Engine (i.e.,
1074 the rsh_daemon and rsh_command parameters may not be changed
1075 from the default).
1076
1077 S_DESCRIPTORS, H_DESCRIPTORS, S_MAXPROC, H_MAXPROC,
1078
1079 S_MEMORYLOCKED, H_MEMORYLOCKED, S_LOCKS, H_LOCKS
1080 Specifies soft and hard resource limits as implemented by the
1081 setrlimit(2) system call. See this manual page on your system
1082 for more information. These parameters complete the list of lim‐
1083 its set by the RESOURCE LIMITS parameter of the queue configura‐
1084 tion as described in queue_conf(5). Unlike the resource limits
1085 in the queue configuration, these resource limits are set for
1086 every job on this execution host. If a value is not specified,
1087 the resource limit is inherited from the execution daemon
1088 process. Because this would lead to unpredicted results, if only
1089 one limit of a resource is set (soft or hard), the corresponding
1090 other limit is set to the same value.
1091 S_DESCRIPTORS and H_DESCRIPTORS specify a value one greater than
1092 the maximum file descriptor number that can be opened by any
1093 process of a job.
1094 S_MAXPROC and H_MAXPROC specify the maximum number of processes
1095 that can be created by the job user on this execution host
1096 S_MEMORYLOCKED and H_MEMORYLOCKED specify the maximum number of
1097 bytes of virtual memory that may be locked into RAM.
1098 S_LOCKS and H_LOCKS specify the maximum number of file locks any
1099 process of a job may establish.
1100 All of these values can be specified using the multiplier let‐
1101 ters k, K, m, M, g and G, see sge_types(1) for details.
1102
1103 INHERIT_ENV
1104 This parameter indicates whether the shepherd should allow the
1105 environment inherited by the execution daemon from the shell
1106 that started it to be inherited by the job it's starting. When
1107 true, any environment variable that is set in the shell which
1108 starts the execution daemon at the time the execution daemon is
1109 started will be set in the environment of any jobs run by that
1110 execution daemon, unless the environment variable is explicitly
1111 overridden, such as PATH or LOGNAME. If set to false, each job
1112 starts with only the environment variables that are explicitly
1113 passed on by the execution daemon, such as PATH and LOGNAME.
1114 The default value is true.
1115
1116 SET_LIB_PATH
1117 This parameter tells the execution daemon whether to add the
1118 Grid Engine shared library directory to the library path of exe‐
1119 cuted jobs. If set to true, and INHERIT_ENV is also set to
1120 true, the Grid Engine shared library directory will be prepended
1121 to the library path which is inherited from the shell which
1122 started the execution daemon. If INHERIT_ENV is set to false,
1123 the library path will contain only the Grid Engine shared
1124 library directory. If set to false, and INHERIT_ENV is set to
1125 true, the library path exported to the job will be the one
1126 inherited from the shell which started the execution daemon. If
1127 INHERIT_ENV is also set to false, the library path will be
1128 empty. After the execution daemon has set the library path, it
1129 may be further altered by the shell in which the job is exe‐
1130 cuted, or by the job script itself. The default value for
1131 SET_LIB_PATH is false.
1132
1133 ENABLE_ADDGRP_KILL
1134 If this parameter is set then Grid Engine uses the supplementary
1135 group ids (see gid_range) to identify all processes which are to
1136 be terminated when a job is deleted, or when ge_shepherd(8)
1137 cleans up after job termination.
1138
1139 PDC_INTERVAL
1140 This parameter defines the interval how often the PDC (Portable
1141 Data Collector) is executed by the execution daemon. The PDC is
1142 responsible for enforcing the resource limits s_cpu, h_cpu,
1143 s_vmem and h_vmem (see queue_conf(5)) and job usage collection.
1144 The parameter can be set to a time_specifier (see sge_types(5))
1145 , to PER_LOAD_REPORT or to NEVER.
1146
1147 ENABLE_BINDING
1148 If this parameter is set then Grid Engine enables the core bind‐
1149 ing module within the execution daemon to apply binding parame‐
1150 ters that are specified during submission time of a job. This
1151 parameter is not set per default and therefore all binding
1152 related information will be ignored. Find more information for
1153 job to core binding in the section -binding of qsub(1).
1154
1155 If this parameter is set to PER_LOAD_REPORT the PDC is triggered in the
1156 same interval as load_report_time (see above). If this parameter is set
1157 to NEVER the PDC run is never triggered. The default is 1 second.
1158 Note: A PDC run is quite compute extensive may degrade the performance
1159 of the running jobs. But if the PDC runs less often or never the online
1160 usage can be incomplete or totally missing (for example online usage of
1161 very short running jobs might be missing) and the resource limit
1162 enforcement is less accurate or would not happen if PDC is turned of
1163 completely.
1164
1165 Changing execd_params will take effect after it was propagated to the
1166 execution daemons. The propagation is done in one load report interval.
1167 The default for execd_params is none.
1168
1169 The global configuration entry for this value may be overwritten by the
1170 execution host local configuration.
1171
1172 reporting_params
1173 Used to define the behavior of reporting modules in the Grid Engine
1174 qmaster. Changes to the reporting_params takes immediate effect. The
1175 following values are recognized:
1176
1177 accounting
1178 If this parameter is set to true, the accounting file is writ‐
1179 ten. The accounting file is prerequisite for using the qacct
1180 command.
1181
1182 reporting
1183 If this parameter is set to true, the reporting file is written.
1184 The reporting file contains data that can be used for monitoring
1185 and analysis, like job accounting, job log, host load and con‐
1186 sumables, queue status and consumables and sharetree configura‐
1187 tion and usage. Attention: Depending on the size and load of
1188 the cluster, the reporting file can become quite large. Only
1189 activate the reporting file if you have a process running that
1190 will consume the reporting file! See reporting(5) for further
1191 information about format and contents of the reporting file.
1192
1193 flush_time
1194 Contents of the reporting file are buffered in the Grid Engine
1195 qmaster and flushed at a fixed interval. This interval can be
1196 configured with the flush_time parameter. It is specified as a
1197 time value in the format HH:MM:SS. Sensible values range from a
1198 few seconds to one minute. Setting it too low may slow down the
1199 qmaster. Setting it too high will make the qmaster consume large
1200 amounts of memory for buffering data.
1201
1202 accounting_flush_time
1203 Contents of the accounting file are buffered in the Grid Engine
1204 qmaster and flushed at a fixed interval. This interval can be
1205 configured with the accounting_flush_time parameter. It is
1206 specified as a time value in the format HH:MM:SS. Sensible val‐
1207 ues range from a few seconds to one minute. Setting it too low
1208 may slow down the qmaster. Setting it too high will make the
1209 qmaster consume large amounts of memory for buffering data.
1210 Setting it to 00:00:00 will disable accounting data buffering;
1211 as soon as data is generated, it will be written to the account‐
1212 ing file. If this parameter is not set, the accounting data
1213 flush interval will default to the value of the flush_time
1214 parameter.
1215
1216 joblog If this parameter is set to true, the reporting file will con‐
1217 tain job logging information. See reporting(5) for more informa‐
1218 tion about job logging.
1219
1220 sharelog
1221 The Grid Engine qmaster can dump information about sharetree
1222 configuration and use to the reporting file. The parameter
1223 sharelog sets an interval in which sharetree information will be
1224 dumped. It is set in the format HH:MM:SS. A value of 00:00:00
1225 configures qmaster not to dump sharetree information. Intervals
1226 of several minutes up to hours are sensible values for this
1227 parameter. See reporting(5) for further information about
1228 sharelog.
1229
1230 log_consumables
1231 This parameter controls writing of consumable resources to the
1232 reporting file. When set to (log_consumables=true) information
1233 about all consumable resources (their current usage and their
1234 capacity) will be written to the reporting file, whenever a con‐
1235 sumable resource changes either in definition, or in capacity,
1236 or when the usage of a consumable resource changes. When
1237 log_consumables is set to false (default), only those variables
1238 will be written to the reporting file, that are configured in
1239 the report_variables in the exec host configuration, see
1240 host_conf(5) for further information about report_variables.
1241
1242 finished_jobs
1243 Note: Deprecated, may be removed in future release.
1244 Grid Engine stores a certain number of just finished jobs to provide
1245 post mortem status information. The finished_jobs parameter defines the
1246 number of finished jobs stored. If this maximum number is reached, the
1247 eldest finished job will be discarded for every new job added to the
1248 finished job list.
1249
1250 Changing finished_jobs will take immediate effect. The default for
1251 finished_jobs is 100.
1252
1253 This value is a global configuration parameter only. It cannot be over‐
1254 written by the execution host local configuration.
1255
1256 qlogin_daemon
1257 This parameter specifies the mechanism that is to be started on the
1258 server side of a qlogin(1) request. Usually this is the builtin mecha‐
1259 nism. It's also possible to configure an external executable by speci‐
1260 fying the full qualified pathname, e.g. of the system's telnet daemon.
1261
1262 Changing qlogin_daemon will take immediate effect. The default value
1263 for qlogin_daemon is builtin.
1264
1265 The global configuration entry for this value may be overwritten by the
1266 execution host local configuration.
1267
1268 Examples for the two allowed kinds of attributes are:
1269 qlogin_daemon builtin
1270 or
1271 qlogin_daemon /usr/sbin/in.telnetd
1272
1273 qlogin_command
1274 This is the command to be executed on the client side of a qlogin(1)
1275 request. Usually this is the builtin qlogin mechanism. It's also pos‐
1276 sible to configure an external mechanism, usually the absolute pathname
1277 of the system's telnet client program. It is automatically started with
1278 the target host and port number as parameters.
1279
1280 Changing qlogin_command will take immediate effect. The default value
1281 for qlogin_command is builtin.
1282
1283 The global configuration entry for this value may be overwritten by the
1284 execution host local configuration.
1285
1286 Examples for the two allowed kinds of attributes are:
1287 qlogin_command builtin
1288 or
1289 qlogin_command /usr/bin/telnetd
1290
1291 rlogin_daemon
1292 This parameter specifies the mechanism that is to be started on the
1293 server side of a qrsh(1) request without a command argument to be exe‐
1294 cuted remotely. Usually this is the builtin mechanism. It's also pos‐
1295 sible to configure an external executable by specifying the absolute
1296 pathname, e.g. of the system's rlogin daemon.
1297
1298 Changing rlogin_daemon will take immediate effect. The default for
1299 rlogin_daemon is builtin.
1300
1301 The global configuration entry for this value may be overwritten by the
1302 execution host local configuration.
1303
1304 The allowed values are similar to the ones of the examples of qlo‐
1305 gin_daemon.
1306
1307 rlogin_command
1308 This is the mechanism to be executed on the client side of a qrsh(1)
1309 request without a command argument to be executed remotely. Usually
1310 this is the builtin mechanism. If no value is given, a specialized Grid
1311 Engine component is used. The command is automatically started with
1312 the target host and port number as parameters. The Grid Engine rlogin
1313 client has been extended to accept and use the port number argument.
1314 You can only use clients, such as ssh, which also understand this syn‐
1315 tax.
1316
1317 Changing rlogin_command will take immediate effect. The default value
1318 for rlogin_command is builtin.
1319
1320 The global configuration entry for this value may be overwritten by the
1321 execution host local configuration.
1322
1323 In addition to the examples of qlogin_command , this value is allowed:
1324 rsh_daemon none
1325
1326 rsh_daemon
1327 This parameter specifies the mechanism that is to be started on the
1328 server side of a qrsh(1) request with a command argument to be executed
1329 remotely. Usually this is the builtin mechanism. If no value is given,
1330 a specialized Grid Engine component is used.
1331
1332 Changing rsh_daemon will take immediate effect. The default value for
1333 rsh_daemon is builtin.
1334
1335 The global configuration entry for this value may be overwritten by the
1336 execution host local configuration.
1337
1338 In addition to the examples of qlogin_daemon , this value is allowed:
1339 rsh_daemon none
1340
1341 rsh_command
1342 This is the mechanism to be executed on the client side of a qrsh(1)
1343 request with a command argument to be executed remotely. Usually this
1344 is the builtin mechanism. If no value is given, a specialized Grid
1345 Engine component is used. The command is automatically started with the
1346 target host and port number as parameters like required for telnet(1)
1347 plus the command with its arguments to be executed remotely. The Grid
1348 Engine rsh client has been extended to accept and use the port number
1349 argument. You can only use clients, such as ssh, which also understand
1350 this syntax.
1351
1352 Changing rsh_command will take immediate effect. The default value for
1353 rsh_command is builtin.
1354
1355 The global configuration entry for this value may be overwritten by the
1356 execution host local configuration.
1357
1358 In addition to the examples of qlogin_command , this value is allowed:
1359 rsh_command none
1360
1361 delegated_file_staging
1362 This flag must be set to "true" when the prolog and epilog are ready
1363 for delegated file staging, so that the DRMAA attribute 'drmaa_trans‐
1364 fer_files' is supported. To establish delegated file staging, use the
1365 variables beginning with "$fs_..." in prolog and epilog to move the
1366 input, output and error files from one host to the other. When this
1367 flag is set to "false", no file staging is available for the DRMAA
1368 interface. File staging is currently implemented only via the DRMAA
1369 interface. When an error occurs while moving the input, output and
1370 error files, return error code 100 so that the error handling mechanism
1371 can handle the error correctly. (See also FORBID_APPERROR).
1372
1373 reprioritize
1374 Note: Deprecated, may be removed in future release.
1375 This flag enables or disables the reprioritization of jobs based on
1376 their ticket amount. The reprioritize_interval in sched_conf(5) takes
1377 effect only if reprioritize is set to true. To turn off job reprioriti‐
1378 zation, the reprioritize flag must be set to false and the repriori‐
1379 tize_interval to 0 which is the default.
1380
1381 This value is a global configuration parameter only. It cannot be over‐
1382 ridden by the execution host local configuration.
1383
1384 jsv_url
1385 This setting defines a server JSV instance which will be started and
1386 triggered by the sge_qmaster(8) process. This JSV instance will be used
1387 to verify job specifications of jobs before they are accepted and
1388 stored in the internal master database. The global configuration entry
1389 for this value cannot be overwritten by execution host local configura‐
1390 tions.
1391
1392 Find more details concerning JSV in jsv(1) and sge_request(1).
1393
1394 The syntax of the jsv_url is specified in sge_types(1).
1395
1396 jsv_allowed_mod
1397 If there is a server JSV script defined with jsv_url parameter, then
1398 all qalter(1) or qmon(1) modification requests for jobs are rejected by
1399 qmaster. With the jsv_allowed_mod parameter an administrator has the
1400 possibility to allow a set of switches which can then be used with
1401 clients to modify certain job attributes. The value for this parameter
1402 has to be a comma separated list of JSV job parameter names as they are
1403 documented in qsub(1) or the value none to indicate that no modifica‐
1404 tion should be allowed. Please note that even if none is specified the
1405 switches -w and -t are allowed for qalter.
1406
1407 libjvm_path
1408 libjvm_path is usually set during qmaster installation and points to
1409 the absolute path of libjvm.so. (or the corresponding library depend‐
1410 ing on your architecture - e.g. /usr/java/jre/lib/i386/server/lib‐
1411 jvm.so) The referenced libjvm version must be at least 1.5. It is
1412 needed by the jvm qmaster thread only. If the java vm needs additional
1413 starting parameters they can be set in additional_jvm_args. If the jvm
1414 thread is started at all can be defined in the bootstrap(5) file. If
1415 libjvm_path is empty or an incorrect path the jvm thread fails to
1416 start.
1417
1418 The global configuration entry for this value may be overwritten by the
1419 execution host local configuration.
1420
1421 additional_jvm_args
1422 additional_jvm_args is usually set during qmaster installation.
1423 Details about possible values additional_jvm_args can be found in the
1424 help output of the accompanying java command. This setting is normally
1425 not needed.
1426
1427 The global configuration entry for this value may be overwritten by the
1428 execution host local configuration.
1429
1431 ge_intro(1), csh(1), qconf(1), qsub(1), jsv(1), rsh(1), sh(1), getpw‐
1432 nam(3), drmaa_attributes(3), queue_conf(5), sched_conf(5),
1433 sge_types(1), ge_execd(8), ge_qmaster(8), ge_shepherd(8), cron(8), Grid
1434 Engine Installation and Administration Guide.
1435
1437 See ge_intro(1) for a full statement of rights and permissions.
1438
1439
1440
1441GE 6.2u5 $Date: 2009/11/13 14:26:06 $ GE_CONF(5)