1GE_SHADOWD(8) Grid Engine Administrative Commands GE_SHADOWD(8)
2
3
4
6 ge_shadowd - Grid Engine shadow master daemon
7
9 ge_shadowd
10
12 ge_shadowd is a "light weight" process which can be run on so-called
13 shadow master hosts in a Grid Engine cluster to detect failure of the
14 current Grid Engine master daemon, ge_qmaster(8), and to start-up a new
15 ge_qmaster(8) on the host on which the ge_shadowd runs. If multiple
16 shadow daemons are active in a cluster, they run a protocol which
17 ensures that only one of them will start-up a new master daemon.
18
19 The hosts suitable for being used as shadow master hosts must have
20 shared root read/write access to the directory $GE_ROOT/$GE_CELL/common
21 as well as to the master daemon spool directory (by default
22 $GE_ROOT/$GE_CELL/spool/qmaster). The names of the shadow master hosts
23 need to be contained in the file $GE_ROOT/$xQS_NAME_Sxx_CELL/com‐
24 mon/shadow_masters.
25
27 ge_shadowd may only be started by root.
28
30 GE_ROOT Specifies the location of the Grid Engine standard con‐
31 figuration files.
32
33 GE_CELL If set, specifies the default Grid Engine cell. To
34 address a Grid Engine cell ge_shadowd uses (in the order
35 of precedence):
36
37 The name of the cell specified in the environment
38 variable GE_CELL, if it is set.
39
40 The name of the default cell, i.e. default.
41
42
43 GE_DEBUG_LEVEL If set, specifies that debug information should be writ‐
44 ten to stderr. In addition the level of detail in which
45 debug information is generated is defined.
46
47 GE_QMASTER_PORT
48 If set, specifies the tcp port on which ge_qmaster(8) is
49 expected to listen for communication requests. Most
50 installations will use a services map entry for the ser‐
51 vice "sge_qmaster" instead to define that port.
52
53 GE_DELAY_TIME This variable controls the interval in which ge_shadowd
54 pauses if a takeover bid fails. This value is used only
55 when there are multiple ge_shadowd instances and they
56 are contending to be the master. The default is 600
57 seconds.
58
59 GE_CHECK_INTERVAL
60 This variable controls the interval in which the
61 ge_shadowd checks the heartbeat file (60 seconds by
62 default).
63
64 GE_GET_ACTIVE_INTERVAL
65 This variable controls the interval when a ge_shadowd
66 instance tries to take over when the heartbeat file has
67 not changed.
68
70 <ge_root>/<cell>/common
71 Default configuration directory
72 <ge_root>/<cell>/common/shadow_masters
73 Shadow master hostname file.
74 <ge_root>/<cell>/spool/qmaster
75 Default master daemon spool directory
76 <ge_root>/<cell>/spool/qmaster/heartbeat
77 The heartbeat file.
78
80 ge_intro(1), ge_conf(5), ge_qmaster(8), Grid Engine Installation and
81 Administration Guide.
82
84 See ge_intro(1) for a full statement of rights and permissions.
85
86
87
88GE 6.2u5 $Date: 2007/11/08 23:04:23 $ GE_SHADOWD(8)