1QHOST(1)                   Grid Engine User Commands                  QHOST(1)
2
3
4

NAME

6       qhost - show the status of Grid Engine hosts, queues, jobs
7

SYNTAX

9       qhost  [ -cb ] [ -F [resource_name,...]  ] [ -help ] [ -h host_list ] [
10       -j ] [ -l resource=val,...  ] [ -u user,...  ] [ -xml ].
11

DESCRIPTION

13       qhost shows the current status of  the  available  Grid  Engine  hosts,
14       queues and the jobs associated with the queues. Selection options allow
15       you to get information about specific hosts, queues, jobs or users.  If
16       multiple  selections are done a host is only displayed if all selection
17       criteria for a host are met.  Without any options qhost will display  a
18       list of all hosts without queue or job information.
19

OPTIONS

21       -F [ resource_name,... ]
22              qhost  will  present  a detailed listing of the current resource
23              availability per host with respect  to  all  resources  (if  the
24              option  argument  is omitted) or with respect to those resources
25              contained  in  the  resource_name  list.  Please  refer  to  the
26              description  of  the Full Format in section OUTPUT FORMATS below
27              for further detail.
28
29       -cb    This command line switch can be used since Grid  Engine  version
30              6.2u5  in combination with other qhost(1) command line switches.
31              In that case the output of the corresponding command  will  con‐
32              tain  information  concerning the added job to core binding fea‐
33              ture.
34
35              If this switch is not used then the command behaves as  in  ver‐
36              sion 6.2u4 and previous versions.
37
38              If this option is used then two additional columns will be shown
39              for each displayed host in the output. The first is  named  NSOC
40              and represents the number of available sockets on that host. The
41              second additional column is named NCOR  and  it  represents  the
42              number of cores that are available per socket on the correspond‐
43              ing machine.
44
45       -help  Prints a listing of all options.
46
47       -h host_list
48              Prints a list of all hosts contained in host_list.
49
50       -j     Prints all jobs running on the queues hosted by the shown hosts.
51              This switch calls -q implicitly.
52
53       -l resource[=value],...
54              Defines the resources to be granted by the hosts which should be
55              included in the host list output. Matching is performed on hosts
56              based  on  non-mutable  resource  availability information only.
57              That means load values are always ignored except  the  so-called
58              static   load  values  (i.e.  "arch",  "num_proc",  "mem_total",
59              "swap_total" and "virtual_total") ones. Also consumable utiliza‐
60              tion  is  ignored.   If  there are multiple -l resource requests
61              they will be concatenated by a logical  AND:  a  host  needs  to
62              match all resources to be displayed.
63
64       -q     Show  information  about the queues instances hosted by the dis‐
65              played hosts.
66
67       -u user,...
68              Display information only on those jobs and queues being  associ‐
69              ated with the users from the given user list.
70
71       -xml   This  option  can be used with all other options and changes the
72              output to XML. The used schemas are referenced in the  XML  out‐
73              put. The output is printed to stdout.
74
75              If  the  -xml parameter is combined with -cb then the XML output
76              will contain additional tags containing information about job to
77              core  binding.   You can find schema files with the suffix "_cb"
78              in  the  directory  $SGE_ROOT/util/resources/schemas/qhost  that
79              describe that changes.
80

OUTPUT FORMATS

82       Depending  on  the  presence  or  absence of the -q or -F and -j option
83       three output formats need to be differentiated.  PP
84
85   Default Format (without -q, -F and -j)
86       For each host one line is printed. The output consists of consisting of
87
88       ·  the Hostname
89
90       ·  the Architecture.
91
92       ·  the  Number of processors.
93
94       ·  the Load.
95
96       ·  the Total Memory.
97
98       ·  the Used Memory.
99
100       ·  the Total Swapspace.
101
102       ·  the Used Swapspace.
103
104       If the -q option is supplied, each host status line also contains extra
105       lines for every queue hosted by the host consisting of,
106
107       ·  the queue name.
108
109       ·  the  queue  type  -  one of B(atch), I(nteractive), C(heckpointing),
110          P(arallel), T(ransfer) or combinations thereof,
111
112       ·  the number of reserved, used and available job slots,
113
114       ·  the state of the queue -  one  of  u(nknown)  if  the  corresponding
115          ge_execd(8)  cannot  be  contacted, a(larm), A(larm), C(alendar sus‐
116          pended), s(uspended), S(ubordinate), d(isabled), D(isabled), E(rror)
117          or combinations thereof.
118
119       If the state is a(alarm) at least one of the load thresholds defined in
120       the load_thresholds list of the queue configuration (see queue_conf(5))
121       is  currently  exceeded, which prevents from scheduling further jobs to
122       that queue.
123
124       As opposed to this, the state A(larm) indicates that at  least  one  of
125       the  suspend  thresholds  of the queue (see queue_conf(5)) is currently
126       exceeded. This will result in jobs running in that queue being  succes‐
127       sively suspended until no threshold is violated.
128
129       The  states  s(uspended)  and  d(isabled) can be assigned to queues and
130       released via the qmod(1) command. Suspending a  queue  will  cause  all
131       jobs executing in that queue to be suspended.
132
133       The  states D(isabled) and C(alendar suspended) indicate that the queue
134       has been disabled or suspended automatically via the calendar  facility
135       of  Grid  Engine  (see calendar_conf(5)), while the S(ubordinate) state
136       indicates, that the queue has been suspend via subordination to another
137       queue (see queue_conf(5) for details). When suspending a queue (regard‐
138       less of the cause) all jobs executing in that queue are suspended too.
139
140       If an E(rror) state is displayed for a queue, ge_execd(8) on that  host
141       was  unable  to  locate  the  ge_shepherd(8) executable on that host in
142       order  to  start  a  job.  Please  check  the  error  logfile  of  that
143       ge_execd(8)  for leads on how to resolve the problem. Please enable the
144       queue afterwards via the -c option of the qmod(1) command manually.
145
146       If the -F option was used, resource availability information is printed
147       following  the  host  status line. For each resource (as selected in an
148       option argument to -F or for all resources if the option  argument  was
149       omitted) a single line is displayed with the following format:
150
151       ·  a  one  letter  specifier  indicating  whether  the current resource
152          availability value was dominated by either
153          `g' - a cluster global,
154          `h' - a host total or
155
156       ·  a second one letter specifier indicating the source for the  current
157          resource availability value, being one of
158          `l' - a load value reported for the resource,
159          `L' - a load value for the resource after administrator defined load
160          scaling has been applied,
161          `c' - availability derived from the  consumable  resources  facility
162          (see complexes(5)),
163          `f'  - a fixed availability definition derived from a non-consumable
164          complex attribute or a fixed resource limit.
165
166       ·  after a colon the name of the resource on which information is  dis‐
167          played.
168
169       ·  after an equal sign the current resource availability value.
170
171       The  displayed  availability  values  and  the  sources from which they
172       derive are always the minimum  values  of  all  possible  combinations.
173       Hence,  for example, a line of the form "qf:h_vmem=4G" indicates that a
174       queue currently has a maximum availability in virtual memory of 4 Giga‐
175       byte,  where  this value is a fixed value (e.g. a resource limit in the
176       queue configuration) and it is queue dominated, i.e. the host in  total
177       may have more virtual memory available than this, but the queue doesn't
178       allow for more. Contrarily a line "hl:h_vmem=4G" would also indicate an
179       upper  bound  of  4 Gigabyte virtual memory availability, but the limit
180       would be derived from a load value currently reported for the host.  So
181       while  the  queue  might  allow  for  jobs  with  higher virtual memory
182       requirements, the host on which this particular queue resides currently
183       only has 4 Gigabyte available.
184
185       After  the  queue  status line (in case of -j) a single line is printed
186       for each job running currently in this queue. Each job status line con‐
187       tains
188
189       ·  the job ID,
190
191       ·  the job name,
192
193       ·  the job owner name,
194
195       ·  the   status   of   the  job  -  one  of  t(ransfering),  r(unning),
196          R(estarted),  s(uspended),  S(uspended)  or  T(hreshold)  (see   the
197          Reduced Format section for detailed information),
198
199       ·  the start date and time and the function of the job (MASTER or SLAVE
200          - only meaningful in case of a parallel job) and
201
202       ·  the priority of the jobs.
203

ENVIRONMENTAL VARIABLES

205       GE_ROOT        Specifies the location of the Grid Engine standard  con‐
206                      figuration files.
207
208       GE_CELL        If  set,  specifies  the  default  Grid  Engine cell. To
209                      address a Grid Engine cell qhost uses (in the  order  of
210                      precedence):
211
212                             The name of the cell specified in the environment
213                             variable GE_CELL, if it is set.
214
215                             The name of the default cell, i.e. default.
216
217
218       GE_DEBUG_LEVEL If set, specifies that debug information should be writ‐
219                      ten  to stderr. In addition the level of detail in which
220                      debug information is generated is defined.
221
222       GE_QMASTER_PORT
223                      If set, specifies the tcp port on which ge_qmaster(8) is
224                      expected  to  listen  for  communication requests.  Most
225                      installations will use a services map entry for the ser‐
226                      vice "sge_qmaster" instead to define that port.
227

FILES

229       <ge_root>/<cell>/common/act_qmaster
230                      Grid Engine master host file
231

SEE ALSO

233       ge_intro(1), qalter(1), qconf(1), qhold(1), qmod(1), qstat(1), qsub(1),
234       queue_conf(5), ge_execd(8), ge_qmaster(8), ge_shepherd(8).
235
237       See ge_intro(1) for a full statement of rights and permissions.
238
239
240
241GE 6.2u5                 $Date: 2009/11/05 13:18:36 $                 QHOST(1)
Impressum