1
2ethhostadmin(8)              EFSFFCLIRG (Man Page)             ethhostadmin(8)
3
4
5

NAME

7       ethhostadmin
8
9
10
11       Performs  a  number  of multi-step host initialization and verification
12       operations, including upgrading software, rebooting  hosts,  and  other
13       operations.  In general, operations performed by ethhostadmin involve a
14       login to one or more host systems.
15

Syntax

17       ethhostadmin [-c] [-f  hostfile] [-h 'hosts']
18       [-r  release] [-I  install_options] [-U  upgrade_options] [-d  dir]
19       [-T  product] [-P  packages] [-S]  operation ...
20

Options

22       --help
23
24                 Produces full help text.
25
26
27       -c
28
29                 Overwrites the result files  from  any  previous  run  before
30                 starting this run.
31
32
33       -f hostfile
34
35                 Specifies  the file with the names of hosts in a cluster. De‐
36                 fault is /etc/eth-tools/hosts file.
37
38
39       -h hosts
40
41                 Specifies the list of hosts to execute the operation against.
42
43
44       -r release
45
46                 Specifies the software version to load/upgrade to. Default is
47                 the  version of Intel(R) Ethernet Fabric Suite Software pres‐
48                 ently being run on the server.
49
50
51       -d dir
52
53                 Specifies the directory to retrieve  product. release.tgz for
54                 load or upgrade.
55
56
57       -I install_options
58
59                 Specifies the software install options.
60
61
62       -U upgrade_options
63
64                 Specifies the software upgrade options.
65
66
67       -T product
68
69                 Specifies the product type to install. Options include:
70
71
72
73                 •      IntelEth-Basic. <distro> (default)
74
75                 •      IntelEth-FS. <distro>
76
77
78
79                 where   <distro>   is  the  distribution  and  CPU,  such  as
80                 RHEL81-x86_64.
81
82       -P packages
83
84                 Specifies the  packages  to  install.  Default  is  eth_tools
85                 eth_rdma  mpi.  Refer to INSTALL -C for compete list of pack‐
86                 ages.
87
88
89       -S
90
91                 Securely prompts for user password on remote system.
92
93
94       operation
95
96                 Performs the specified operation, which can be one or more of
97                 the following:
98
99
100
101
102                 load      Starts initial installation of all hosts.
103
104
105
106
107
108                 upgrade   Upgrades installation of all hosts.
109
110
111
112
113
114                 reboot    Reboots hosts, ensures they go down and come back.
115
116
117
118
119
120                 rping     Verifies this host can ping each host through RDMA.
121
122
123
124
125
126                 mpiperf   Verifies latency and bandwidth for each host.
127
128
129
130
131
132                 mpiperfdeviation
133                           Verifies   latency  and  bandwidth  for  each  host
134                           against a defined threshold (or relative to average
135                           host performance).
136
137
138

Example

140       ethhostadmin -c reboot
141
142       ethhostadmin upgrade
143
144       ethhostadmin -h 'elrond arwen' reboot
145
146       HOSTS='elrond arwen' ethhostadmin reboot
147

Details

149       ethhostadmin provides detailed logging of its results. During each run,
150       the following files are produced:
151
152       •      test.res : Appended with summary results of run.
153
154       •      test.log : Appended with detailed results of run.
155
156       •      save_tmp/ : Contains a directory per failed test  with  detailed
157              logs.
158
159       •      test_tmp*/ : Intermediate result files while test is running.
160
161       The -c option removes all log files.
162
163       Results from ethhostadmin are grouped into test suites, test cases, and
164       test items. A given run of ethhostadmin represents a single test suite.
165       Within a test suite, multiple test cases occur; typically one test case
166       per host being operated on. Some of the  more  complex  operations  may
167       have multiple test items per test case. Each test item represents a ma‐
168       jor step in the overall test case.
169
170       Each ethhostadmin run appends to test.res  and  test.log,  and  creates
171       temporary files in test_tmp$PID in the current directory. test.res pro‐
172       vides an overall summary of operations performed and their results. The
173       same  information  is  also  displayed while ethhostadmin is executing.
174       test.log contains detailed information about what  was  performed,  in‐
175       cluding  the  specific  commands executed and the resulting output. The
176       test_tmp directories contain temporary files  which  reflect  tests  in
177       progress  (or  killed).  The  logs  for  any failures are logged in the
178       save_temp directory with a directory per failed test case. If the  same
179       test  case fails more than once, save_temp retains the information from
180       the first failure. Subsequent runs  of  ethhostadmin  are  appended  to
181       test.log.  Intel  recommends reviewing failures and using the -c option
182       to remove old logs before subsequent runs of ethhostadmin.
183
184       ethhostadmin implicitly performs its operations in  parallel.  However,
185       as  for  the other tools, FF_MAX_PARALLEL can be exported to change the
186       degree of parallelism. 1000 parallel operations is the default.
187

Environment Variables

189       The following environment variables are also used by this command:
190
191       HOSTS
192
193                 List of hosts, used if -h option not supplied.
194
195
196       HOSTS_FILE
197
198                 File containing list of hosts, used in absence of -f and -h.
199
200
201       FF_MAX_PARALLEL
202
203                 Maximum concurrent operations are performed.
204
205
206       FF_SERIALIZE_OUTPUT
207
208                 Serialize output of parallel operations (yes or no).
209
210
211       FF_TIMEOUT_MULT
212
213                 Multiplier for all timeouts  associated  with  this  command.
214                 Used if the systems are slow for some reason.
215
216
217

ethhostadmin Operation Details

219       (Host)  Intel  recommends  that  you set up password SSH or SCP for use
220       during this operation. Alternatively, the -S option can be used to  se‐
221       curely  prompt  for a password, in which case the same password is used
222       for all hosts. Alternately, the password may be put in the  environment
223       or the ethfastfabric.conf file using FF_PASSWORD and FF_ROOTPASS.
224
225       load
226
227                 Performs  an initial installation of Intel(R) Ethernet Fabric
228                 Suite Software on a group of hosts. Any existing installation
229                 is  uninstalled and existing configuration files are removed.
230                 Subsequently, the hosts are installed with a default Intel(R)
231                 Ethernet  Fabric  Suite Software configuration. The -I option
232                 can be used to select different install packages. Default  is
233                 eth_tools eth_rdma mpi The -r option can be used to specify a
234                 release to install other than the one that this host is pres‐
235                 ently  running.  The  FF_PRODUCT. FF_PRODUCT_VERSION.tgz file
236                 (for example, IntelEth-Basic. version.tgz) is expected to ex‐
237                 ist  in the directory specified by -d. Default is the current
238                 working directory. The specified software is  copied  to  all
239                 the selected hosts and installed.
240
241
242       upgrade
243
244                 Upgrades  all  selected hosts without modifying existing con‐
245                 figurations. This operation is comparable to  the  -U  option
246                 when running ./INSTALL manually. The -r option can be used to
247                 upgrade to a release different from this host. The default is
248                 to upgrade to the same release as this host. The  FF_PRODUCT.
249                 FF_PRODUCT_VERSION.tgz  file  (for  example,  IntelEth-Basic.
250                 version.tgz)  is expected to exist in the directory specified
251                 by -d. The default is  the  current  working  directory.  The
252                 specified  software  is  copied  to all the end nodes and in‐
253                 stalled.
254
255
256
257
258       NOTE: Only components that are currently installed are  upgraded.  This
259       operation  fails  for  hosts  that do not have Intel(R) Ethernet Fabric
260       Suite Software installed.
261
262
263
264       reboot
265
266                 Reboots the given hosts and ensures they  go  down  and  come
267                 back  up  by pinging them during the reboot process. The ping
268                 rate is slow (5 seconds), so if the servers boot faster  than
269                 this, false failures may be seen.
270
271
272       rping
273
274                 Verifies  RDMA basic operation by ensuring that the nodes can
275                 ping each other through RDMA. To run this  command,  Intel(R)
276                 Ethernet Fabric software must be installed, RDMA must be con‐
277                 figured and running on the host, and  the  given  hosts,  and
278                 switches must be up.
279
280
281       mpiperf
282
283                 Verifies  that  MPI  is operational and checks MPI end-to-end
284                 latency and bandwidth between pairs of  nodes  (for  example,
285                 1-2,  3-4,  5-6). Use this to verify switch latency/hops, PCI
286                 bandwidth, and overall MPI  performance.  The  test.res  file
287                 contains the results of each pair of nodes tested.
288
289
290
291
292
293       NOTE:  This option is available for the Intel(R) Ethernet Host Software
294       OFA Delta packaging, but is not presently available for  other  packag‐
295       ings of OFED.
296
297
298
299              To  obtain  accurate  results, this test should be run at a time
300              when no other stressful applications (for example, MPI  jobs  or
301              high  stress  file  system  operations) are running on the given
302              hosts.
303
304              Bandwidth issues typically indicate server configuration  issues
305              (for  example,  incorrect slot used, incorrect BIOS settings, or
306              incorrect NIC model), or fabric issues (for example, symbol  er‐
307              rors,  incorrect  link  width, or speed). Assuming ethreport has
308              previously been used to check for link errors and link speed is‐
309              sues, the server configuration should be verified.
310
311              Note  that  BIOS  settings and differences between server models
312              can account for 10-20% differences in bandwidth.  For  more  de‐
313              tails  about  BIOS  settings, consult the documentation from the
314              server supplier and/or the server PCI chipset manufacturer.
315
316       mpiperfdeviation
317
318                 Specifies the enhanced version of mpiperf that  verifies  MPI
319                 performance.  Can  be used to verify switch latency/hops, PCI
320                 bandwidth, and overall MPI performance. It performs  assorted
321                 pair-wise bandwidth and latency tests, and reports pairs out‐
322                 side an acceptable tolerance range. The tool identifies  spe‐
323                 cific nodes that have problems and provides a concise summary
324                 of results. The test.res file contains the  results  of  each
325                 pair of nodes tested.
326
327
328                 By  default,  concurrent  mode is used to quickly analyze the
329                 fabric and host performance. Pairs that have 20%  less  band‐
330                 width  or 50% more latency than the average pair are reported
331                 as failures.
332
333                 The tool can be run in a sequential or a concurrent mode. Se‐
334                 quential mode runs each host against a reference host. By de‐
335                 fault, the reference host is selected based on the best  per‐
336                 formance  from a quick test of the first 40 hosts. In concur‐
337                 rent mode, hosts are paired up and all pairs are run  concur‐
338                 rently.  Since  there  may be fabric contention during such a
339                 run, any poor performing pairs are  then  rerun  sequentially
340                 against the reference host.
341
342                 Concurrent  mode  runs  the  tests  in the shortest amount of
343                 time, however, the results could be  slightly  less  accurate
344                 due  to  switch  contention. In heavily oversubscribed fabric
345                 designs, if concurrent mode  is  producing  unexpectedly  low
346                 performance, try sequential mode.
347
348
349
350
351       NOTE:  This option is available for the Intel(R) Ethernet Host Software
352       OFA Delta packaging, but is not presently available for  other  packag‐
353       ings of OFED.
354
355
356
357              To  obtain  accurate  results, this test should be run at a time
358              when no other stressful applications  (for  example,  MPI  jobs,
359              high  stress  file  system  operations) are running on the given
360              hosts.
361
362              Bandwidth issues typically indicate server configuration  issues
363              (for  example,  incorrect slot used, incorrect BIOS settings, or
364              incorrect NIC model), or fabric issues (for example, symbol  er‐
365              rors,  incorrect  link  width, or speed). Assuming ethreport has
366              previously been used to check for link errors and link speed is‐
367              sues, the server configuration should be verified.
368
369              Note  that  BIOS  settings and differences between server models
370              can account for 10-20% differences in bandwidth. A result  5-10%
371              below  the average is typically not cause for serious alarm, but
372              may reflect limitations in the server design or the chosen  BIOS
373              settings.
374
375              For  more details about BIOS settings, consult the documentation
376              from the server supplier and/or the server PCI chipset  manufac‐
377              turer.
378
379              The  deviation application supports a number of parameters which
380              allow for more precise control  over  the  mode,  benchmark  and
381              pass/fail  criteria. The parameters to use can be selected using
382              the FF_DEVIATION_ARGS  configuration  parameter  in  ethfastfab‐
383              ric.conf
384
385              Available parameters for deviation application:
386
387              [-bwtol bwtol] [-bwdelta MBs] [-bwthres MBs]
388
389              [-bwloop count] [-bwsize size] [-lattol latol]
390
391              [-latdelta usec] [-latthres usec] [-latloop count]
392
393              [-latsize size][-c] [-b] [-v] [-vv]
394
395              [-h reference_host]
396
397
398
399
400
401              -bwtol    Specifies the percent of bandwidth degradation allowed
402                        below average value.
403
404
405
406
407
408              -bwbidir  Performs a bidirectional bandwidth test.
409
410
411
412
413
414              -bwunidir Performs a unidirectional bandwidth test (Default).
415
416
417
418
419
420              -bwdelta  Specifies the limit in MB/s of  bandwidth  degradation
421                        allowed below average value.
422
423
424
425
426
427              -bwthres  Specifies  the  lower  limit  in MB/s of bandwidth al‐
428                        lowed.
429
430
431
432
433
434              -bwloop   Specifies the number of loops to  execute  each  band‐
435                        width test.
436
437
438
439
440
441              -bwsize   Specifies  the  size  of  message to use for bandwidth
442                        test.
443
444
445
446
447
448              -lattol   Specifies the percent of latency  degradation  allowed
449                        above average value.
450
451
452
453
454
455              -latdelta Specifies the imit in &#181;sec of latency degradation
456                        allowed above average value.
457
458
459
460
461
462              -latthres Specifies the lower limit in &#181;sec of latency  al‐
463                        lowed.
464
465
466
467
468
469              -latloop  Specifies  the number of loops to execute each latency
470                        test.
471
472
473
474
475
476              -latsize  Specifies the size of message to use for latency test.
477
478
479
480
481
482              -c        Runs test pairs concurrently instead of the default of
483                        sequential.
484
485
486
487
488
489              -b        When  comparing  results  against tolerance and delta,
490                        uses best instead of average.
491
492
493
494
495
496              -v        Specifies the verbose output.
497
498
499
500
501
502              -vv       Specifies the very verbose output.
503
504
505
506
507
508              -h        Specifies the reference host  to  use  for  sequential
509                        pairing.
510
511
512
513              Both bwtol and bwdelta must be exceeded to fail bandwidth test.
514
515              When bwthres is supplied, bwtol and bwdelta are ignored.
516
517              Both lattol and latdelta must be exceeded to fail latency test.
518
519              When latthres is supplied, lattol and latdelta are ignored.
520
521              For  consistency with OSU benchmarks, MB/s is defined as 1000000
522              bytes/s.
523
524
525
526Copyright(C) 2020              Intel Corporation               ethhostadmin(8)
Impressum