1
2ethhostadmin(8)              EFSFFCLIRG (Man Page)             ethhostadmin(8)
3
4
5

NAME

7       ethhostadmin
8
9
10
11       Performs  a  number  of multi-step host initialization and verification
12       operations, including upgrading software, rebooting  hosts,  and  other
13       operations.  In general, operations performed by ethhostadmin involve a
14       login to one or more host systems.
15

Syntax

17       ethhostadmin [-c] [-e] [-f  hostfile] [-h 'hosts']
18       [-r  release] [-I  install_options] [-U  upgrade_options] [-d  dir]
19       [-T  product] [-P  packages] [-S]  operation ...
20

Options

22       --help
23
24                 Produces full help text.
25
26
27       -c
28
29                 Overwrites the result files  from  any  previous  run  before
30                 starting this run.
31
32
33       -e
34
35                 exit after 1st operation which fails.
36
37
38       -f hostfile
39
40                 Specifies  the file with the names of hosts in a cluster. De‐
41                 fault is /etc/eth-tools/hosts file.
42
43
44       -h hosts
45
46                 Specifies the list of hosts to execute the operation against.
47
48
49       -r release
50
51                 Specifies the software version to load/upgrade to. Default is
52                 the  version of Intel(R) Ethernet Fabric Suite Software pres‐
53                 ently being run on the server.
54
55
56       -d dir
57
58                 Specifies the directory to retrieve  product. release.tgz for
59                 load or upgrade.
60
61
62       -I install_options
63
64                 Specifies the software install options.
65
66
67       -U upgrade_options
68
69                 Specifies the software upgrade options.
70
71
72       -T product
73
74                 Specifies the product type to install. Options include:
75
76
77
78                 •      IntelEth-Basic. <distro> (default)
79
80                 •      IntelEth-FS. <distro>
81
82
83
84                 where   <distro>   is  the  distribution  and  CPU,  such  as
85                 RHEL81-x86_64.
86
87       -P packages
88
89                 Specifies the packages to install. Default is  eth  eth_rdma.
90                 Refer to INSTALL -C for compete list of packages.
91
92
93       -S
94
95                 Securely prompts for user password on remote system.
96
97
98       operation
99
100                 Performs the specified operation, which can be one or more of
101                 the following:
102
103
104
105
106                 load      Starts initial installation of all hosts.
107
108
109
110
111
112                 upgrade   Upgrades installation of all hosts.
113
114
115
116
117
118                 reboot    Reboots hosts, ensures they go down and come back.
119
120
121
122
123
124                 rping     Verifies this host can ping each host through RDMA.
125
126
127
128
129
130                 pfctest   Verifies PFC works on all hosts.
131
132
133
134
135
136                 mpiperf   Verifies latency and bandwidth for each host.
137
138
139
140
141
142                 mpiperfdeviation
143                           Verifies  latency  and  bandwidth  for  each   host
144                           against a defined threshold (or relative to average
145                           host performance).
146
147
148

Example

150       ethhostadmin -c reboot
151
152       ethhostadmin upgrade
153
154       ethhostadmin -h 'elrond arwen' reboot
155
156       HOSTS='elrond arwen' ethhostadmin reboot
157

Details

159       ethhostadmin provides detailed logging of its results. During each run,
160       the following files are produced:
161
162       •      test.res : Appended with summary results of run.
163
164       •      test.log : Appended with detailed results of run.
165
166       •      save_tmp/  :  Contains a directory per failed test with detailed
167              logs.
168
169       •      test_tmp*/ : Intermediate result files while test is running.
170
171       The -c option removes all log files.
172
173       Results from ethhostadmin are grouped into test suites, test cases, and
174       test items. A given run of ethhostadmin represents a single test suite.
175       Within a test suite, multiple test cases occur; typically one test case
176       per  host  being  operated  on. Some of the more complex operations may
177       have multiple test items per test case. Each test item represents a ma‐
178       jor step in the overall test case.
179
180       Each  ethhostadmin  run  appends  to test.res and test.log, and creates
181       temporary files in test_tmp$PID in the current directory. test.res pro‐
182       vides an overall summary of operations performed and their results. The
183       same information is also displayed  while  ethhostadmin  is  executing.
184       test.log  contains  detailed  information about what was performed, in‐
185       cluding the specific commands executed and the  resulting  output.  The
186       test_tmp  directories  contain  temporary  files which reflect tests in
187       progress (or killed). The logs for  any  failures  are  logged  in  the
188       save_temp  directory with a directory per failed test case. If the same
189       test case fails more than once, save_temp retains the information  from
190       the  first  failure.  Subsequent  runs  of ethhostadmin are appended to
191       test.log. Intel recommends reviewing failures and using the  -c  option
192       to remove old logs before subsequent runs of ethhostadmin.
193
194       ethhostadmin  implicitly  performs its operations in parallel. However,
195       as for the other tools, FF_MAX_PARALLEL can be exported to  change  the
196       degree of parallelism. 1000 parallel operations is the default.
197

Environment Variables

199       The following environment variables are also used by this command:
200
201       HOSTS
202
203                 List of hosts, used if -h option not supplied.
204
205
206       HOSTS_FILE
207
208                 File containing list of hosts, used in absence of -f and -h.
209
210
211       FF_MAX_PARALLEL
212
213                 Maximum concurrent operations are performed.
214
215
216       FF_SERIALIZE_OUTPUT
217
218                 Serialize output of parallel operations (yes or no).
219
220
221       FF_TIMEOUT_MULT
222
223                 Multiplier  for  all  timeouts  associated with this command.
224                 Used if the systems are slow for some reason.
225
226
227

ethhostadmin Operation Details

229       (Host) Intel recommends that you set up password SSH  or  SCP  for  use
230       during  this operation. Alternatively, the -S option can be used to se‐
231       curely prompt for a password, in which case the same password  is  used
232       for  all hosts. Alternately, the password may be put in the environment
233       or the ethfastfabric.conf file using FF_PASSWORD and FF_ROOTPASS.
234
235       load
236
237                 Performs an initial installation of Intel(R) Ethernet  Fabric
238                 Suite Software on a group of hosts. Any existing installation
239                 is uninstalled and existing configuration files are  removed.
240                 Subsequently, the hosts are installed with a default Intel(R)
241                 Ethernet Fabric Suite Software configuration. The  -I  option
242                 can  be used to select different install packages. Default is
243                 eth_tools eth_rdma mpi The -r option can be used to specify a
244                 release to install other than the one that this host is pres‐
245                 ently running. The  FF_PRODUCT.  FF_PRODUCT_VERSION.tgz  file
246                 (for example, IntelEth-Basic. version.tgz) is expected to ex‐
247                 ist in the directory specified by -d. Default is the  current
248                 working  directory.  The  specified software is copied to all
249                 the selected hosts and installed.
250
251
252       upgrade
253
254                 Upgrades all selected hosts without modifying  existing  con‐
255                 figurations.  This  operation  is comparable to the -U option
256                 when running ./INSTALL manually. The -r option can be used to
257                 upgrade to a release different from this host. The default is
258                 to upgrade to the same release as this host. The  FF_PRODUCT.
259                 FF_PRODUCT_VERSION.tgz  file  (for  example,  IntelEth-Basic.
260                 version.tgz) is expected to exist in the directory  specified
261                 by  -d.  The  default  is  the current working directory. The
262                 specified software is copied to all the  end  nodes  and  in‐
263                 stalled.
264
265
266
267
268       NOTE:  Only  components that are currently installed are upgraded. This
269       operation fails for hosts that do not  have  Intel(R)  Ethernet  Fabric
270       Suite Software installed.
271
272
273
274       reboot
275
276                 Reboots  the  given  hosts  and ensures they go down and come
277                 back up by pinging them during the reboot process.  The  ping
278                 rate  is slow (5 seconds), so if the servers boot faster than
279                 this, false failures may be seen.
280
281
282       rping
283
284                 Verifies RDMA basic operation by ensuring that the nodes  can
285                 ping  each  other through RDMA. To run this command, Intel(R)
286                 Ethernet Fabric software must be installed, RDMA must be con‐
287                 figured  and  running  on  the host, and the given hosts, and
288                 switches must be up.
289
290
291       pfctest
292
293                 Empirical test which verifies PFC is working  right.  To  run
294                 this  command,  Intel(R) Ethernet Fabric software must be in‐
295                 stalled, PFC must be configured on both hosts  and  switches,
296                 and the given hosts and switches must be up.
297
298
299       mpiperf
300
301                 Verifies  that  MPI  is operational and checks MPI end-to-end
302                 latency and bandwidth between pairs of  nodes  (for  example,
303                 1-2,  3-4,  5-6). Use this to verify switch latency/hops, PCI
304                 bandwidth, and overall MPI  performance.  The  test.res  file
305                 contains the results of each pair of nodes tested.
306
307
308
309
310
311       NOTE:  This option is available for the Intel(R) Ethernet Host Software
312       OFA Delta packaging, but is not presently available for  other  packag‐
313       ings of OFED.
314
315
316
317              To  obtain  accurate  results, this test should be run at a time
318              when no other stressful applications (for example, MPI  jobs  or
319              high  stress  file  system  operations) are running on the given
320              hosts.
321
322              Bandwidth issues typically indicate server configuration  issues
323              (for  example,  incorrect slot used, incorrect BIOS settings, or
324              incorrect NIC model), or fabric issues (for example, symbol  er‐
325              rors,  incorrect  link  width, or speed). Assuming ethreport has
326              previously been used to check for link errors and link speed is‐
327              sues, the server configuration should be verified.
328
329              Note  that  BIOS  settings and differences between server models
330              can account for 10-20% differences in bandwidth.  For  more  de‐
331              tails  about  BIOS  settings, consult the documentation from the
332              server supplier and/or the server PCI chipset manufacturer.
333
334       mpiperfdeviation
335
336                 Specifies the enhanced version of mpiperf that  verifies  MPI
337                 performance.  Can  be used to verify switch latency/hops, PCI
338                 bandwidth, and overall MPI performance. It performs  assorted
339                 pair-wise bandwidth and latency tests, and reports pairs out‐
340                 side an acceptable tolerance range. The tool identifies  spe‐
341                 cific nodes that have problems and provides a concise summary
342                 of results. The test.res file contains the  results  of  each
343                 pair of nodes tested.
344
345
346                 By  default,  concurrent  mode is used to quickly analyze the
347                 fabric and host performance. Pairs that have 20%  less  band‐
348                 width  or 50% more latency than the average pair are reported
349                 as failures.
350
351                 The tool can be run in a sequential or a concurrent mode. Se‐
352                 quential mode runs each host against a reference host. By de‐
353                 fault, the reference host is selected based on the best  per‐
354                 formance  from a quick test of the first 40 hosts. In concur‐
355                 rent mode, hosts are paired up and all pairs are run  concur‐
356                 rently.  Since  there  may be fabric contention during such a
357                 run, any poor performing pairs are  then  rerun  sequentially
358                 against the reference host.
359
360                 Concurrent  mode  runs  the  tests  in the shortest amount of
361                 time, however, the results could be  slightly  less  accurate
362                 due  to  switch  contention. In heavily oversubscribed fabric
363                 designs, if concurrent mode  is  producing  unexpectedly  low
364                 performance, try sequential mode.
365
366
367
368
369       NOTE:  This option is available for the Intel(R) Ethernet Host Software
370       OFA Delta packaging, but is not presently available for  other  packag‐
371       ings of OFED.
372
373
374
375              To  obtain  accurate  results, this test should be run at a time
376              when no other stressful applications  (for  example,  MPI  jobs,
377              high  stress  file  system  operations) are running on the given
378              hosts.
379
380              Bandwidth issues typically indicate server configuration  issues
381              (for  example,  incorrect slot used, incorrect BIOS settings, or
382              incorrect NIC model), or fabric issues (for example, symbol  er‐
383              rors,  incorrect  link  width, or speed). Assuming ethreport has
384              previously been used to check for link errors and link speed is‐
385              sues, the server configuration should be verified.
386
387              Note  that  BIOS  settings and differences between server models
388              can account for 10-20% differences in bandwidth. A result  5-10%
389              below  the average is typically not cause for serious alarm, but
390              may reflect limitations in the server design or the chosen  BIOS
391              settings.
392
393              For  more details about BIOS settings, consult the documentation
394              from the server supplier and/or the server PCI chipset  manufac‐
395              turer.
396
397              The  deviation application supports a number of parameters which
398              allow for more precise control  over  the  mode,  benchmark  and
399              pass/fail  criteria. The parameters to use can be selected using
400              the FF_DEVIATION_ARGS  configuration  parameter  in  ethfastfab‐
401              ric.conf
402
403              Available parameters for deviation application:
404
405              [-bwtol bwtol] [-bwdelta MBs] [-bwthres MBs]
406
407              [-bwloop count] [-bwsize size] [-lattol latol]
408
409              [-latdelta usec] [-latthres usec] [-latloop count]
410
411              [-latsize size][-c] [-b] [-v] [-vv]
412
413              [-h reference_host]
414
415
416
417
418
419              -bwtol    Specifies the percent of bandwidth degradation allowed
420                        below average value.
421
422
423
424
425
426              -bwbidir  Performs a bidirectional bandwidth test.
427
428
429
430
431
432              -bwunidir Performs a unidirectional bandwidth test (Default).
433
434
435
436
437
438              -bwdelta  Specifies the limit in MB/s of  bandwidth  degradation
439                        allowed below average value.
440
441
442
443
444
445              -bwthres  Specifies  the  lower  limit  in MB/s of bandwidth al‐
446                        lowed.
447
448
449
450
451
452              -bwloop   Specifies the number of loops to  execute  each  band‐
453                        width test.
454
455
456
457
458
459              -bwsize   Specifies  the  size  of  message to use for bandwidth
460                        test.
461
462
463
464
465
466              -lattol   Specifies the percent of latency  degradation  allowed
467                        above average value.
468
469
470
471
472
473              -latdelta Specifies the imit in &#181;sec of latency degradation
474                        allowed above average value.
475
476
477
478
479
480              -latthres Specifies the lower limit in &#181;sec of latency  al‐
481                        lowed.
482
483
484
485
486
487              -latloop  Specifies  the number of loops to execute each latency
488                        test.
489
490
491
492
493
494              -latsize  Specifies the size of message to use for latency test.
495
496
497
498
499
500              -c        Runs test pairs concurrently instead of the default of
501                        sequential.
502
503
504
505
506
507              -b        When  comparing  results  against tolerance and delta,
508                        uses best instead of average.
509
510
511
512
513
514              -v        Specifies the verbose output.
515
516
517
518
519
520              -vv       Specifies the very verbose output.
521
522
523
524
525
526              -h        Specifies the reference host  to  use  for  sequential
527                        pairing.
528
529
530
531              Both bwtol and bwdelta must be exceeded to fail bandwidth test.
532
533              When bwthres is supplied, bwtol and bwdelta are ignored.
534
535              Both lattol and latdelta must be exceeded to fail latency test.
536
537              When latthres is supplied, lattol and latdelta are ignored.
538
539              For  consistency with OSU benchmarks, MB/s is defined as 1000000
540              bytes/s.
541
542
543
544Copyright(C) 2020-2022         Intel Corporation               ethhostadmin(8)
Impressum