1
2ethhostadmin(8)              EFSFFCLIRG (Man Page)             ethhostadmin(8)
3
4
5

NAME

7       ethhostadmin
8
9
10
11       Performs  a  number  of multi-step host initialization and verification
12       operations, including upgrading software, rebooting  hosts,  and  other
13       operations.  In general, operations performed by ethhostadmin involve a
14       login to one or more host systems.
15

Syntax

17       ethhostadmin [-c] [-f  hostfile] [-h 'hosts']
18       [-r  release] [-I  install_options] [-U  upgrade_options] [-d  dir]
19       [-T  product] [-P  packages] [-S]  operation ...
20

Options

22       --help
23
24                 Produces full help text.
25
26
27       -c
28
29                 Overwrites the result files  from  any  previous  run  before
30                 starting this run.
31
32
33       -f hostfile
34
35                 Specifies  the file with the names of hosts in a cluster. De‐
36                 fault is /etc/eth-tools/hosts file.
37
38
39       -h hosts
40
41                 Specifies the list of hosts to execute the operation against.
42
43
44       -r release
45
46                 Specifies the software version to load/upgrade to. Default is
47                 the  version of Intel(R) Ethernet Fabric Suite Software pres‐
48                 ently being run on the server.
49
50
51       -d dir
52
53                 Specifies the directory to retrieve  product. release.tgz for
54                 load or upgrade.
55
56
57       -I install_options
58
59                 Specifies the software install options.
60
61
62       -U upgrade_options
63
64                 Specifies the software upgrade options.
65
66
67       -T product
68
69                 Specifies the product type to install. Options include:
70
71
72
73                 •      IntelEth-Basic. <distro> (default)
74
75                 •      IntelEth-FS. <distro>
76
77
78
79                 where   <distro>   is  the  distribution  and  CPU,  such  as
80                 RHEL81-x86_64.
81
82       -P packages
83
84                 Specifies the packages to install. Default is  eth  eth_rdma.
85                 Refer to INSTALL -C for compete list of packages.
86
87
88       -S
89
90                 Securely prompts for user password on remote system.
91
92
93       operation
94
95                 Performs the specified operation, which can be one or more of
96                 the following:
97
98
99
100
101                 load      Starts initial installation of all hosts.
102
103
104
105
106
107                 upgrade   Upgrades installation of all hosts.
108
109
110
111
112
113                 reboot    Reboots hosts, ensures they go down and come back.
114
115
116
117
118
119                 rping     Verifies this host can ping each host through RDMA.
120
121
122
123
124
125                 pfctest   Verifies PFC works on all hosts.
126
127
128
129
130
131                 mpiperf   Verifies latency and bandwidth for each host.
132
133
134
135
136
137                 mpiperfdeviation
138                           Verifies  latency  and  bandwidth  for  each   host
139                           against a defined threshold (or relative to average
140                           host performance).
141
142
143

Example

145       ethhostadmin -c reboot
146
147       ethhostadmin upgrade
148
149       ethhostadmin -h 'elrond arwen' reboot
150
151       HOSTS='elrond arwen' ethhostadmin reboot
152

Details

154       ethhostadmin provides detailed logging of its results. During each run,
155       the following files are produced:
156
157       •      test.res : Appended with summary results of run.
158
159       •      test.log : Appended with detailed results of run.
160
161       •      save_tmp/  :  Contains a directory per failed test with detailed
162              logs.
163
164       •      test_tmp*/ : Intermediate result files while test is running.
165
166       The -c option removes all log files.
167
168       Results from ethhostadmin are grouped into test suites, test cases, and
169       test items. A given run of ethhostadmin represents a single test suite.
170       Within a test suite, multiple test cases occur; typically one test case
171       per  host  being  operated  on. Some of the more complex operations may
172       have multiple test items per test case. Each test item represents a ma‐
173       jor step in the overall test case.
174
175       Each  ethhostadmin  run  appends  to test.res and test.log, and creates
176       temporary files in test_tmp$PID in the current directory. test.res pro‐
177       vides an overall summary of operations performed and their results. The
178       same information is also displayed  while  ethhostadmin  is  executing.
179       test.log  contains  detailed  information about what was performed, in‐
180       cluding the specific commands executed and the  resulting  output.  The
181       test_tmp  directories  contain  temporary  files which reflect tests in
182       progress (or killed). The logs for  any  failures  are  logged  in  the
183       save_temp  directory with a directory per failed test case. If the same
184       test case fails more than once, save_temp retains the information  from
185       the  first  failure.  Subsequent  runs  of ethhostadmin are appended to
186       test.log. Intel recommends reviewing failures and using the  -c  option
187       to remove old logs before subsequent runs of ethhostadmin.
188
189       ethhostadmin  implicitly  performs its operations in parallel. However,
190       as for the other tools, FF_MAX_PARALLEL can be exported to  change  the
191       degree of parallelism. 1000 parallel operations is the default.
192

Environment Variables

194       The following environment variables are also used by this command:
195
196       HOSTS
197
198                 List of hosts, used if -h option not supplied.
199
200
201       HOSTS_FILE
202
203                 File containing list of hosts, used in absence of -f and -h.
204
205
206       FF_MAX_PARALLEL
207
208                 Maximum concurrent operations are performed.
209
210
211       FF_SERIALIZE_OUTPUT
212
213                 Serialize output of parallel operations (yes or no).
214
215
216       FF_TIMEOUT_MULT
217
218                 Multiplier  for  all  timeouts  associated with this command.
219                 Used if the systems are slow for some reason.
220
221
222

ethhostadmin Operation Details

224       (Host) Intel recommends that you set up password SSH  or  SCP  for  use
225       during  this operation. Alternatively, the -S option can be used to se‐
226       curely prompt for a password, in which case the same password  is  used
227       for  all hosts. Alternately, the password may be put in the environment
228       or the ethfastfabric.conf file using FF_PASSWORD and FF_ROOTPASS.
229
230       load
231
232                 Performs an initial installation of Intel(R) Ethernet  Fabric
233                 Suite Software on a group of hosts. Any existing installation
234                 is uninstalled and existing configuration files are  removed.
235                 Subsequently, the hosts are installed with a default Intel(R)
236                 Ethernet Fabric Suite Software configuration. The  -I  option
237                 can  be used to select different install packages. Default is
238                 eth_tools eth_rdma mpi The -r option can be used to specify a
239                 release to install other than the one that this host is pres‐
240                 ently running. The  FF_PRODUCT.  FF_PRODUCT_VERSION.tgz  file
241                 (for example, IntelEth-Basic. version.tgz) is expected to ex‐
242                 ist in the directory specified by -d. Default is the  current
243                 working  directory.  The  specified software is copied to all
244                 the selected hosts and installed.
245
246
247       upgrade
248
249                 Upgrades all selected hosts without modifying  existing  con‐
250                 figurations.  This  operation  is comparable to the -U option
251                 when running ./INSTALL manually. The -r option can be used to
252                 upgrade to a release different from this host. The default is
253                 to upgrade to the same release as this host. The  FF_PRODUCT.
254                 FF_PRODUCT_VERSION.tgz  file  (for  example,  IntelEth-Basic.
255                 version.tgz) is expected to exist in the directory  specified
256                 by  -d.  The  default  is  the current working directory. The
257                 specified software is copied to all the  end  nodes  and  in‐
258                 stalled.
259
260
261
262
263       NOTE:  Only  components that are currently installed are upgraded. This
264       operation fails for hosts that do not  have  Intel(R)  Ethernet  Fabric
265       Suite Software installed.
266
267
268
269       reboot
270
271                 Reboots  the  given  hosts  and ensures they go down and come
272                 back up by pinging them during the reboot process.  The  ping
273                 rate  is slow (5 seconds), so if the servers boot faster than
274                 this, false failures may be seen.
275
276
277       rping
278
279                 Verifies RDMA basic operation by ensuring that the nodes  can
280                 ping  each  other through RDMA. To run this command, Intel(R)
281                 Ethernet Fabric software must be installed, RDMA must be con‐
282                 figured  and  running  on  the host, and the given hosts, and
283                 switches must be up.
284
285
286       pfctest
287
288                 Empirical test which verifies PFC is working  right.  To  run
289                 this  command,  Intel(R) Ethernet Fabric software must be in‐
290                 stalled, PFC must be configured on both hosts  and  switches,
291                 and the given hosts and switches must be up.
292
293
294       mpiperf
295
296                 Verifies  that  MPI  is operational and checks MPI end-to-end
297                 latency and bandwidth between pairs of  nodes  (for  example,
298                 1-2,  3-4,  5-6). Use this to verify switch latency/hops, PCI
299                 bandwidth, and overall MPI  performance.  The  test.res  file
300                 contains the results of each pair of nodes tested.
301
302
303
304
305
306       NOTE:  This option is available for the Intel(R) Ethernet Host Software
307       OFA Delta packaging, but is not presently available for  other  packag‐
308       ings of OFED.
309
310
311
312              To  obtain  accurate  results, this test should be run at a time
313              when no other stressful applications (for example, MPI  jobs  or
314              high  stress  file  system  operations) are running on the given
315              hosts.
316
317              Bandwidth issues typically indicate server configuration  issues
318              (for  example,  incorrect slot used, incorrect BIOS settings, or
319              incorrect NIC model), or fabric issues (for example, symbol  er‐
320              rors,  incorrect  link  width, or speed). Assuming ethreport has
321              previously been used to check for link errors and link speed is‐
322              sues, the server configuration should be verified.
323
324              Note  that  BIOS  settings and differences between server models
325              can account for 10-20% differences in bandwidth.  For  more  de‐
326              tails  about  BIOS  settings, consult the documentation from the
327              server supplier and/or the server PCI chipset manufacturer.
328
329       mpiperfdeviation
330
331                 Specifies the enhanced version of mpiperf that  verifies  MPI
332                 performance.  Can  be used to verify switch latency/hops, PCI
333                 bandwidth, and overall MPI performance. It performs  assorted
334                 pair-wise bandwidth and latency tests, and reports pairs out‐
335                 side an acceptable tolerance range. The tool identifies  spe‐
336                 cific nodes that have problems and provides a concise summary
337                 of results. The test.res file contains the  results  of  each
338                 pair of nodes tested.
339
340
341                 By  default,  concurrent  mode is used to quickly analyze the
342                 fabric and host performance. Pairs that have 20%  less  band‐
343                 width  or 50% more latency than the average pair are reported
344                 as failures.
345
346                 The tool can be run in a sequential or a concurrent mode. Se‐
347                 quential mode runs each host against a reference host. By de‐
348                 fault, the reference host is selected based on the best  per‐
349                 formance  from a quick test of the first 40 hosts. In concur‐
350                 rent mode, hosts are paired up and all pairs are run  concur‐
351                 rently.  Since  there  may be fabric contention during such a
352                 run, any poor performing pairs are  then  rerun  sequentially
353                 against the reference host.
354
355                 Concurrent  mode  runs  the  tests  in the shortest amount of
356                 time, however, the results could be  slightly  less  accurate
357                 due  to  switch  contention. In heavily oversubscribed fabric
358                 designs, if concurrent mode  is  producing  unexpectedly  low
359                 performance, try sequential mode.
360
361
362
363
364       NOTE:  This option is available for the Intel(R) Ethernet Host Software
365       OFA Delta packaging, but is not presently available for  other  packag‐
366       ings of OFED.
367
368
369
370              To  obtain  accurate  results, this test should be run at a time
371              when no other stressful applications  (for  example,  MPI  jobs,
372              high  stress  file  system  operations) are running on the given
373              hosts.
374
375              Bandwidth issues typically indicate server configuration  issues
376              (for  example,  incorrect slot used, incorrect BIOS settings, or
377              incorrect NIC model), or fabric issues (for example, symbol  er‐
378              rors,  incorrect  link  width, or speed). Assuming ethreport has
379              previously been used to check for link errors and link speed is‐
380              sues, the server configuration should be verified.
381
382              Note  that  BIOS  settings and differences between server models
383              can account for 10-20% differences in bandwidth. A result  5-10%
384              below  the average is typically not cause for serious alarm, but
385              may reflect limitations in the server design or the chosen  BIOS
386              settings.
387
388              For  more details about BIOS settings, consult the documentation
389              from the server supplier and/or the server PCI chipset  manufac‐
390              turer.
391
392              The  deviation application supports a number of parameters which
393              allow for more precise control  over  the  mode,  benchmark  and
394              pass/fail  criteria. The parameters to use can be selected using
395              the FF_DEVIATION_ARGS  configuration  parameter  in  ethfastfab‐
396              ric.conf
397
398              Available parameters for deviation application:
399
400              [-bwtol bwtol] [-bwdelta MBs] [-bwthres MBs]
401
402              [-bwloop count] [-bwsize size] [-lattol latol]
403
404              [-latdelta usec] [-latthres usec] [-latloop count]
405
406              [-latsize size][-c] [-b] [-v] [-vv]
407
408              [-h reference_host]
409
410
411
412
413
414              -bwtol    Specifies the percent of bandwidth degradation allowed
415                        below average value.
416
417
418
419
420
421              -bwbidir  Performs a bidirectional bandwidth test.
422
423
424
425
426
427              -bwunidir Performs a unidirectional bandwidth test (Default).
428
429
430
431
432
433              -bwdelta  Specifies the limit in MB/s of  bandwidth  degradation
434                        allowed below average value.
435
436
437
438
439
440              -bwthres  Specifies  the  lower  limit  in MB/s of bandwidth al‐
441                        lowed.
442
443
444
445
446
447              -bwloop   Specifies the number of loops to  execute  each  band‐
448                        width test.
449
450
451
452
453
454              -bwsize   Specifies  the  size  of  message to use for bandwidth
455                        test.
456
457
458
459
460
461              -lattol   Specifies the percent of latency  degradation  allowed
462                        above average value.
463
464
465
466
467
468              -latdelta Specifies the imit in &#181;sec of latency degradation
469                        allowed above average value.
470
471
472
473
474
475              -latthres Specifies the lower limit in &#181;sec of latency  al‐
476                        lowed.
477
478
479
480
481
482              -latloop  Specifies  the number of loops to execute each latency
483                        test.
484
485
486
487
488
489              -latsize  Specifies the size of message to use for latency test.
490
491
492
493
494
495              -c        Runs test pairs concurrently instead of the default of
496                        sequential.
497
498
499
500
501
502              -b        When  comparing  results  against tolerance and delta,
503                        uses best instead of average.
504
505
506
507
508
509              -v        Specifies the verbose output.
510
511
512
513
514
515              -vv       Specifies the very verbose output.
516
517
518
519
520
521              -h        Specifies the reference host  to  use  for  sequential
522                        pairing.
523
524
525
526              Both bwtol and bwdelta must be exceeded to fail bandwidth test.
527
528              When bwthres is supplied, bwtol and bwdelta are ignored.
529
530              Both lattol and latdelta must be exceeded to fail latency test.
531
532              When latthres is supplied, lattol and latdelta are ignored.
533
534              For  consistency with OSU benchmarks, MB/s is defined as 1000000
535              bytes/s.
536
537
538
539Copyright(C) 2020-2021         Intel Corporation               ethhostadmin(8)
Impressum