1
2ethhostadmin(8) EFSFFCLIRG (Man Page) ethhostadmin(8)
3
4
5
7 ethhostadmin
8
9
10
11 Performs a number of multi-step host initialization and verification
12 operations, including upgrading software, rebooting hosts, and other
13 operations. In general, operations performed by ethhostadmin involve a
14 login to one or more host systems.
15
17 ethhostadmin [-c] [-f hostfile] [-h 'hosts']
18 [-r release] [-I install_options] [-U upgrade_options] [-d dir]
19 [-T product] [-P packages] [-S] operation ...
20
22 --help
23
24 Produces full help text.
25
26
27 -c
28
29 Overwrites the result files from any previous run before
30 starting this run.
31
32
33 -f hostfile
34
35 Specifies the file with the names of hosts in a cluster. De‐
36 fault is /etc/eth-tools/hosts file.
37
38
39 -h hosts
40
41 Specifies the list of hosts to execute the operation against.
42
43
44 -r release
45
46 Specifies the software version to load/upgrade to. Default is
47 the version of Intel(R) Ethernet Fabric Suite Software pres‐
48 ently being run on the server.
49
50
51 -d dir
52
53 Specifies the directory to retrieve product. release.tgz for
54 load or upgrade.
55
56
57 -I install_options
58
59 Specifies the software install options.
60
61
62 -U upgrade_options
63
64 Specifies the software upgrade options.
65
66
67 -T product
68
69 Specifies the product type to install. Options include:
70
71
72
73 • IntelEth-Basic. <distro> (default)
74
75 • IntelEth-FS. <distro>
76
77
78
79 where <distro> is the distribution and CPU, such as
80 RHEL81-x86_64.
81
82 -P packages
83
84 Specifies the packages to install. Default is eth eth_rdma.
85 Refer to INSTALL -C for compete list of packages.
86
87
88 -S
89
90 Securely prompts for user password on remote system.
91
92
93 operation
94
95 Performs the specified operation, which can be one or more of
96 the following:
97
98
99
100
101 load Starts initial installation of all hosts.
102
103
104
105
106
107 upgrade Upgrades installation of all hosts.
108
109
110
111
112
113 reboot Reboots hosts, ensures they go down and come back.
114
115
116
117
118
119 rping Verifies this host can ping each host through RDMA.
120
121
122
123
124
125 pfctest Verifies PFC works on all hosts.
126
127
128
129
130
131 mpiperf Verifies latency and bandwidth for each host.
132
133
134
135
136
137 mpiperfdeviation
138 Verifies latency and bandwidth for each host
139 against a defined threshold (or relative to average
140 host performance).
141
142
143
145 ethhostadmin -c reboot
146
147 ethhostadmin upgrade
148
149 ethhostadmin -h 'elrond arwen' reboot
150
151 HOSTS='elrond arwen' ethhostadmin reboot
152
154 ethhostadmin provides detailed logging of its results. During each run,
155 the following files are produced:
156
157 • test.res : Appended with summary results of run.
158
159 • test.log : Appended with detailed results of run.
160
161 • save_tmp/ : Contains a directory per failed test with detailed
162 logs.
163
164 • test_tmp*/ : Intermediate result files while test is running.
165
166 The -c option removes all log files.
167
168 Results from ethhostadmin are grouped into test suites, test cases, and
169 test items. A given run of ethhostadmin represents a single test suite.
170 Within a test suite, multiple test cases occur; typically one test case
171 per host being operated on. Some of the more complex operations may
172 have multiple test items per test case. Each test item represents a ma‐
173 jor step in the overall test case.
174
175 Each ethhostadmin run appends to test.res and test.log, and creates
176 temporary files in test_tmp$PID in the current directory. test.res pro‐
177 vides an overall summary of operations performed and their results. The
178 same information is also displayed while ethhostadmin is executing.
179 test.log contains detailed information about what was performed, in‐
180 cluding the specific commands executed and the resulting output. The
181 test_tmp directories contain temporary files which reflect tests in
182 progress (or killed). The logs for any failures are logged in the
183 save_temp directory with a directory per failed test case. If the same
184 test case fails more than once, save_temp retains the information from
185 the first failure. Subsequent runs of ethhostadmin are appended to
186 test.log. Intel recommends reviewing failures and using the -c option
187 to remove old logs before subsequent runs of ethhostadmin.
188
189 ethhostadmin implicitly performs its operations in parallel. However,
190 as for the other tools, FF_MAX_PARALLEL can be exported to change the
191 degree of parallelism. 1000 parallel operations is the default.
192
194 The following environment variables are also used by this command:
195
196 HOSTS
197
198 List of hosts, used if -h option not supplied.
199
200
201 HOSTS_FILE
202
203 File containing list of hosts, used in absence of -f and -h.
204
205
206 FF_MAX_PARALLEL
207
208 Maximum concurrent operations are performed.
209
210
211 FF_SERIALIZE_OUTPUT
212
213 Serialize output of parallel operations (yes or no).
214
215
216 FF_TIMEOUT_MULT
217
218 Multiplier for all timeouts associated with this command.
219 Used if the systems are slow for some reason.
220
221
222
224 (Host) Intel recommends that you set up password SSH or SCP for use
225 during this operation. Alternatively, the -S option can be used to se‐
226 curely prompt for a password, in which case the same password is used
227 for all hosts. Alternately, the password may be put in the environment
228 or the ethfastfabric.conf file using FF_PASSWORD and FF_ROOTPASS.
229
230 load
231
232 Performs an initial installation of Intel(R) Ethernet Fabric
233 Suite Software on a group of hosts. Any existing installation
234 is uninstalled and existing configuration files are removed.
235 Subsequently, the hosts are installed with a default Intel(R)
236 Ethernet Fabric Suite Software configuration. The -I option
237 can be used to select different install packages. Default is
238 eth_tools eth_rdma mpi The -r option can be used to specify a
239 release to install other than the one that this host is pres‐
240 ently running. The FF_PRODUCT. FF_PRODUCT_VERSION.tgz file
241 (for example, IntelEth-Basic. version.tgz) is expected to ex‐
242 ist in the directory specified by -d. Default is the current
243 working directory. The specified software is copied to all
244 the selected hosts and installed.
245
246
247 upgrade
248
249 Upgrades all selected hosts without modifying existing con‐
250 figurations. This operation is comparable to the -U option
251 when running ./INSTALL manually. The -r option can be used to
252 upgrade to a release different from this host. The default is
253 to upgrade to the same release as this host. The FF_PRODUCT.
254 FF_PRODUCT_VERSION.tgz file (for example, IntelEth-Basic.
255 version.tgz) is expected to exist in the directory specified
256 by -d. The default is the current working directory. The
257 specified software is copied to all the end nodes and in‐
258 stalled.
259
260
261
262
263 NOTE: Only components that are currently installed are upgraded. This
264 operation fails for hosts that do not have Intel(R) Ethernet Fabric
265 Suite Software installed.
266
267
268
269 reboot
270
271 Reboots the given hosts and ensures they go down and come
272 back up by pinging them during the reboot process. The ping
273 rate is slow (5 seconds), so if the servers boot faster than
274 this, false failures may be seen.
275
276
277 rping
278
279 Verifies RDMA basic operation by ensuring that the nodes can
280 ping each other through RDMA. To run this command, Intel(R)
281 Ethernet Fabric software must be installed, RDMA must be con‐
282 figured and running on the host, and the given hosts, and
283 switches must be up.
284
285
286 pfctest
287
288 Empirical test which verifies PFC is working right. To run
289 this command, Intel(R) Ethernet Fabric software must be in‐
290 stalled, PFC must be configured on both hosts and switches,
291 and the given hosts and switches must be up.
292
293
294 mpiperf
295
296 Verifies that MPI is operational and checks MPI end-to-end
297 latency and bandwidth between pairs of nodes (for example,
298 1-2, 3-4, 5-6). Use this to verify switch latency/hops, PCI
299 bandwidth, and overall MPI performance. The test.res file
300 contains the results of each pair of nodes tested.
301
302
303
304
305
306 NOTE: This option is available for the Intel(R) Ethernet Host Software
307 OFA Delta packaging, but is not presently available for other packag‐
308 ings of OFED.
309
310
311
312 To obtain accurate results, this test should be run at a time
313 when no other stressful applications (for example, MPI jobs or
314 high stress file system operations) are running on the given
315 hosts.
316
317 Bandwidth issues typically indicate server configuration issues
318 (for example, incorrect slot used, incorrect BIOS settings, or
319 incorrect NIC model), or fabric issues (for example, symbol er‐
320 rors, incorrect link width, or speed). Assuming ethreport has
321 previously been used to check for link errors and link speed is‐
322 sues, the server configuration should be verified.
323
324 Note that BIOS settings and differences between server models
325 can account for 10-20% differences in bandwidth. For more de‐
326 tails about BIOS settings, consult the documentation from the
327 server supplier and/or the server PCI chipset manufacturer.
328
329 mpiperfdeviation
330
331 Specifies the enhanced version of mpiperf that verifies MPI
332 performance. Can be used to verify switch latency/hops, PCI
333 bandwidth, and overall MPI performance. It performs assorted
334 pair-wise bandwidth and latency tests, and reports pairs out‐
335 side an acceptable tolerance range. The tool identifies spe‐
336 cific nodes that have problems and provides a concise summary
337 of results. The test.res file contains the results of each
338 pair of nodes tested.
339
340
341 By default, concurrent mode is used to quickly analyze the
342 fabric and host performance. Pairs that have 20% less band‐
343 width or 50% more latency than the average pair are reported
344 as failures.
345
346 The tool can be run in a sequential or a concurrent mode. Se‐
347 quential mode runs each host against a reference host. By de‐
348 fault, the reference host is selected based on the best per‐
349 formance from a quick test of the first 40 hosts. In concur‐
350 rent mode, hosts are paired up and all pairs are run concur‐
351 rently. Since there may be fabric contention during such a
352 run, any poor performing pairs are then rerun sequentially
353 against the reference host.
354
355 Concurrent mode runs the tests in the shortest amount of
356 time, however, the results could be slightly less accurate
357 due to switch contention. In heavily oversubscribed fabric
358 designs, if concurrent mode is producing unexpectedly low
359 performance, try sequential mode.
360
361
362
363
364 NOTE: This option is available for the Intel(R) Ethernet Host Software
365 OFA Delta packaging, but is not presently available for other packag‐
366 ings of OFED.
367
368
369
370 To obtain accurate results, this test should be run at a time
371 when no other stressful applications (for example, MPI jobs,
372 high stress file system operations) are running on the given
373 hosts.
374
375 Bandwidth issues typically indicate server configuration issues
376 (for example, incorrect slot used, incorrect BIOS settings, or
377 incorrect NIC model), or fabric issues (for example, symbol er‐
378 rors, incorrect link width, or speed). Assuming ethreport has
379 previously been used to check for link errors and link speed is‐
380 sues, the server configuration should be verified.
381
382 Note that BIOS settings and differences between server models
383 can account for 10-20% differences in bandwidth. A result 5-10%
384 below the average is typically not cause for serious alarm, but
385 may reflect limitations in the server design or the chosen BIOS
386 settings.
387
388 For more details about BIOS settings, consult the documentation
389 from the server supplier and/or the server PCI chipset manufac‐
390 turer.
391
392 The deviation application supports a number of parameters which
393 allow for more precise control over the mode, benchmark and
394 pass/fail criteria. The parameters to use can be selected using
395 the FF_DEVIATION_ARGS configuration parameter in ethfastfab‐
396 ric.conf
397
398 Available parameters for deviation application:
399
400 [-bwtol bwtol] [-bwdelta MBs] [-bwthres MBs]
401
402 [-bwloop count] [-bwsize size] [-lattol latol]
403
404 [-latdelta usec] [-latthres usec] [-latloop count]
405
406 [-latsize size][-c] [-b] [-v] [-vv]
407
408 [-h reference_host]
409
410
411
412
413
414 -bwtol Specifies the percent of bandwidth degradation allowed
415 below average value.
416
417
418
419
420
421 -bwbidir Performs a bidirectional bandwidth test.
422
423
424
425
426
427 -bwunidir Performs a unidirectional bandwidth test (Default).
428
429
430
431
432
433 -bwdelta Specifies the limit in MB/s of bandwidth degradation
434 allowed below average value.
435
436
437
438
439
440 -bwthres Specifies the lower limit in MB/s of bandwidth al‐
441 lowed.
442
443
444
445
446
447 -bwloop Specifies the number of loops to execute each band‐
448 width test.
449
450
451
452
453
454 -bwsize Specifies the size of message to use for bandwidth
455 test.
456
457
458
459
460
461 -lattol Specifies the percent of latency degradation allowed
462 above average value.
463
464
465
466
467
468 -latdelta Specifies the imit in µsec of latency degradation
469 allowed above average value.
470
471
472
473
474
475 -latthres Specifies the lower limit in µsec of latency al‐
476 lowed.
477
478
479
480
481
482 -latloop Specifies the number of loops to execute each latency
483 test.
484
485
486
487
488
489 -latsize Specifies the size of message to use for latency test.
490
491
492
493
494
495 -c Runs test pairs concurrently instead of the default of
496 sequential.
497
498
499
500
501
502 -b When comparing results against tolerance and delta,
503 uses best instead of average.
504
505
506
507
508
509 -v Specifies the verbose output.
510
511
512
513
514
515 -vv Specifies the very verbose output.
516
517
518
519
520
521 -h Specifies the reference host to use for sequential
522 pairing.
523
524
525
526 Both bwtol and bwdelta must be exceeded to fail bandwidth test.
527
528 When bwthres is supplied, bwtol and bwdelta are ignored.
529
530 Both lattol and latdelta must be exceeded to fail latency test.
531
532 When latthres is supplied, lattol and latdelta are ignored.
533
534 For consistency with OSU benchmarks, MB/s is defined as 1000000
535 bytes/s.
536
537
538
539Copyright(C) 2020-2021 Intel Corporation ethhostadmin(8)