1
2ethhostadmin(8) EFSFFCLIRG (Man Page) ethhostadmin(8)
3
4
5
7 ethhostadmin
8
9
10
11 Performs a number of multi-step host initialization and verification
12 operations, including upgrading software, rebooting hosts, and other
13 operations. In general, operations performed by ethhostadmin involve a
14 login to one or more host systems.
15
17 ethhostadmin [-c] [-f hostfile] [-h 'hosts']
18 [-r release] [-I install_options] [-U upgrade_options] [-d dir]
19 [-T product] [-P packages] [-S] operation ...
20
22 --help
23
24 Produces full help text.
25
26
27 -c
28
29 Overwrites the result files from any previous run before
30 starting this run.
31
32
33 -f hostfile
34
35 Specifies the file with the names of hosts in a cluster. De‐
36 fault is /etc/eth-tools/hosts file.
37
38
39 -h hosts
40
41 Specifies the list of hosts to execute the operation against.
42
43
44 -r release
45
46 Specifies the software version to load/upgrade to. Default is
47 the version of Intel(R) Ethernet Fabric Suite Software pres‐
48 ently being run on the server.
49
50
51 -d dir
52
53 Specifies the directory to retrieve product. release.tgz for
54 load or upgrade.
55
56
57 -I install_options
58
59 Specifies the software install options.
60
61
62 -U upgrade_options
63
64 Specifies the software upgrade options.
65
66
67 -T product
68
69 Specifies the product type to install. Options include:
70
71
72
73 • IntelEth-Basic. <distro> (default)
74
75 • IntelEth-FS. <distro>
76
77
78
79 where <distro> is the distribution and CPU, such as
80 RHEL81-x86_64.
81
82 -P packages
83
84 Specifies the packages to install. Default is eth_tools
85 eth_rdma mpi. Refer to INSTALL -C for compete list of pack‐
86 ages.
87
88
89 -S
90
91 Securely prompts for user password on remote system.
92
93
94 operation
95
96 Performs the specified operation, which can be one or more of
97 the following:
98
99
100
101
102 load Starts initial installation of all hosts.
103
104
105
106
107
108 upgrade Upgrades installation of all hosts.
109
110
111
112
113
114 reboot Reboots hosts, ensures they go down and come back.
115
116
117
118
119
120 rping Verifies this host can ping each host through RDMA.
121
122
123
124
125
126 mpiperf Verifies latency and bandwidth for each host.
127
128
129
130
131
132 mpiperfdeviation
133 Verifies latency and bandwidth for each host
134 against a defined threshold (or relative to average
135 host performance).
136
137
138
140 ethhostadmin -c reboot
141
142 ethhostadmin upgrade
143
144 ethhostadmin -h 'elrond arwen' reboot
145
146 HOSTS='elrond arwen' ethhostadmin reboot
147
149 ethhostadmin provides detailed logging of its results. During each run,
150 the following files are produced:
151
152 • test.res : Appended with summary results of run.
153
154 • test.log : Appended with detailed results of run.
155
156 • save_tmp/ : Contains a directory per failed test with detailed
157 logs.
158
159 • test_tmp*/ : Intermediate result files while test is running.
160
161 The -c option removes all log files.
162
163 Results from ethhostadmin are grouped into test suites, test cases, and
164 test items. A given run of ethhostadmin represents a single test suite.
165 Within a test suite, multiple test cases occur; typically one test case
166 per host being operated on. Some of the more complex operations may
167 have multiple test items per test case. Each test item represents a ma‐
168 jor step in the overall test case.
169
170 Each ethhostadmin run appends to test.res and test.log, and creates
171 temporary files in test_tmp$PID in the current directory. test.res pro‐
172 vides an overall summary of operations performed and their results. The
173 same information is also displayed while ethhostadmin is executing.
174 test.log contains detailed information about what was performed, in‐
175 cluding the specific commands executed and the resulting output. The
176 test_tmp directories contain temporary files which reflect tests in
177 progress (or killed). The logs for any failures are logged in the
178 save_temp directory with a directory per failed test case. If the same
179 test case fails more than once, save_temp retains the information from
180 the first failure. Subsequent runs of ethhostadmin are appended to
181 test.log. Intel recommends reviewing failures and using the -c option
182 to remove old logs before subsequent runs of ethhostadmin.
183
184 ethhostadmin implicitly performs its operations in parallel. However,
185 as for the other tools, FF_MAX_PARALLEL can be exported to change the
186 degree of parallelism. 1000 parallel operations is the default.
187
189 The following environment variables are also used by this command:
190
191 HOSTS
192
193 List of hosts, used if -h option not supplied.
194
195
196 HOSTS_FILE
197
198 File containing list of hosts, used in absence of -f and -h.
199
200
201 FF_MAX_PARALLEL
202
203 Maximum concurrent operations are performed.
204
205
206 FF_SERIALIZE_OUTPUT
207
208 Serialize output of parallel operations (yes or no).
209
210
211 FF_TIMEOUT_MULT
212
213 Multiplier for all timeouts associated with this command.
214 Used if the systems are slow for some reason.
215
216
217
219 (Host) Intel recommends that you set up password SSH or SCP for use
220 during this operation. Alternatively, the -S option can be used to se‐
221 curely prompt for a password, in which case the same password is used
222 for all hosts. Alternately, the password may be put in the environment
223 or the ethfastfabric.conf file using FF_PASSWORD and FF_ROOTPASS.
224
225 load
226
227 Performs an initial installation of Intel(R) Ethernet Fabric
228 Suite Software on a group of hosts. Any existing installation
229 is uninstalled and existing configuration files are removed.
230 Subsequently, the hosts are installed with a default Intel(R)
231 Ethernet Fabric Suite Software configuration. The -I option
232 can be used to select different install packages. Default is
233 eth_tools eth_rdma mpi The -r option can be used to specify a
234 release to install other than the one that this host is pres‐
235 ently running. The FF_PRODUCT. FF_PRODUCT_VERSION.tgz file
236 (for example, IntelEth-Basic. version.tgz) is expected to ex‐
237 ist in the directory specified by -d. Default is the current
238 working directory. The specified software is copied to all
239 the selected hosts and installed.
240
241
242 upgrade
243
244 Upgrades all selected hosts without modifying existing con‐
245 figurations. This operation is comparable to the -U option
246 when running ./INSTALL manually. The -r option can be used to
247 upgrade to a release different from this host. The default is
248 to upgrade to the same release as this host. The FF_PRODUCT.
249 FF_PRODUCT_VERSION.tgz file (for example, IntelEth-Basic.
250 version.tgz) is expected to exist in the directory specified
251 by -d. The default is the current working directory. The
252 specified software is copied to all the end nodes and in‐
253 stalled.
254
255
256
257
258 NOTE: Only components that are currently installed are upgraded. This
259 operation fails for hosts that do not have Intel(R) Ethernet Fabric
260 Suite Software installed.
261
262
263
264 reboot
265
266 Reboots the given hosts and ensures they go down and come
267 back up by pinging them during the reboot process. The ping
268 rate is slow (5 seconds), so if the servers boot faster than
269 this, false failures may be seen.
270
271
272 rping
273
274 Verifies RDMA basic operation by ensuring that the nodes can
275 ping each other through RDMA. To run this command, Intel(R)
276 Ethernet Fabric software must be installed, RDMA must be con‐
277 figured and running on the host, and the given hosts, and
278 switches must be up.
279
280
281 mpiperf
282
283 Verifies that MPI is operational and checks MPI end-to-end
284 latency and bandwidth between pairs of nodes (for example,
285 1-2, 3-4, 5-6). Use this to verify switch latency/hops, PCI
286 bandwidth, and overall MPI performance. The test.res file
287 contains the results of each pair of nodes tested.
288
289
290
291
292
293 NOTE: This option is available for the Intel(R) Ethernet Host Software
294 OFA Delta packaging, but is not presently available for other packag‐
295 ings of OFED.
296
297
298
299 To obtain accurate results, this test should be run at a time
300 when no other stressful applications (for example, MPI jobs or
301 high stress file system operations) are running on the given
302 hosts.
303
304 Bandwidth issues typically indicate server configuration issues
305 (for example, incorrect slot used, incorrect BIOS settings, or
306 incorrect NIC model), or fabric issues (for example, symbol er‐
307 rors, incorrect link width, or speed). Assuming ethreport has
308 previously been used to check for link errors and link speed is‐
309 sues, the server configuration should be verified.
310
311 Note that BIOS settings and differences between server models
312 can account for 10-20% differences in bandwidth. For more de‐
313 tails about BIOS settings, consult the documentation from the
314 server supplier and/or the server PCI chipset manufacturer.
315
316 mpiperfdeviation
317
318 Specifies the enhanced version of mpiperf that verifies MPI
319 performance. Can be used to verify switch latency/hops, PCI
320 bandwidth, and overall MPI performance. It performs assorted
321 pair-wise bandwidth and latency tests, and reports pairs out‐
322 side an acceptable tolerance range. The tool identifies spe‐
323 cific nodes that have problems and provides a concise summary
324 of results. The test.res file contains the results of each
325 pair of nodes tested.
326
327
328 By default, concurrent mode is used to quickly analyze the
329 fabric and host performance. Pairs that have 20% less band‐
330 width or 50% more latency than the average pair are reported
331 as failures.
332
333 The tool can be run in a sequential or a concurrent mode. Se‐
334 quential mode runs each host against a reference host. By de‐
335 fault, the reference host is selected based on the best per‐
336 formance from a quick test of the first 40 hosts. In concur‐
337 rent mode, hosts are paired up and all pairs are run concur‐
338 rently. Since there may be fabric contention during such a
339 run, any poor performing pairs are then rerun sequentially
340 against the reference host.
341
342 Concurrent mode runs the tests in the shortest amount of
343 time, however, the results could be slightly less accurate
344 due to switch contention. In heavily oversubscribed fabric
345 designs, if concurrent mode is producing unexpectedly low
346 performance, try sequential mode.
347
348
349
350
351 NOTE: This option is available for the Intel(R) Ethernet Host Software
352 OFA Delta packaging, but is not presently available for other packag‐
353 ings of OFED.
354
355
356
357 To obtain accurate results, this test should be run at a time
358 when no other stressful applications (for example, MPI jobs,
359 high stress file system operations) are running on the given
360 hosts.
361
362 Bandwidth issues typically indicate server configuration issues
363 (for example, incorrect slot used, incorrect BIOS settings, or
364 incorrect NIC model), or fabric issues (for example, symbol er‐
365 rors, incorrect link width, or speed). Assuming ethreport has
366 previously been used to check for link errors and link speed is‐
367 sues, the server configuration should be verified.
368
369 Note that BIOS settings and differences between server models
370 can account for 10-20% differences in bandwidth. A result 5-10%
371 below the average is typically not cause for serious alarm, but
372 may reflect limitations in the server design or the chosen BIOS
373 settings.
374
375 For more details about BIOS settings, consult the documentation
376 from the server supplier and/or the server PCI chipset manufac‐
377 turer.
378
379 The deviation application supports a number of parameters which
380 allow for more precise control over the mode, benchmark and
381 pass/fail criteria. The parameters to use can be selected using
382 the FF_DEVIATION_ARGS configuration parameter in ethfastfab‐
383 ric.conf
384
385 Available parameters for deviation application:
386
387 [-bwtol bwtol] [-bwdelta MBs] [-bwthres MBs]
388
389 [-bwloop count] [-bwsize size] [-lattol latol]
390
391 [-latdelta usec] [-latthres usec] [-latloop count]
392
393 [-latsize size][-c] [-b] [-v] [-vv]
394
395 [-h reference_host]
396
397
398
399
400
401 -bwtol Specifies the percent of bandwidth degradation allowed
402 below average value.
403
404
405
406
407
408 -bwbidir Performs a bidirectional bandwidth test.
409
410
411
412
413
414 -bwunidir Performs a unidirectional bandwidth test (Default).
415
416
417
418
419
420 -bwdelta Specifies the limit in MB/s of bandwidth degradation
421 allowed below average value.
422
423
424
425
426
427 -bwthres Specifies the lower limit in MB/s of bandwidth al‐
428 lowed.
429
430
431
432
433
434 -bwloop Specifies the number of loops to execute each band‐
435 width test.
436
437
438
439
440
441 -bwsize Specifies the size of message to use for bandwidth
442 test.
443
444
445
446
447
448 -lattol Specifies the percent of latency degradation allowed
449 above average value.
450
451
452
453
454
455 -latdelta Specifies the imit in µsec of latency degradation
456 allowed above average value.
457
458
459
460
461
462 -latthres Specifies the lower limit in µsec of latency al‐
463 lowed.
464
465
466
467
468
469 -latloop Specifies the number of loops to execute each latency
470 test.
471
472
473
474
475
476 -latsize Specifies the size of message to use for latency test.
477
478
479
480
481
482 -c Runs test pairs concurrently instead of the default of
483 sequential.
484
485
486
487
488
489 -b When comparing results against tolerance and delta,
490 uses best instead of average.
491
492
493
494
495
496 -v Specifies the verbose output.
497
498
499
500
501
502 -vv Specifies the very verbose output.
503
504
505
506
507
508 -h Specifies the reference host to use for sequential
509 pairing.
510
511
512
513 Both bwtol and bwdelta must be exceeded to fail bandwidth test.
514
515 When bwthres is supplied, bwtol and bwdelta are ignored.
516
517 Both lattol and latdelta must be exceeded to fail latency test.
518
519 When latthres is supplied, lattol and latdelta are ignored.
520
521 For consistency with OSU benchmarks, MB/s is defined as 1000000
522 bytes/s.
523
524
525
526Copyright(C) 2020 Intel Corporation ethhostadmin(8)