1
2ethfabricanalysis(8) EFSFFCLIRG (Man Page) ethfabricanalysis(8)
3
4
5
7 ethfabricanalysis
8
9
10
11 Performs analysis of the fabric.
12
14 ethfabricanalysis [-b|-e] [-s] [-d dir] [-c file] [-E file] [-p
15 planes] [-T topology_inputs] [-f host_files]
16
18 --help
19
20 Produces full help text.
21
22 -b
23
24 Specifies the baseline mode. Default is compare/check mode.
25
26 -e
27
28 Evaluates health only. Default is compare/check mode.
29
30 -s
31
32 Saves history of failures (errors/differences).
33
34 -d dir
35
36 Specifies the top-level directory for saving baseline and
37 history of failed checks. Default is /var/usr/lib/eth-
38 tools/analysis
39
40 -c file
41
42 Specifies the error thresholds config file.Default is
43 /etc/eth-tools/ethmon.conf
44
45 -E file
46
47 Specifies Ethernet Mgt configuration file. The default is
48 /etc/eth-tools/mgt_config.xml.
49
50
51 -p planes
52
53 Specifies Fabric planes separated by space. The default is
54 the first enabled plane defined in config file. Value 'ALL'
55 will use all enabled planes.
56
57
58 -f host_files
59
60 Hosts files separated by space. It overrides the HostsFiles
61 defined in Mgt config file for the corresponding planes.
62 Value 'DEFAULT' will use the HostFile defined in Mgt config
63 file for the corresponding plane
64
65
66 -T topology_inputs
67
68 Specifies the name of topology input filenames separated by
69 space. See Details and ethreport for more information.
70
72 ethfabricanalysis
73
74 ethfabricanalysis -p 'p1 p2' -f 'hosts1 DEFAULT'
75
76 The fabric analysis tool checks the following:
77
78 • Fabric links (both internal to switch and external cables)
79
80 • Fabric components (nodes, links, systems, and their configura‐
81 tion)
82
83 • Fabric error counters and link speed mismatches
84
85 NOTE: The comparison includes components on the fabric. Therefore, op‐
86 erations such as shutting down a server cause the server to no longer
87 appear on the fabric and are flagged as a fabric change or failure by
88 ethfabricanalysis.
89
90
92 The following environment variables are also used by this command:
93
94 FF_ANALYSIS_DIR
95
96 Top-level directory for baselines and failed health checks.
97
99 You can specify the topology_input file to be used with one of the fol‐
100 lowing methods:
101
102 • On the command line using the -T option.
103
104 • Using the TopologyFile specified in Ethernet Mgt config file.
105
106 If the specified file does not exist, no topology_input file is used.
107
108 For more information on topology_input, refer to ethreport
109
110 By default, the error analysis includes counters and slow links (that
111 is, links running below enabled speeds). You can change this using the
112 FF_FABRIC_HEALTH configuration parameter in ethfastfabric.conf. This
113 parameter specifies the ethreport options and reports to be used for
114 the health analysis.
115
116 When a topology_input file is used, it can also be useful to extend
117 FF_FABRIC_HEALTH to include fabric topology verification options such
118 as -o verifylinks.
119
120 The thresholds for counter analysis default to /etc/eth-tools/eth‐
121 mon.conf. However, you can specify an alternate configuration file for
122 thresholds using the -c option. The ethmon.si.conf file can also be
123 used to check for any non-zero values for signal integrity (SI) coun‐
124 ters.
125
126 All files generated by ethfabricanalysis start with fabric in their
127 file name.
128
129 The ethfabricanalysis tool generates files such as the following within
130 FF_ANALYSIS_DIR :
131
132 Health Check
133
134
135 • latest/fabric.<plane_name>.errors stdout of ethreport for errors
136 encountered during fabric error analysis.
137
138
139 • latest/fabric.<plane_name>.errors.stderr stderr of ethreport
140 during fabric error analysis.
141
142
143 Baseline
144
145
146 During a baseline run, the following files are also created in
147 FF_ANALYSIS_DIR/latest.
148
149 • baseline/fabric.<plane_name>.snapshot.xml ethreport snapshot of
150 complete fabric components and configuration.
151
152
153 • baseline/fabric.<plane_name>.comps ethreport summary of fabric
154 components and basic configuration.
155
156
157 • baseline/fabric.<plane_name>.links ethreport summary of internal
158 and external links.
159
160
161 Full Analysis
162
163
164 • latest/fabric.<plane_name>.snapshot.xml ethreport snapshot of
165 complete fabric components and configuration.
166
167
168 • latest/fabric.<plane_name>.snapshot.stderr stderr of ethreport
169 during snapshot.
170
171
172 • latest/fabric.<plane_name>.errors stdout of ethreport for errors
173 encountered during fabric error analysis.
174
175
176 • latest/fabric.<plane_name>.errors.stderr stderr of ethreport
177 during fabric error analysis.
178
179
180 • latest/fabric.<plane_name>.comps stdout of ethreport for fabric
181 components and configuration.
182
183
184 • latest/fabric.<plane_name>.comps.stderr stderr of ethreport for
185 fabric components.
186
187
188 • latest/fabric.<plane_name>.comps.diff diff of baseline and lat‐
189 est fabric components.
190
191
192 • latest/fabric.<plane_name>.links stdout of ethreport summary of
193 internal and external links.
194
195
196 • latest/fabric.<plane_name>.links.stderr stderr of ethreport sum‐
197 mary of internal and external links.
198
199
200 • latest/fabric.<plane_name>.links.diff diff of baseline and lat‐
201 est fabric internal and external links.
202
203
204 • latest/fabric.<plane_name>.links.changes.stderr stderr of ethre‐
205 port comparison of links.
206
207
208 • latest/fabric.<plane_name>.links.changes ethreport comparison of
209 links against baseline. This is typically easier to read than
210 the links.diff file and contains the same information.
211
212
213 • latest/fabric.<plane_name>.comps.changes.stderr stderr of ethre‐
214 port comparison of components.
215
216
217 • latest/fabric.<plane_name>.comps.changes ethreport comparison of
218 components against baseline. This is typically easier to read
219 than the comps.diff file and contains the same information.
220
221
222 The .diff and .changes files are only created if differences are de‐
223 tected.
224
225 If the -s option is used and failures are detected, files related to
226 the checks that failed are also copied to the time-stamped directory
227 name under FF_ANALYSIS_DIR.
228
230 Based on ethreport -o links:
231
232 • Unconnected/down/missing cables
233
234 • Added/moved cables
235
236 • Changes in link width and speed
237
238 • Changes to IfAddr in fabric (replacement of NIC or Switch hard‐
239 ware)
240
241 • Adding/Removing Nodes [NIC, Virtual NICs, Virtual Switches,
242 Physical Switches, Physical Switch internal switching cards
243 (leaf/spine)]
244
245 • Changes to server or switch names
246
247 Based on ethreport -o comps:
248
249 • Overlap with items from links report
250
251 • Changes in port MTU
252
253 • Changes in port speed/width enabled or supported
254
255 • Changes in NIC or switch device IDs/revisions/VendorID (for ex‐
256 ample, ASIC hardware changes)
257
258 • Changes in port Capability mask (which features/agents run on
259 port/server)
260
261 • Changes to IOUs/IOCs/IOC Services provided
262
263
264
266 Based on ethreport -s -o errors -o slowlinks:
267
268 • error counters on all Intel(R) Ethernet Fabric ports (NIC,
269 switch external and switch internal) checked against config‐
270 urable thresholds.
271
272 • Typically identifies potential fabric errors, such as symbol er‐
273 rors.
274
275 • May also identify transient congestion, depending on the coun‐
276 ters that are monitored.
277
278 • Link active speed/width as compared to Enabled speed.
279
280 • Identifies links whose active speed/width is < min (enabled
281 speed/width on each side of link).
282
283 • This typically reflects bad cables or bad ports or poor connec‐
284 tions.
285
286 • Side effect is the verification of fabric health.
287
288
289
290Copyright(C) 2020-2022 Intel Corporation ethfabricanalysis(8)