1LLVM-PROFDATA(1) LLVM LLVM-PROFDATA(1)
2
3
4
6 llvm-profdata - Profile data tool
7
9 llvm-profdata command [args...]
10
12 The llvm-profdata tool is a small utility for working with profile data
13 files.
14
16 • merge
17
18 • show
19
20 • overlap
21
23 SYNOPSIS
24 llvm-profdata merge [options] [filename...]
25
26 DESCRIPTION
27 llvm-profdata merge takes several profile data files generated by PGO
28 instrumentation and merges them together into a single indexed profile
29 data file.
30
31 By default profile data is merged without modification. This means that
32 the relative importance of each input file is proportional to the num‐
33 ber of samples or counts it contains. In general, the input from a
34 longer training run will be interpreted as relatively more important
35 than a shorter run. Depending on the nature of the training runs it may
36 be useful to adjust the weight given to each input file by using the
37 -weighted-input option.
38
39 Profiles passed in via -weighted-input, -input-files, or via positional
40 arguments are processed once for each time they are seen.
41
42 OPTIONS
43 --help Print a summary of command line options.
44
45 --output=<output>, -o
46 Specify the output file name. Output cannot be - as the result‐
47 ing indexed profile data can't be written to standard output.
48
49 --weighted-input=<weight,filename>
50 Specify an input file name along with a weight. The profile
51 counts of the supplied filename will be scaled (multiplied) by
52 the supplied weight, where weight is a decimal integer >= 1.
53 Input files specified without using this option are assigned a
54 default weight of 1. Examples are shown below.
55
56 --input-files=<path>, -f
57 Specify a file which contains a list of files to merge. The en‐
58 tries in this file are newline-separated. Lines starting with
59 '#' are skipped. Entries may be of the form <filename> or
60 <weight>,<filename>.
61
62 --remapping-file=<path>, -r
63 Specify a file which contains a remapping from symbol names in
64 the input profile to the symbol names that should be used in the
65 output profile. The file should consist of lines of the form
66 <input-symbol> <output-symbol>. Blank lines and lines starting
67 with # are skipped.
68
69 The llvm-cxxmap tool can be used to generate the symbol remap‐
70 ping file.
71
72 --instr (default)
73 Specify that the input profile is an instrumentation-based pro‐
74 file.
75
76 --sample
77 Specify that the input profile is a sample-based profile.
78
79 The format of the generated file can be generated in one of
80 three ways:
81
82 --binary (default)
83
84 Emit the profile using a binary encoding. For instrumenta‐
85 tion-based profile the output format is the indexed binary for‐
86 mat.
87
88 --extbinary
89
90 Emit the profile using an extensible binary encoding. This op‐
91 tion can only be used with sample-based profile. The extensible
92 binary encoding can be more compact with compression enabled and
93 can be loaded faster than the default binary encoding.
94
95 --text
96
97 Emit the profile in text mode. This option can also be used with
98 both sample-based and instrumentation-based profile. When this
99 option is used the profile will be dumped in the text format
100 that is parsable by the profile reader.
101
102 --gcc
103
104 Emit the profile using GCC's gcov format (Not yet supported).
105
106 --sparse[=true|false]
107 Do not emit function records with 0 execution count. Can only be
108 used in conjunction with -instr. Defaults to false, since it can
109 inhibit compiler optimization during PGO.
110
111 --num-threads=<N>, -j
112 Use N threads to perform profile merging. When N=0, llvm-prof‐
113 data auto-detects an appropriate number of threads to use. This
114 is the default.
115
116 --failure-mode=[any|all]
117 Set the failure mode. There are two options: 'any' causes the
118 merge command to fail if any profiles are invalid, and 'all'
119 causes the merge command to fail only if all profiles are in‐
120 valid. If 'all' is set, information from any invalid profiles is
121 excluded from the final merged product. The default failure mode
122 is 'any'.
123
124 --prof-sym-list=<path>
125 Specify a file which contains a list of symbols to generate pro‐
126 file symbol list in the profile. This option can only be used
127 with sample-based profile in extbinary format. The entries in
128 this file are newline-separated.
129
130 --compress-all-sections=[true|false]
131 Compress all sections when writing the profile. This option can
132 only be used with sample-based profile in extbinary format.
133
134 --use-md5=[true|false]
135 Use MD5 to represent string in name table when writing the pro‐
136 file. This option can only be used with sample-based profile in
137 extbinary format.
138
139 --gen-partial-profile=[true|false]
140 Mark the profile to be a partial profile which only provides
141 partial profile coverage for the optimized target. This option
142 can only be used with sample-based profile in extbinary format.
143
144 --supplement-instr-with-sample=<file>
145 Supplement an instrumentation profile with sample profile. The
146 sample profile is the input of the flag. Output will be in in‐
147 strumentation format (only works with -instr).
148
149 --zero-counter-threshold=<float>
150 For the function which is cold in instr profile but hot in sam‐
151 ple profile, if the ratio of the number of zero counters divided
152 by the total number of counters is above the threshold, the pro‐
153 file of the function will be regarded as being harmful for per‐
154 formance and will be dropped.
155
156 --instr-prof-cold-threshold=<int>
157 User specified cold threshold for instr profile which will over‐
158 ride the cold threshold got from profile summary.
159
160 --suppl-min-size-threshold=<int>
161 If the size of a function is smaller than the threshold, assume
162 it can be inlined by PGO early inliner and it will not be ad‐
163 justed based on sample profile.
164
165 --debug-info=<path>
166 Specify the executable or .dSYM that contains debug info for the
167 raw profile. When -debug-info-correlate was used for instrumen‐
168 tation, use this option to correlate the raw profile.
169
170 EXAMPLES
171 Basic Usage
172 Merge three profiles:
173
174 llvm-profdata merge foo.profdata bar.profdata baz.profdata -output merged.profdata
175
176 Weighted Input
177 The input file foo.profdata is especially important, multiply its
178 counts by 10:
179
180 llvm-profdata merge --weighted-input=10,foo.profdata bar.profdata baz.profdata --output merged.profdata
181
182 Exactly equivalent to the previous invocation (explicit form; useful
183 for programmatic invocation):
184
185 llvm-profdata merge --weighted-input=10,foo.profdata --weighted-input=1,bar.profdata --weighted-input=1,baz.profdata --output merged.profdata
186
188 SYNOPSIS
189 llvm-profdata show [options] [filename]
190
191 DESCRIPTION
192 llvm-profdata show takes a profile data file and displays the informa‐
193 tion about the profile counters for this file and for any of the speci‐
194 fied function(s).
195
196 If filename is omitted or is -, then llvm-profdata show reads its input
197 from standard input.
198
199 OPTIONS
200 --all-functions
201 Print details for every function.
202
203 --binary-ids
204 Print embedded binary ids in a profile.
205
206 --counts
207 Print the counter values for the displayed functions.
208
209 --show-format=<text|json|yaml>
210 Emit output in the selected format if supported by the provided
211 profile type.
212
213 --function=<string>
214 Print details for a function if the function's name contains the
215 given string.
216
217 --help Print a summary of command line options.
218
219 --output=<output>, -o
220 Specify the output file name. If output is - or it isn't speci‐
221 fied, then the output is sent to standard output.
222
223 --instr (default)
224 Specify that the input profile is an instrumentation-based pro‐
225 file.
226
227 --text Instruct the profile dumper to show profile counts in the text
228 format of the instrumentation-based profile data representation.
229 By default, the profile information is dumped in a more human
230 readable form (also in text) with annotations.
231
232 --topn=<n>
233 Instruct the profile dumper to show the top n functions with the
234 hottest basic blocks in the summary section. By default, the
235 topn functions are not dumped.
236
237 --sample
238 Specify that the input profile is a sample-based profile.
239
240 --memop-sizes
241 Show the profiled sizes of the memory intrinsic calls for shown
242 functions.
243
244 --value-cutoff=<n>
245 Show only those functions whose max count values are greater or
246 equal to n. By default, the value-cutoff is set to 0.
247
248 --list-below-cutoff
249 Only output names of functions whose max count value are below
250 the cutoff value.
251
252 --profile-version
253 Print profile version.
254
255 --showcs
256 Only show context sensitive profile counts. The default is to
257 filter all context sensitive profile counts.
258
259 --show-prof-sym-list=[true|false]
260 Show profile symbol list if it exists in the profile. This op‐
261 tion is only meaningful for sample-based profile in extbinary
262 format.
263
264 --show-sec-info-only=[true|false]
265 Show basic information about each section in the profile. This
266 option is only meaningful for sample-based profile in extbinary
267 format.
268
269 --debug-info=<path>
270 Specify the executable or .dSYM that contains debug info for the
271 raw profile. When -debug-info-correlate was used for instrumen‐
272 tation, use this option to show the correlated functions from
273 the raw profile.
274
275 --covered
276 Show only the functions that have been executed, i.e., functions
277 with non-zero counts.
278
280 SYNOPSIS
281 llvm-profdata overlap [options] [base profile file] [test profile file]
282
283 DESCRIPTION
284 llvm-profdata overlap takes two profile data files and displays the
285 overlap of counter distribution between the whole files and between any
286 of the specified functions.
287
288 In this command, overlap is defined as follows: Suppose base profile
289 file has the following counts: {c1_1, c1_2, ..., c1_n, c1_u_1, c2_u_2,
290 ..., c2_u_s}, and test profile file has {c2_1, c2_2, ..., c2_n, c2_v_1,
291 c2_v_2, ..., c2_v_t}. Here c{1|2}_i (i = 1 .. n) are matched counters
292 and c1_u_i (i = 1 .. s) and c2_v_i (i = 1 .. v) are unmatched counters
293 (or counters only existing in) base profile file and test profile file,
294 respectively. Let sum_1 = c1_1 + c1_2 + ... + c1_n + c1_u_1 + c2_u_2
295 + ... + c2_u_s, and sum_2 = c2_1 + c2_2 + ... + c2_n + c2_v_1 + c2_v_2
296 + ... + c2_v_t. overlap = min(c1_1/sum_1, c2_1/sum_2) +
297 min(c1_2/sum_1, c2_2/sum_2) + ... + min(c1_n/sum_1, c2_n/sum_2).
298
299 The result overlap distribution is a percentage number, ranging from
300 0.0% to 100.0%, where 0.0% means there is no overlap and 100.0% means a
301 perfect overlap.
302
303 Here is an example, if base profile file has counts of {400, 600}, and
304 test profile file has matched counts of {60000, 40000}. The overlap is
305 80%.
306
307 OPTIONS
308 --function=<string>
309 Print details for a function if the function's name contains the
310 given string.
311
312 --help Print a summary of command line options.
313
314 --output=<output>, -o
315 Specify the output file name. If output is - or it isn't speci‐
316 fied, then the output is sent to standard output.
317
318 --value-cutoff=<n>
319 Show only those functions whose max count values are greater or
320 equal to n. By default, the value-cutoff is set to max of un‐
321 signed long long.
322
323 --cs Only show overlap for the context sensitive profile counts. The
324 default is to show non-context sensitive profile counts.
325
327 llvm-profdata returns 1 if the command is omitted or is invalid, if it
328 cannot read input files, or if there is a mismatch between their data.
329
331 Maintained by the LLVM Team (https://llvm.org/).
332
334 2003-2023, LLVM Project
335
336
337
338
33916 2023-08-24 LLVM-PROFDATA(1)