1PQ2-ANA-DIST(1) General Commands Manual PQ2-ANA-DIST(1)
2
3
4
6 pq2-ana-dist - Analyse the file distribution of a dataset (or a set of
7 datasets) from a dataset meta-repository based on ROOT files
8
10 pq2-ana-dist [options] datasets
11
13 This manual page documents briefly the pq2-ana-dist program.
14
15 pq2-ana-dist is a script invoking the pq2 ROOT application to analyse
16 the file distribution of a dataset (or a set of datasets) over the file
17 servers, eiher in terms of files or of file sizes. The output is a text
18 file with the the file movements needed to make the file distribution
19 even in the chosen metrics to be used, for example, as in input to
20 pq2-redistribute(1). Optionally the internal objects can be saved so
21 that they can be used as starting point for a subsequent run. An his‐
22 togram and a plot can also be saved to visualize the file distribution.
23 The repository with the dataset information can be accessed via the
24 local file system or a remote file server daemon or a PROOF facility.
25
26 More details about the underlying 'pq2' application can be found in the
27 man page pq2(1).
28
30 datasets
31 Comma-separated list of datasets to be analysed; the '*' wild
32 card in the items (in such a case the full string - as shown by
33 pq2-ls - should be given in quotes, e.g.
34 '/default/ganis/h1-set5*'.
35
37 -h, --help
38 Display help information.
39
40 -k, --keep
41 Keep the temporary files created during the analysis under
42 $TMPDIR
43
44 -v Verbose mode
45
46 --dataset=<datasets>
47 Alternative way to define the datasets to be analysed.
48
49 -s <servers>, --servers=<servers>
50 Comma-separated list of servers to be used (-s) in the analysis;
51 a '+' in front of the list adds the specified servers to the
52 existing ones: this can be useful when determining file move‐
53 ments to empty or new servers
54
55 -e <excsrvs>, --exclude=<excsrvs>
56 Comma-separated list of servers to be excluded from the target
57 servers; this can be used, for example, to determine the files
58 movements to drain a server.
59
60 -i <ignsrvs>, --ignore=<ignsrvs>
61 Comma-separated list of servers to be ignored in the analysis;
62 this can be used, for example, to skip the redirector.
63
64 -m <metrics>, --metrics=<metrics>
65 Metrics to be used to calculate the degree of evenness:
66 F use the number of files (default)
67 S use the file size
68
69 -f <filemv>, --filemv=<filemv>
70 Defines the file where to save the result of the analysis; by
71 the default the result is send to the screen. The output con‐
72 tains one line per each file that needs to be moved with the
73 format 'file source destination' where 'file' is the file name,
74 'source' is the source server URL and 'destination' is the des‐
75 tination server URL. The file can be used as input, for exam‐
76 ple, to pq2-redistribute(1).
77
78 --plot[=<fileplot.fmt>]
79 Defines the file with the output plot with the original distri‐
80 bution with the server names and the +-10% limits; the extension
81 (if known) defines the format; the default format is 'png' and
82 the default name 'plot.png'. The plot can also be obtained
83 directly from a binary output file (saved with '--fout=<out‐
84 file>.root') but just specifying '--fin=<outfile>.root --plot'.
85 The available formats are those known by ROOT: png (default),
86 eps, ps, pdf, svg, gif, xpm, jpg, tiff.
87
88 --fout[=<outfile>]
89 Defines the file where to save the output of the analysis in
90 binary form (ROOT file); this output can be used as starting
91 point for a next run, allowing to run over many datasets in sep‐
92 arate steps.
93
94 --fin[=<infile>]
95 Defines the ROOT file from where to fetch the output of a previ‐
96 ous run (saved with --fout=infile).
97
98 -u <serverurl>, --url=<serverurl>
99 URL of the PROOF master or data server providing the informa‐
100 tion; for data servers, it must include the directory. Can also
101 be specified via the environment variables PQ2PROOFURL or
102 PQ2DSSRVURL (see ENVIRONMENT VARIABLES)."
103
104 -t <dir>, --tmpdir=<dir>
105 Directory for temporary files; defualt is /tmp/<username>.
106
108 The out file constist of one line per file to be moved with the follow‐
109 ing format:
110
111 file-name source-server-URL destination-server-URL
112
114 See setup-pq2(1).
115
117 pq2(1), setup-pq2(1), pq2-ls(1), pq2-ls-files(1), pq2-ls-files-
118 server(1), pq2-info-server(1), pq2-redistribute(1), pq2-verify(1),
119 pq2-rm(1), pq2-cache(1)
120
121 For more information on the ROOT system, please refer to
122 http://root.cern.ch
123
125 Gerardo Ganis for the ROOT team.
126
128 This library is free software; you can redistribute it and/or modify it
129 under the terms of the GNU Lesser General Public License as published
130 by the Free Software Foundation; either version 2.1 of the License, or
131 (at your option) any later version.
132
133 This library is distributed in the hope that it will be useful, but
134 WITHOUT ANY WARRANTY; without even the implied warranty of MER‐
135 CHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser
136 General Public License for more details.
137
138 You should have received a copy of the GNU Lesser General Public
139 License along with this library; if not, write to the Free Software
140 Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301
141 USA
142
144 This manual page was originally written by Gerardo Ganis <ger‐
145 ardo.ganis@cern.ch>, for ROOT version 5.
146
147
148
149ROOT Version 5 PQ2-ANA-DIST(1)