1condor_dagman_metrics_reporteGre(n1e)ral Commands Mcaonnudaolr_dagman_metrics_reporter(1)
2
3
4
6 condor_dagman_metrics_reporterReport the statistics of a DAGMan run to
7 a central HTTP server
8
10 condor_dagman_metrics_reporter[-s] [-u URL] [-t maxtime] -f
11 /path/to/metrics/file
12
14 condor_dagman_metrics_reporteranonymously reports metrics from a DAGMan
15 workflow to a central server. The reporting of workflow metrics is only
16 enabled for DAGMan workflows run under Pegasus; metrics reporting has
17 been requested by Pegasus' funding sources: see http://pega‐
18 sus.isi.edu/wms/docs/latest/funding_citing_usage.php#usage_statistic‐
19 sand https://confluence.pegasus.isi.edu/display/pegasus/DAGMan+Met‐
20 rics+Reportingfor the requirements to collect this data.
21
22 The data sent to the server is in JSON format. Here is an example of
23 what is sent:
24
25 {
26 "client":"condor_dagman",
27 "version":"8.1.0",
28 "planner":"/lfs1/devel/Pegasus/pegasus/bin/pegasus-plan",
29 "planner_version":"4.3.0cvs",
30 "type":"metrics",
31 "wf_uuid":"htcondor-test-job_dagman_metrics-A-subdag",
32 "root_wf_uuid":"htcondor-test-job_dagman_metrics-A",
33 "start_time":1375313459.603,
34 "end_time":1375313491.498,
35 "duration":31.895,
36 "exitcode":1,
37 "dagman_id":"26",
38 "parent_dagman_id":"11",
39 "rescue_dag_number":0,
40 "jobs":4,
41 "jobs_failed":1,
42 "jobs_succeeded":3,
43 "dag_jobs":0,
44 "dag_jobs_failed":0,
45 "dag_jobs_succeeded":0,
46 "total_jobs":4,
47 "total_jobs_run":4,
48 "total_job_time":0.000,
49 "dag_status":2
50 }
51
52 Metrics are sent only if the condor_dagmanprocess has PEGASUS_METRICS‐
53 set to Truein its environment, and the CONDOR_DEVELOPERSconfiguration
54 variable does nothave the value NONE.
55
56 Ordinarily, this program will be run by condor_dagman, and users do not
57 need to interact with it. This program uses the following environment
58 variables:
59
60 PEGASUS_USER_METRICS_DEFAULT_SERVER
61
62 The URL of the default server to which to send the data. It defaults
63 to http://metrics.pegasus.isi.edu/metrics. It can be overridden at
64 the command line with the -uoption.
65
66
67
68 PEGASUS_USER_METRICS_SERVER
69
70 A comma separated list of URLs of servers that will receive the
71 data, in addition to the default server.
72
73
74
75 The -fargument specifies the metrics file to be sent to the HTTP
76 server.
77
79 -s
80
81 Sleep for a random number of seconds between 1 and 10, before
82 attempting to send data. This option is used to space out the
83 reporting from any sub-DAGs when a DAG is removed.
84
85
86
87 -u URL
88
89 Overrides setting of the environment variable PEGASUS_USER_MET‐
90 RICS_DEFAULT_SERVER. This option is unused by condor_dagman; it is
91 for testing by developers.
92
93
94
95 -t maxtime
96
97 A maximum time in seconds that defaults to 100 seconds, setting a
98 limit on the amount of time this program will wait for communication
99 from the server. A setting of zero will result in a single attempt
100 per server. condor_dagmanretrieves this value from the DAGMAN_PEGA‐
101 SUS_REPORT_TIMEOUTconfiguration variable.
102
103
104
105 -f metrics_file
106
107 The name of the file containing the metrics values to be reported.
108
109
110
112 condor_dagman_metrics_reporterwill exit with a status value of 0 (zero)
113 upon success, and it will exit with a value of 1 (one) upon failure.
114
116 Center for High Throughput Computing, University of Wisconsin-Madison
117
119 Copyright (C) 1990-2019 Center for High Throughput Computing, Computer
120 Sciences Department, University of Wisconsin-Madison, Madison, WI. All
121 Rights Reserved. Licensed under the Apache License, Version 2.0.
122
123
124
125 date condor_dagman_metrics_reporter(1)