1just-man-pages/condor_dagman(G1e)neral Commands Manjuuaslt-man-pages/condor_dagman(1)
2
3
4

Name

6       condor_dagman  meta  scheduler  of the jobs submitted as the nodes of a
7       DAG or DAGs
8

Synopsis

10       condor_dagman [ -debug level ] [ -rescue filename  ]  [  -maxidle  num‐
11       berOfJobs  ] [ -maxjobs numberOfJobs ] [ -maxpre NumberOfPREscripts ] [
12       -maxpost NumberOfPOSTscripts ] [ -noeventchecks ] [ -allowlogerror ]  [
13       -usedagdir ] -lockfile filename [ -waitfordebug ] [ -autorescue 0|1 ] [
14       -dorescuefrom number ] -csdversion version_string  [  -allowversionmis‐
15       match  ]  [  -DumpRescue  ]  -dag  dag_file [ -dag dag_file_2 ...  -dag
16       dag_file_n ]
17

Description

19       condor_dagman is a meta scheduler for the  Condor  jobs  within  a  DAG
20       (directed  acyclic  graph) (or multiple DAGs). In typical usage, a sub‐
21       mitter of jobs that are organized into a DAG submits the DAG using con‐
22       dor_submit_dag  .   condor_submit_dag does error checking on aspects of
23       the DAG and then submits condor_dagman as a Condor job.   condor_dagman
24       uses  log files to coordinate the further submission of the jobs within
25       the DAG.
26
27       As part of daemoncore , the set of command-line arguments given in sec‐
28       tion work for condor_dagman .
29
30       Arguments  to condor_dagman are either automatically set by condor_sub‐
31       mit_dag or they are specified as command-line arguments to  condor_sub‐
32       mit_dag  and passed on to condor_dagman . The method by which the argu‐
33       ments are set is given in their description below.
34
35       condor_dagman can run multiple, independent DAGs. This is done by spec‐
36       ifying  multiple -dag a rguments. Pass multiple DAG input files as com‐
37       mand-line arguments to condor_submit_dag .
38
39       Debugging output may be obtained by  using  the  -debug  level  option.
40       Level values and what they produce is described as
41
42          * level = 0; never produce output, except for usage info
43
44          * level = 1; very quiet, output severe errors
45
46          * level = 2; normal output, errors and warnings
47
48          * level = 3; output errors, as well as all warnings
49
50          * level = 4; internal debugging output
51
52          * level = 5; internal debugging output; outer loop debugging
53
54          * level = 6; internal debugging output; inner loop debugging
55
56          * level = 7; internal debugging output; rarely used
57

Options

59       -debug level
60
61          An  integer  level  of  debugging output.  level is an integer, with
62          values of 0-7 inclusive, where 7 is the most  verbose  output.  This
63          command-line  option to condor_submit_dag is passed to condor_dagman
64          or defaults to the value 3, as set by condor_submit_dag .
65
66
67
68       -rescue filename
69
70          Sets the file name of the rescue DAG to write in the case of a fail‐
71          ure.  As  passed by condor_submit_dag , the name of the file will be
72          the name of the DAG input file concatenated with the string .rescue.
73          This argument is now optional, and in general it is preferred to not
74          specify it. This allows condor_dagman to automatically  generate  an
75          appropriate rescue DAG name.
76
77
78
79       -maxidle NumberOfJobs
80
81          Sets  the  maximum  number of idle jobs allowed before condor_dagman
82          stops submitting more jobs. Once idle jobs start to run, condor_dag‐
83          man  will  resume submitting jobs.  NumberOfJobs is a positive inte‐
84          ger. This command-line option to condor_submit_dag is passed to con‐
85          dor_dagman . If not specified, the number of idle jobs is unlimited.
86
87
88
89       -maxjobs numberOfJobs
90
91          Sets  the maximum number of jobs within the DAG that will be submit‐
92          ted to Condor at one time.  numberOfJobs is a positive integer. This
93          command-line  option to condor_submit_dag is passed to condor_dagman
94          . If not specified, the default number of jobs is unlimited.
95
96
97
98       -maxpre NumberOfPREscripts
99
100          Sets the maximum number of PRE scripts within the DAG  that  may  be
101          running at one time.  NumberOfPREScripts is a positive integer. This
102          command-line option to condor_submit_dag is passed to  condor_dagman
103          . If not specified, the default number of PRE scripts is unlimited.
104
105
106
107       -maxpost NumberOfPOSTscripts
108
109          Sets  the  maximum number of POST scripts within the DAG that may be
110          running at one time.  NumberOfPOSTScripts  is  a  positive  integer.
111          This  command-line  option  to  condor_submit_dag  is passed to con‐
112          dor_dagman . If not specified, the default number of POST scripts is
113          unlimited.
114
115
116
117       -noeventchecks
118
119          This  argument is no longer used; it is now ignored. Its functional‐
120          ity is now implemented by the DAGMAN_ALLOW_EVENTSconfiguration macro
121          (see section ).
122
123
124
125       -allowlogerror
126
127          This  optional  argument  has condor_dagman try to run the specified
128          DAG, even in the case of detected errors in the user log  specifica‐
129          tion.  As of version 7.3.2, this argument has an effect only on DAGs
130          containing Stork job nodes.
131
132
133
134       -usedagdir
135
136          This optional argument causes condor_dagman to  run  each  specified
137          DAG  as  if  the  directory containing that DAG file was the current
138          working directory. This option is most useful when running  multiple
139          DAGs in a single condor_dagman .
140
141
142
143       -lockfile filename
144
145          Names  the  file created and used as a lock file. The lock file pre‐
146          vents execution of two of the same DAG, as defined by  a  DAG  input
147          file.  A default lock file ending with the suffix .dag.lockis passed
148          to condor_dagman by condor_submit_dag .
149
150
151
152       -waitfordebug
153
154          This optional argument causes condor_dagman to wait at startup until
155          someone  attaches  to  the  process  with  a  debugger  and sets the
156          wait_for_debug variable in main_init() to false.
157
158
159
160       -autorescue 0|1
161
162          Whether to automatically run the newest rescue DAG for the given DAG
163          file, if one exists (0 = false, 1 = true).
164
165
166
167       -dorescuefrom number
168
169          Forces  condor_dagman to run the specified rescue DAG number for the
170          given DAG. A value of 0 is the same as not specifying  this  option.
171          Specifying a non-existant rescue DAG is a fatal error.
172
173
174
175       -csdversion version_string
176
177          version_string  is  the version of the condor_submit_dag program. At
178          startup, condor_dagman checks for a version mismatch with  the  con‐
179          dor_submit_dag version in this argument.
180
181
182
183       -allowversionmismatch
184
185          This  optional argument causes condor_dagman to allow a version mis‐
186          match between condor_dagman itself and the .condor.subfile  produced
187          by  condor_submit_dag (or, in other words, between condor_submit_dag
188          and condor_dagman ). WARNING! This option should  be  used  only  if
189          absolutely  necessary.  Allowing version mismatches can cause subtle
190          problems when running DAGs. (Note that, starting with version 7.4.0,
191          condor_dagman  no  longer  requires  an  exact version match between
192          itself and the .condor.subfile. Instead, a "minimum compatible  ver‐
193          sion"  is  defined, and any .condor.subfile of that version or newer
194          is accepted.)
195
196
197
198       -DumpRescue
199
200          This optional argument causes condor_dagman to  immediately  dump  a
201          rescue  DAG  and  then exit, as opposed to actually running the DAG.
202          (This feature is mainly intended for testing.)
203
204
205
206       -dag filename
207
208          filename is the name of the DAG input file that is set as  an  argu‐
209          ment to condor_submit_dag , and passed to condor_dagman .
210
211
212

Exit Status

214       condor_dagman  will  exit with a status value of 0 (zero) upon success,
215       and it will exit with the value 1 (one) upon failure.
216

Examples

218       condor_dagman is normally not run directly, but submitted as  a  Condor
219       job by running condor_submit_dag. See the condor_submit_dag manual page
220       for examples.
221

Author

223       Condor Team, University of Wisconsin-Madison
224
226       Copyright (C) 1990-2009 Condor Team, Computer Sciences Department, Uni‐
227       versity   of  Wisconsin-Madison,  Madison,  WI.  All  Rights  Reserved.
228       Licensed under the Apache License, Version 2.0.
229
230       See  the  Condor  Version   7.4.2   Manual   or   http://www.condorpro
231       ject.org/licensefor additional notices. condor-admin@cs.wisc.edu
232
233
234
235                                     date      just-man-pages/condor_dagman(1)
Impressum