carbon-c-relay(1)

1CARBON-C-RELAY(1)           General Commands Manual          CARBON-C-RELAY(1)
2
3
4

NAME

6       carbon-c-relay - graphite relay, aggregator and rewriter
7
8        https://travis-ci.org/grobian/carbon-c-relay
9

SYNOPSIS

11       carbon-c-relay -f config-file [ options ... ]
12

DESCRIPTION

14       carbon-c-relay  accepts,  cleanses, matches, rewrites, forwards and ag‐
15       gregates graphite metrics by listening for incoming connections and re‐
16       laying  the messages to other servers defined in its configuration. The
17       core functionality is to route messages via flexible rules to  the  de‐
18       sired destinations.
19
20       carbon-c-relay  is  a simple program that reads its routing information
21       from a file. The command line arguments allow to set the  location  for
22       this file, as well as the amount of dispatchers (worker threads) to use
23       for reading the data from incoming connections and  passing  them  onto
24       the  right destination(s). The route file supports two main constructs:
25       clusters and matches. The first define groups of hosts data metrics can
26       be  sent  to,  the  latter define which metrics should be sent to which
27       cluster. Aggregation rules are seen as matches.
28
29       For every metric received by the relay,  cleansing  is  performed.  The
30       following  changes are performed before any match, aggregate or rewrite
31       rule sees the metric:
32
33       ○   double dot elimination (necessary for correctly functioning consis‐
34           tent hash routing)
35
36       ○   trailing/leading dot elimination
37
38       ○   whitespace  normalisation  (this mostly affects output of the relay
39           to other targets: metric, value and timestamp will be separated  by
40           a single space only, ever)
41
42       ○   irregular  char replacement with underscores (_), currently irregu‐
43           lar is defined as not being in [0-9a-zA-Z-_:#], but can be overrid‐
44           den  on the command line. Note that tags (when present and allowed)
45           are not processed this way.
46
47
48

OPTIONS

50       These options control the behaviour of carbon-c-relay.
51
52       ○   -v: Print version string and exit.
53
54       ○   -d: Enable debug mode, this prints statistics to stdout and  prints
55           extra  messages about some situations encountered by the relay that
56           normally would be too verbose to be enabled. When combined with  -t
57           (test  mode)  this also prints stub routes and consistent-hash ring
58           contents.
59
60       ○   -s: Enable submission mode. In this mode, internal  statistics  are
61           not  generated.  Instead,  queue pressure and metrics drops are re‐
62           ported on stdout. This mode is useful when used as submission relay
63           which´ job is just to forward to (a set of) main relays. Statistics
64           about the submission relays in this case are not needed, and  could
65           easily  cause a non-desired flood of metrics e.g. when used on each
66           and every host locally.
67
68       ○   -t: Test mode. This mode doesn´t do any routing at all, but instead
69           reads input from stdin and prints what actions would be taken given
70           the loaded configuration. This mode is very useful for testing  re‐
71           lay  routes  for  regular  expression syntax etc. It also allows to
72           give insight on how routing is applied in  complex  configurations,
73           for  it shows rewrites and aggregates taking place as well. When -t
74           is repeated, the relay will only test the configuration for  valid‐
75           ity  and  exit  immediately afterwards. Any standard output is sup‐
76           pressed in this mode, making it ideal for start-scripts to  test  a
77           (new) configuration.
78
79       ○   -f  config-file:  Read configuration from config-file. A configura‐
80           tion consists of clusters and routes. See CONFIGURATION SYNTAX  for
81           more information on the options and syntax of this file.
82
83       ○   -l  log-file:  Use  log-file for writing messages. Without this op‐
84           tion, the relay writes both to stdout and stderr. When  logging  to
85           file,  all  messages  are  prefixed with MSG when they were sent to
86           stdout, and ERR when they were sent to stderr.
87
88       ○   -p port: Listen for connections on port port. The  port  number  is
89           used  for  both  TCP, UDP and UNIX sockets. In the latter case, the
90           socket file contains the port number. The port  defaults  to  2003,
91           which  is also used by the original carbon-cache.py. Note that this
92           only applies to the defaults, when listen  directives  are  in  the
93           config, this setting is ignored.
94
95       ○   -w  workers:  Use  workers number of threads. The default number of
96           workers is equal to the amount of  detected  CPU  cores.  It  makes
97           sense  to  reduce  this  number  on many-core machines, or when the
98           traffic is low.
99
100       ○   -b batchsize: Set the amount of metrics that sent to remote servers
101           at  once  to batchsize. When the relay sends metrics to servers, it
102           will retrieve batchsize metrics from the pending queue  of  metrics
103           waiting  for that server and send those one by one. The size of the
104           batch will have minimal impact on sending performance, but it  con‐
105           trols  the  amount  of lock-contention on the queue. The default is
106           2500.
107
108       ○   -q queuesize: Each server from the configuration  where  the  relay
109           will  send  metrics  to, has a queue associated with it. This queue
110           allows for disruptions and bursts to be handled. The size  of  this
111           queue will be set to queuesize which allows for that amount of met‐
112           rics to be stored in the queue before it overflows, and  the  relay
113           starts  dropping metrics. The larger the queue, more metrics can be
114           absorbed, but also more memory will be used by the relay.  The  de‐
115           fault queue size is 25000.
116
117       ○   -L  stalls: Sets the max mount of stalls to stalls before the relay
118           starts dropping metrics for a server. When a queue  fills  up,  the
119           relay  uses a mechanism called stalling to signal the client (writ‐
120           ing to the relay) of this event.  In  particular  when  the  client
121           sends  a  large  amount  of  metrics  in  very  short time (burst),
122           stalling can help to avoid dropping metrics, since the client  just
123           needs to slow down for a bit, which in many cases is possible (e.g.
124           when catting a file with nc(1)). However, this behaviour  can  also
125           obstruct, artificially stalling writers which cannot stop that eas‐
126           ily. For this the stalls can be set from 0 to 15, where each  stall
127           can take around 1 second on the client. The default value is set to
128           4, which is aimed at the occasional disruption scenario and max ef‐
129           fort to not loose metrics with moderate slowing down of clients.
130
131       ○   -C  CAcertpath:  Read  CA  certs (for use with TLS/SSL connections)
132           from given path or file. When not given, the default locations  are
133           used.  Strict  verfication  of the peer is performed, so when using
134           self-signed certificates, be sure to include the CA cert in the de‐
135           fault location, or provide the path to the cert using this option.
136
137       ○   -T  timeout:  Specifies  the  IO  timeout  in milliseconds used for
138           server connections. The default is 600 milliseconds, but  may  need
139           increasing when WAN links are used for target servers. A relatively
140           low value for connection timeout allows the relay to quickly estab‐
141           lish  a  server  is unreachable, and as such failover strategies to
142           kick in before the queue runs high.
143
144       ○   -c chars: Defines the characters that are next to  [A-Za-z0-9]  al‐
145           lowed  in  metrics to chars. Any character not in this list, is re‐
146           placed by the relay with _ (underscore). The default  list  of  al‐
147           lowed characters is -_:#.
148
149       ○   -m  length:  Limits  the metric names to be of at most length bytes
150           long. Any lines containing metric names larger than  this  will  be
151           discarded.
152
153       ○   -M  length  Limits  the input to lines of at most length bytes. Any
154           excess lines will be discarded. Note that -m needs  to  be  smaller
155           than this value.
156
157       ○   -H  hostname:  Override  hostname  determined by a call to gethost‐
158           name(3) with hostname. The hostname is used mainly in  the  statis‐
159           tics metrics carbon.relays.<hostname>.<...> sent by the relay.
160
161       ○   -B  backlog:  Sets TCP connection listen backlog to backlog connec‐
162           tions. The default value is 32 but on servers  which  receive  many
163           concurrent  connections,  this setting likely needs to be increased
164           to avoid connection refused errors on the clients.
165
166       ○   -U bufsize: Sets the socket send/receive buffer sizes in bytes, for
167           both TCP and UDP scenarios. When unset, the OS default is used. The
168           maximum is also determined by the OS. The sizes are set using  set‐
169           sockopt  with  the flags SO_RCVBUF and SO_SNDBUF. Setting this size
170           may be necessary for large volume  scenarios,  for  which  also  -B
171           might apply. Checking the Recv-Q and the receive errors values from
172           netstat gives a good hint about buffer usage.
173
174       ○   -E: Disable disconnecting idle incoming connections. By default the
175           relay disconnects idle client connections after 10 minutes. It does
176           this to prevent resources clogging up when a  faulty  or  malicious
177           client  keeps on opening connections without closing them. It typi‐
178           cally prevents running out of file descriptors. For some scenarios,
179           however,  it  is  not  desirable for idle connections to be discon‐
180           nected, hence passing this flag will disable this behaviour.
181
182       ○   -D: Deamonise into the background after startup.  This  option  re‐
183           quires -l and -P flags to be set as well.
184
185       ○   -P  pidfile:  Write  the  pid of the relay process to a file called
186           pidfile. This is in particular useful when daemonised  in  combina‐
187           tion with init managers.
188
189       ○   -O  threshold: The minimum number of rules to find before trying to
190           optimise the ruleset. The default is 50, to disable the  optimiser,
191           use  -1,  to always run the optimiser use 0. The optimiser tries to
192           group rules to avoid spending excessive time  on  matching  expres‐
193           sions.
194
195
196

CONFIGURATION SYNTAX

198       The  config  file  supports  the following syntax, where comments start
199       with a # character and can appear at any position on a  line  and  sup‐
200       press input until the end of that line:
201
202       ```  cluster  name  <forward | any_of | failover [useall] | carbon_ch |
203       fnv1a_ch   |   jump_fnv1a_ch   [replication    count]    [dynamic]    >
204       <host[:port][=instance]  [proto  udp  | tcp] [type linemode] [transport
205       plain | gzip | lz4 | snappy [ssl]]> ... ;
206
207       cluster name file [ip] </path/to/file> ... ;
208
209       match <* | expression ...> [validate expression else log |  drop]  send
210       to <cluster ... | blackhole> [stop] ;
211
212       rewrite expression into replacement ;
213
214       aggregate expression ... every interval seconds expire after expiration
215       seconds [timestamp at start | middle | end of  bucket]  compute  sum  |
216       count  |  max | min | average | median | percentile<% | variance | std‐
217       dev> write to metric [compute ...] [send to <cluster ...>] [stop] ;
218
219       send statistics to <cluster ...> [stop] ; statistics [submit every  in‐
220       terval  seconds]  [reset  counters after interval] [prefix with prefix]
221       [send to <cluster ...>] [stop] ;
222
223       listen type linemode [transport plain | gzip | lz4 | snappy  [ssl  pem‐
224       cert]]  «interface[:port]  |  port> proto udp | tcp> ... </ptah/to/file
225       proto unix> ... ;
226
227       include </path/to/file/or/glob> ; ```
228
229   CLUSTERS
230       Multiple clusters can be defined, and need not to be  referenced  by  a
231       match  rule.  All  clusters point to one or more hosts, except the file
232       cluster which writes to files in the local filesystem. host may  be  an
233       IPv4  or  IPv6 address, or a hostname. Since host is followed by an op‐
234       tional : and port, for IPv6 addresses not to  be  interpreted  wrongly,
235       either  a  port must be given, or the IPv6 address surrounded by brack‐
236       ets, e.g. [::1]. Optional transport and proto clauses can  be  used  to
237       wrap the connection in a compression or encryption later or specify the
238       use of UDP or TCP to connect to the remote  server.  When  omitted  the
239       connection  defaults  to  an unwrapped TCP connection. type can only be
240       linemode at the moment.
241
242       The forward and file clusters simply send everything  they  receive  to
243       all  defined members (host addresses or files). The any_of cluster is a
244       small variant of the forward cluster, but instead of sending to all de‐
245       fined members, it sends each incoming metric to one of defined members.
246       This is not much useful in itself, but since any of the members can re‐
247       ceive  each metric, this means that when one of the members is unreach‐
248       able, the other members will receive all of the metrics.  This  can  be
249       useful when the cluster points to other relays. The any_of router tries
250       to send the same metrics consistently  to  the  same  destination.  The
251       failover cluster is like the any_of cluster, but sticks to the order in
252       which servers are defined. This is to implement a  pure  failover  sce‐
253       nario  between  servers. The carbon_ch cluster sends the metrics to the
254       member that is responsible according to the consistent  hash  algorithm
255       (as used in the original carbon), or multiple members if replication is
256       set to more than 1. When dynamic is set, failure of any of the  servers
257       does  not  result in metrics being dropped for that server, but instead
258       the undeliverable metrics are sent to any other server in  the  cluster
259       in  order  for  the  metrics  not to get lost. This is most useful when
260       replication is 1. The fnv1a_ch cluster is a identical in  behaviour  to
261       carbon_ch,  but  it  uses  a  different hash technique (FNV1a) which is
262       faster but more importantly defined to get by a limitation of carbon_ch
263       to  use both host and port from the members. This is useful when multi‐
264       ple targets live on the same host just separated by port. The  instance
265       that original carbon uses to get around this can be set by appending it
266       after the port, separated by an equals sign, e.g. 127.0.0.1:2006=a  for
267       instance  a.  When  using the fnv1a_ch cluster, this instance overrides
268       the hash key in use. This allows for many things, including  masquerad‐
269       ing  old  IP addresses, but mostly to make the hash key location to be‐
270       come agnostic of the (physical) location of that key. For example,  us‐
271       age  like 10.0.0.1:2003=4d79d13554fa1301476c1f9fe968b0ac would allow to
272       change port and/or ip address of the server that receives data for  the
273       instance  key.  Obviously, this way migration of data can be dealt with
274       much more conveniently. The jump_fnv1a_ch cluster is also a  consistent
275       hash cluster like the previous two, but it does not take the server in‐
276       formation into account at all. Whether this is useful to you depends on
277       your  scenario.  The  jump  hash  has  a much better balancing over the
278       servers defined in the cluster, at the expense of not being able to re‐
279       move  any  server  but  the last in order. What this means is that this
280       hash is fine to use with ever growing clusters where  older  nodes  are
281       also replaced at some point. If you have a cluster where removal of old
282       nodes takes place often, the jump hash is not suitable  for  you.  Jump
283       hash  works  with servers in an ordered list without gaps. To influence
284       the ordering, the instance given to the server will be used as  sorting
285       key.  Without,  the  order  will  be as given in the file. It is a good
286       practice to fix the order of the servers with instances such that it is
287       explicit what the right nodes for the jump hash are.
288
289       DNS  hostnames are resolved to a single address, according to the pref‐
290       erence rules  in  RFC  3484  https://www.ietf.org/rfc/rfc3484.txt.  The
291       any_of, failover and forward clusters have an explicit useall flag that
292       enables expansion for hostnames resolving to multiple  addresses.  Each
293       address returned becomes a cluster destination.
294
295   MATCHES
296       Match rules are the way to direct incoming metrics to one or more clus‐
297       ters. Match rules are processed top to bottom as they  are  defined  in
298       the  file.  It is possible to define multiple matches in the same rule.
299       Each match rule can send data to one  or  more  clusters.  Since  match
300       rules  "fall  through"  unless  the  stop  keyword  is added, carefully
301       crafted match expression can be used to target multiple clusters or ag‐
302       gregations.  This  ability allows to replicate metrics, as well as send
303       certain metrics to alternative clusters with careful ordering and usage
304       of the stop keyword. The special cluster blackhole discards any metrics
305       sent to it. This can be useful for weeding out unwanted metrics in cer‐
306       tain cases. Because throwing metrics away is pointless if other matches
307       would accept the same data, a match with as destination  the  blackhole
308       cluster,  has  an  implicit stop. The validation clause adds a check to
309       the data (what comes after the metric) in the form of a regular expres‐
310       sion.  When  this expression matches, the match rule will execute as if
311       no validation clause was present. However, if it fails, the match  rule
312       is  aborted,  and  no metrics will be sent to destinations, this is the
313       drop behaviour. When log is used, the metric is logged to stderr.  Care
314       should  be taken with the latter to avoid log flooding. When a validate
315       clause is present, destinations need not to be present, this allows for
316       applying  a  global  validation rule. Note that the cleansing rules are
317       applied before validation is done, thus the data will not  have  dupli‐
318       cate spaces. The route using clause is used to perform a temporary mod‐
319       ification to the key used for input to the consistent hashing routines.
320       The  primary  purpose  is  to route traffic so that appropriate data is
321       sent to the needed aggregation instances.
322
323   REWRITES
324       Rewrite rules take a regular expression as input to match incoming met‐
325       rics,  and  transform them into the desired new metric name. In the re‐
326       placement, backreferences are allowed to match capture  groups  defined
327       in the input regular expression. A match of server\.(x|y|z)\. allows to
328       use e.g. role.\1. in the substitution. A few caveats apply to the  cur‐
329       rent implementation of rewrite rules. First, their location in the con‐
330       fig file determines when the rewrite is performed. The rewrite is  done
331       in-place, as such a match rule before the rewrite would match the orig‐
332       inal name, a match rule after the rewrite no longer matches the  origi‐
333       nal  name.  Care should be taken with the ordering, as multiple rewrite
334       rules in succession can take place, e.g. a gets replaced  by  b  and  b
335       gets replaced by c in a succeeding rewrite rule. The second caveat with
336       the current implementation, is that the rewritten metric names are  not
337       cleansed, like newly incoming metrics are. Thus, double dots and poten‐
338       tial dangerous characters can  appear  if  the  replacement  string  is
339       crafted to produce them. It is the responsibility of the writer to make
340       sure the metrics are clean. If this is an issue for  routing,  one  can
341       consider  to  have a rewrite-only instance that forwards all metrics to
342       another instance that will do the routing.  Obviously  the  second  in‐
343       stance  will cleanse the metrics as they come in. The backreference no‐
344       tation allows to lowercase and uppercase the  replacement  string  with
345       the use of the underscore (_) and carret (^) symbols following directly
346       after the backslash. For example, role.\_1. as substitution will lower‐
347       case  the contents of \1. The dot (.) can be used in a similar fashion,
348       or followed after the underscore or caret to replace dots  with  under‐
349       scores in the substitution. This can be handy for some situations where
350       metrics are sent to graphite.
351
352   AGGREGATIONS
353       The aggregations defined take one or more input  metrics  expressed  by
354       one  or  more  regular expresions, similar to the match rules. Incoming
355       metrics are aggregated over a period of time defined by the interval in
356       seconds.  Since  events  may arrive a bit later in time, the expiration
357       time in seconds defines when the aggregations should be considered  fi‐
358       nal,  as  no new entries are allowed to be added any more. On top of an
359       aggregation multiple aggregations can be computed. They can be  of  the
360       same  or  different aggregation types, but should write to a unique new
361       metric. The metric names can include back references  like  in  rewrite
362       expressions,  allowing for powerful single aggregation rules that yield
363       in many aggregations. When no send to clause is given, produced metrics
364       are sent to the relay as if they were submitted from the outside, hence
365       match and aggregation rules apply to those. Care should be  taken  that
366       loops  are  avoided  this  way. For this reason, the use of the send to
367       clause is encouraged, to direct the output traffic where possible. Like
368       for  match  rules,  it  is possible to define multiple cluster targets.
369       Also, like match rules, the stop keyword applies to control the flow of
370       metrics in the matching process.
371
372   STATISTICS
373       The  send  statistics to construct is deprecated and will be removed in
374       the next release. Use the special statistics construct instead.
375
376       The statistics construct can control a couple of things about the  (in‐
377       ternal)  statistics  produced  by  the relay. The send to target can be
378       used to avoid router loops by sending the statistics to a certain  des‐
379       tination  cluster(s).  By  default  the  metrics are prefixed with car‐
380       bon.relays.<hostname>, where hostname is determinted on startup and can
381       be  overridden  using the -H argument. This prefix can be set using the
382       prefix with clause similar to a rewrite rule target. The input match in
383       this  case  is the pre-set regular expression ^(([^.]+)(\..*)?)$ on the
384       hostname. As such, one can see that the default prefix is set  by  car‐
385       bon.relays.\.1. Note that this uses the replace-dot-with-underscore re‐
386       placement feature from rewrite rules. Given the input  expression,  the
387       following  match  groups  are available: \1 the entire hostname, \2 the
388       short hostname and \3 the domainname (with leading dot).  It  may  make
389       sense  to  replace  the default by something like carbon.relays.\_2 for
390       certain scenarios, to always use the lowercased short  hostname,  which
391       following the expression doesn´t contain a dot. By default, the metrics
392       are submitted every 60 seconds, this can be changed  using  the  submit
393       every <interval> seconds clause.
394       To  obtain  a more compatible set of values to carbon-cache.py, use the
395       reset counters after interval clause  to  make  values  non-cumulative,
396       that is, they will report the change compared to the previous value.
397
398   LISTENERS
399       The  ports  and  protocols the relay should listen for incoming connec‐
400       tions can be specified using the listen directive. Currently, all  lis‐
401       teners  need to be of linemode type. An optional compression or encryp‐
402       tion wrapping can be specified for  the  port  and  optional  interface
403       given  by  ip  address,  or  unix socket by file. When interface is not
404       specified, the any interface on all available ip protocols is  assumed.
405       If  no listen directive is present, the relay will use the default lis‐
406       teners for port 2003 on tcp and udp, plus the unix socket  /tmp/.s.car‐
407       bon-c-relay.2003.  This typically expands to 5 listeners on an IPv6 en‐
408       abled system. The default matches the behaviour of  versions  prior  to
409       v3.2.
410
411   INCLUDES
412       In  case configuration becomes very long, or is managed better in sepa‐
413       rate files, the include directive can be used to read another file. The
414       given  file will be read in place and added to the router configuration
415       at the time of inclusion. The end result is one  big  route  configura‐
416       tion. Multiple include statements can be used throughout the configura‐
417       tion file. The positioning will influence the order of rules as normal.
418       Beware that recursive inclusion (include from an included file) is sup‐
419       ported, and currently no safeguards exist for an  inclusion  loop.  For
420       what  is worth, this feature likely is best used with simple configura‐
421       tion files (e.g. not having include in them).
422

EXAMPLES

424       carbon-c-relay evolved over time, growing features  on  demand  as  the
425       tool  proved  to  be stable and fitting the job well. Below follow some
426       annotated examples of constructs that can be used with the relay.
427
428       Clusters can be defined as much as necessary. They  receive  data  from
429       match  rules,  and  their type defines which members of the cluster fi‐
430       nally get the metric data. The simplest cluster form is a forward clus‐
431       ter:
432
433       cluster send-through forward 10.1.0.1 ;
434
435       Any  metric  sent to the send-through cluster would simply be forwarded
436       to the server at IPv4 address 10.1.0.1. If we define multiple  servers,
437       all of those servers would get the same metric, thus:
438
439       cluster send-through forward 10.1.0.1 10.2.0.1 ;
440
441       The  above  results  in a duplication of metrics send to both machines.
442       This can be useful, but most of the time it is not. The any_of  cluster
443       type  is  like forward, but it sends each incoming metric to any of the
444       members. The same example with such cluster would be:
445
446       cluster send-to-any-one any_of 10.1.0.1:2010 10.1.0.1:2011;
447
448       This would implement a multipath scenario, where two servers are  used,
449       the  load between them is spread, but should any of them fail, all met‐
450       rics are sent to the remaining one. This typically works well  for  up‐
451       stream  relays,  or for balancing carbon-cache processes running on the
452       same machine. Should any member become unavailable, for instance due to
453       a rolling restart, the other members receive the traffic. If it is nec‐
454       essary to have true fail-over, where the secondary server is only  used
455       if the first is down, the following would implement that:
456
457       cluster try-first-then-second failover 10.1.0.1:2010 10.1.0.1:2011;
458
459       These types are different from the two consistent hash cluster types:
460
461       cluster    graphite    carbon_ch    127.0.0.1:2006=a   127.0.0.1:2007=b
462       127.0.0.1:2008=c ;
463
464       If a member in this example fails, all metrics that would  go  to  that
465       member are kept in the queue, waiting for the member to return. This is
466       useful for clusters of carbon-cache machines where it is desirable that
467       the  same metric ends up on the same server always. The carbon_ch clus‐
468       ter type is compatible with carbon-relay consistent hash,  and  can  be
469       used for existing clusters populated by carbon-relay. For new clusters,
470       however, it is better to use the  fnv1a_ch  cluster  type,  for  it  is
471       faster, and allows to balance over the same address but different ports
472       without an instance number, in constrast to carbon_ch.
473
474       Because we can use multiple clusters, we can also replicate without the
475       use of the forward cluster type, in a more intelligent way:
476
477       ``` cluster dc-old carbon_ch replication 2 10.1.0.1 10.1.0.2 10.1.0.3 ;
478       cluster dc-new1 fnv1a_ch replication 2  10.2.0.1  10.2.0.2  10.2.0.3  ;
479       cluster dc-new2 fnv1a_ch replication 2 10.3.0.1 10.3.0.2 10.3.0.3 ;
480
481       match * send to dc-old ; match * send to dc-new1 dc-new2 stop ; ```
482
483       In  this  example  all  incoming metrics are first sent to dc-old, then
484       dc-new1 and finally to dc-new2. Note that the cluster type of dc-old is
485       different.  Each incoming metric will be send to 2 members of all three
486       clusters, thus replicating to in total 6 destinations. For each cluster
487       the destination members are computed independently. Failure of clusters
488       or members does not  affect  the  others,  since  all  have  individual
489       queues. The above example could also be written using three match rules
490       for each dc, or one match rule for all three  dcs.  The  difference  is
491       mainly  in  performance, the number of times the incoming metric has to
492       be matched against an expression. The stop rule in dc-new match rule is
493       not  strictly necessary in this example, because there are no more fol‐
494       lowing match rules. However, if the match would target a specific  sub‐
495       set,  e.g.  ^sys\.,  and  more clusters would be defined, this could be
496       necessary, as for instance in the following abbreviated example:
497
498       ``` cluster dc1-sys ... ; cluster dc2-sys ... ;
499
500       cluster dc1-misc ... ; cluster dc2-misc ... ;
501
502       match ^sys. send to dc1-sys; match ^sys. send to dc2-sys stop;
503
504       match * send to dc1-misc; match * send to dc2-misc stop; ```
505
506       As can be seen, without the stop in dc2-sys´ match  rule,  all  metrics
507       starting  with sys. would also be send to dc1-misc and dc2-misc. It can
508       be that this is desired, of course, but in this example there is a ded‐
509       icated cluster for the sys metrics.
510
511       Suppose  there would be some unwanted metric that unfortunately is gen‐
512       erated, let´s assume some bad/old software. We don´t want to store this
513       metric.  The  blackhole cluster is suitable for that, when it is harder
514       to actually whitelist all wanted metrics. Consider the following:
515
516       match some_legacy1$ some_legacy2$ send to blackhole stop;
517
518       This would throw away all metrics that end with some_legacy, that would
519       otherwise  be  hard  to  filter out. Since the order matters, it can be
520       used in a construct like this:
521
522       ``` cluster old ... ; cluster new ... ;
523
524       match * send to old;
525
526       match unwanted send to blackhole stop;
527
528       match * send to new; ```
529
530       In this example the old cluster would receive  the  metric  that´s  un‐
531       wanted for the new cluster. So, the order in which the rules occur does
532       matter for the execution.
533
534       Validation can be used to ensure the data for metrics is as expected. A
535       global validation for just number (no floating point) values could be:
536
537       match * validate ^[0-9]+\ [0-9]+$ else drop ;
538
539       (Note  the  escape  with backslash \ of the space, you might be able to
540       use \s or [:space:] instead, this depends on your configured regex  im‐
541       plementation.)
542
543       The  validation  clause can exist on every match rule, so in principle,
544       the following is valid:
545
546       match ^foo validate ^[0-9]+\ [0-9]+$ else drop send to  integer-cluster
547       ;  match  ^foo  validate  ^[0-9.e+-]+\  [0-9.e+-]+$  else  drop send to
548       float-cluster stop;
549
550       Note that the behaviour is different in the previous two examples. When
551       no  send  to clusters are specified, a validation error makes the match
552       behave like the stop keyword  is  present.  Likewise,  when  validation
553       passes, processing continues with the next rule. When destination clus‐
554       ters are present, the match respects the stop keyword as  normal.  When
555       specified,  processing  will always stop when specified so. However, if
556       validation fails, the rule does not send anything  to  the  destination
557       clusters, the metric will be dropped or logged, but never sent.
558
559       The  relay  is  capable  of rewriting incoming metrics on the fly. This
560       process is done based on regular expressions with capture  groups  that
561       allow  to substitute parts in a replacement string. Rewrite rules allow
562       to cleanup metrics from applications, or provide a migration  path.  In
563       it´s simplest form a rewrite rule looks like this:
564
565       rewrite          ^server\.(.+)\.(.+)\.([a-zA-Z]+)([0-9]+)          into
566       server.\_1.\2.\3.\3\4 ;
567
568       In this example a metric like server.DC.role.name123  would  be  trans‐
569       formed  into  server.dc.role.name.name123.  For  rewrite rules hold the
570       same as for matches, that their order matters. Hence to build on top of
571       the old/new cluster example done earlier, the following would store the
572       original metric name in the old cluster, and the new metric name in the
573       new cluster:
574
575       ``` match * send to old;
576
577       rewrite ... ;
578
579       match * send to new; ```
580
581       Note  that  after  the  rewrite,  the original metric name is no longer
582       available, as the rewrite happens in-place.
583
584       Aggregations are probably the most complex part of carbon-c-relay.  Two
585       ways  of  specifying  aggregates  are  supported by carbon-c-relay. The
586       first, static rules, are handled by an optimiser which  tries  to  fold
587       thousands of rules into groups to make the matching more efficient. The
588       second, dynamic rules, are very powerful compact definitions with  pos‐
589       sibly  thousands  of internal instantiations. A typical static aggrega‐
590       tion looks like:
591
592       aggregate      ^sys\.dc1\.somehost-[0-9]+\.somecluster\.mysql\.replica‐
593       tion_delay     ^sys\.dc2\.somehost-[0-9]+\.somecluster\.mysql\.replica‐
594       tion_delay every 10 seconds expire after 35 seconds timestamp at end of
595       bucket  compute  sum write to mysql.somecluster.total_replication_delay
596       compute average  write  to  mysql.somecluster.average_replication_delay
597       compute  max  write  to mysql.somecluster.max_replication_delay compute
598       count write to mysql.somecluster.replication_delay_metric_count ;
599
600       In this example, four  aggregations  are  produced  from  the  incoming
601       matching metrics. In this example we could have written the two matches
602       as one, but for demonstration purposes we did not. Obviously  they  can
603       refer  to  different metrics, if that makes sense. The every 10 seconds
604       clause specifies in what interval the aggregator can expect new metrics
605       to arrive. This interval is used to produce the aggregations, thus each
606       10 seconds 4 new metrics are generated from the  data  received  sofar.
607       Because  data may be in transit for some reason, or generation stalled,
608       the expire after clause specifies how long the data should be kept  be‐
609       fore considering a data bucket (which is aggregated) to be complete. In
610       the example, 35 was used, which means after 35 seconds the first aggre‐
611       gates  are  produced.  It also means that metrics can arrive 35 seconds
612       late, and still be taken into account. The exact time at which the  ag‐
613       gregate  metrics  are  produced is random between 0 and interval (10 in
614       this case) seconds after the expiry time. This is done to prevent thun‐
615       dering  herds of metrics for large aggregation sets. The timestamp that
616       is used for the aggregations can be specified to be the  start,  middle
617       or  end  of the bucket. Original carbon-aggregator.py uses start, while
618       carbon-c-relay´s default has  always  been  end.  The  compute  clauses
619       demonstrate  a single aggregation rule can produce multiple aggregates,
620       as often is the case. Internally, this comes for free, since all possi‐
621       ble aggregates are always calculated, whether or not they are used. The
622       produced new metrics are resubmitted to the relay,  hence  matches  de‐
623       fined  before  in the configuration can match output of the aggregator.
624       It is important to avoid loops, that can be generated this way. In gen‐
625       eral, splitting aggregations to their own carbon-c-relay instance, such
626       that it is easy to forward the produced metrics to  another  relay  in‐
627       stance is a good practice.
628
629       The previous example could also be written as follows to be dynamic:
630
631       aggregate     ^sys\.dc[0-9].(somehost-[0-9]+)\.([^.]+)\.mysql\.replica‐
632       tion_delay every 10 seconds expire after 35 seconds compute  sum  write
633       to     mysql.host.\1.replication_delay    compute    sum    write    to
634       mysql.host.all.replication_delay  compute  sum  write  to   mysql.clus‐
635       ter.\2.replication_delay  compute sum write to mysql.cluster.all.repli‐
636       cation_delay ;
637
638       Here a single match, results in four aggregations, each of a  different
639       scope.  In  this  example aggregation based on hostname and cluster are
640       being made, as well as the more general all targets, which in this  ex‐
641       ample  have  both identical values. Note that with this single aggrega‐
642       tion rule, both per-cluster, per-host and total aggregations  are  pro‐
643       duced. Obviously, the input metrics define which hosts and clusters are
644       produced.
645
646       With use of the send to clause, aggregations can be made more intuitive
647       and less error-prone. Consider the below example:
648
649       ``` cluster graphite fnv1a_ch ip1 ip2 ip3;
650
651       aggregate ^sys.somemetric every 60 seconds expire after 75 seconds com‐
652       pute sum write to sys.somemetric send to graphite stop ;
653
654       match * send to graphite; ```
655
656       It sends all incoming metrics  to  the  graphite  cluster,  except  the
657       sys.somemetric  ones,  which it replaces with a sum of all the incoming
658       ones. Without a stop in the aggregate, this causes a loop, and  without
659       the  send  to, the metric name can´t be kept its original name, for the
660       output now directly goes to the cluster.
661

STATISTICS

663       When carbon-c-relay is run without -d or -s arguments, statistics  will
664       be  produced.  By default they are sent to the relay itself in the form
665       of carbon.relays.<hostname>.*. See the statistics construct to override
666       this  prefix,  sending interval and values produced. While many metrics
667       have a similar name to what carbon-cache.py would produce, their values
668       are  likely  different.  By  default,  most values are running counters
669       which only increase over time. The use of  the  nonNegativeDerivative()
670       function from graphite is useful with these.
671
672       The  following  metrics are produced under the carbon.relays.<hostname>
673       namespace:
674
675       ○   metricsReceived
676
677           The number of metrics that were received  by  the  relay.  Received
678           here  means  that  they  were seen and processed by any of the dis‐
679           patchers.
680
681       ○   metricsSent
682
683           The number of metrics that were sent from the relay. This is a  to‐
684           tal  count  for all servers combined. When incoming metrics are du‐
685           plicated by the cluster configuration, this  counter  will  include
686           all  those duplications. In other words, the amount of metrics that
687           were successfully sent to other systems. Note that metrics that are
688           processed  (received)  but  still in the sending queue (queued) are
689           not included in this counter.
690
691       ○   metricsDiscarded
692
693           The number of input lines that were not considered to  be  a  valid
694           metric.  Such  lines  can  be empty, only containing whitespace, or
695           hitting the limits given for max input  length  and/or  max  metric
696           length (see -m and -M options).
697
698       ○   metricsQueued
699
700           The  total  number  of metrics that are currently in the queues for
701           all the server targets. This metric is not cumulative, for it is  a
702           sample  of  the  queue size, which can (and should) go up and down.
703           Therefore you should not use the derivative function for this  met‐
704           ric.
705
706       ○   metricsDropped
707
708           The  total  number  of  metric that had to be dropped due to server
709           queues overflowing. A queue typically overflows when the server  it
710           tries  to  send its metrics to is not reachable, or too slow in in‐
711           gesting the amount of metrics queued. This can be  network  or  re‐
712           source related, and also greatly depends on the rate of metrics be‐
713           ing sent to the particular server.
714
715       ○   metricsBlackholed
716
717           The number of metrics that did not match any  rule,  or  matched  a
718           rule  with  blackhole as target. Depending on your configuration, a
719           high value might be an indication of a misconfiguration  somewhere.
720           These  metrics were received by the relay, but never sent anywhere,
721           thus they disappeared.
722
723       ○   metricStalls
724
725           The number of times the relay had to stall  a  client  to  indicate
726           that  the  downstream server cannot handle the stream of metrics. A
727           stall is only performed when the queue is full and  the  server  is
728           actually  receptive  of  metrics,  but just too slow at the moment.
729           Stalls typically happen during micro-bursts, where the client typi‐
730           cally is unaware that it should stop sending more data, while it is
731           able to.
732
733       ○   connections
734
735           The number of connect requests handled. This is an ever  increasing
736           number just counting how many connections were accepted.
737
738       ○   disconnects
739
740           The number of disconnected clients. A disconnect either happens be‐
741           cause the client goes away, or due to an idle timeout in the relay.
742           The difference between this metric and connections is the amount of
743           connections actively held by the relay. In normal  situations  this
744           amount  remains within reasonable bounds. Many connections, but few
745           disconnections typically indicate a possible connection leak in the
746           client.  The  idle  connections  disconnect in the relay here is to
747           guard against resource drain in such scenarios.
748
749       ○   dispatch_wallTime_us
750
751           The number of microseconds spent by the  dispatchers  to  do  their
752           work.  In  particular on multi-core systems, this value can be con‐
753           fusing, however, it indicates how long the dispatchers  were  doing
754           work handling clients. It includes everything they do, from reading
755           data from a socket, cleaning up the input  metric,  to  adding  the
756           metric to the appropriate queues. The larger the configuration, and
757           more complex in terms of matches, the  more  time  the  dispatchers
758           will spend on the cpu. But also time they do /not/ spend on the cpu
759           is included in this number. It is the pure wallclock time the  dis‐
760           patcher was serving a client.
761
762       ○   dispatch_sleepTime_us
763
764           The  number of microseconds spent by the dispatchers sleeping wait‐
765           ing for work. When this value gets small (or even  zero)  the  dis‐
766           patcher has so much work that it doesn´t sleep any more, and likely
767           can´t process the work in a timely fashion  any  more.  This  value
768           plus  the  wallTime  from above sort of sums up to the total uptime
769           taken by this dispatcher. Therefore,  expressing  the  wallTime  as
770           percentage  of  this sum gives the busyness percentage draining all
771           the way up to 100% if sleepTime goes to 0.
772
773       ○   server_wallTime_us
774
775           The number of microseconds spent by the servers to send the metrics
776           from their queues. This value includes connection creation, reading
777           from the queue, and sending metrics over the network.
778
779       ○   dispatcherX
780
781           For each indivual dispatcher, the metrics received  and  blackholed
782           plus the wall clock time. The values are as described above.
783
784       ○   destinations.X
785
786           For  all known destinations, the number of dropped, queued and sent
787           metrics plus the wall clock time spent. The values are as described
788           above.
789
790       ○   aggregators.metricsReceived
791
792           The number of metrics that were matched an aggregator rule and were
793           accepted by the aggregator. When a metric matches multiple aggrega‐
794           tors, this value will reflect that. A metric is not counted when it
795           is considered syntactically invalid, e.g. no value was found.
796
797       ○   aggregators.metricsDropped
798
799           The number of metrics that were sent to an aggregator, but did  not
800           fit  timewise. This is either because the metric was too far in the
801           past or future. The expire after  clause  in  aggregate  statements
802           controls how long in the past metric values are accepted.
803
804       ○   aggregators.metricsSent
805
806           The  number  of  metrics that were sent from the aggregators. These
807           metrics were produced and are the actual results of aggregations.
808
809
810

BUGS

812       Please report them at: https://github.com/grobian/carbon-c-relay/issues
813

AUTHOR

815       Fabian Groffen <grobian@gentoo.org>
816

ACKNOWLEDGEMENTS

839       This program was originally developed for Booking.com,  which  approved
840       that  the code was published and released as Open Source on GitHub, for
841       which the author would like to express his gratitude.  Development  has
842       continued since with the help of many contributors suggesting features,
843       reporting bugs, adding patches and more  to  make  carbon-c-relay  into
844       what it is today.
845
846
847
848                                   July 2020                 CARBON-C-RELAY(1)