snapraid(1)

1SnapRAID Backup For Disk ArraGyesn(e1r)al CommandsSMnaanpuRaAlID Backup For Disk Arrays(1)
2
3
4

NAME

6       snapraid - SnapRAID Backup For Disk Arrays
7

SYNOPSIS

9       snapraid [-c, --conf CONFIG]
10            [-f, --filter PATTERN] [-d, --filter-disk NAME]
11            [-m, --filter-missing] [-e, --filter-error]
12            [-a, --audit-only] [-h, --pre-hash] [-i, --import DIR]
13            [-p, --plan PERC|bad|new|full]
14            [-o, --older-than DAYS] [-l, --log FILE]
15            [-Z, --force-zero] [-E, --force-empty]
16            [-U, --force-uuid] [-D, --force-device]
17            [-N, --force-nocopy] [-F, --force-full]
18            [-R, --force-realloc]
19            [-S, --start BLKSTART] [-B, --count BLKCOUNT]
20            [-L, --error-limit NUMBER]
21            [-v, --verbose] [-q, --quiet]
22            status|smart|up|down|diff|sync|scrub|fix|check|list|dup
23            |pool|devices|touch|rehash
24       snapraid [-V, --version] [-H, --help] [-C, --gen-conf CONTENT]

DESCRIPTION

26       SnapRAID is a backup program for disk arrays. It stores parity informa‐
27       tion of your data and it recovers from up to six disk failures.
28
29       SnapRAID is mainly targeted for a home media center, with a lot of  big
30       files that rarely change.
31
32       Beside  the  ability  to  recover from disk failures, other features of
33       SnapRAID are:
34       ·      All your data is hashed to ensure data integrity  and  to  avoid
35              silent corruption.
36       ·      If  the  failed disks are too many to allow a recovery, you lose
37              the data only on the failed disks.  All the data  in  the  other
38              disks is safe.
39       ·      If you accidentally delete some files in a disk, you can recover
40              them.
41       ·      You can start with already filled disks.
42       ·      The disks can have different sizes.
43       ·      You can add disks at any time.
44       ·      It doesn´t lock-in your data. You can stop using SnapRAID at any
45              time without the need to reformat or move data.
46       ·      To  access a file, a single disk needs to spin, saving power and
47              producing less noise.
48
49       The official site of SnapRAID is:
50
51           http://www.snapraid.it/

LIMITATIONS

53       SnapRAID is in between a RAID and a Backup program trying  to  get  the
54       best  benefits  of them. Although it also has some limitations that you
55       should consider before using it.
56
57       The main one is that if a disk fails, and you haven´t recently  synced,
58       you may be unable to do a complete recover.  More specifically, you may
59       be unable to recover up to the size of the amount  of  the  changed  or
60       deleted  files  from the last sync operation.  This happens even if the
61       files changed or deleted are not in the failed disk. This is the reason
62       because SnapRAID is better suited for data that rarely change.
63
64       Instead the new added files don´t prevent the recovering of the already
65       existing files. You may only lose the just added files, if they are  on
66       the failed disk.
67
68       Other limitations are:
69       ·      You have different file-systems for each disk.  Using a RAID you
70              have only a big file-system.
71       ·      It doesn´t stripe data.  With RAID you get a  speed  boost  with
72              striping.
73       ·      It  doesn´t  support  real-time  recovery.  With RAID you do not
74              have to stop working when a disk fails.
75       ·      It´s able to recover damages  only  from  a  limited  number  of
76              disks.   With  a  Backup you are able to recover from a complete
77              failure of the whole disk array.
78       ·      Only file, time-stamps, symlinks and hardlinks are saved.   Per‐
79              missions, ownership and extended attributes are not saved.
80

GETTING STARTED

82       To use SnapRAID you need to first select one disk of your disk array to
83       dedicate at the "parity" information. With one disk for parity you will
84       be able to recover from a single disk failure, like RAID5.
85
86       If  you want to be able to recover from more disk failures, like RAID6,
87       you must reserve additional disks for  parity.  Any  additional  parity
88       disk allow to recover from one more disk failure.
89
90       As  parity  disks,  you have to pick the biggest disks in the array, as
91       the parity information may grow in size as the biggest data disk in the
92       array.
93
94       These  disks will be dedicated to store the "parity" files.  You should
95       not store your data in them.
96
97       Then you have to define the "data" disks that you want to protect  with
98       SnapRAID.  The protection is more effective if these disks contain data
99       that rarely change. For this reason it´s better to DO NOT  include  the
100       Windows C:\ disk, or the Unix /home, /var and /tmp disks.
101
102       The  list  of  files is saved in the "content" files, usually stored in
103       the data, parity or boot disks.  These files  contain  the  details  of
104       your  backup,  with  all  the  check-sums to verify its integrity.  The
105       "content" file is stored in multiple copies, and each one must be in  a
106       different  disk,  to ensure that in even in case of multiple disk fail‐
107       ures at least one copy is available.
108
109       For example, suppose that you are interested only at one  parity  level
110       of protection, and that your disks are present in:
111
112           /mnt/diskp <- selected disk for parity
113           /mnt/disk1 <- first disk to protect
114           /mnt/disk2 <- second disk to protect
115           /mnt/disk3 <- third disk to protect
116       you  have  to create the configuration file /etc/snapraid.conf with the
117       following options:
118
119           parity /mnt/diskp/snapraid.parity
120           content /var/snapraid/snapraid.content
121           content /mnt/disk1/snapraid.content
122           content /mnt/disk2/snapraid.content
123           data d1 /mnt/disk1/
124           data d2 /mnt/disk2/
125           data d3 /mnt/disk3/
126       If you are in Windows, you should use the  Windows  path  format,  with
127       drive letters and backslashes instead of slashes.
128
129           parity E:\snapraid.parity
130           content C:\snapraid\snapraid.content
131           content F:\array\snapraid.content
132           content G:\array\snapraid.content
133           data d1 F:\array\
134           data d2 G:\array\
135           data d3 H:\array\
136       If you have many disks, and you run out of drive letters, you can mount
137       disks directly in sub folders. See:
138
139           https://www.google.com/search?q=Windows+mount+point
140       At this point you are ready to start the "sync" command  to  build  the
141       parity information.
142
143           snapraid sync
144       This  process may take some hours the first time, depending on the size
145       of the data already present in the disks. If the disks  are  empty  the
146       process is immediate.
147
148       You  can  stop  it  at any time pressing Ctrl+C, and at the next run it
149       will start where interrupted.
150
151       When this command completes, your data is SAFE.
152
153       Now you can start using your array as you like, and periodically update
154       the parity information running the "sync" command.
155
156   Scrubbing
157       To  periodically  check the data and parity for errors, you can run the
158       "scrub" command.
159
160           snapraid scrub
161       This command verifies the data in your array comparing it with the hash
162       computed in the "sync" command.
163
164       Every run of the command checks about the 8% of the array, but not data
165       already scrubbed in the previous 10 days.  You can use the  -p,  --plan
166       option  to  specify a different amount, and the -o, --older-than option
167       to specify a different age in days.  For example, to check  5%  of  the
168       array older than 20 days use:
169
170           snapraid -p 5 -o 20 scrub
171       If  during  the  process,  silent or input/output errors are found, the
172       corresponding blocks are marked as  bad  in  the  "content"  file,  and
173       listed in the "status" command.
174
175           snapraid status
176       To  fix  them,  you  can use the "fix" command filtering for bad blocks
177       with the -e, --filter-error options:
178
179           snapraid -e fix
180       At the next "scrub" the errors will disappear from the "status"  report
181       if  really  fixed.  To  make  it fast, you can use -p bad to scrub only
182       blocks marked as bad.
183
184           snapraid -p bad scrub
185       Take care that running "scrub" on a not  synced  array  may  result  in
186       errors  caused  by removed or modified files. These errors are reported
187       in the "scrub" result, but related blocks are not marked as bad.
188
189   Pooling
190       To have all the files in your array shown in the same  directory  tree,
191       you  can  enable  the  "pooling"  feature.  It  consists  in creating a
192       read-only virtual view of all the files in your  array  using  symbolic
193       links.
194
195       You  can  configure  the  "pooling" directory in the configuration file
196       with:
197
198           pool /pool
199       or, if you are in Windows, with:
200
201           pool C:\pool
202       and then run the "pool" command to create or update the virtual view.
203
204           snapraid pool
205       If you are using a Unix platform and you want to share  such  directory
206       in  the  network  to either Windows or Unix machines, you should add to
207       your /etc/samba/smb.conf the following options:
208
209           # In the global section of smb.conf
210           unix extensions = no
211           # In the share section of smb.conf
212           [pool]
213           comment = Pool
214           path = /pool
215           read only = yes
216           guest ok = yes
217           wide links = yes
218           follow symlinks = yes
219       In Windows the  same  sharing  operation  is  not  so  straightforward,
220       because  Windows  shares  the  symbolic  links  as  they  are, and that
221       requires the network clients to resolve them remotely.
222
223       To make it working, besides sharing in the network the pool  directory,
224       you  must also share all the disks independently, using as share points
225       the disk names as defined in the  configuration  file.  You  must  also
226       specify  in  the  "share" option of the configure file, the Windows UNC
227       path that remote clients needs to use to access such shared disks.
228
229       For example, operating from a server named "darkstar", you can use  the
230       options:
231
232           data d1 F:\array\
233           data d2 G:\array\
234           data d3 H:\array\
235           pool C:\pool
236           share \\darkstar
237       and share the following dirs in the network:
238
239           \\darkstar\pool -> C:\pool
240           \\darkstar\d1 -> F:\array
241           \\darkstar\d2 -> G:\array
242           \\darkstar\d3 -> H:\array
243       to allow remote clients to access all the files at \\darkstar\\pool.
244
245       You may also need to configure remote clients enabling access at remote
246       symlinks with the command:
247
248           fsutil behavior set SymlinkEvaluation L2L:1 R2R:1 L2R:1 R2L:1
249   Undeleting
250       SnapRAID is more like a backup program than a RAID system, and  it  can
251       be  used to restore or undelete files to their previous state using the
252       -f, --filter option :
253
254           snapraid fix -f FILE
255       or for a directory:
256
257           snapraid fix -f DIR/
258       You can also use it to recover only accidentally deleted files inside a
259       directory  using  the  -m,  --filter-missing option, that restores only
260       missing files, leaving untouched all the others.
261
262           snapraid fix -m -f DIR/
263       Or to recover all the deleted files in all the drives with:
264
265           snapraid fix -m
266   Recovering
267       The worst happened, and you lost one or more disks!
268
269       DO NOT PANIC! You will be able to recover them!
270
271       The first thing you have to do is to avoid further changes at your disk
272       array.   Disable  any  remote  connection to it, any scheduled process,
273       including any scheduled SnapRAID nightly sync or scrub.
274
275       Then proceed with the following steps.
276
277   STEP 1 -> Reconfigure
278       You need some space to recover, even better if you already  have  addi‐
279       tional spare disks, but in case, also an external USB or remote disk is
280       enough.
281
282       Change the SnapRAID configuration file to make the "data"  or  "parity"
283       option  of  the failed disk to point to the place where you have enough
284       empty space to recover the files.
285
286       For example, if you have that disk "d1" failed, you can change from:
287
288           data d1 /mnt/disk1/
289       to:
290
291           data d1 /mnt/new_spare_disk/
292       If the disk to recover is a parity disk, change the  appropriate  "par‐
293       ity"  option.  If you have more broken disks, change all their configu‐
294       ration options.
295
296   STEP 2 -> Fix
297       Run the fix command, storing the log in an external file with:
298
299           snapraid -d NAME -l fix.log fix
300       Where NAME is the name of the disk, like "d1" as in our previous  exam‐
301       ple.   In  case the disk to recover is a parity disk, use the "parity",
302       "2-parity" names.  If you have  more  broken  disks,  use  multiple  -d
303       options to specify all of them.
304
305       This command will take a long time.
306
307       Take  care  that  you need also few gigabytes free to store the fix.log
308       file.  Run it from a disk with some free space.
309
310       Now you have recovered all the recoverable. If some file  is  partially
311       or  totally  unrecoverable,  it will be renamed adding the ".unrecover‐
312       able" extension.
313
314       You can get a detailed list of all  the  unrecoverable  blocks  in  the
315       fix.log file checking all the lines starting with "unrecoverable:"
316
317       If  you  are  not satisfied of the recovering, you can retry it as many
318       time you wish.
319
320       For example, if you have removed files from the array  after  the  last
321       "sync",  this  may  result  in some other files not recovered.  In this
322       case, you can retry the "fix" using the -i, --import option, specifying
323       where  these  files  are  now,  to include them again in the recovering
324       process.
325
326       If you are satisfied of the recovering, you can  now  proceed  further,
327       but  take  care  that  after syncing you cannot retry the "fix" command
328       anymore!
329
330   STEP 3 -> Check
331       As paranoid check, you can now run a "check"  command  to  ensure  that
332       everything is OK on the recovered disk.
333
334           snapraid -d NAME -a check
335       Where  NAME is the name of the disk, like "d1" as in our previous exam‐
336       ple.
337
338       The options -d and -a tell SnapRAID to check only the  specified  disk,
339       and ignore all the parity data.
340
341       This  command  will  take a long time, but if you are not paranoid, you
342       can skip it.
343
344   STEP 4 -> Sync
345       Run the "sync" command to resynchronize the array with the new disk.
346
347           snapraid sync
348       If everything is recovered, this command is immediate.
349

COMMANDS

351       SnapRAID provides a few simple commands that allow to:
352       ·      Prints the status of the array -> "status"
353       ·      Controls the disks -> "smart", "up", "down"
354       ·      Makes a backup/snapshot -> "sync"
355       ·      Periodically checks data -> "scrub"
356       ·      Restore the last backup/snapshot -> "fix".
357
358       Take care that the commands have to be written in lower case.
359
360   status
361       Prints a summary of the state of the disk array.
362
363       It includes information about the parity fragmentation, how old are the
364       blocks without checking, and all the recorded silent errors encountered
365       while scrubbing.
366
367       Note that the information presented refers at the latest time  you  run
368       "sync". Later modifications are not taken into account.
369
370       If  bad  blocks  were detected, their block numbers are listed.  To fix
371       them, you can use the "fix -e" command.
372
373       It also shows a graph representing the the last  time  each  block  was
374       scrubbed  or  synced. Scrubbed blocks are shown with ´*´, blocks synced
375       but not yet scrubbed with ´o´.
376
377       Nothing is modified.
378
379   smart
380       Prints a SMART report of all the disks of the array.
381
382       It includes an estimation of the probability of  failure  in  the  next
383       year  allowing  to plan maintenance replacements of the disks that show
384       suspicious attributes.
385
386       This probability estimation obtained correlating the  SMART  attributes
387       of the disks, with the Backblaze data available at:
388
389           https://www.backblaze.com/hard-drive-test-data.html
390       If SMART reports that a disk is failing, "FAIL" or "PREFAIL" is printed
391       for that disk, and SnapRAID returns with an error.   In  this  case  an
392       immediate replacement of the disk is highly recommended.
393
394       Other possible strings are:
395           logfail In the past some attributes were lower than the threshold.
396           logerr The device error log contains errors.
397           selferr The device self-test log contains errors.
398
399       If  the -v, --verbose option is specified a deeper statistical analysis
400       is provided. This analysis can help you to decide if you need  more  or
401       less parity.
402
403       This  command  uses  the  "smartctl"  tool,  and it´s equivalent to run
404       "smartctl -a" on all the devices.
405
406       If your devices are not auto-detected correctly, you  can  configure  a
407       custom command using the "smartctl" option in the configuration file.
408
409       Nothing is modified.
410
411   up
412       Spins up all the disks of the array.
413
414       You  can  spin-up  only some specific disks using the -d, --filter-disk
415       option.
416
417       Take care that spinniup-up all the disks at the same time needs  a  lot
418       of power.  Ensure that your power-supply can sustain that.
419
420       Nothing is modified.
421
422   down
423       Spins down all the disks of the array.
424
425       This  command  uses  the  "smartctl"  tool,  and it´s equivalent to run
426       "smartctl -s standby,now" on all the devices.
427
428       You can spin-down only some specific disks using the -d,  --filter-disk
429       option.
430
431       Nothing is modified.
432
433   diff
434       Lists  all  the  files  modified from the last "sync" that need to have
435       their parity data recomputed.
436
437       This command doesn´t check the file data, but only the file  time-stamp
438       size and inode.
439
440       At  the  end  of  the command, you´ll get a summary of the file changes
441       grouped by:
442           equal Files equal at before.
443           added Files added that were not present before.
444           removed Files removed.
445           updated Files with a different size  or  time-stamp,  meaning  that
446               they were modified.
447           moved  Files moved to a different directory of the same disk.  They
448               are identified by having the same name,  size,  time-stamp  and
449               inode, but different directory.
450           copied  Files copied in the same or different disk. Note that if in
451               true they are moved to a different disk, you´ll also have  them
452               counted  in  "removed".  They are identified by having the same
453               name, size, and time-stamp. But if the sub-second time-stamp is
454               zero, then the full path should match, and not only the name.
455           restored  Files  with  a  different  inode  but with name, size and
456               time-stamp matching. These are  usually  files  restored  after
457               being deleted.
458
459       If  a  "sync" is required, the process return code is 2, instead of the
460       default 0. The return code 1 is instead for a generic error condition.
461
462       Nothing is modified.
463
464   sync
465       Updates the parity information. All the  modified  files  in  the  disk
466       array are read, and the corresponding parity data is updated.
467
468       You  can  stop this process at any time pressing Ctrl+C, without losing
469       the work already done.  At the next run the "sync" process  will  start
470       where interrupted.
471
472       If  during  the  process,  silent or input/output errors are found, the
473       corresponding blocks are marked as bad.
474
475       Files are identified by path and/or  inode  and  checked  by  size  and
476       time-stamp.   If  the file size or time-stamp are different, the parity
477       data is recomputed for the whole file.  If the file is moved or renamed
478       in the same disk, keeping the same inode, the parity is not recomputed.
479       If the file is moved to another disk, the parity is recomputed, but the
480       previously computed hash information is kept.
481
482       The  "content" and "parity" files are modified if necessary.  The files
483       in the array are NOT modified.
484
485   scrub
486       Scrubs the array, checking for silent or input/output  errors  in  data
487       and parity disks.
488
489       For  each command invocation, about the 8% of the array is checked, but
490       nothing that was already scrubbed in the last 10 days.  This means that
491       scrubbing  once  a week, every bit of data is checked at least one time
492       every three months.
493
494       You can define a different scrub plan or amount using  the  -p,  --plan
495       option  that  takes  as argument: bad - Scrub blocks marked bad.  new -
496       Scrub just synced blocks not yet scrubbed.  full  -  Scrub  everything.
497       0-100 - Scrub the exact percentage of blocks.
498
499       If  you  specify  a  percentage  amount,  you  can  also  use  the  -o,
500       --older-than option to define how old the block should be.  The  oldest
501       blocks  are  scrubbed  first ensuring an optimal check.  If instead you
502       want to scrub the just synced blocks, not yet scrubbed, you should  use
503       the "-p new" option.
504
505       To get the details of the scrub status use the "status" command.
506
507       For any silent or input/output error found the corresponding blocks are
508       marked as bad in the "content" file.  These bad blocks  are  listed  in
509       "status",  and  can be fixed with "fix -e".  After the fix, at the next
510       scrub they will be rechecked, and if found corrected, the bad mark will
511       be  removed.   To  scrub only the bad blocks, you can use the "scrub -p
512       bad" command.
513
514       It´s recommended to run "scrub" only on a synced  array,  to  avoid  to
515       have  reported  error  caused by unsynced data. These errors are recog‐
516       nized as not being silent errors, and the blocks are not marked as bad,
517       but such errors are reported in the output of the command.
518
519       Files are identified only by path, and not by inode.
520
521       The  "content" file is modified to update the time of the last check of
522       each block, and to mark bad blocks.  The "parity" files are  NOT  modi‐
523       fied.  The files in the array are NOT modified.
524
525   fix
526       Fix all the files and the parity data.
527
528       All  the files and the parity data are compared with the snapshot state
529       saved in the last "sync".  If a difference is found, it´s  reverted  to
530       the stored snapshot.
531
532       The  "fix" command doesn´t differentiate between errors and intentional
533       modifications. It unconditionally reverts the file state  at  the  last
534       "sync".
535
536       If  no  other option is specified the full array is processed.  Use the
537       filter options to select a subset of files or disks to operate on.
538
539       To only fix the blocks marked bad during "sync" and  "scrub",  use  the
540       -e,  --filter-error  option.   As difference from other filter options,
541       with this one the fixes are applied only to files that are not modified
542       from the the latest "sync".
543
544       All  the files that cannot be fixed are renamed adding the ".unrecover‐
545       able" extension.
546
547       Before fixing, the full array is scanned to find any moved file,  after
548       the  last  "sync"  operation.   These  files  are  identified  by their
549       time-stamp, ignoring their name and directory,  and  are  used  in  the
550       recovering process if necessary.  If you moved some of them outside the
551       array, you can use the -i, --import option to specify additional direc‐
552       tories to scan.
553
554       Files are identified only by path, and not by inode.
555
556       The "content" file is NOT modified.  The "parity" files are modified if
557       necessary.  The files in the array are modified if necessary.
558
559   check
560       Verify all the files and the parity data.
561
562       It works like "fix", but it only simulates a recovery and no change  is
563       written in the array.
564
565       This  command is mostly intended for manual verifications, like after a
566       recovery process or in other  special  conditions.   For  periodic  and
567       scheduled checks uses "scrub".
568
569       If  you use the -a, --audit-only option, only the file data is checked,
570       and the parity data is ignored for a faster run.
571
572       Files are identified only by path, and not by inode.
573
574       Nothing is modified.
575
576   list
577       Lists all the files contained in the array at  the  time  of  the  last
578       "sync".
579
580       Nothing is modified.
581
582   dup
583       Lists  all  the  duplicate  files. Two files are assumed equal if their
584       hashes are matching. The file data is not read, but  only  the  precom‐
585       puted hashes are used.
586
587       Nothing is modified.
588
589   pool
590       Creates or updates in the "pooling" directory a virtual view of all the
591       files of your disk array.
592
593       The files are not really copied here, but just  linked  using  symbolic
594       links.
595
596       When  updating, all the present symbolic links and empty subdirectories
597       are deleted and replaced with the new view of the array. Any other reg‐
598       ular file is left in place.
599
600       Nothing is modified outside the pool directory.
601
602   devices
603       Prints the low level devices used by the array.
604
605       This command prints the devices associations in place in the array, and
606       it´s mainly intended as a script interface.
607
608       The first two columns are the low level device id and path.   The  next
609       two  columns  are the high level device id and path.  The latest column
610       if the disk name in the array.
611
612       In most cases you have one low level device for each disk in the array,
613       but  in  some  more  complex  configurations, you may have multiple low
614       level devices used by a single disk in the array.
615
616       Nothing is modified.
617
618   touch
619       Sets arbitrarely the sub-second timestamp of all the files that have it
620       at zero.
621
622       This  improves  the  SnapRAID  capability to recognize moved and copied
623       files as it makes the timestamp almost unique, removing possible dupli‐
624       cates.
625
626       More  specifically, if the sub-second timestamp is not zero, a moved or
627       copied file is identified as such if it  matches  the  name,  size  and
628       timestamp. If instead the sub-second timestamp is zero, it´s considered
629       a copy only if it matches the full path, size and timestamp.
630
631       Note that the second precision timestamp is not modified, and  all  the
632       dates and times of your files will be maintained.
633
634   rehash
635       Schedules a rehash of the whole array.
636
637       This  command changes the hash kind used, typically when upgrading from
638       a 32 bits system to a 64 bits one, to switch from  MurmurHash3  to  the
639       faster SpookyHash.
640
641       If  you  are  already using the optimal hash, this command does nothing
642       and tells you that nothing has to be done.
643
644       The rehash isn´t done immediately, but  it  takes  place  progressively
645       during "sync" and "scrub".
646
647       You can get the rehash state using "status".
648
649       During the rehash, SnapRAID maintains full functionality, with the only
650       exception of "dup" not able to detect duplicated files using a  differ‐
651       ent hash.
652

OPTIONS

654       SnapRAID provides the following options:
655
656       -c, --conf CONFIG
657              Selects  the configuration file to use. If not specified in Unix
658              it´s used the file "/usr/local/etc/snapraid.conf" if it  exists,
659              or  "/etc/snapraid.conf"  otherwise.   In  Windows it´s used the
660              file "snapraid.conf" in the same directory of "snapraid.exe".
661
662       -f, --filter PATTERN
663              Filters the files to process in "check"  and  "fix".   Only  the
664              files  matching  the entered pattern are processed.  This option
665              can be used many  times.   See  the  PATTERN  section  for  more
666              details in the pattern specifications.  In Unix, ensure to quote
667              globbing chars if used.  This  option  can  be  used  only  with
668              "check"  and "fix".  Note that it cannot be used with "sync" and
669              "scrub", because they always process the whole array.
670
671       -d, --filter-disk NAME
672              Filters the disks to process in "check", "fix", "up" and "down".
673              You must specify a disk name as named in the configuration file.
674              You can also specify parity  disks  with  the  names:  "parity",
675              "2-parity",  "3-parity",  ... to limit the operations a specific
676              parity disk.  If you combine more  --filter,  --filter-disk  and
677              --filter-missing  options,  only  files  matching all the set of
678              filters are selected.  This option can be used many times.  This
679              option  can  be  used only with "check", "fix", "up" and "down".
680              Note that it cannot be used with  "sync"  and  "scrub",  because
681              they always process the whole array.
682
683       -m, --filter-missing
684              Filters  the  files  to  process in "check" and "fix".  Only the
685              files missing/deleted from the array are processed.   When  used
686              with  "fix",  this is a kind of "undelete" command.  If you com‐
687              bine more --filter, --filter-disk and --filter-missing  options,
688              only  files  matching all the set of filters are selected.  This
689              option can be used only with "check" and "fix".   Note  that  it
690              cannot  be  used  with  "sync"  and "scrub", because they always
691              process the whole array.
692
693       -e, --filter-error
694              Filters the blocks to process in "check"  and  "fix".   It  pro‐
695              cesses only the blocks marked with silent or input/output errors
696              during "sync" and "scrub", and listed in "status".  This  option
697              can be used only with "check" and "fix".
698
699       -p, --plan PERC|bad|new|full
700              Selects  the  scrub  plan.  If PERC is a numeric value from 0 to
701              100, it´s interpreted as the  percentage  of  blocks  to  scrub.
702              Instead  of  a  percentage,  you  can also specify a plan: "bad"
703              scrubs bad blocks, "new" the blocks not yet scrubbed, and "full"
704              for everything.  This option can be used only with "scrub".
705
706       -o, --older-than DAYS
707              Selects  the  older the part of the array to process in "scrub".
708              DAYS is the minimum age in days for  a  block  to  be  scrubbed,
709              default is 10.  Blocks marked as bad are always scrubbed despite
710              this option.  This option can be used only with "scrub".
711
712       -a, --audit-only
713              In "check" verifies the hash of the files without doing any kind
714              of  check on the parity data.  If you are interested in checking
715              only the file data this option can speedup a  lot  the  checking
716              process.  This option can be used only with "check".
717
718       -h, --pre-hash
719              In  "sync"  runs a preliminary hashing phase of all the new data
720              to have an additional verification before  the  parity  computa‐
721              tion.  Usually in "sync" no preliminary hashing is done, and the
722              new data is hashed just before the parity computation when  it´s
723              read  for  the  first time.  Unfortunately, this process happens
724              when the system is under heavy load, with all disks spinning and
725              with  a busy CPU.  This is an extreme condition for the machine,
726              and if it has  a  latent  hardware  problem,  it´s  possible  to
727              encounter silent errors what cannot be detected because the data
728              is not yet hashed.  To avoid  this  risk,  you  can  enable  the
729              "pre-hash"  mode  and have all the data read two times to ensure
730              its integrity.  This option also verifies the files moved inside
731              the  array, to ensure that the move operation went successfully,
732              and in case to block the sync and to allow to run a  fix  opera‐
733              tion.  This option can be used only with "sync".
734
735       -i, --import DIR
736              Imports  from  the specified directory any file that you deleted
737              from the array after the last "sync".  If you  still  have  such
738              files,  they  could  be used by "check" and "fix" to improve the
739              recover process.  The files are read also in subdirectories  and
740              they  are  identified regardless of their name.  This option can
741              be used only with "check" and "fix".
742
743       -Z, --force-zero
744              Forces the insecure operation of syncing a file with  zero  size
745              that  before  was not.  If SnapRAID detects a such condition, it
746              stops proceeding unless you specify this option.  This allows to
747              easily  detect  when  after  a system crash, some accessed files
748              were truncated.  This is a possible condition in Linux with  the
749              ext3/ext4  file-systems.   This  option  can  be  used only with
750              "sync".
751
752       -E, --force-empty
753              Forces the insecure operation of syncing a  disk  with  all  the
754              original  files missing.  If SnapRAID detects that all the files
755              originally present in the disk  are  missing  or  rewritten,  it
756              stops proceeding unless you specify this option.  This allows to
757              easily detect when a data  file-system  is  not  mounted.   This
758              option can be used only with "sync".
759
760       -U, --force-uuid
761              Forces  the  insecure  operation of syncing, checking and fixing
762              with disks that have changed their UUID.   If  SnapRAID  detects
763              that  some  disks  have changed UUID, it stops proceeding unless
764              you specify this option.  This allows to detect when your  disks
765              are  mounted  in the wrong mount points.  It´s anyway allowed to
766              have a single UUID change with single parity, and more with mul‐
767              tiple  parity,  because  it´s the normal case of replacing disks
768              after a recovery.  This option can be  used  only  with  "sync",
769              "check" or "fix".
770
771       -D, --force-device
772              Forces the insecure operation of fixing with inaccessible disks,
773              or with disks on the same physical device.  Like if you lost two
774              data  disks, and you have a spare disk to recover only the first
775              one, and you want to ignore the second inaccessible disk.  Or if
776              you  want to recover a disk in the free space left in an already
777              used disk, sharing the same physical device.  This option can be
778              used only with "fix".
779
780       -N, --force-nocopy
781              In "sync", "check and "fix", disables the copy detection heuris‐
782              tic.  Without this option SnapRAID assumes that files with  same
783              attributes,  like  name, size and time-stamp are copies with the
784              same data.  This allows to identify copied or moved  files  from
785              one  disk  to  another,  and  to reuse the already computed hash
786              information to detect silent errors or to recover missing files.
787              This  behavior,  in  some  rare cases, may result in false posi‐
788              tives, or in a slow process due the many hash verifications, and
789              this  option  allows  to  resolve them.  This option can be used
790              only with "sync", "check" and "fix".
791
792       -F, --force-full
793              In "sync" forces a full rebuild of the parity.  This option  can
794              be used when you add a new parity level, or if you reverted back
795              to an old content file using a more recent parity data.  Instead
796              of recomputing the parity from scratch, this allows to reuse the
797              hashes present in the content file  to  validate  data,  and  to
798              maintain  data  protection  during  the "sync" process using the
799              parity data you have.  This option can be used only with "sync".
800
801       -R, --force-realloc
802              In "sync" forces a full reallocation of files and rebuild of the
803              parity.   This  option  can be used to completely reallocate all
804              the files removing the fragmentation,  but  reusing  the  hashes
805              present  in  the content file to validate data.  Compared to -F,
806              --force-full, this option reallocates all the parity not  having
807              data  protection  during the operation.  This option can be used
808              only with "sync".
809
810       -l, --log FILE
811              Write a detailed log in the specified file.  If this  option  is
812              not  specified,  unexpected  errors  are  printed on the screen,
813              likely resulting in too much output in case of many errors. When
814              -l, --log is specified, on the screen, go only fatal errors that
815              makes SnapRAID to stop progress.  If the path starts  with  ´>>´
816              the  file is opened in append mode. Occurrences of ´%D´ and ´%T´
817              in the name are replaced with the date and time  in  the  format
818              YYYYMMDD  and  HHMMSS.  Note that in Windows batch files, you´ll
819              have to double the ´%´ char, like  result-%%D.log.  And  to  use
820              ´>>´  you´ll have to enclose the name in ", like ">>result.log".
821              To output the log to standard output or standard error, you  can
822              use respectively ">&1" and ">&2".
823
824       -L, --error-limit
825              Sets  a  new  error limit before stopping execution.  By default
826              SnapRAID stops if  it  encounters  more  than  100  Input/Output
827              errors,  meaning  that  likely  a  disk  is  going to die.  This
828              options affects "sync" and "scrub", that are allowed to continue
829              after the first bunch of disk errors, to try to complete at most
830              their operations.  Instead, "check" and "fix" always stop at the
831              first error.
832
833       -S, --start BLKSTART
834              Starts  the processing from the specified block number. It could
835              be useful to retry to check or fix some specific block, in  case
836              of  a  damaged  disk.   It´s  present mainly for advanced manual
837              recovering.
838
839       -B, --count BLKCOUNT
840              Processes only the specified number  of  blocks.   It´s  present
841              mainly for advanced manual recovering.
842
843       -C, --gen-conf CONTENT_FILE
844              Generates  a  dummy  configuration file from an existing content
845              file.  The configuration file is written in the standard output,
846              and  it  doesn´t  overwrite an existing one.  This configuration
847              file also contains the information  needed  to  reconstruct  the
848              disk mount points, in case you lose the entire system.
849
850       -v, --verbose
851              Prints  more  information on the screen.  If specified one time,
852              it prints excluded files and more stats.   This  option  has  no
853              effect on the log files.
854
855       -q, --quiet
856              Prints  less  information on the screen.  If specified one time,
857              removes the progress bar, if two times, the running  operations,
858              three  times, the info messages, four times the status messages.
859              Fatal errors are always printed on the screen.  This option  has
860              no effect on the log files.
861
862       -H, --help
863              Prints a short help screen.
864
865       -V, --version
866              Prints the program version.
867

CONFIGURATION

869       SnapRAID requires a configuration file to know where your disk array is
870       located, and where storing the parity information.
871
872       In Unix it´s used the file "/usr/local/etc/snapraid.conf" if it exists,
873       or  "/etc/snapraid.conf"  otherwise.   In  Windows  it´s  used the file
874       "snapraid.conf" in the same directory of "snapraid.exe".
875
876       It should contain the following options (case sensitive):
877
878   parity FILE [,FILE] ...
879       Defines the files to use to store the parity information.   The  parity
880       enables the protection from a single disk failure, like RAID5.
881
882       You  can  specify  multiples  files  that should be in different disks.
883       When a file cannot grow anymore, the next one is used.  The total space
884       available must be as big as the biggest data disk in the array.
885
886       You  can  add  additional  parity  files  at later time, but you cannot
887       reorder or remove them.
888
889       Leaving the parity disks reserved for parity ensures  that  it  doesn´t
890       get fragmented, improving the performance.
891
892       In  Windows  256  MB  are left unused in each disk to avoid the warning
893       about full disks.
894
895       This option is mandatory and it can be used only one time.
896
897   (2,3,4,5,6)-parity FILE [,FILE] ...
898       Defines the files to use to store extra parity information.
899
900       For each parity  specified,  one  additional  level  of  protection  is
901       enabled:
902       ·      2-parity enables RAID6 dual parity.
903       ·      3-parity enables triple parity
904       ·      4-parity enables quad (four) parity
905       ·      5-parity enables penta (five) parity
906       ·      6-parity enables hexa (six) parity
907
908       Each parity level requires the precence of all the previous parity lev‐
909       els.
910
911       The same considerations of the ´parity´ option apply.
912
913       These options are optional and they can be used only one time.
914
915   z-parity FILE [,FILE] ...
916       Defines an alternate file and format to store the triple parity.
917
918       This option is an alternative at ´3-parity´ mainly intended for low-end
919       CPUs  like ARM or AMD Phenom, Athlon and Opteron that don´t support the
920       SSSE3 instructions set. In such cases it provides a better performance.
921
922       This format is similar, but faster, at the one used by the ZFS  RAIDZ3.
923       Like ZFS, it doesn´t work beyond triple parity.
924
925       When using ´3-parity´ you will be warned if it´s recommended to use the
926       ´z-parity´ format for a performance improvement.
927
928       It´s possible to convert from one format to another, adjusting the con‐
929       figuration  file  with  the wanted z-parity or 3-parity file, and using
930       ´fix´ to recreate it.
931
932   content FILE
933       Defines the file to use to store the list and  check-sums  of  all  the
934       files present in your disk array.
935
936       It  can  be placed in the disk used to store data, parity, or any other
937       disk available.  If you use a data disk,  this  file  is  automatically
938       excluded from the "sync" process.
939
940       This  option  is  mandatory  and it can be used more times to save more
941       copies of the same files.
942
943       You have to store at least one copy for each parity disk used plus one.
944       Using some more doesn´t hurt.
945
946   data NAME DIR
947       Defines  the  name  and the mount point of the data disks of the array.
948       NAME is used to identify the disk, and it must be unique.  DIR  is  the
949       mount point of the disk in the file-system.
950
951       You  can  change the mount point as you like, as long you keep the NAME
952       fixed.
953
954       You should use one option for each data disk of the array.
955
956       You can rename later a disk, changing the NAME directly in the configu‐
957       ration  file,  and  then run a ´sync´ command.  In the rename case, the
958       association is done using the stored UUID of the disks.
959
960   nohidden
961       Excludes all the hidden files and directory.  In Unix hidden files  are
962       the ones starting with ".".  In Windows they are the ones with the hid‐
963       den attribute.
964
965   exclude/include PATTERN
966       Defines the file or directory patterns to exclude and  include  in  the
967       sync process.  All the patterns are processed in the specified order.
968
969       If  the  first  pattern  that  matches is an "exclude" one, the file is
970       excluded. If it´s an "include" one, the file is included.  If  no  pat‐
971       tern  matches, the file is excluded if the last pattern specified is an
972       "include", or included if the last pattern specified is an "exclude".
973
974       See the PATTERN section for more details in the pattern specifications.
975
976       This option can be used many times.
977
978   blocksize SIZE_IN_KIBIBYTES
979       Defines the basic block size in kibi bytes of  the  parity.   One  kibi
980       bytes is 1024 bytes.
981
982       The default blocksize is 256 and it should work for most cases.
983
984       WARNING!  This  option is for experts only, and it´s highly recommended
985       to not change it. To change again this value in future you´ll  have  to
986       recreate the whole parity!
987
988       A  reason  to  use  a  different hashsize is if you have a lot of small
989       files. In the order of many millions.
990
991       For each file, even of few bytes, a whole block of parity is allocated,
992       and  with  many  files this may result in a lot of unused parity space.
993       And when you completely fill the parity disk, you are  not  allowed  to
994       add  more  files  in the data disks.  Anyway, the wasted parity doesn´t
995       sum between data disks. Wasted space resulting from a  high  number  of
996       files  in a data disk, limits only the amount of data in such data disk
997       and not in others.
998
999       As approximation, you can assume that half of the block size is  wasted
1000       for each file. For example, with 100000 files and a 256 KiB block size,
1001       you are going to waste 13 GB of parity, that may result in 13  GB  less
1002       space available in the data disk.
1003
1004       You  can  get  the  amount of wasted space in each disk using "status".
1005       This is the amount of space that you must leave free in the data disks,
1006       or use for files not included in the array.  If this value is negative,
1007       it means that your are near to fill the parity, and it  represents  the
1008       space you can still waste.
1009
1010       To  avoid  the problem, you can use a bigger partition for parity.  For
1011       example, if you have the parity partition bigger than 13 GB  than  data
1012       disks, you have enough extra space to handle up to 100000 files in each
1013       data disk.
1014
1015       A trick to get a bigger parity partition in Linux, is to format it with
1016       the command:
1017
1018           mkfs.ext4 -m 0 -T largefile4 DEVICE
1019       This  results in about 1.5% of extra space. Meaning about 60 GB for a 4
1020       TB disk, that allows about 460000 files in each data disk  without  any
1021       wasted space.
1022
1023   hashsize SIZE_IN_BYTES
1024       Defines the hash size in bytes of the saved blocks.
1025
1026       The  default  hashsize  is  16 bytes (128 bits), and it should work for
1027       most cases.
1028
1029       WARNING! This option is for experts only, and it´s  highly  recommended
1030       to  not  change it. To change again this value in future you´ll have to
1031       recreate the whole parity!
1032
1033       A reason to use a different hashsize is if your system has  small  mem‐
1034       ory.  As  a rule of thumb SnapRAID usually requires 1 GiB of RAM memory
1035       for each 16 TB of data in the array.
1036
1037       Specifically, to store the hashes of the data, SnapRAID requires  about
1038       TS*(1+HS)/BS  bytes of RAM memory.  Where TS is the total size in bytes
1039       of your disk array, BS is the block size in bytes, and HS is  the  hash
1040       size in bytes.
1041
1042       For  example  with 8 disks of 4 TB and a block size of 256 KiB (1 KiB =
1043       1024 bytes), and an hash size of 16, you get:
1044
1045       RAM = (8 * 4 * 10^12) * (1+16) / (256 * 2^10) = 1.93 GiB
1046       Switching to a hash size of 8, you get:
1047
1048       RAM = (8 * 4 * 10^12) * (1+8) / (256 * 2^10) = 1.02 GiB
1049       Switching to a block size of 512, you get:
1050
1051       RAM = (8 * 4 * 10^12) * (1+16) / (512 * 2^10) = 0.96 GiB
1052       Switching to both a hash size of 8, and a block size of 512 you get:
1053
1054       RAM = (8 * 4 * 10^12) * (1+8) / (512 * 2^10) = 0.51 GiB
1055   autosave SIZE_IN_GIGABYTES
1056       Automatically save the state when syncing or scrubbing after the speci‐
1057       fied amount of GB processed.  This option is useful to avoid to restart
1058       from scratch long "sync" commands interrupted by a  machine  crash,  or
1059       any other event that may interrupt SnapRAID.
1060
1061   pool DIR
1062       Defines  the pooling directory where the virtual view of the disk array
1063       is created using the "pool" command.
1064
1065       The directory must already exist.
1066
1067   share UNC_DIR
1068       Defines the Windows UNC path required to access the disks remotely.
1069
1070       If this option is specified, the symbolic links  created  in  the  pool
1071       directory  use  this UNC path to access the disks.  Without this option
1072       the symbolic links generated use only  local  paths,  not  allowing  to
1073       share the pool directory in the network.
1074
1075       The  symbolic links are formed using the specified UNC path, adding the
1076       disk name as specified in the "disk" option,  and  finally  adding  the
1077       file dir and name.
1078
1079       This option is only required for Windows.
1080
1081   smartctl DISK/PARITY OPTIONS...
1082       Defines  a  custom  smartctl command to obtain the SMART attributes for
1083       each disk. This may be required for RAID controllers and for  some  USB
1084       disk that cannot be auto-detected.
1085
1086       DISK  is  the same disk name specified in the "disk" option.  PARITY is
1087       one of the parity name as "parity,(1,2,3,4,5,6,z)-parity".
1088
1089       In the specified OPTIONS, the "%s" string is  replaced  by  the  device
1090       name. Note that in case of RAID controllers the device is likely fixed,
1091       and you don´t have to use "%s".
1092
1093       Refers at the smartmontools documentation about the possible options:
1094
1095           https://www.smartmontools.org/wiki/Supported_RAID-Controllers
1096           https://www.smartmontools.org/wiki/Supported_USB-Devices
1097   Examples
1098       An example of a typical configuration for Unix is:
1099
1100           parity /mnt/diskp/snapraid.parity
1101           content /mnt/diskp/snapraid.content
1102           content /var/snapraid/snapraid.content
1103           data d1 /mnt/disk1/
1104           data d2 /mnt/disk2/
1105           data d3 /mnt/disk3/
1106           exclude /lost+found/
1107           exclude /tmp/
1108           smartctl d1 -d sat %s
1109           smartctl d2 -d usbjmicron %s
1110           smartctl parity -d areca,1/1 /dev/sg0
1111           smartctl 2-parity -d areca,2/1 /dev/sg0
1112       An example of a typical configuration for Windows is:
1113
1114           parity E:\snapraid.parity
1115           content E:\snapraid.content
1116           content C:\snapraid\snapraid.content
1117           data d1 G:\array\
1118           data d2 H:\array\
1119           data d3 I:\array\
1120           exclude Thumbs.db
1121           exclude \$RECYCLE.BIN
1122           exclude \System Volume Information
1123           smartctl d1 -d sat %s
1124           smartctl d2 -d usbjmicron %s
1125           smartctl parity -d areca,1/1 /dev/arcmsr0
1126           smartctl 2-parity -d areca,2/1 /dev/arcmsr0

PATTERN

1128       Patterns are used to select a subset of files to exclude or include  in
1129       the process.
1130
1131       There are four different types of patterns:
1132
1133       FILE   Selects any file named as FILE. You can use any globbing charac‐
1134              ter like * and ?, and char classes like [a-z].  This pattern  is
1135              applied only to files and not to directories.
1136
1137       DIR/   Selects  any directory named DIR and everything inside.  You can
1138              use any globbing character  like  *  and  ?.   This  pattern  is
1139              applied only to directories and not to files.
1140
1141       /PATH/FILE
1142              Selects  the exact specified file path. You can use any globbing
1143              character like * and ? but they never match a  directory  slash.
1144              This pattern is applied only to files and not to directories.
1145
1146       /PATH/DIR/
1147              Selects  the  exact  specified  directory  path  and  everything
1148              inside. You can use any globbing character like * and ? but they
1149              never  match a directory slash.  This pattern is applied only to
1150              directories and not to files.
1151
1152       Note that when you specify an  absolute  path  starting  with  /,  it´s
1153       applied  at  the  array  root dir and not at the local file-system root
1154       dir.
1155
1156       In Windows you can use the backslash \ instead of the forward slash  /.
1157       Note  that Windows system directories, junctions, mount points, and any
1158       other Windows special directory are treated just as files, meaning that
1159       to exclude them you must use a file rule, and not a directory one.
1160
1161       If  the  file  name you want to use really contains a ´*´, ´?´, ´[´, or
1162       ´]´ char, you have to escape it to avoid to have interpreted as a glob‐
1163       bing  character.  In  Unix the escape char is ´\´, in Windows it´s ´^´.
1164       Note that when the pattern is on the command line, you have  to  double
1165       the  escape  character  to  avoid to have it interpreted by the command
1166       shell.
1167
1168       In the configuration file, you can use different strategies  to  filter
1169       the  files to process.  The simplest one is to use only "exclude" rules
1170       to remove all the files and directories you do not want to process. For
1171       example:
1172
1173           # Excludes any file named "*.unrecoverable"
1174           exclude *.unrecoverable
1175           # Excludes the root directory "/lost+found"
1176           exclude /lost+found/
1177           # Excludes any sub-directory named "tmp"
1178           exclude tmp/
1179       The  opposite way is to define only the file you want to process, using
1180       only "include" rules. For example:
1181
1182           # Includes only some directories
1183           include /movies/
1184           include /musics/
1185           include /pictures/
1186       The final way, is to mix "exclude" and "include" rules.  In  this  case
1187       take care that the order of rules is important. Previous rules have the
1188       precedence over the later ones.  To get things simpler  you  can  first
1189       have all the "exclude" rules and then all the "include" ones. For exam‐
1190       ple:
1191
1192           # Excludes any file named "*.unrecoverable"
1193           exclude *.unrecoverable
1194           # Excludes any sub-directory named "tmp"
1195           exclude tmp/
1196           # Includes only some directories
1197           include /movies/
1198           include /musics/
1199           include /pictures/
1200       On the command line, using the -f option, you can  only  use  "include"
1201       patterns. For example:
1202
1203           # Checks only the .mp3 files.
1204           # Note the "" use to avoid globbing expansion by the shell in Unix.
1205           snapraid -f "*.mp3" check
1206       In  Unix,  when  using  globbing chars in the command line, you have to
1207       quote them. Otherwise the shell will try to expand them.
1208

CONTENT

1210       SnapRAID stores the list and check-sums of your files  in  the  content
1211       file.
1212
1213       It´s  a  binary file, listing all the files present in your disk array,
1214       with all the check-sums to verify their integrity.
1215
1216       This file is read and written by the "sync" and "scrub"  commands,  and
1217       read by "fix", "check" and "status".
1218

PARITY

1220       SnapRAID  stores  the  parity  information  of your array in the parity
1221       files.
1222
1223       They are binary files, containing the computed parity of all the blocks
1224       defined in the "content" file.
1225
1226       These  files are read and written by the "sync" and "fix" commands, and
1227       only read by "scrub" and "check".
1228

ENCODING

1230       SnapRAID in Unix ignores any encoding. It reads  and  stores  the  file
1231       names with the same encoding used by the file-system.
1232
1233       In  Windows  all  the names read from the file-system are converted and
1234       processed in the UTF-8 format.
1235
1236       To have the file names printed correctly you have to  set  the  Windows
1237       console  in  the  UTF-8  mode, with the command "chcp 65001", and use a
1238       TrueType font like "Lucida Console" as console font.  Note that it  has
1239       effect only on the printed file names, if you redirect the console out‐
1240       put to a file, the resulting file is always in the UTF-8 format.
1241

COPYRIGHT

1243       This file is Copyright (C) 2011 Andrea Mazzoleni
1244