1srf2fastq(1) Staden io_lib srf2fastq(1)
2
3
4
6 srf2fastq - Converts SRF files to Sanger fastq format
7
8
10 srf2fastq [options] srf_archive ...
11
12
14 srf2fastq extracts sequences and qualities from one or more SRF ar‐
15 chives and writes them in Sanger fastq format to stdout.
16
17 Note that Illumina also have a fastq format (used in the GERALD direc‐
18 tories) which differs slightly in the use of log-odds scores for the
19 quality values. The format described here is using the traditional
20 Phred style of quality encoding.
21
22
24 -c Outputs calibrated confidence values using the ZTR CNF1 chunk
25 type for a single quality per base. Without this use the origi‐
26 nal Illumina _prb.txt files consisting of four quality values
27 per base, stored in the ZTR CNF4 chunks.
28
29 -C Masks out sequences tagged as bad quality.
30
31 -s root
32 Generates files on disk with filenames starting root, one file
33 per non-explicit element in the SRF/ZTR region (REGN) chunk.
34 Typically this results in two files for paired end runs. The
35 filename suffixes come from the names listed in the SRF region
36 chunks. This option conflicts with the -S parameter.
37
38 -S Splits sequences into regions, but sequentially lists each
39 sequence region to stdout instead of splitting to separate files
40 on disk. This option conflicts with the -s parameter.
41
42 -n When using -s the filename suffixes are simply numbered (start‐
43 ing with 1) instead of using the names listed in the SRF region
44 chunks.
45
46 -a Appends region index to the sequence names. Ie generate "name/1"
47 and "name/2" for a paired read.
48
49 -e Include any explicit sequence (ZTR region chunk of type 'E') in
50 the sequence output. The explicit sequence is also included in
51 the quality line too. Currently this is utilised by ABI SOLiD to
52 store the last base of the primer.
53
54 -r region list
55 Reverse complements the sequence and reverses the quality values
56 for all regions in the region list. This is a comma separated
57 list of integer values enumerating the regions, starting from 1.
58 Note that this option only works when either -s or -S are speci‐
59 fied.
60
61
63 To extract only the good quality sequences from all srf files in the
64 current directory using calibrated confidence values (if available).
65
66 srf2fastq -c -C *.srf > runX.fastq
67
68 To extract a paired end run into two separate files with sequences
69 named name/1 and name/2.
70
71 srf2fastq -s runX -a -n runX.srf
72
73 To extract a paired end run as a single file, alternating forward and
74 reverse sequences, with the second read being reverse complemented.
75
76 srf2fastq -S -r 2 runX.srf > runX.fastq
77
79 James Bonfield, Steven Leonard - Wellcome Trust Sanger Institute
80
81
82
83 December 10 srf2fastq(1)