1BP_MASK_BY_SEARCH(1)  User Contributed Perl Documentation BP_MASK_BY_SEARCH(1)
2
3
4

NAME

6       mask_by_search - mask sequence(s) based on its alignment results
7

SYNOPSIS

9         mask_by_search.pl -f blast genomefile blastfile.bls > maskedgenome.fa
10

DESCRIPTION

12       Mask sequence based on significant alignments of another sequence.  You
13       need to provide the report file and the entire sequence data which you
14       want to mask.  By default this will assume you have done a TBLASTN (or
15       TFASTY) and try and mask the hit sequence assuming you've provided the
16       sequence file for the hit database.  If you would like to do the
17       reverse and mask the query sequence specify the -t/--type query flag.
18
19       This is going to read in the whole sequence file into memory so for
20       large genomes this may fall over.  I'm using DB_File to prevent keeping
21       everything in memory, one solution is to split the genome into pieces
22       (BEFORE you run the DB search though, you want to use the exact file
23       you BLASTed with as input to this program).
24
25       Below the double dash (--) options are of the form --format=fasta or
26       --format fasta or you can just say -f fasta
27
28       By -f/--format I mean either are acceptable options.  The =s or =n or
29       =c specify these arguments expect a 'string'
30
31       Options:
32           -f/--format=s    Search report format (fasta,blast,axt,hmmer,etc)
33           -sf/--sformat=s  Sequence format (fasta,genbank,embl,swissprot)
34           --hardmask       (booelean) Hard mask the sequence
35                            with the maskchar [default is lowercase mask]
36           --maskchar=c     Character to mask with [default is N], change
37                            to 'X' for protein sequences
38           -e/--evalue=n    Evalue cutoff for HSPs and Hits, only
39                            mask sequence if alignment has specified evalue
40                            or better
41           -o/--out/
42           --outfile=file   Output file to save the masked sequence to.
43           -t/--type=s      Alignment seq type you want to mask, the
44                            'hit' or the 'query' sequence. [default is 'hit']
45           --minlen=n       Minimum length of an HSP for it to be used
46                            in masking [default 0]
47           -h/--help        See this help information
48

AUTHOR - Jason Stajich

50       Jason Stajich, jason-at-bioperl-dot-org.
51
52
53
54perl v5.12.0                      2010-04-29              BP_MASK_BY_SEARCH(1)
Impressum