1GMX-PDB2GMX(1)                      GROMACS                     GMX-PDB2GMX(1)
2
3
4

NAME

6       gmx-pdb2gmx  -  Convert  coordinate  files to topology and FF-compliant
7       coordinate files
8

SYNOPSIS

10          gmx pdb2gmx [-f [<.gro/.g96/...>]] [-o [<.gro/.g96/...>]] [-p [<.top>]]
11                      [-i [<.itp>]] [-n [<.ndx>]] [-q [<.gro/.g96/...>]]
12                      [-chainsep <enum>] [-merge <enum>] [-ff <string>]
13                      [-water <enum>] [-[no]inter] [-[no]ss] [-[no]ter]
14                      [-[no]lys] [-[no]arg] [-[no]asp] [-[no]glu] [-[no]gln]
15                      [-[no]his] [-angle <real>] [-dist <real>] [-[no]una]
16                      [-[no]ignh] [-[no]missing] [-[no]v] [-posrefc <real>]
17                      [-vsite <enum>] [-[no]heavyh] [-[no]deuterate]
18                      [-[no]chargegrp] [-[no]cmap] [-[no]renum] [-[no]rtpres]
19

DESCRIPTION

21       gmx pdb2gmx reads a .pdb (or .gro) file,  reads  some  database  files,
22       adds  hydrogens  to  the molecules and generates coordinates in GROMACS
23       (GROMOS), or optionally .pdb, format and a topology in GROMACS  format.
24       These files can subsequently be processed to generate a run input file.
25
26       gmx  pdb2gmx  will  search  for  force  fields  by looking for a force‐
27       field.itp file in subdirectories <forcefield>.ff of the current working
28       directory  and  of  the  GROMACS library directory as inferred from the
29       path of the binary or the GMXLIB environment variable.  By default  the
30       forcefield  selection is interactive, but you can use the -ff option to
31       specify one of the short names in the list on the command line instead.
32       In  that  case  gmx  pdb2gmx  just  looks for the corresponding <force‐
33       field>.ff directory.
34
35       After choosing a force field, all files will be read only from the cor‐
36       responding  force  field  directory.   If  you  want to modify or add a
37       residue types, you can copy the force field directory from the  GROMACS
38       library directory to your current working directory. If you want to add
39       new protein residue types, you will need to modify residuetypes.dat  in
40       the  library  directory  or copy the whole library directory to a local
41       directory and set the environment variable GMXLIB to the name  of  that
42       directory.   Check  Chapter  5 of the manual for more information about
43       file formats.
44
45       Note that a .pdb file is nothing more than a file format, and  it  need
46       not necessarily contain a protein structure. Every kind of molecule for
47       which there is support in the database can be converted.  If  there  is
48       no support in the database, you can add it yourself.
49
50       The  program  has  limited  intelligence, it reads a number of database
51       files, that allow it to make special bonds (Cys-Cys,  Heme-His,  etc.),
52       if necessary this can be done manually. The program can prompt the user
53       to select which kind of LYS, ASP, GLU, CYS or HIS residue  is  desired.
54       For Lys the choice is between neutral (two protons on NZ) or protonated
55       (three protons, default), for Asp and  Glu  unprotonated  (default)  or
56       protonated, for His the proton can be either on ND1, on NE2 or on both.
57       By default these selections are done automatically.  For His,  this  is
58       based  on  an optimal hydrogen bonding conformation. Hydrogen bonds are
59       defined based on a simple geometric criterion, specified by the maximum
60       hydrogen-donor-acceptor  angle  and  donor-acceptor distance, which are
61       set by -angle and -dist respectively.
62
63       The protonation state of N- and C-termini can be  chosen  interactively
64       with  the  -ter  flag.   Default  termini  are ionized (NH3+ and COO-),
65       respectively.  Some force fields support zwitterionic forms for  chains
66       of  one  residue,  but  for  polypeptides  these  options should NOT be
67       selected.  The AMBER force fields have unique forms  for  the  terminal
68       residues,  and these are incompatible with the -ter mechanism. You need
69       to prefix your N- or C-terminal residue names with “N” or  “C”  respec‐
70       tively  to  use these forms, making sure you preserve the format of the
71       coordinate file. Alternatively, use named  terminating  residues  (e.g.
72       ACE, NME).
73
74       The  separation  of  chains is not entirely trivial since the markup in
75       user-generated PDB files frequently varies and sometimes it  is  desir‐
76       able  to  merge entries across a TER record, for instance if you want a
77       disulfide bridge or distance restraints between two protein  chains  or
78       if  you  have  a HEME group bound to a protein.  In such cases multiple
79       chains should be contained in a  single  moleculetype  definition.   To
80       handle  this,  gmx pdb2gmx uses two separate options.  First, -chainsep
81       allows you to choose when a new chemical chain should start,  and  ter‐
82       mini  added when applicable. This can be done based on the existence of
83       TER records, when the chain id changes, or combinations  of  either  or
84       both  of  these. You can also do the selection fully interactively.  In
85       addition, there is a -merge option that controls  how  multiple  chains
86       are merged into one moleculetype, after adding all the chemical termini
87       (or not).  This can be turned off (no merging),  all  non-water  chains
88       can  be  merged  into  a  single molecule, or the selection can be done
89       interactively.
90
91       gmx pdb2gmx will also check the occupancy field of the .pdb  file.   If
92       any  of  the  occupancies  are not one, indicating that the atom is not
93       resolved well in the structure, a warning message is  issued.   When  a
94       .pdb  file does not originate from an X-ray structure determination all
95       occupancy fields may be zero. Either way, it is up to the user to  ver‐
96       ify the correctness of the input data (read the article!).
97
98       During processing the atoms will be reordered according to GROMACS con‐
99       ventions. With -n an index file can  be  generated  that  contains  one
100       group  reordered  in  the same way. This allows you to convert a GROMOS
101       trajectory and coordinate file to  GROMOS.  There  is  one  limitation:
102       reordering  is done after the hydrogens are stripped from the input and
103       before new hydrogens are added. This means  that  you  should  not  use
104       -ignh.
105
106       The .gro and .g96 file formats do not support chain identifiers. There‐
107       fore it is useful to enter a .pdb file name at the -o option  when  you
108       want to convert a multi-chain .pdb file.
109
110       The  option -vsite removes hydrogen and fast improper dihedral motions.
111       Angular and out-of-plane motions can be removed by  changing  hydrogens
112       into  virtual sites and fixing angles, which fixes their position rela‐
113       tive to neighboring atoms. Additionally,  all  atoms  in  the  aromatic
114       rings  of  the standard amino acids (i.e. PHE, TRP, TYR and HIS) can be
115       converted into virtual sites, eliminating the  fast  improper  dihedral
116       fluctuations  in  these  rings  (but this feature is deprecated).  Note
117       that in this case all other hydrogen atoms are also converted  to  vir‐
118       tual  sites.  The  mass  of  all  atoms that are converted into virtual
119       sites, is added to the heavy atoms.
120
121       Also slowing down of dihedral motion can be done with -heavyh  done  by
122       increasing  the  hydrogen-mass  by a factor of 4. This is also done for
123       water hydrogens to slow down  the  rotational  motion  of  water.   The
124       increase in mass of the hydrogens is subtracted from the bonded (heavy)
125       atom so that the total mass of the system remains the same.
126

OPTIONS

128       Options to specify input files:
129
130       -f [<.gro/.g96/…>] (protein.pdb)
131              Structure file: gro g96 pdb brk ent esp tpr
132
133       Options to specify output files:
134
135       -o [<.gro/.g96/…>] (conf.gro)
136              Structure file: gro g96 pdb brk ent esp
137
138       -p [<.top>] (topol.top)
139              Topology file
140
141       -i [<.itp>] (posre.itp)
142              Include file for topology
143
144       -n [<.ndx>] (index.ndx) (Optional)
145              Index file
146
147       -q [<.gro/.g96/…>] (clean.pdb) (Optional)
148              Structure file: gro g96 pdb brk ent esp
149
150       Other options:
151
152       -chainsep <enum> (id_or_ter)
153              Condition in PDB files  when  a  new  chain  should  be  started
154              (adding termini): id_or_ter, id_and_ter, ter, id, interactive
155
156       -merge <enum> (no)
157              Merge  multiple  chains  into  a single [moleculetype]: no, all,
158              interactive
159
160       -ff <string> (select)
161              Force field, interactive by default. Use -h for information.
162
163       -water <enum> (select)
164              Water model to use:  select,  none,  spc,  spce,  tip3p,  tip4p,
165              tip5p, tips3p
166
167       -[no]inter (no)
168              Set the next 8 options to interactive
169
170       -[no]ss (no)
171              Interactive SS bridge selection
172
173       -[no]ter (no)
174              Interactive termini selection, instead of charged (default)
175
176       -[no]lys (no)
177              Interactive lysine selection, instead of charged
178
179       -[no]arg (no)
180              Interactive arginine selection, instead of charged
181
182       -[no]asp (no)
183              Interactive aspartic acid selection, instead of charged
184
185       -[no]glu (no)
186              Interactive glutamic acid selection, instead of charged
187
188       -[no]gln (no)
189              Interactive glutamine selection, instead of charged
190
191       -[no]his (no)
192              Interactive histidine selection, instead of checking H-bonds
193
194       -angle <real> (135)
195              Minimum hydrogen-donor-acceptor angle for a H-bond (degrees)
196
197       -dist <real> (0.3)
198              Maximum donor-acceptor distance for a H-bond (nm)
199
200       -[no]una (no)
201              Select  aromatic  rings  with  united CH atoms on phenylalanine,
202              tryptophane and tyrosine
203
204       -[no]ignh (no)
205              Ignore hydrogen atoms that are in the coordinate file
206
207       -[no]missing (no)
208              Continue when atoms are missing and bonds cannot be  made,  dan‐
209              gerous
210
211       -[no]v (no)
212              Be slightly more verbose in messages
213
214       -posrefc <real> (1000)
215              Force constant for position restraints
216
217       -vsite <enum> (none)
218              Convert atoms to virtual sites: none, hydrogens, aromatics
219
220       -[no]heavyh (no)
221              Make hydrogen atoms heavy
222
223       -[no]deuterate (no)
224              Change the mass of hydrogens to 2 amu
225
226       -[no]chargegrp (yes)
227              Use charge groups in the .rtp file
228
229       -[no]cmap (yes)
230              Use cmap torsions (if enabled in the .rtp file)
231
232       -[no]renum (no)
233              Renumber the residues consecutively in the output
234
235       -[no]rtpres (no)
236              Use .rtp entry names as residue names
237

SEE ALSO

239       gmx(1)
240
241       More     information    about    GROMACS    is    available    at    <‐
242       http://www.gromacs.org/>.
243
245       2019, GROMACS development team
246
247
248
249
2502019.4                           Oct 02, 2019                   GMX-PDB2GMX(1)
Impressum