Shell Scripts and Awk Programs

cartopdbq

 
Usage
: cartopdbq lig.car > lig.pdbq
Converts from Biosym InsightII ".car" format to PDBQ format

check-qs

 
Usage
: check-qs lig

Needs:
 lig.pdbq

Creates:
 lig.err
Checks partial atomic charges in PDBQ file; any non-integral charges are reported.

checkqs

 
Usage
: checkqs lig

Needs:
 lig.pdbq

Creates:
 lig.err
Sorts the input PDBQ file by residue number before running the result through check-qs .

clamp

 
Usage
: clamp grid.map > grid.map.NEW
Clamps any AutoGrid map values that exceed ECLAMP (normally set to 1000.0)

cnvmol2topdbq

 
Usage
: cnvmol2topdbq lig.mol2 > lig.pdbq
Needs:
 lig.mol2
Creates:
 lig.pdbq
This converts from (fixed format) Tripos SYBYL mol2 fiormat into PDBQ format, but stores all the residues' chain-IDs specified by the SUBSTRUCTURE records in the mol2 file. These chain-IDs are then output when the PDBQ lines are written.

deftors

 
Usage
: deftors lig.mol2

Creates
: lig.pdbq
, lig.err
Sets up rotatable bonds for AutoDock . This script launches AutoTors , with the -A +15.0, -a, -h and -m flags; it also checks the charges in the output PDBQ file, with check-qs .

dpf3gen

 
Usage
: dpf3gen lig.pdbq > lig.dpf
This is normally used by mkdpf3, so you should not use this script by hand.

It generates a pre-cursor to a default AutoDockdocking parameter file. You must edit the file before using it. This reads in the small molecule PDBQ file, detects all atom types present in the lig.pdbq; and creates a docking parameter file for AutoDock . Note the user must replace the tags <lig> and <macromol> by appropriate filename stems.

This uses equilibrium separations and well depths to define pairwise energy potentials, rather than coefficients.

 
Usage: 
get-coords lig.vol > lig.txt
This is used as part of prepare, prepare-gpf+dpf, prepare IIand prepare III. It takes the .volfile created by pdb-volumeand creates a line that can be used in the grid parameter file to specify the center of the maps.

get-docked

 
Usage
: get-docked lig.macro.dlg
Creates
: lig.macro.dlg.pdb
This extracts the docked records from a docking log file. This is very useful when wanting to view the results of a docking in a molecular modelling program or molecular viewer. It is essentially the same as the ` dockedtopdb ' awk program.

gethis

 
Usage
: gethis lig.macro.dlg

getready

 
Usage
: getready lig.pdb

Needs:
        pdbinfo, pdbsplitchains, pdbwaters, pdbdewater

Creates:
 lig.info, lig_.atm.pdb, lig_.het.pdb, lig_.wat.pdb

gpf3gen

 
Usage
: gpf3gen lig.pdbq[s] > lig.gpf
This is used by mkgpf3 , and should not be used by hand; otherwise the user must edit certain tags by hand before this can be used by AutoGrid .

This generates a precursor to a grid parameter file. It takes lig.pdbq as its input file, detects all atom types present, and creates the properly formatted parameter file for AutoGrid . It uses equilibrium separations and well depths to define pairwise energy potential. It also assigns atomic solvation parameters, based on Stouten, P.F.W., FrÖmmel, C., Nakamura, H., and Sander, C. (1993), "An effective solvation term based on atomic occupancies for use in protein simulations", Molecular Simulation , 10 , 97-120.

histable

 
Usage
: histable lig.macro.dlg
Creates
: lig.macro.dlg.tbl
This extracts the histogram from the docking log file, and counts all the `#' symbols, writing the result in a table file. This is suitable for input to a variety of graph drawing programs and spreadsheets.

job3

 
Usage
: job3 lig.macro > lig.macro.joblog &
Launches a single AutoDock 3.0 job. It assumes that " lig.macro.dpf " exists, and executes AutoDock using the arguments:
 
autodock3 -p lig.macro.dpf -l lig.macro.dlg
You must edit this script the first time you use it, so that the environment variables $root, $bin and $sh are correctly set equal to, respectively: the path to the root of AutoDock tree, the architecture-dependent binary subdirectory and the Unix scripts subdirectory. The file lig.macro.joblog contains the output from the job script.

makebox

 
Usage
: makebox macro.gpf >! macro.gpf.box.pdb

Creates
: macro.gpf.box.pdb
If you colour the `box molecule' by atom type, i.e. red for oxygen, green for carbon, and blue for nitrogen, then the edges of this box will be coloured-coded to indicate the Cartesian axes. R,G,B will correspond to x,y,z, respectively. Your molecule viewer must obey the CONECT records in the ` macro.gpf.box.pdb ' file, even though the corresponding bonds may appear too long, otherwise the edges of the grid box will not be displayed.

mkbox

 
Usage
: mkbox macro.gpf >! macro.gpf.box.pdb

Creates
: macro.gpf.box.pdb
This is very similar to ` makebox ', except that this puts a phosphorus atom at the minimum x , minimum y and minimum z coordinates of the box. This helps to convey which directions are +x, +y and +z . Once again, if oxygen is red, carbon is green and nitrogen is blue, then R,G,B will correspond to x,y,z, respectively.

mkdlgfld

 
Usage
: mkdlgfld lig.macro.dlg

Needs
: lig.macro.dlg

Creates
: lig.macro.dlg.fld
Only needed for AVS users.

This extracts the "AVSFLD" records from an AutoDock log file, and puts them in lig.macro.dlg.fld. These "AVSFLD" descriptors must be removed before the file can be used in AVS.

mkdpf3

 
Usage
: mkdpf3 lig.pdbq macromol.pdbqs

Needs
: dpf3gen, dpf3gen.awk (AWK program)
Creates
: lig.macro.dpf
This creates a default docking parameter file for AutoDock 3.0; it needs the ligand in PDBQ format and the macromolecule in PDBQS format. It uses the script dpf3gen , which in turn calls the awk program ` dpf3gen.awk '. The lig.macro.dpf docking parameter file is based on the atom types detected in the input lig.pdbq file. See dpf3gen above.

mkgpf3

 
Usage
: mkgpf3 lig.pdbq macromol.pdbqs
Needs
: gpf3gen, gpf3gen.awk (AWK program), 
pdbcen (AWK program)
Creates
: macro.gpf
This creates a default grid parameter file for AutoGrid 3.0; it needs gpf3gen.awk and pdbcen , both awk programs. See gpf3gen above.

mol2fftopdbq

 
Usage
: mol2fftopdbq lig.mol2 > lig.pdbq

Needs:
 lig.mol2

Creates:
 lig.pdbq
Converts from free formatted SYBYL mol2 into AutoDock PDBQ format. Chain-IDs specified in the mol2 file by the SUBSTRUCTURE records are incorporated into the PDBQ file.

mol2topdbq

 
Usage
: mol2topdbq lig.mol2

Needs:
 lig.mol2

Creates:
 lig.pdbq
Converts from fixed-format SYBYL mol2 into AutoDock PDBQ format, and automatically names the output based on the stem of the input mol2 file. Do not use " mol2topdbq lig.mol2 > lig.pdbq ", because " lig.pdbq " is automatically created.

mol2topdbqs

 
Usage
: mol2topdbqs lig.mol2

Needs:
 lig.mol2

Creates:
 lig.pdbqs

Converts from SYBYL mol2 format into AutoGrid 3.0 PDBQS format, by calling mol2topdbq then running addsol on the intermediate PDBQ file. Like mol2topdbq , it also removes any lone pairs (using " rem-lp "), and automatically names the output based on the stem of the input mol2 file. There is no need to use " mol2topdbq lig.mol2 > lig.pdbq ", because " lig.pdbqs " is automatically created.

pdbcen

 
Usage
: pdbcen lig.pdb
Creates
: a "gridcenter" line in AutoGrid GPF format, holding the x,y,z coordinates of the molecule.
This calculates the center of a molecule supplied in PDB format, and outputs a line holding the x,y,z coordinates of the molecule for inclusion in an AutoGrid 3.0 grid parameter file (GPF).

pdb-center

 
Usage
: pdb-center [ lig.pdb | lig.pdbq ] > lig2.pdb
Calculates the center of mass of each residue; writes these coordinates out using REMARK records.

pdb-center-all

 
Usage
: pdb-center-all [ lig.pdb | lig.pdbq ] > lig2.pdb
Calculates the center of mass of each residue; writes these coordinates out using REMARK records.Also calculates the center of all the residues.

pdb-distance

 
Usage
: pdb-distance macro.pdb
The first line of the macro.pdb file defines the center of the distance profile. It is just a copy of the line containing the atom of interest, which will be the origin for the distance calculations. However, it must have the ATOM or HETATM record replaced with a non-PDB tag, `FROM'. The x,y,z coordinates in this FROM line will then be used to calculate the distance to the center of each residue in the protein. Finally, this awk program outputs a bar chart using `#' symbols, showing the distance from this point to each residue. This can be useful to identify all the residues nearest a particular ligand atom, or near an active site.

pdbdewater

 
Usage
: pdbdewater macro.pdb >! macro.dry.pdb
This removes any water records from a PDB file.

pdbinfo

 
Usage
: pdbinfo macro.pdb
Builds a summary of the contents of a PDB file.

pdb-volume

 
Usage
: pdb-volume [ lig.pdb | lig.pdbq ] > lig2.pdb
Calculates the center of mass of each residue. Writes out REMARKs showing these coordinates. Draws ASCII diagram showing volume extents of each residue.

pdbqtobnd

Usage : pdbqtobnd lig

Needs:lig.pdbq
Creates:lig.bnd

Creates " lig.bnd " from the existing " lig.pdbq " ligand PDBQ file. Note this script needs just the stem of the file name. This script executes " pdbqtoatm " and " atmtobnd ": the latter is an executable, not a script, so it must be compiled for each architecture and operating system used.

pdbqtopdb

 
Usage
: pdbqtopdb lig.pdbq > lig.pdb
Converts from AutoDock PDBQ to PDB format.

pdbsplitchains

 
Usage
: pdbsplitchains macro.pdb
Creates separate PDB files that contain each of the chains in macro.pdb . The chain IDs are used to name the new PDB files. If there is no chain ID, the underscore character, `_' is used.

pdbtoatm

Usage : pdbtoatm lig.pdbq > lig.atm

This creates a Connolly ATM formatted file " lig.atm " from the ligand PDBQ file, " lig.pdbq ". This is used to create input for the utility program atmtobnd to generate a bond connectivity file.

pdbdewaters

 
Usage
: pdbwaters macro.pdb > macro.wet.pdb

prepare

 
Usage:
 prepare m s
where: m.pdb and s.pdbq contain the receptor and ligand respectively. Prepare performs the following eight steps. The macromolecule `.pdb ' filename stem is represented by " m ", and the ligand `.pdbq ' filename stem by "s":
1. Extracts all ATOM and TER records from m.pdb into m.enz;

2. Renumbers residues to avoid problems in protonate -step;
3. Adds polar hydrogens to m.enz , creating m.polH ;
4. Somewhat crudely assigns partial atomic charges to m.polH , creating m.pdbq ;
5. Checks charges in m.pdbq , all errors held in m.err;
6. Creates s.gpf , a parameter file for AutoGrid , based on ligand file s.pdbq ;
7. Creates s.vol , a volume dimensions file; and finally,
8. Creates s.dpf , a parameter file for AutoDock, based on ligand file s.pdbq ;

Its arguments are the stem of the filename of the macromolecule ` .pdb ' file and that of the ligand PDBQ file. See the flowchart below for more details. It shows what files are created by ` prepare ', and which scripts or programs are used. Steps 1.-4. are better carried out with a reliable molecular modeling system: these steps can produce some odd results unless carefully checked.

The user must check the m.err error file to ensure there are no non-integral charges, either on any residue in the macromolecule, or on the macromolecule as a whole. If there are, then the user must repair the m.pdbq file. This problem can arise if there are atoms for which no coordinates were assigned by the crystallographer, e.g . due to ambiguous electron density. Assuming there were no problems, s.gpf and s.dpf should be successfully produced.

prepare-gpf+dpf

 
Usage
: prepare-gpf+dpf macro lig
Executes only steps 6. through 8.

rem-lp

 
Usage
: rem-lp lig.pdbq
or
: rem-lp lig.mol2
Creates:
 lig.pdbq
or
: lig.mol2
This removes the lone-pairs (atom name = LP) added by some molecular modelling programs, such as SYBYL, and adds their partial charges on to that of the atom to which they were attached (SG in cysteines and SD in methionines). Otherwise, AutoDock treats lone-pairs as carbon atoms. (Note: if you need lone-pairs, you can force AutoGrid to calculate a grid map for "LP" atoms, using the atom code "L" in the "types" commands of AutoGrid and AutoDock).

renumberatoms

 
Usage
: lig.pdb > lig2.pdb
 
Usage
: lig.pdb > lig.rnm
Used by prepare to renumber residues in the macromolecule contiguously. This step is needed prior to using protonate , which may fail if there are gaps in the residue numbers.

resrange

 
Usage
: resrange lig.pdb
This is handy to summarise the range(s) of residues in a given protein PDB file.

runtrj

 
Usage
: runtrj lig

Needs:
 lig.dpf
, lig.trj

Creates:
 lig.tcom
, lig.tlg
 and
 lig.tout
This creates an AutoDock command file, lig.tcom , which is then used to convert the trajectory written in state variables ( lig.trj ), into a trajectory written in cartesian coordinates. lig.trj is created by an earlier run of AutoDock , in which trjfrq was set to a non-zero value.

stats

 
Usage
: stats columns.dat
This is a very useful, general awk program. Use it to calculate the minimum, maximum, mean and standard deviation for each column of numbers in an input file, here ` columns.dat '. Any alphanumeric columns will be ignored.