My favorites | Sign in
Project Home Downloads Wiki Issues Source
Project Information
Members
Featured
Downloads

Introduction

pdb-tools is a set of command line python scripts that manipulate wwPDB macromolecule structure files. There are many programs, both open source and proprietary, that perform similar tasks; however, most of these tools are buried within programs of larger functionality. Thus, relatively simple calculations often involve learning a new program, compiling modules, and installing libraries. To fill a niche (and get the tasks done that I needed done), I started writing my own toolset. This has evolved into the pdb-tools suite. The suite of programs is characterized by the following philosophy:

  1. Each program should run as a stand-alone application with a standard, GNU/POSIX style command line interface.
  2. Each program should be written in such a way to allow it to be used as a library of functions for more complex programs.
  3. Programs should require a minimum of external dependencies.

Most of the scripts will run "out of the box" using a python interpreter. The command line parser is designed to be flexible. It will take an arbitrarily long list of pdb files, pdb ids, text files with pdb ids, or some mixture of all three. If the pdb file or id is not in the working directory, scripts will attempt to download the pdb file from RCSB. Depending on the type of operation being done, a program will either write output files in the working directory or will print to stdout. All structure outputs are written in standard pdb format. All data outputs are in fixed-width column format. They were designed to be read by the statistics package R; however, they should be easily parsed by other graphing programs.

Note: These scripts are only compatible with Python version 2.4 and above.

Current functions

Structure-based calculations

Geometry

Energy calculation

  • calculate coulomb energy: pdb_coulomb.py
  • calculate the dipole moment of the protein: pdb_moment.py
  • calculate pKa of ionizable groups using the Solvent-Accessibility-modified Tanford-Kirkwood method pdb_satk.py (requires fortran compiler)

Structure properties

  • extract structure experiment properties: pdb_exper.py
  • extract protein sequence from structure: pdb_seq.py
  • calculate theoretical pI, MW, fraction titratable residues, charge: pdb_param.py

File/structure manipulation

Some of the programs are written as interfaces to other programs: CHARMM, DSSP and NACCESS, which must be downloaded and installed separately if their functions are desired. To use pdb_satk.py, a set of fortran packages must be compiled.

Project Contributors

  • Mike Harms
  • Marcin Cieślik
Powered by Google Project Hosting