Welcome

Overview

Research

Services

Home

Theoretical Biophysics Group
Welcome
NIH Resource for Macromolecular Modeling and Bioinformatics

To Students Contemplating Research in Theoretical Biophysics

(For information on how to apply to UIUC go here )

Klaus Schulten

The intellectual mastering of life at the level of biomolecules and their assemplies is the research goal of the Theoretical Biophysics Group. In pursuing this goal the group follows the example of physics, the discipline that has been so eminently successful in carrying the torch of mathematics into the natural sciences.

The Theoretical Biophysics Group is aware that there are differences as well as similarities in the quests towards physical and biological theory. The most important difference in the pursuit of biological theory is that the latter is about advancing biology and not about advancing physics or other fields. One must also realize that theory is not as welcome by experimentalists in biology as it is in physics, since successes of theory in biology have been few, the field of evolutionary and hereditary biology being the notable exception. Recently, though, the standing of theory in biology has dramatically improved, both due to challenges in the biological sciences which beg for theoretical approaches as well as due to the increasing and extremely supportive role of computation, e.g., for sequence analysis and structure determination.

Many challenges for theoretical work exist. Most pressing are needs for concepts and algorithms to handle and analyze the rapidly increasing genetic data bases. The complete genomes of many biological species have recently become available and many more will become available soon. For many proteins, variants for numerous biological species are known with rich and still unearthed information underlying the conservation and variability of amino acids. The problem of predicting the structure of proteins from genetic sequences, the so-called protein folding problem, begs a solution with immeasurable opportunities once a solution is at hand. A further challenge is the structure--function relationship of proteins, a problem with extremely wide scope and characterized both through universality and diversity. It is fair to say that nearly half a decade after the first discovery of the atomic structure of a protein scientist have not yet unveiled the mechanisms by which proteins achieve their many functions: to catalyze reactions, to receive and generate signals, to endow cells with shape and elasticity, to move cells and their internal cargo, and to control genetic expression. The complexity of proteins demands still utmost respect from theoretical scientists.

Much of the research in the Theoretical Biophysics Group at Beckman Institute focuses on proteins. Theoretical studies of proteins proliferated with the advent of sufficiently powerful computers to simulate large particle systems and with the explosive increase of atomic resolution structures of proteins. The latter structures, though necessary prerequisites, in themselves are not sufficient for any physical theory of protein function; the motions in a protein play an equal role. On the methodological side the field is concerned with providing accurate, yet simple, force fields which govern the atomic motion of proteins. Ultimately, force fields will be determined in combined classical (for the nuclear motion)/quantum chemical (for the valence electrons) calculations.

Researchers are investing currently strong efforts in developing efficient computational methods for classical dynamics of proteins involving tens to hundred thousands of atoms; a serious hurdle, for example, is the calculation of Coulomb forces since they need to be evaluated for all pairs of atoms for all time steps of the classical motion. Suitable integration schemes can economize the costly update of forces, in particular the Coulomb forces, with a resulting boost in computational efficiency. At present a practitioner of the theory of proteins needs to be extremely competent in scientific computing with an understanding of massively parallel computing holding a particular promise for further success.

On the conceptual side an entry to protein dynamics is provided by studying normal modes. Since the classical Hamiltonian describing atomic motion is significantly non-harmonic and also extremely heterogeneous, conventional normal mode analysis as applied, e.g., for crystals, is not suitable. A quasi-harmonic description derives normal modes from a principal component analysis, i.e., from a diagonalization of the covariance matrix of all atomic positions, averaged over time. However, a gliding average with a, say, 100~ps window, reveals that modes derived in such way vary in time due to significant conformational transitions and disorder in proteins. Protein motion needs to be characterized also on spatial scales involving multi-atom segments of proteins. In fact, many proteins exhibit conformational changes which can be described as rotations of segments around hinges or as motions of flaps formed by secondary structure elements, e.g., alpha-helices or loops between alpha-helices. Finally, the structure of proteins can undergo melting-type transitions when the environment changes, e.g., through binding of a charged substrate, a feature that is exploited by proteins involved in cellular signaling.

The abstraction of functional properties from molecular dynamics simulations still remains an important challenge. Following established approaches one can identify correlation functions and susceptibilities that provide essential characteristics of protein dynamics and can be related to observation and function. Examples are the following: a correlation function called the dynamic structure function is the Fourier transform of the motion of a protein's constituents and is observable through neutron diffraction or Moessbauer spectroscopy; the correlation function of the energy difference between two quantum states with diagonal coupling to the protein matrix accounts for the transition rate between the two states, e.g.,for the rate of electron transfer; the dielectric susceptibility and thermal susceptibility, determined through monitoring dipolar or energy fluctuations account for dielectric properties and the degree of order of water associated with proteins.

The theory of proteins, as a relatively young field, can benefit tremendously from related and already established fields. The closest relative is the theory of liquids since solvent molecules, though not connected into a polymer and much more homogeneous in structure, are subject to similar forces and to similar disorder phenomena. On larger length and longer time scales, molecular hydrodynamics can provide much guidance to gain understanding of low frequency motion of protein segments encompassing many atoms. Condensed matter theory of disordered materials likewise deals with systems, e.g., glasses, of great conceptual similarity. Condensed matter theory can also serve as a reminder that the primary role of theory is not quantitative description, but rather qualitative understanding; anybody suspecting that not much useful can come of such a role should have a close look at the triumphs of condensed matter theory.

The beauty of theoretical protein science stems from its rapidly increasing treasure of new structures and functions; one could hardly imagine a science with more relevance to the existence and well-being of humans, and with greater riches in new discoveries and, consequently, new challenges.

The greatest such challenge considers the interplay of many proteins in biological cells. This interplay is the core attribute of living systems, one may even say that it answers the question of the origin of life. Living systems are all made of many molecular components that self-assemble, control each other and self-replicate, and dealing with such systems has been a problem for scientists who were good at taking the molecular machines in living cells apart and learnt what they are made of. But how are these machines assembled? How can it be that proteins, describable by the laws of physics, assemble themselves into cellular machines and structures, these into complete living cells, and the latter into whole organisms that require a whole new language for their description, that of biology.

Where in this hierarchy of assemblies does the step from inorganic matter to living systems occur? Certainly single cells must be considered alive, and half a cell does not remain alive. What is the simplest living cell made of?

But is the cell the lowest organization that typifies living systems or can one glance the key characteristics of living systems already from its constituents. Would such attempt run counter to understanding life or could it provide a more gradual route to develop such understanding on the basis of what we know well. The answer cannot be known beforehand, but going this path scientists need to guide their search and analysis by keeping in mind the eventual goal of understanding how life comes about through the self-organization of innate matter.

Are we aware of principles that govern such self-organization? We definitely can state three principles.

One is that the sustenance of the order of systems in living cells requires a constant consumption of energy. Living systems cannot be static and in keeping alive consume resources. Important is that this principle holds down to any level of order in cells.

Second, self-organization is based on interactions of many components. Mathematically, this is reflected by the fact that nonlinear dynamical systems that account for interactions of components exhibit emergent properties in developing order from homogeneous initial conditions. Biologically, this is reflected in the many control components found in living cells: genes are controlled often through numerous other genes and molecules; the cell has developed also many signaling loops for the control of its metabolism and movement.

Lastly, one needs to account for the empirical fact that living systems are robust against minor perturbations such that they seek to employ networks of interactions that allow operation under perturbations as opposed to using network types that function only in a narrow range of physical parameters.

The principles of self-organization have been established during the past decades, but the real being of cells is molecular and there is still a wide gap in our knowledge. We need to build bridges between the molecular level of cells and higher organizational forms. To see where such bridges can be built is the most difficult part in this endeavor.

The Theoretical Biophysics Group at Beckman has developed a research program that assumes that suitable bridges have been identified. For example, the group investigates a cellular membrane in archaebacteria (purple membrane) that absorbs light and converts its energy into a membrane potential. Likewise, the group investigates an apparatus, called the photosynthetic unit, that exists in the cell membrane of certain photosynthetic bacteria and is more complex than the purple membrane. This apparatus contains six types of multi-protein complexes: the photosynthetic reaction center, the light harvesting complex 1, the light harvesting complex 2, a cytochrome, a bc1 complex, and an ATPase. All proteins are structurally quite well known so that explanations can be based on molecular level information.

The website of the Theoretical Biophysics Group provides further examples of our recent work that reflect the research opportunities available to students who choose a career in Theoretical Biophysics. Students with good backgrounds in physics, mathematics, and computing, with a strong motivation to discover the molecular organization of life, and the interest to work in a team of physicists, chemists, biologists and computer scientists are welcome to join us.

For information on how to apply to UIUC go here .

	Overview	Research	Development	Dissemination	Services

This document was last modified on Thursday, 07-Sep-2000 10:53:23 CDT
Material on this page is copyrighted
Contact Webmaster for more information
6120 accesses since 03 Nov 2000