Info

 

It is a good morning exercise for a research scientist to discard a pet hypothesis every day before breakfast. It keeps him young.

Konrad Lorenz

 

 

 
 
Overview
The true essence of all science is modeling; to develop a visual, mathematical, or statistical picture of how the world works. Model is a simplified version of reality that allows us to explore and understand the aspects of a particular phenomena. Simplification comes from making assumptions, so, of course, models have clear and obvious weaknesses. They are often incomplete or only simulate one aspect of a particular behavior. For example, projectile motion can reasonably determine the flight of a cannonball. But Galileo Galilei’s earliest models did not include air resistance or the aerodynamics of the cannonball’s shape or even the effects of varying gravitational fields. Including these characteristics gave more and more accurate results, so today we can very accurately predict the path of a spaceship headed for the Moon and beyond.

Figure 1: Modeling in physics

But like any large-scale construction project (like building a skyscraper or launching a rocket to the Moon), science is constructed from the “ground up”. Each piece of the structure rests on that which came before it. In this tutorial, we stress that each individual tutorials is the foundation for that which follows. And that’s the way science succeeds . . . one step at a time.

 Cycle of modeling

Biological modeling is no different that what we do with physics; the only difference is that we are used to describing physical phenomenon in mathematical terms. But modern biology – and especially the increased interdisciplinary nature of science – requires more and more mathematics. So biological modeling is not really all that different from that done with physics for the past 400 years.

3 4 2 1 5

Figure 2: Cycle of modeling (click the numbers to read more)

Take, for example, a simple real-world problem. The first step is to rough out an approximate biological model of the problem. What is happening here? How long does it take? What steps appear to take place as the behavior progresses? After the basic schematic of the problem is laid out, we now approach it from a mathematical or computation point-of-view. We attempt to mathematically define each of the previously-stated questions from a quantitative standpoint. Then we develop a computer simulation – using each of the mathematical rules for the model – and let it run.

We then evaluate the result and compare it to the biological model. If the model reasonably matches or predicts that model, we analyze our model with respect to the original real-world problem. And, again, if the results are reasonably accurate, we know that we are on the right track. Of course, no model is complete . . . life has too many variables (and surprises) for us to incorporate fully. Instead, we modify and merge and tweak and twist our computational model into shape, always comparing and contrasting the model to that of the real world. And if we are able to develop a predictive model that works, science progresses. If not, we take our licks, pick ourselves up again, and prepare to do battle with nature once more.

So how does the cycle of modeling – and biological modeling, in particular – work?

Going from a Real World problem to a Biological Model

The first step is to rough out an approximate biological model of the problem. What is happening here? How long does it take? What steps appear to take place as the behavior progresses? What assumptions and/or simplifications can we make in order to focus on the behavior under study?


Going from a Biological Model to a Mathematical and Computational Model

After the basic schematic of the problem is laid out, we now approach it from a mathematical or computation point-of-view. We attempt to mathematically define each of the previously-stated questions from a quantitative standpoint. Then we develop a computer simulation – using each of the mathematical rules for the model – and let it run.


Analysis

Now we can employ a variety of mathematical and computation tools; most notably, statistics and probability. Here’s where the use of random numbers, Monte Carlo simulations, and Markov chains play a key role in examining the model.


Conclusions and Predictions

What are our results? Are they valid (in a mathematical sense; that is, are they statistically significant? Can they be reproduced with little or no variation?)?


Interpreting Mathematical Conclusions and Predictions in the Biological Context

After evaluating the success (or failure) of our mathematical model, it is time to compare it to the biological model we first developed.


Interpreting Biological Conclusions and Predictions in terms of a Real life Problem

So how did we do? How does the biological model compare (and contrast) to that of the real life problem we started with? How relevant and appropriate were the original assumptions we made in order to model our real life problem? If the results are reasonably accurate, we know that we are on the right track. If not, we start the process all over again.


Once we have analyzed the model and compared it to the real life problem, it’s time to modify and adjust and improve our model. We make new assumptions, or change one or more parameters, or possibly throw out the original model as we seek to improve our understanding of the problem.

Of course, no model is complete . . . life has too many variables (and surprises) for us to incorporate fully. Instead, we modify and merge and tweak and twist our computational model into shape, always comparing and contrasting the model to that of the real world. And if we are able to develop a predictive model that works, science progresses. If not, we take our licks, pick ourselves up again, and prepare to do battle with nature once more.

Mathematical Foundations and Biological Applications

The understanding of biological processes begins with observation. We start be collecting and recording observations. Once we have gathered enough data we try to come up with the rules governing that biological process: we hypothesize. Here is where modeling comes into play.

For instance, we may be interested in understanding how a disease develops in humans. (Of course, we cannot do experiments with humans.) We could start by studying how this disease develops in animals or in individual cells or in test tubes. These approaches are called animal modeling, in vivo modeling and in vitro modeling respectively.

Which approach is better than which? They are all good, but they are applied depending on the scenario. In general, animal modeling is used for system-wide studies, in vivo modeling for cellular level studies and in vitro for molecular scale studies. Animal modeling is ideal to study, for instance, immunological response (tissue wide) to an infection or how hormones regulate mood, body weight and sexual behavior. Modeling in vivo is ideal for studying, for example, cell movement and division, cellular localization of biomolecules and organelle rearrangements. Modeling in vitro is best to study, for example, protein folding, interactions within biomolecular complexes, and molecular mechanisms. Usually these approaches are used in combination.

All these three types of modeling share something in common: they are experimental approaches that we use to test our understanding (our hypothesis). For instance, we can propose that the presence of a certain protein is essential for progression of a particular disease. That would be our hypothesis or rule; now we need to test it. What we can do in the laboratory is deplete the protein from cells and see if this actually stops the disease. If it does, the hypothesis is ratified, if it does not, the hypothesis is rectified.

What if the experiment is too difficult or impossible to do? For instance, imagine a simple experiment that must be done in the absence of gravity; that would require going to space. At a molecular scale there are also examples: making a protein to be completely rigid or move only when we want it is impossible. What about studies of natural disasters and spread of contagious diseases? There must better alternatives than destroying levees and infecting populations for the sake of doing experiments. Mathematical modeling can help in all the examples above.

Furthermore, a very valuable feature of mathematical modeling is that it is inherently quantitative. It can potentially tell us not only what happens but how much and when. In the case of a contagious disease outbreak, we may need to predict not only whether a  whole city will be hit but also how many hospital beds will be needed and how long it will take to reach the crisis peak. Because we have the equations and we can keep track of the numbers, we essentially know it all in a mathematical simulation. In experimental approaches, we can only record the things we purposely measure and not everything is measurable.

So, when computational or numerical modeling comes into play? This approach is handy at least in too scenarios. One situation could be when the biological process is too complicated to be fully defined mathematically in a small set of equations. The second situation would be when the mathematical equations are rather simple but there are too many equations. For instance, let's imagine that we are trying to model ant colony behavior. The behavior of a single ant is perhaps easy to model but colonies have thousands of ants. Imagine modeling potential germ carriers in a city of millions.

Also in this category there is a biological process that we address in further detail in this work: microtubules. We model these dynamical biopolymers individually in relatively simple terms using tools such as random numbers, Monte Carlo and Markov chains. But we are not interested in modeling only single microtubules but also their collective properties. Cells may have several hundred microtubules and then the computational approach suites well.

In general, the choice of underlying mathematical tool is made based on the nature of the biological process and questions being asked. For example, if we are studying protein folding and we are only interested in the most stable structure, a Monte Carlo approach may be the most appropriate tool. Another example could be the study of the molecular evolution of genes. In this case, random nucleotide mutations obeying laws of a Markov process would be the best choice. The Ising and Potts models on their part have been successfully used to understand patterning and cell migration respectively and a variety of other biological puzzels.

In summary, the purpose of this work is to introduce these mathematical tools and encourage its application to understand biological problems.

microtubules TBA Biology Ising Monte TBA Biology TBA Math Markov Walk Random stats

Figure 2: Structure of out online tutorials

In he tutorial section of this site we present yet another example of a natural process that can be modeled mathematically: microtubule dynamics. We see in this example that this network of biological polymers can be meaningfully modeled using simple mathematical tools such as random number generators, Monte Carlo computations and Markov chains.

Future work

In these tutorials, we start with the very basic mathematics you will need to build biological models. Statistics and probability rule here (much as they do in quantum mechanics in physics). After a brief introduction to random numbers and walks, Monte Carlo techniques, and Markov chains, you will study a few of the basic modeling techniques used. From there, you will proceed into new and largely unexplored territory in mathematical biology.

In the future, you can expect tutorials on a number of applications currently under study by biologists, mathematicians, physicists, and computer scientists. These include models of termite behavior, flocking and swarming, cell aggregation, myxobacteria, and even how the leopard got his spots (or, perhaps, how the zebra got her stripes).

 
 
 

Nome

University of Notre Dame
Notre Dame, Indiana 46556
Phone: 574-631-5000
Copyright © 2006 University of Notre Dame
Last modified: July 08, 2007 12:50 PM