Evolutionary Systems Lab


News:

Check out the new Qbio@NCState site.

Congratulations to Jaells Naranjo for completing his M.S. in Bioinformatics!

POInTbrowse is now online at wgd.statgen.ncsu.edu

Affiliations



Research Interests:

Our primary research interest is in understanding the origins of novel features in evolution and ecosystems, in particular how new genetic features appear and are altered by natural selection. Below are a few examples and links to related publications. Areas we study in the lab include:


Modeling and orthology inference in polyploids

We have developed POInT: the Polyploidy Orthology Inference Tool (Genetics, 2008,PLoS Genetics, 2018), which uses phylogenetic models of duplicate gene loss after polyploidy (see left) to understand the process duplicate loss after polyploidy and to identify orthologous genes between different polyploid taxa. Those orthology inferences are avalaible from our POInTbrowse online tool.



Gene duplication and biological networks

A major route to evolutionary innovation is gene duplication, which can occur from the level of the single gene all the way up to the entire genome (Review). Gene duplication can have a large impact on biological networks, including regulatory networks and metabolic networks. In this second paper, we inferred metabolic networks for several mammalian genomes and explored the patterns of gene duplication in them, finding that duplications are localized to regions of the network involved in traits of interest, such as milk production in cattle. Understanding these copy-number variations in mammals is also has an important role in understanding diseases including cancer, Alzheimer's and Down syndrome. We have also studied whether small "motifs" in regulatory networks are derived from duplication or convergent evolution.
Dark dots are genes with different numbers of copies in humans and cows



Evolutionary transitions through genome duplication

I have also studied why bakers' yeast is able to turn sugar into alcohol with such efficiency. Although we still do not understand all the details of this story, genes that produce ethanol have been duplicated in the yeast genome through a whole-genome duplication (WGD). We think these duplications contribute to the process of evolving new functions in two important ways. Firstly, it is sometimes enough to have a second copy of a gene to obtain a new function. The reason is that genes are the templates from which the machinery of the cell is made. Having an extra copy of a gene can therefore allow more copies of a given protein to be made, potentially allowing the cell to grow faster. Perhaps more importantly, these extra copies of genes can be altered such that they perform a new function that is related to their original function, much as computer users often open an old letter and then modify it to make a new one. In addition to gene duplication, bakers' yeast has also changed in other ways: the timing with which certain genes are turned on and off has also been altered, allowing yeast to more closely control how it uses the sugar it finds in its environment. The result is an evolutionary advantage based on the fast, inefficent use of glucose.

Simulations of competition between slow and fast fermenters


We have made similar observations in plants, finding that genes surviving in duplicate after genome duplication tend to act in key reactions (i.e., reactions with high metabolic flux. There is also a tendency toward the clustering of duplicates, in keeping with the idea that genome duplications, unlike single gene duplications, can duplicate highly connected genes in biological networks.
Duplications in the Arabidopsis thaliana metabolic network


More generally, WGD also altered the global patterns of gene expression in yeast. We developed an algorithm for partitioning the yeast coexpression network in two groups based on the WGD. Using this program, we found evidence for partitioning of expression into, for instance, one group of genes involved in stress response and another (composed of duplicates of the first) that had more general functions.
A partitioned network



Metagenomics and systems biology of ruminants

In collaboration with Prof. Kristi Cammack at the Univeristy of Wyoming and Prof. Bill Lamberson at MU, we are interested in the structure of the ecosystems found in ruminant mammals. The microbes that dwell in these animals' rumens allow the host animal to live off of a diet of plant matter that would not support life in other metazoans. In addition to studying the distribtion of taxonomic groups in animals fed differing diets, we are interested in the systems biology of these systems. Using the MetaCyc database, we have mapped reads from ruminal microbes into a metabolic network. Then, using comparative genomics, we defined an interface between that network and the metabolism of the host animal. We are currently exploring the structure of these systems with a variety of network-based approaches.


The microbial (right) and host (left) metabolic networks from two steers: nodes are reactions and edges join reactions sharing a metabolite. Nodes on the right are color-coded by the number of reads found for that reaction (see scale bar). The color-coded compounds are interface metabolites that we believe to be interchanged between microbes and the host.


Orthology inference

Orthology refers to a biological structure in two different species that was shared in the common ancestor. We have developed tools for the inference of orthology inference among genes for mammals. Our synteny approach allowed us to explore the evolution of the mammalian placenta.



Metabolism and cell structure

We are also interested in how metabolism is spatially structured: we have shown that scaffolding proteins help organize metabolism in yeast.

In this picture, the grey lines join enzymes (blue dots) that use the same metabolites. The superimposed red lines are cases where those two proteins are also connected by a third protein, that is not an enzyme (see right).


Bioinformatics software

We also develop software for the bioinformatics/biology community. Our most recent packages are: