Bioinformatics computer programs help scientists unlock the secrets of intrinsically disordered proteins, which have key roles in biology

Image of Dr. Kurgan in front of a computer screen displaying a digital model

Photo: Lucasz Kurgan, Ph.D., vice chair and Qimonda Professor in the Department of Computer Science, has designed computer programs that are shedding light on the workings of proteins for researchers in 96 countries.

By Leah Small

--

Proteins are the building blocks of life and biological agents. They are drivers of growth and development and the spread of viruses and bacteria, and have key roles in disease pathways and virtually all cellular functions.  As scientists gain knowledge about proteins, the mechanisms behind biological mysteries are revealed.

To help shed light on the workings of proteins, VCU researcher Lukasz Kurgan, Ph.D., vice chair and Qimonda Professor in the Computer Science Department in the School of Engineering, has developed a series of bioinformatics programs to assist biologists in developing insights into the functions of intrinsically disordered proteins. This group of proteins lacks a fixed structure, which means they are totally or partially flexible and amorphous.

Over the last several decades, scientists have sequenced 85 million unique proteins, structured and unstructured alike, but still don’t know what the vast majority of these proteins do. As more proteins are discovered, more sophisticated computer programs must be developed to help determine their functions.

“We have manually curated less than one percent of these proteins in order to better understand them, and right now there’s over 80 million to solve. A program can solve these proteins faster than a single human and can help researchers speed up hypothesis generation,” Kurgan said.

Determining a protein’s function becomes even more challenging when a protein is completely or partially disordered. When a protein does have a defined structure, researchers use prior knowledge and bioinformatics programs to first decipher that structure, which then helps determine function. If the protein is disordered, biologists turn to programs built by Kurgan and other computer scientists that use predictive models to generate workable hypotheses on the protein’s function.

Solving the puzzle

Since 2008, Kurgan has developed a number of programs for this purpose. This spring, Kurgan’s team was awarded a $500,000 grant from the National Science Foundation to continue their innovative work. So far, Kurgan’s programs have over 7,000 users from over 1,300 cities in 96 countries.

Kurgan has also developed additional programs that determine whether a protein is disordered or not. In 2012, his MFDp program was ranked third out of 28 participants in the biannual worldwide CASP10 experiment, which evaluates the effectiveness of computer and human predictors of intrinsic disorder. In 2014, Kurgan’s lab released DisoRDPbind, the first program to predict multiple functions of intrinsically disordered proteins.

Kurgan’s programs use existing collections of data on proteins whose functions have been determined, to build predictive models to map out the functions of unknown intrinsically disordered proteins.

“The details are not easy. Building these models takes a little bit of art, theory and experience,” Kurgan said.

New ideas for disorder

It’s a commonly accepted fact among scientists that disordered proteins, similar to their structured counterparts, have essential functions. This assertion was at first met with disbelief, as is initially common with many scientific discoveries.

“About 30 years ago, when disordered proteins were discovered, there were a lot of deniers. Some people said that this is just noise in the protein structures” Kurgan said. “Now, disorder as a mechanism of biology is an accepted fact. Just because a protein has no defined structure, doesn’t mean it’s useless. It just works in a different way.”

Now, deciphering disorder is a collaborative effort in the scientific community, and various programs from multiple entities are brought together to provide different approaches to determine the functions of disordered proteins. Kurgan has worked with researchers from University of South Florida, Indiana University, and Tianjin and Nankai Universities in China, on a study that used his programs to discover the incidence of intrinsic disorder in close to 1000 species from all kingdoms of life. Several other collaborative studies have focused on the functional roles of intrinsic disorder in HIV, Hepatitis C and Dengue viruses.

“Collectively we can push the boundaries of what is being done,” Kurgan said. “It’s not based on the efforts of one specific researcher or group. Collectively we help each other.”