A new ‘network-extracted ontology’ model of the cell
December 20, 2012

Part of the network-extracted ontology (NeXO) hierarchical ontology of genes, cellular components, and processes derived from large genomic datasets. The ontology is shown as a tree, with nodes indicating terms and edges indicating hierarchical relations between terms. Node sizes indicate the number of genes assigned to a term. Node colors represent the degree of correspondence to a term in the manually curated Gene Ontology (GO) as determined by ontology alignment, with high-level alignments labeled. (Credit: UC San Diego School of Medicine)
Researchers at the University of California, San Diego School of Medicine and colleagues have proposed a new “network-extracted ontology” (NeXO) method that creates a computational model of the cell from large networks of gene and protein interactions, discovering how genes and proteins connect to form higher-level cellular machinery.
“Our method creates [an] ontology, or a specification of all the major players in the cell and the relationships between them,” said first author Janusz Dutkowski, PhD, postdoctoral researcher in the UC San Diego Department of Medicine. It uses knowledge about how genes and proteins interact with each other and automatically organizes this information to form a comprehensive catalog of gene functions, cellular components, and processes.
“What’s new about our ontology is that it is created automatically from large datasets. In this way, we see not only what is already known, but also potentially new biological components and processes — the bases for new hypotheses,” said Dutkowski.
Ontologies
Originally devised by philosophers attempting to explain the nature of existence, ontologies are now broadly used to encapsulate everything known about a subject in a hierarchy of terms and relationships. Intelligent information systems, such as iPhone’s Siri, are built on ontologies to enable reasoning about the real world.
Ontologies are also used by scientists to structure knowledge about subjects like taxonomy, anatomy and development, bioactive compounds, disease, and clinical diagnosis.
Gene Ontology
A Gene Ontology (GO) also exists, constructed over the last decade through a joint effort of hundreds of scientists. It is considered the gold standard for understanding cell structure and gene function, containing 34,765 terms and 64,635 hierarchical relations annotating genes from more than 80 species.
“GO is very influential in biology and bioinformatics, but it is also incomplete and hard to update based on new data,” said senior author Trey Ideker, PhD, chief of the Division of Genetics in the School of Medicine and professor of bioengineering in UC San Diego’s Jacobs School of Engineering.
“This is expert knowledge based upon the work of many people over many, many years,” said Ideker, who is also principal investigator of the National Resource for Network Biology, based at UC San Diego. “A fundamental problem is consistency. People do things in different ways, and that impacts what findings are incorporated into GO and how they relate to other findings.
The approach we have proposed is a more objective way to determine what’s known and uncover what’s new.”
In their paper, Dutkowski, Ideker and colleagues capitalized upon the growing power and utility of new technologies like high-throughput assays and bioinformatics to create elaborately detailed datasets describing complex biological networks. To test the approach, the scientists pulled together multiple such datasets, applied their method, and then compared the resulting “network-extracted ontology” to the existing GO.
They found that their ontology captured the majority of known cellular components, plus many additional terms and relationships, which subsequently triggered updates of the existing GO.
Ideker and Dutkowski don’t say the new approach is intended to replace the current GO. Rather, they envision it as complementary high-tech model that identifies both known and uncharacterized biological components derived directly from data, something the current GO does not do well. Moreover, they note that a network-extracted ontology can be continuously updated and refined with every new dataset, moving scientists closer to the complete model of the cell.
Funding in part for this research was from National Institutes of Health grants.
Comments (6)
by dibiase
When we will be able to reach a cybenetic-cell control we will controling Life itself!
by Tony Stender
How hierarchical information is related has been posited as the brains system of organization. Not to mention, Jeff Hawkins assertion that creativity is simply using known information in a new context, seems to indicate that, if this can be used as input to AI systems, many new ideas on life extension and health improvements are not too far in the future. I look forward to benefiting from the improvements which may be forthcoming, at the exponential speeds predicted by Ray Kurzweil.
by Tony Stender
How hierarchical information is related has been posited as the brains system of organization. Not to mention, Jeff Hawkins assertion that creativity is simply using known information in a new context, seems to indicate that, if this can be used as input to AI systems, many new ideas on life extension and health improvements are not too far in the future. I look forward to benefiting from the improvements which may be forthcoming, at the exponential speeds predicted by Ray Kurzweil.
by Joel C.
Imagine an AI software reading this network-based ontology, let’s say, of a biological cell at extreme lightning speed… and from this, it’ll be able to extrapolate the different ways this cell can be modified (gene therapy?) to cure all sorts of dysfunction efficiently and safely.
by Bri
Imagine that same AI evaluating social and ethical issues, with the same understanding of the synergies, the symbiotic relationships. The interdependency, and the mutual needs of ” The Artilect” and of humans. The general continuity of existence. After all, what is life but SN assemblage of differentiated cells working together in a symbiotic relationship. I think VIKKI will be inclined to want to work toward humanity tan against it. Soon we will understand all the expressive qualities of ” the Cell” much like John Carpenters version of The Thing, able to mutate it into any expressive state feasible, and AI will have that ability at the ready to. I’d definitely want to work on peaceful interactions with future AI, and really everyone and every thing. That way future A I would perceive us as being s net positive. We are getting close to a complete codification of biological systems. It’s still an immense mountain of data to correlate , but the framework is in place, and the process is being automated. It’s like coming into the final stretch.
by high carbfoods
The dynamics of cell seems unlimited operations carried out most silently. To probe and control is human curiosity and that is precisely what is being reported. Within numerous networks, cell is a wonder like the brain or consciousness itself. the mystery is only deepening!