* 'Junk' DNA runs the genes, $196 million international study finds

* Project offers 'guidebook to the human genome'

* May transform personal DNA analysis, drug discovery

By Sharon Begley

NEW YORK, Sept 5 (Reuters) - In the largest single batch of discoveries about human DNA since the completion of the human genome project in 2003, 442 scientists in labs across three continents released 30 studies jam-packed with finds on Wednesday.

The discoveries, representing what the journal Nature calls the "guidebook to the human genome," range from the esoteric - what is a gene? - to the practical - that just 20 gene switches may underlie 17 seemingly unrelated cancers, giving companies a workable number of drug targets.

The studies come from a $196 million project called the Encyclopedia of DNA Elements, or ENCODE, whose goal is to take the babel produced by the human genome project - the sequence of 3.2 billion chemical "bases" or "letters" that constitute the human genome - and make sense of it.

"We understood the meaning of only a small percentage of the genome's letters," said Dr. Eric Green, director of the National Human Genome Research Institute, which paid for the bulk of the study.


ENCODE was launched in 2003 to build a complete "parts list" for Homo sapiens by identifying and pinpointing the location of every stretch of the genome that does something - "a reference map of all the functional elements in the human genome," said geneticist Joseph Ecker of the Salk Institute for Biological Studies in La Jolla, Calif.

The best-known elements in the genome are the 21,000 or so genes that specify what proteins a cell makes. The dopamine receptor gene makes dopamine receptors in brain cells, for instance, and the insulin gene makes insulin in the pancreas.

Only about 1 percent of the genome codes for proteins, however, and the challenge has been to figure out the function of the other 99 percent, which for years was termed "junk DNA" because it did not code for proteins.

The ENCODE scientists are biology's version of the Occupy movement, said Mark Gerstein of Yale University in New Haven, Conn., who led one of the ENCODE teams: "For years everyone focused on the 1 percent. ENCODE looks at the 99 percent."

In examining the overlooked part of the genome, the ENCODE scientists discovered that about 80 percent of the DNA once dismissed as junk performs a biological function. Primarily, the not-so-junky DNA constitutes the most sophisticated control panel this side of NASA's, with some 4 million bits of DNA controlling all the rest.

"The 'junk' DNA, the 99 percent, is actually in charge of running the genes," said Gerstein.

This regulation can influence both normal genes and aberrant ones, affecting the likelihood of disease.

That's because "transcription factors" and other regulatory elements - proteins made by this controlling DNA - hopscotch across each cell's double helix, binding to it in a way that turns genes on and off or up and down like a toddler who has just discovered light switches and dimmer dials.


The power of the gene-control elements may explain why simple personal DNA sequencing sometimes concludes that people are at risk for diseases they never get or misses the warning signs of those they do develop. If the switches quiet an unhealthy gene, "it might reduce levels of proteins that have some nasty effect," said Ecker. But if they mess up a normal gene, then someone can develop a DNA-based illness nevertheless.

How complicated is the genetic regulatory system? There are nearly 4 million gene switches in the major human organs, with about 200,000 acting in any given kind of cell, such as in heart muscle.

"Our genome is simply alive with switches: millions of places that determine whether a gene is switched on or off," said Ewan Birney of the European Molecular Biology Laboratory-European Bioinformatics Institute and one of the leaders of ENCODE.

The gene-control system means drug companies may have to look in new locations for influential genes. In one paper, scientists at the University of Washington in Seattle found that most of the DNA variants previously linked to 400-plus diseases lie in regulatory regions often far from the "disease gene."

"Disease-associated genetic changes are concentrated in the switches," UW's John Stamatoyannopoulos told the press briefing.

As a result, genome analyses that look for glitches only in "diabetes genes" or "cancer genes" or any other "disease genes" are likely to miss those that cause disease by changing when, where and how genes are turned on.

That discovery "will change how we understand the genetic basis for disease and open up new avenues for therapy," said Stamatoyannopoulos. For instance, in 17 very different cancers "just 20 regulatory factors turn up over and over," he said. That suggests that drugs targeting only those, and not the hundreds of targets drug companies are now pursuing, might treat many cancers.

ENCODE may also simplify the search for therapies for Crohn's disease. Genetic studies have identified about 100 places in the genome that affect a person's risk of developing the autoimmune disorder, way too many for drug companies to investigate, Birney said. But only a handful of transcription factors control those regions, presenting a workable number of targets by which to silence disease-causing DNA.

The UW group also found that some seemingly unrelated diseases share regulatory circuitry, including rheumatoid arthritis, type-1 diabetes, lupus and other autoimmune diseases. That suggests that one drug might be effective against all of them.

With ENCODE, said Stamatoyannopoulos, "we're exposing previously hidden connections between diseases."

ENCODE has also shown that a gene is not the simple stretch of DNA that makes a protein, as students are taught. Instead, the functional unit is an amalgam of sequences from both strands of the double helix, interleaved like two halves of a deck of cards in the hands of a Vegas dealer.

Nature is making all of the ENCODE research freely available, at www.nature.com/encode and through an iPad app.

(Reporting by Sharon Begley; Editing by Julie Steenhuysen and Douglas Royalty)

Correction: A previous version of this article used the term "dopamine" instead of "dopamine receptor." This error has been corrected.

Also on HuffPost:

Loading Slideshow...
  • Glow-in-the-dark cats

    In 2007, South Korean scientists altered a cat’s DNA to make it glow in the dark and then took that DNA and cloned other cats from it — creating a set of fluffy, <a href="http://www.theguardian.com/science/2011/sep/11/genetically-modified-glowing-cats" target="_blank">fluorescent felines</a>. Here’s how they did it: The researchers took skin cells from Turkish Angora female cats and used a virus to insert genetic instructions for making red fluorescent protein. Then they put the gene-altered nuclei into the eggs for cloning, and the cloned embryos were implanted back into the donor cats — making the cats the surrogate mothers for their own clones. What’s the point of creating a pet that doubles as a nightlight? Scientists say the ability to engineer animals with fluorescent proteins will enable them to artificially create animals with human genetic diseases.

  • Enviropig

    The <a href="http://news.nationalgeographic.com/news/2010/03/100330-bacon-pigs-enviropig-dead-http://news.nationalgeographic.com/news/2010/03/100330-bacon-pigs-enviropig-dead-zones/" target="_hplink">Enviropig</a>, or “Frankenswine,” as critics call it, is a pig that’s been genetically altered to better digest and process phosphorus. Pig manure is high in phytate, a form of phosphorus, so when farmers use the manure as fertilizer, the chemical enters the watershed and causes algae blooms that deplete oxygen in the water and kill marine life. So scientists added an E. Coli bacteria and mouse DNA to a pig embryo. This modification decreases a pig’s phosphorous output by as much as 70 percent — making the pig more environmentally friendly.

  • Pollution-fighting plants

    Scientists at the University of <a href="http://www.mnn.com/local-reports/washington" target="_hplink">Washington</a> are <a href="http://wa.water.usgs.gov/pubs/fs/fs082-98/" target="_hplink">engineering poplar trees that can clean up contamination sites</a> by absorbing groundwater pollutants through their roots. The plants then break the pollutants down into harmless byproducts that are incorporated into their roots, stems and leaves or released into the air. In laboratory tests, the transgenic plants are able to remove as much as 91 percent of trichloroethylene — the most common groundwater contaminant at U.S. Superfund sites — out of a liquid solution. Regular poplar plants removed just 3 percent of the contaminant.

  • Venomous cabbage

    Scientists have recently taken the gene that programs poison in scorpion tails and combined it with cabbage. Why would they want to create <a href="http://www.nature.com/cr/journal/v12/n2/full/7290120a.html" target="_hplink">venomous cabbage</a>? To limit pesticide use while still preventing caterpillars from damaging cabbage crops. These genetically modified cabbages produce scorpion poison that kills caterpillars when they bite leaves — but the toxin is modified so it isn’t harmful to humans.

  • Web-spinning goats

    Strong, flexible spider silk is one of the most valuable materials in nature, and it could be used to make an array of products — from artificial ligaments to parachute cords — if we could just produce it on a commercial scale. In 2000, Nexia Biotechnologies announced it had the answer: <a href="http://www.physorg.com/news194539934.html" target="_hplink">a goat that produced spiders’ web protein</a> in its milk. Researchers inserted a spiders’ dragline silk gene into the goats’ DNA in such a way that the goats would make the silk protein only in their milk. This “silk milk” could then be used to manufacture a web-like material called Biosteel.

  • Fast-growing salmon

    AquaBounty’s genetically modified salmon grows twice as fast as the conventional variety — the photo shows two same-age salmon with the genetically altered one in the rear. The company says the fish has the same flavor, texture, color and odor as a regular salmon; however, the debate continues over whether the fish is safe to eat. <a href="http://www.aquabounty.com/products/products-295.aspx" target="_hplink">Genetically engineered Atlantic salmon</a> has an added growth hormone from a Chinook salmon that allows the fish to produce growth hormone year-round. Scientists were able to keep the hormone active by using a gene from an eel-like fish called an ocean pout, which acts as an “on switch” for the hormone. If the FDA approves the sale of the salmon, it will be the first time the government has allowed modified animals to be marketed for human consumption. According to federal guidelines, the fish would not have to be labeled as genetically modified.

  • Flavr Savr tomato

    The <a href="http://californiaagriculture.ucanr.org/landingpage.cfm?article=ca.v054n04p6&fulltext=yes" target="_hplink">Flavr Savr tomato</a> was the first commercially grown genetically engineered food to be granted a license for human consumption. By adding an antisense gene, the <a href="http://www.mnn.com/local-reports/california" target="_hplink">California</a>-based company Calgene hoped to slow the ripening process of the tomato to prevent softening and rotting, while allowing the tomato to retain its natural flavor and color. The FDA approved the Flavr Savr in 1994; however, the tomatoes were so delicate that they were difficult to transport, and they were off the market by 1997. On top of production and shipping problems, the tomatoes were also reported to have a very bland taste: “The Flavr Savr tomatoes didn’t taste that good because of the variety from which they were developed. There was very little flavor to save,” said Christ Watkins, a horticulture professor at Cornell University.

  • <a href="http://www.mnn.com/green-tech/research-innovations/photos/12-bizarre-examples-of-genetic-engineering/banana-vaccines" target="_hplink"><strong>CLICK HERE</strong></a> to continue on to <a href="http://www.mnn.com" target="_hplink">Mother Nature Network</a> to see the rest of these bizarre genetically engineered creations, including <a href="http://www.mnn.com/green-tech/research-innovations/photos/12-bizarre-examples-of-genetic-engineering/banana-vaccines" target="_hplink">banana vaccines</a>, <a href="http://www.mnn.com/green-tech/research-innovations/photos/12-bizarre-examples-of-genetic-engineering/less-flatulent-cow" target="_hplink">less-flatulent cows</a>, <a href="http://www.mnn.com/green-tech/research-innovations/photos/12-bizarre-examples-of-genetic-engineering/medicinal-eggs" target="_hplink">medicinal eggs</a> and more!