On 2014 Jul 25, Claudiu Bandea commented:
Everlasting confusion on ‘functional DNA’ and ‘junk DNA’
As discussed by Doolittle (1), addressing and defining the functionality of genomic DNA can be challenging. However, our common sense combined with our philosophical instinct - we know a function when we see one - should allow us to sensibly address the biology of our genome. After all, the ENCODE ‘function fiasco’ was not the result of misunderstanding the concept of biological function, nor was it due to scientific incompetence as suggested by others (2). On the contrary, because it conflicted with some of the project’s objectives and with its significance, there was a concerted effort not to bring this concept forward (3); indeed, as clearly shown in a recent ENCODE publication (4), at least some ENCODE members seem well aware of the scientific rationale and criteria for addressing putative biological functions for genomic DNA.
Nevertheless, as concluded by Doolittle (1), poor use of words in communicating scientific observations, and lack of attention to detail have led to significant misrepresentations and confusion. Here are a few examples spanning more than four decades.
In a recent study entitled “Multiple knockout mouse models reveal lncRNAs are required for life and brain,” which addressed putative biological functions of long noncoding RNAs (lncRNAs), it was concluded that “This study demonstrates that lncRNAs play critical roles in vivo…” (5). Unfortunately, both the title and the conclusion misrepresent the results; an accurate interpretation of the results is that “Multiple knockout mouse models reveal that some (or a few) lncRNAs are required for life and brain,” and that “This study demonstrates that some (or a few) lncRNAs play critical roles in vivo….” As a matter of fact, based on the results of the study, which showed that only 5 of the 18 lncRNAs, which have been selected from hundreds sequences as the best candidates for being functional, a more appropriate scientific interpretation would be: “This study demonstrates that most lncRNAs (i.e. 13 out of 18) do not appear to play critical roles in vivo….” Moreover, if the goal is to evaluate this study at the highest scientific standards (as presumably required by the journal eLife, where it was published), then, I would suggest that, due to lack of appropriate control experiments such as duplicated knockout experiments, we don’t know if the observed dysfunctions were associated with specific lncRNAs, or they were caused by untargeted genome modifications introduced accidentally during the procedure of generating knockout mice, which in fact questions the validly of the entire study. As previously noted (3), all these problems reflect the deficiencies of the current limited and closed peer review system; indeed, a document reviewed by a few peers (usually, 2 or 3) can hardly be considered peer-reviewed.
The next three examples illustrate some of the confusion surrounding the concepts of biological function and junk DNA (jDNA), albeit in a more subtle way.
In one of the first papers addressing the notion of jDNA (6), David Comings writes: “These considerations suggest that up to 20% of the genome is actively used and the remaining 80+% is junk. But being junk doesn’t mean it is entirely useless. Common sense suggests that anything that is completely useless would be discarded.”
In their iconic paper “Selfish DNA: the ultimate parasite” (7), Leslie Orgel and Francis Crick, summarize their thoughts as follows: “The DNA of higher organisms usually falls into two classes, one specific and the other comparatively nonspecific. It seems plausible that most of the latter originated by the spreading of sequences which had little or no effect on the phenotype.”
More recently, in a publication addressing the ENCODE project (8), Sean Eddy, remarked: “These data support a view that eukaryotic genomes contain a substantial fraction of DNA that serves little useful purpose for the organism, much of which has originated from the replication of transposable (selfish) elements.”
Although seemingly innocent, these citations are relevant examples of poor use of words. Some of this confusion might dissipate by recognizing that: (i) if genomic sequences are not entirely useless, (ii) if they have little effect on the phenotype, or (iii) if they serve little useful purpose for the organism, then, these DNA sequences are functional, period.
In the next example, the protagonists are Michael Eisen, the host of a popular blog suggestively named “it is NOT junk”, and Ryan Gregory, the host of another popular blog “Genomicron”. Interestingly, Eisen was the PNAS editor in charge of Doolittle’s article on jDNA (1), and, according to Doolittle, Gregory is “now the principal C-value theorist” (1). Undoubtedly, Eisen and Gregory are among the most knowledgeable and versed communicators on genome biology and jDNA and, therefore, their ‘words’ are representative of the thinking in the field.
Immediately after the publication of ENCODE’s flurry of articles and the associated publicity stunt orchestrated by a few ENCODE scientists, both Eisen and Gregory reacted forcefully, but apparently from opposite perspectives. In a post in which he blasts ENCODE as “a carefully orchestrated spectacle” (9), Eisen writes: “nobody actually thinks that non-coding DNA is ‘junk’ any more. It’s an idea that pretty much only appears in the popular press… It is dishonest – nobody can credibly claim this to be a finding of ENCODE….”
In a prompt response (10), entitled “Michael Eisen’s take on ENCODE — there’s no junk?”, Gregory goes into a detailed and cynical questioning of Eisen’s perspective. Fortunately, due to the open communication platform offered by these blogs, this scientific ‘drama’ ended within hours, when Eisen apparently clarified: “I was not saying that everybody knows that 100% of the genome is functional! I was saying that nobody thinks that 100% of non-coding DNA is non-functional” (10).
Is there an end to this distressing confusion in the field of genome biology? Unlikely, if the funding system continues to focus primarily on generating data and observations, while neglecting their interpretation and integration into productive conceptual frameworks; in other words, you only get what you pay for.
References
(1) Doolittle WF. 2013. Is junk DNA bunk? A critique of ENCODE. Proc Natl Acad Sci USA., 110:5294-300. Doolittle WF, 2013
(2) Graur D et al., 2013. On the immortality of television sets: "function" in the human genome according to the evolution-free gospel of ENCODE. Genome Biol Evol., 5:578-90. Graur D, 2013
(3) Bandea CI. 2014. Closing the gap between ‘words’ and ‘facts’ in evaluating genome biology and the ENCODE project. PubMed Commons (National Library of Medicine; Bethesda, MD). Comment on: Doolittle WF, 2013
(4) Kellis M. et al., 2014. Defining functional DNA elements in the human genome. Proc Natl Acad Sci USA., 111:6131-8. Kellis M, 2014
(5) Sauvageau M. et al., 2014. Multiple knockout mouse models reveal lincRNAs are required for life and brain development. eLife (DOI: 10.7554/eLife.01749) Sauvageau M, 2013
(6) Comings DE.1972. The structure and function of chromatin. Adv Hum Genet. 3:237-431. Comings DE, 1972
(7) Orgel LE, Crick FH. 1980. Selfish DNA: the ultimate parasite. Nature. 284:604-7. Orgel LE, 1980
(8) Eddy SR. 2012. The C-value paradox, junk DNA and ENCODE. Curr Biol. 22:898-9. Eddy SR, 2012
(9) Eisen M. 2012. This 100,000 word post on the ENCODE media bonanza will cure cancer. Blog: it is NOT junk. http://www.michaeleisen.org/blog/?p=1167
(10) Gregory TR. 2012. Michael Eisen’s take on ENCODE — there’s no junk? Blog: Genomicron. http://www.genomicron.evolverzone.com/2012/09/michael-eisens-take-on-encode-theres-no-junk/
This comment, imported by Hypothesis from PubMed Commons, is licensed under CC BY.