This is the user sandbox of Genome42. A user sandbox is a subpage of the user's user page. It serves as a testing spot and page development space for the user and is not an encyclopedia article. Create or edit your own sandbox here.Other sandboxes: Main sandbox | Template sandbox Finished writing a draft article? Are you ready to request review of it by an experienced editor for possible inclusion in Wikipedia? Submit your draft for review!

Junk DNA

Junk DNA is DNA that does not have a function; therefore, it is important to define "function." Proponents of junk DNA define functional DNA as DNA that is currently under purifying selection. This is the definition used by Dan Gaur in his textbook "Molecular and Genome Evolution."

"Functional DNA refers to any segment in the genome whose selected-effect function is that for which it was selected and/or by which it is maintained. Most functional sequences are maintained by purifying selection."^[1]

This definition of function is called the maintenance function.^[2]^[3] From this it follows that nonfunctional DNA, or junk DNA, is any segment in the genome that is NOT maintained by purifying selection. Many similar definitions have been published but they all have in common the idea that junk DNA is DNA that does not have a function and this means that it is not under negative selective pressure.^[4]^[5]^[6]

How much of the human genome is junk?

Most of this article is about the human genome but the arguments for function and junk apply to other genomes.

The data on functional and nonfunctional DNA elements in the human genome is covered in many other articles so this is just a brief summary.

Genes (main article genes) There are approximately 20,000 protein-coding genes in the human genome. The number of noncoding genes is disputed with values ranging from about 5,000 to more than 100,000.

Arguments against junk DNA

Some scientists are convinced that junk DNA does not exist. For example, Peter Larsen declared in 2018 that,

"There is no such thing as 'junk DNA.' Indeed, a suite of discoveries made over the past few decades have put to rest this misnomer and have identified many important roles that so-called junk DNA provides to both genome and function."^[7]

This is a widely held point of view although most of these authors don't explain why obvious examples of junk DNA, such as pseudogenes and broken bits of transposons, don't qualify as junk DNA.

Mutation load

The idea of excess DNA in some species started with the realization that the expected number of mutations in a species was would lead to extinction if the entire genome were full of functional DNA. This is a reference to mutation load or [genetic load].

By the late 1960s it was apparent that much of the DNA in humans had to be invisible to mutations and only a small percentage could be devoted to genes and other functional elements. The connection between the mutation load argument and junk DNA appeared in the paper by Susumu Ohno in 1972 where he said,

"All in all, it appears that the calculations made by Muller, Kimura and others are not far off the mark and that at least 90% of our genomic DNA is 'junk' or 'garbage' of various sorts."^[8]

The C-Value Paradox

How much of the human genome is junk?

Some scientists are convinced that junk DNA does not exist. For example, Peter Larsen declared in 2018 that,

"There is no such thing as 'junk DNA.' Indeed, a suite of discoveries made over the past few decades have put to rest this misnomer and have identified many important roles that so-called junk DNA provides to both genome and function."^[9]

This is a widely held point of view although most of these authors don't explain why obvious examples of junk DNA, such as pseudogenes and broken bits of transposons, don't qualify as junk DNA.

Mutation load

The connection between the mutation load argument and junk DNA appeared in a paper by Susumu Ohno in 1972 where he said,

"All in all, it appears that the calculations made by Muller, Kimura and others are not far off the mark and that at least 90% of our genomic DNA is 'junk' or 'garbage' of various sorts."^[10]

.... see "Gene: Mutation" ....

Most mutations are due to DNA replication errors. The DNA replication complex is highly accurate and newly replicated DNA will only have only about one error for every 10 billion base pairs replicated (10^-10 per bp per replication.) - the estimates in various publications range from 10^-9 to 10^-11. ^[11]^[12]^[13]^[14]^[15] The overall replication error rate is the product of (1) the intrinsic error rate of the polymerization reaction, (2) the errors that are corrected by proofreading, and (3) the errors that are corrected by repair enzymes following DNA replication.^[11]

The extraordinary accuracy of DNA replication means that mutations will be rare in unicellular organisms with small genomes, such as bacteria. However, when mutations occur they will likely affect genes since genes take up a large part of the bacterial genome. Such mutations have a good chance of being deleterious.

The overall DNA replication error rate applies to all cell divisions in multicellular organisms.^[16] This means a much greater chance of a mutation being passed on the the daughter cells in various tissues in species with large genomes. In humans, for example, an overall error rate of 10^-10 means that there will be 0.62 mutations every time a cell divides (assuming cells are diploid and a genome size of 3.1 x 10⁹). Spontaneous somatic cell mutations are responsible for many human diseases, including cancer.^[17]^{[Note 1]}

In multicellular species, the mutation rate per generation can be calculated from the DNA replication error rate knowing the number of cell divisions that occur in germline cells.^[18]^[12] It can also be observed directly by sequencing the genomes of each parent and their offspring. These two values agree in humans, leading to an estimate of about 100 new mutations in every newborn baby.^[14]^{[Note 2]} Mutation rates can also be calculated by comparing the genome sequences of two closely related sequences, such as humans and chimpanzees, and these rates are roughly the same as those obtained by the two other methods.^[19]^[20]^[21]^[22]^[23]

Since the phylogenetic rate only measures the neutral mutation rate, the agreement of the three estimates means that most of the human and chimpanzee genomes is evolving at the neutral rate - an observation that's consistent with the idea that most of the genome is junk .

Genes occupy about 45% of the human genome so in every newborn child there will be approximately 45 new mutations in genes and 55 new mutations elsewhere . If a large fraction of those mutations were deleterious then human species could not survive such a mutation load (genetic load). This lead to predictions in the late 1940s by one of the founders of population genetics,J.B.S. Haldane, and by Nobel laureate, Hermann Muller, that only a small percentage of the human genome contains functional DNA elements that can be destroyed by mutation.^[24]^[25]

In 1966 Muller reviewed these prediction and concluded that the human genome could only contain about 30,000 genes based on the known mutation rate and the number of deleterious mutations that the species could tolerate . ^[26] Similar predictions were made by other leading experts in molecular evolution who concluded that the human genome could not contain more than 40,000 genes and that less than 10% of the genome was functional.^[18]^[27] ^[28] These predictions were confirmed with the publication of the human genome sequence.

The connection between the mutation load argument and junk DNA appeared in a paper by Susumu Ohno in 1972 where he said,

"All in all, it appears that the calculations made by Muller, Kimura and others are not far off the mark and that at least 90% of our genomic DNA is 'junk' or 'garbage' of various sorts."^[29]

Several hundred thousand human genome have been sequenced making it possible to analyze the regions that are subject to purifying selection, that is, sequences that seem to be protected from mutations because such mutations are very deleterious. The results show that only a small percentage of the genome (less than 10%) seems to be functional by this criterion. Less than half of the sites subject to purifying selection lie within genes and these are concentrated in coding regions, the regions specifying functional non-codong RNAs, and intron splice sites. Other sites subject to purifying selection include regulatory sequences.^[30]^[31]^[32]

Notes

Transposon-related sequences

Molecular evolution

DRAFT SECTION

Junk DNA stub for Non-Coding DNA

Origin of introns (Feb. 25, 2023)

Remove Prokaryotic Cell Diagram

Are introns mostly junk?

Highly repetitive DNA

Untranslated regions

Defining the genome

Conflicting definitions 'gene'

There are many different ways to use the term "gene" based on different aspects of their inheritance, selection, biological function, or molecular structure but most of the definitions fall into two categories, the Mendelian gene or the molecular gene. (12 = Orgogozo et al. (2016) ^[55] ^[56]^[57]

The Mendelian gene is the classical gene of genetics and it refers to any heritable trait. This is the gene described in "The Selfish Gene" 14 = Dawkins). More thorough discussions of this version of a gene can be found in the articles on Genetics and Gene-centered view of evolution. This article focuses on the molecular gene—the gene that's described in terms of DNA sequence. There are many different different definitions of this gene - some of which are mispleading or incorrect. Cite error: A <ref> tag is missing the closing </ref> (see the help page)..

There are lots of different ways to use the term "gene." Richard Dawkins, for example, wrote a book called "The Selfish Gene"^[58] where 'gene' simply meant any part of the chromosome that was subject to natural selection. This 'gene' is often referred to as the "Mendelian gene" whereas the physical gene described in this article is called the "molecular gene." ^[59]

The very first edition of the textbook "Molecular Biology of the Gene" (1965) described two kinds of molecular gene: protein-coding genes and those that specified functional RNA molecules such as ribosomal RNA and tRNA (noncoding genes).^[60] But the idea of two kinds of genes dates back to the late 1950's when Jacob and Monod speculated that regulatory genes might produce repressor RNAs.^[61]

This idea of two kinds of genes is still part of the definition of a gene in most textbooks. For example,

"The primary function of the genome is to produce RNA molecules. Selected portions of the DNA nucleotide sequence are copied into a corresponding RNA nucleotide sequence, which either encodes a protein (if it is an mRNA) or forms a 'structural' RNA, such as a transfer RNA (tRNA) or ribosomal RNA (rRNA) molecule. Each region of the DNA helix that produces a functional RNA molecule constitutes a gene."^[62]

"We define a gene as a DNA sequence that is transcribed. This definition includes genes that do not encode proteins (not all transcripts are messenger RNA). The definition normally excludes regions of the genome that control transcription but are not themselves transcribed. We will encounter some exceptions to our definition of a gene - surprisingly, there is no definition that is entirely satisfactory."^[63]

"A gene is a DNA sequence that codes for a diffusible product. This product may be protein (as is the case in the majority of genes) or may be RNA (as is the case of genes that code for tRNA and rRNA). The crucial feature is that the product diffuses away from its site of synthesis to act elsewhere."^[64]

The important parts of such definitions are: (1) that a gene corresponds to a transcription unit; (2) that genes produce both mRNA and noncoding RNAs; and (3) regulatory sequences control gene expression but are not part of the gene itself. However, there's one other important part of the definition and it is emphasized in Kostas Kampourakis' book "Making Sense of Genes."

"Therefore in this book I will consider genes as DNA sequences encoding information for functional products, be it proteins or RNA molecles. With 'encoding information,' I mean that the DNA sequence is used as a template for the production of an RNA molecule or a protein that performs some function.'^[55]

The emphasis on function is essential because there are stretches of DNA that produce non-functional transcripts and they don't qualify as genes. These include obvious examples such as transcribed pseudogenes as well as less obvious examples such as junk RNA produced as noise due to transcription errors. In order to qualify as a true gene, by this definition, one has to prove that the transcript has a biological function.^[55]

Early speculations on the size of a typical gene were based on high resolution genetic mapping and on the size of proteins and RNA molecules. A length of 1500 base pairs seemed reasonable at the time (1965).^[60] This was based on the idea that the gene was the DNA that was directly responsible for production of the functional product. The discovery of introns in the 1970s meant that many eukaryotic genes were much larger than the size of the functional product would imply. Typical mammalian protein-coding genes, for example, are about 62,000 base pairs in length (transcribed region) and since there are about 20,000 of them they occupy about 35-40% of the mammalian genome (including the human genome).^[65]^[66]^[67]

In spite of the fact that both protein-coding genes and noncoding genes have been known for more than 50 years, there are still a number of textbooks, websites, and scientific publications that define a gene as a DNA sequence that specifies a protein. In other words, the definition is restricted to protein-coding genes. Here's an example from a recent article in American Scientist.

What Is a Gene, Really?

... to truly assess the potential significance of de novo genes, we relied on a strict definition of the word "gene" with which nearly every expert can agree. First, in order for a nucleotide sequence to be considered a true gene, an open reading frame (ORF) must be present. The ORF can be thought of as the "gene itself"; it begins with a starting mark common for every gene and ends with one of three possible finish line signals. One of the key enzymes in this process, the RNA polymerase, zips along the strand of DNA like a train on a monorail, transcribing it into its messenger RNA form. This point brings us to our second important criterion: A true gene is one that is both transcribed and translated. That is, a true gene is first used as a template to make transient messenger RNA, which is then translated into a protein.^[68]

This restricted definition is so common that it has spawned many recent articles that criticize this "standard definition" and call for a new expanded definition that includes noncoding genes.^[69]^[70]^[71] However, this so-called "new" definition has been around for more than half a century and it's not clear why some modern writers are ignoring noncoding genes.

There are exceptions to the standard definition of a gene; for example, some viruses have an RNA genome. The one important exception concerns bacterial operons where a contiguous stretch of DNA containing multiple protein-coding regions is transcribed into one large mRNA. Scientists usually refer to each of the coding regions as separate genes in this case. The only significant controversy over the definition of a gene is whether to include the regulatory sequences that control transcription of the gene. The general consensus among scientists is that regulatory elements control the expression of a gene but are not part of the gene.

Repeat sequences, transposons and viral elements

Virus DNA

There are two main types of viruses, DNA viruses and RNA viruses. Some RNA viruses are called retroviruses in eukaryotes because the RNA is 'retrotranscribed' into DNA as part of the life cycle. In prokaryotes, these viruses are called bacteriophage or phage.

Sometimes the viral genome can become incorporated into the host genome, either as part of the normal life cycle or by accident. The viral sequence will then be passed on to daughter cells following DNA replication and cell division. If the insertion occurs in the germ line of multicellular species then the viral genome will be inherited in the next generation and the viral DNA may become fixed in the genome by random genetic drift.^[72]

The viral genome usually contains virus-specific genes that are transcribed and translated, which means that this DNA doesn't qualify as 'non-coding' in the strictest sense of the word, but, with some exceptions, the viral DNA evolves at the neutral rate of evolution^[73] so it soon becomes non-functional and qualifies as junk DNA. The exceptions include a few retroviral genes that have secondarily become essential in the life of the host.^[72]

DNA viruses and their degenerate descendants occupy about 3-4% of the human genome and RNA virus fragments take up about 9%.^[74] Viral DNAs have inserted into introns and also the spaces between genes (intergenic DNA). Since introns take up a substantial portion of the genome, the viral DNA elements are about equally distributed between introns and intergenic DNA. ^[75]

Transposons and retrotransposons are mobile genetic elements. Retrotransposon repeated sequences, which include long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs), account for a large proportion of the genomic sequences in many species. Alu sequences, classified as a short interspersed nuclear element, are the most abundant mobile elements in the human genome. Some examples have been found of SINEs exerting transcriptional control of some protein-encoding genes.^[76]^[77]^[78]

Endogenous retrovirus sequences are the product of reverse transcription of retrovirus genomes into the genomes of germ cells. Mutation within these retro-transcribed sequences can inactivate the viral genome.^[79]

Over 8% of the human genome is made up of (mostly decayed) endogenous retrovirus sequences, as part of the over 42% fraction that is recognizably derived of retrotransposons, while another 3% can be identified to be the remains of DNA transposons. Much of the remaining half of the genome that is currently without an explained origin is expected to have found its origin in transposable elements that were active so long ago (> 200 million years) that random mutations have rendered them unrecognizable.^[80] Genome size variation in at least two kinds of plants is mostly the result of retrotransposon sequences.^[81]^[82]

Protein-coding genes

Biochemical activity

Another criterion that has been used to estimate functional elements is biochemical activity. Biochemical activity includes whether a given locus is transcribed or whether it binds a transcription factor.

In a series of papers published in 2012 the Encyclopedia of DNA Elements (ENCODE) project reported that detectable biochemical activity was observed in regions covering at least 80% of the human genome.^[90] These conclusions were promoted by a publicity campaign announcing the demise of junk DNA.^[91]^[92]

The ENCODE conclusions were challenged in a series of publications over the next few years. The challengers suggested that many transcripts are spurious transcripts that do not necessarily come from functional regions of the genome. They also suggested that many transcription factor binding sites are nonfunctional sites that occur by chance in large genomes.^[93]^[5]^[94]^[95]^[96]^[36]^[4]^[97]^[98]^[6]

The challengers argued that biochemical activity is not a reliable indicator of function and in 2014 the ENCODE researchers agreed with the challengers and abandoned their claim that 80% of the human genome was functional. They also presented evidence for junk DNA that was missing in their 2012 papers.^[99]

The most recent attempt to define function using biochemical activity focuses on identifying which transcripts have a function and which transcription factor binding sites are true regulatory sequences.^[100] One way of distinguishing between true functional biochemical activity and spurious nonfunctional biochemical activity is to look for evidence of sequence conservation or purifying selection. Opponents of junk DNA argue that biochemical activity detects functional regions of the genome that are not identified by sequence conservation or purifying selection.^[101] ^[102]^[103]

Kellis et al. (2014)

According to the ENCODE researchers, the genetic approach looks at the phenotypic effects of mutations in order to identify functional regions of the genome. They maintain that the genetic approach is the "gold standard" for defining function. We know that there can be mutations in non-functional (junk) DNA that cause genetic diseases, for example by creating spurious splice sites, so the genetic approach cannot be a definitive criterion for identifying function.

The question is whether there are clear examples where the genetic approach identifies function elements of the genome that are not under purifying selection. In the absence of such examples, purifying selection is the only reasonable criterion.

The ENCODE researchers recognize that sequence conservation is indicative of purifying selection and they seem to implicitly accept that the definitive criterion is purifying selection and not just sequence conservation. They highlight technical difficulties in detecting sequence conservation but they don't discuss methods of detecting purifying selection.

They point out that human-specific elements will not be conserved but they fail to mention that they will still be subject to purifying selection, which is the preferred definition of function for that very reason.

They conclude that "absence of conservation cannot be interpreted as evidence for the lack of function." There are two problems with that statement. First, it's not conservation that defines function; it's purifying selection. Second, the goal is to identify function and not to provide evidence that a given region of DNA does not have a function (proving the negative). What we need is solid evidence that there are functional regions of the genome that are not under purifying selection so we can use another criterion to identify function if such a criterion exists.

The ENCODE researchers say that the biochemical approach identifies "candidate" functional elements. This is correct as long as it is understood to mean that it detects only a subset of true functional elements. The important point is that the biochemical approach by itself does not identify 'actual' functional elements but only 'possible' (candidate) functional elements. This is a retraction of their 2012 claim that all sequence with biochemical activity are functional.

They now point out that regions exhibiting biochemical function "are not always deterministic evidence of function, but can occur stochastically." This is exactly the point made by critics of their 2012 claim that 80% of the genome is functional.

The ENCODE researchers have conceded that not all regions of biochemical activity are functional but in order for biochemical activity to be a useful addition in identifying function there would have to be examples of true functional elements with biochemical activity that are not under purifying selection. Otherwise, the purifying selection definition supersedes biochemical activity in all cases.

In the context of the junk DNA debate, it is important to identify functional regions of the genome whether or not we know the exact type of function the the region specifies. The ENCODE researchers state that "Our results reinforce the principle that each approach [genetic, biochemical, evolutionary] provided complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease." Unfortunately, they have not provided a single example where biochemical activity or the genetic approach identifies function (i.e. not junk) in the absence of purifying selection but there are examples where the genetic and biochemical approaches identify regions that are not functional and assumed to be junk. It's difficult to see why all three approaches are said to be complimentary.

References

For references with author credit

((cite web)): Empty citation (help)

For references without author credit

((cite web)): Empty citation (help) access-date= 2023-02-28

Citing a symposium volume.

Bloggs, Fred (January 1, 2001). "Chapter 2: The History of the Bloggs Family". In Doe, John (ed.). Big Compilation Book with Many Chapters and Distinct Chapter Authors. Book Publishers. pp. 100–110.

Link to subsections within an article. Junk DNA section of Non-coding_DNA

This is the first citation to Alberts et al. 1994 textbook.^[104] This is the second citation.^[104]

Shortened footnote template (sfn). Refers to the first reference in the list that corresponds to the same author name and date (e.g. Gould (2002) pp. 1-10)^[105]

Alberts et al. 1994 textbook^[106]

Amaral et al. (2023) (human genome catalogue) ^[86]

Abascal et al. (2018) ^[83]

Besenbacher et al. (2019) (mutation rates in great apes)^[107]

Bishop (1974)^[108]

Britten and Davidson (1969)^[109]

Britten and Kohne (1968)^[110]

Brown (2018) (Genomes 4)^[111]

Brown (2018) (Genomes 4: Chapt. 12 Transcriptomics)^[112]

Brunet and Doolittle (2014)^[96]

Brzović and Šustar (2020)^[113]

Casane et al. (2015)^[92]

Cavalier-Smith (1978)^[114]

Cavalier-Smith (1980)^[115]

Cavalier-Smith (1991) (introns)^[116]

Christmas et al. 2023^[117]

Comings (1972) (book)^[118]

Comings (1972) (book review)^[119]

Coyne (2009) ^[120]

Crick (1978)(introns)^[121]

Dawkins (1976) (The Selfish Gene)^[122]

Dawkins and Wong (2016)^[123]

De Parseval and Heidmann (2005) (ERVs)^[124]

Doolittle (1978)(introns)^[125]

Doolittle (1991) (origin of inrons) ^[126]

Doolittle (2013)^[95]

Doolittle and Sapienza (1980)^[127]

Doolittle et al. (2014)^[97]

Dover (1980)^[128]

Dover and Doolittle (1980)^[129]

Dukler et al. (2022) (genetic load)^[130]

Echols and Goodman (1991) (DNA replication)^[131]

Eddy (2012)^[5]

Eddy (2013)^[94]

Elliot et al. (2014)^[132]

ENCODE (2012)^[133]

ENCODE cartoon^[134]

ENCODE EMBL video^[135]

ENCODE The Guardian video^[136]

ENCODE Maher blog (2012)^[137]

Ensemble Homo sapiens^[138]

Francis and Wörheide (2017) (50% genes)^[139]

Galeota-Sprung et al. (2020)^[140]

Gericke and Hagberg (2007) (gene definitions)^[141]

Germain et al. (2014)^[142]

Gil and Latorre (2012) (junk DNA in bacteria)^[143]

Gilbert (1978)(introns)^[144]

Gilbert (1985)(introns)^[145]

Gould (2002) ^[146]

Graur (2016) (textbook) ^[147]

Graur (2017)^[148]

Graur et al. (2013)^[4]

Graur et al. (2015)^[149]

Gregory (2005)^[150]

Gymrek et al. (2016) (STRs)^[151]

Haldane (1949)^[24]

Halldorson et al. (2022)(genetic load)^[152]

Häsler et al. (2007) (Alus not junk)^[153]

Hatje et al. (2019)^[89]

Haerty and Ponting (2014)^[154]

Hopkin (2009) (gene definition)^[155]

Hoyt et al. (2022) (T2T sequence)^[156]

Hubé and Francastel (2015) (introns)^[157]

Irimia and Roy (2014) (origin of introns)^[42]

Jain (1980)^[158]

Jensen (2001) (orthologs and paralogs)^[159]

Jensen et al. (2013) (pervasive transcription)^[160]

Johnson (2019) (ERVs)^[161]

Judson (1996) (The Eight Day of Creation)^[162]

Jukes (1979) (letter to Crick)^[163]

Kampourakis (2017)^[55]

Keightly (2012) (mutation rates)^[164]

Kimura (1968)^[18]

Kimura and Ohta (1971)^[165]

King and Jukes (1969)^[27]

Kirchberger et al. (2020) (bacterial genomes)^[166]

Kronenberg et al. (2018) (great ape genomes)Cite error: The <ref> tag has too many names (see the help page).

Kunkel (2009) (DNA replication)^[167]

Lander et al. (2001) (human genome)^[168]

Larsen (2018)^[169]

Lewin (1974)^[170]

Lewin (1974b)^[171]

Lewin (1974c)(Cell editorial)^[172]

Lewin (2004) (Genes VIII)Cite error: The <ref> tag has too many names (see the help page).

Leypold and Speicher (2021) (sequence conservation)^[173]

Linquist (2022)^[174]

Linquist et al. (2020)^[3]

Lynch (2016) (~100 mutations per newborn)^[175]

Lynch et al. (2016) (mutation rate)^[176]

Mattick (2023)^[102]

Mattick (2023b)^[103]

Mattick and Dinger (2013)^[101]

McHughen (2020)^[177]

Moorjani et al. 2016) (primate molecular clock)^[178]

Moran et al. (2012) Principles of Biochemistry)Cite error: The <ref> tag has too many names (see the help page).

Morange (2014) (junk DNA controversy)^[98]

Morange (2020)(intron history)^[179]

Mortola and Long (2021) (gene definition/birth)^[180]

Muller (1950)^[25]

Muller, H.J. (1966)^[26]

Nachman (2004) (mutation rate history)^[181]

Neil and Faribrother (2019) (intron function)^[182]

Nelson et al. (2004)^[183]

Nowak and Waclaw (2017) (review of mutations cause cancer)^[184]

Niu and Jiang (2013)^[6]

O'Brian (1973)^[185]

Ohno (1972) (So much 'Junk' DNA)^[186]

Ohno (1972) (Genetic Simplicity)^[28]

Ohno (1972) (regulatory sequences)^[187]

Ohno and Yomo (1991)^[188]

Ohta (1973) ^[189]

Ohta (1998) ^[190]

Ohta and Kimura (1971)(30,000 genes)^[191]

Omenn et al. (2020) ^[85]

Orgel and Crick (1980)^[192]

Orgel, Crick and Sapienza (1980)^[193]

Orgogozo et al. (2016) (Mendelian vs Molecular Gene)Cite error: The <ref> tag has too many names (see the help page).

Palazzo and Gregory (2014)^[36]

Palazzo and Kejiou (2022) (molecular biologists)^[194]

Palazzo and Lee (2015)^[195]

Pearson (2006) (gene definition)^[196]

Pennisi (2007) (gene definition)^[197]

Pennisi (2012)^[91]

Piovesan et al. (2019)^[198]

Pioveasan et al. (2919) (length weight of human genome)^[199]

Ponicson et al. (2010) (SINE function)^[200]

Ponting (2017)^[201]

Ponting and Hardison (2011)^[202]

Ponting and Haerty (2022)^[203]

Ségurel et al. (2014) (mutation rates)^[204]

Scally (2016) (human mutation rate)^[205]

Scally and Durbin (2012) (human mutation rate)^[206]

Sharp (1991) ("Five easy pieces")^[207]

Sverdlov (2017) (junk RNA)^[208]

Sweet (2022)(junk DNA history thesis)^[209]

Thomas (1971) (C-value Paradox)^[210]

Yu et al. (2002) (minimal introns not junk)^[211]

van Bakel et al. (2011) (pervasive transcription)^[212]

Wade and Grainger (2018) (spurious transcription)^[213]

Walters et al. (2009) (SINE functions)^[214]

Watson (1965) (Molecular Biology of he Gene)^[215]

Wong et al. (2000) (are introns junk?)^[216]

Zhou et al. (2021) (DNA replication)^[217]

Press release: Yale 2012 ^[218]

^ Graur D (2016). Molecular and Genome Evolution. Sunderland MA (USA): Sinauer Associates, Inc. ISBN 9781605354699.
^ Linquist S (2022). "Causal-role myopia and the functional investigation of junk DNA". Biology & Philosophy. 37: 1–23. doi:10.1007/s10539-022-09853-2.
^ ^a ^b Linquist S, Doolittle WF, and Palazzo AF (2020). "Getting clear about the F-word in genomics". PLOS Genetics. 16: e1008702. doi:10.1371/journal.pgen.1008702.((cite journal)): CS1 maint: unflagged free DOI (link)
^ ^a ^b ^c Graur D, Zheng Y, Price N, Azevedo RB, Zufall RA, Elhaik E (2013). "On the immortality of television sets: "function" in the human genome according to the evolution-free gospel of ENCODE". Genome Biology and Evolution. 5 (3): 578–590. doi:10.1093/gbe/evt028. PMC 3622293. PMID 23431001.
^ ^a ^b ^c Eddy SR (2012). "The C-value paradox, junk DNA and ENCODE". Current Biology. 22: R898. doi:10.1016/j.cub.2012.10.002.
^ ^a ^b ^c Niu DK, Jiang L (2013). "Can ENCODE tell us how much junk DNA we carry in our genome?". Biochemical and biophysical research communications. 430: 1340–1343. doi:10.1016/j.bbrc.2012.12.074.
^ Larsen PA (2018). "Transposable elements and the multidimensional genome". Chromosome Research. 26: 1–3. doi:10.1007/s10577-018-9575-2.
^ Ohno S (1972). "An argument for the genetic simplicity of man and other mammals". Journal of Human Evolution. 1: 651–662. doi:10.1016/0047-2484(72)90011-5.
^ Larsen PA (2018). "Transposable elements and the multidimensional genome". Chromosome Research. 26: 1–3. doi:10.1007/s10577-018-9575-2.
^ Ohno S (1972). "An argument for the genetic simplicity of man and other mammals". Journal of Human Evolution. 1: 651–662. doi:10.1016/0047-2484(72)90011-5.
^ ^a ^b Echols H, and Goodman MF (1991). "Fidelity mechanisms in DNA replication". Annual review of biochemistry. 60: 477–511. doi:10.1146/annurev.bi.60.070191.002401.
^ ^a ^b Kunkel, TA (2009). "Evolving views of DNA replication (in) fidelity". Cold Spring Harbor Symposia on Quantitative Biology. 74: 91–101. doi:10.1101/sqb.2009.74.027.
^ Keightley, PD (2012). "Rates and fitness consequences of new mutations in humans". Genetics. 190: 295–304. doi:10.1534/genetics.111.134668.
^ ^a ^b Michael, Lynch (2016). "Mutation and Human Exceptionalism: Our Future Genetic Load". Genetics. 202: 869–875. doi:10.1534/genetics.115.180471.
^ Zhou ZX, Lujan SA, Burkholder AB, StCharles J, Dahl J, Farrell CE, Williams JS, and Kunkel TA (2021). "How asymmetric DNA replication achieves symmetrical fidelity". Nature structural & molecular biology. 28: 1020–1028. doi:10.1038/s41594-021-00691-6.
^ Lynch M, Ackerman MS, Gout JF, Long H, Sung W, Thomas WK, and Foster PL (2016). "Genetic drift, selection and the evolution of the mutation rate". Nature Reviews Genetics. 17: 704–714. doi:10.1038/nrg.2016.104.
^ Nowak MA, and Waclaw B (2017). "Genes, environment, and "bad luck"". Science. 355: 1266–1267. doi:10.1126/science.aam9746.
^ ^a ^b ^c Kimura, Mootoo (1968). "Evolutionary rate at the molecular level" (PDF). Nature. 217: 624–626.
^ Ségurel L, Wyman MJ, and Przeworski M (2014). "Determinants of mutation rate variation in the human germline". Annual review of genomics and human genetics. 15: 47–70. doi:10.1146/annurev-genom-031714-125740.
^ Scally A, and Durbin R (2012). "Revising the human mutation rate: implications for understanding human evolution". Nature Reviews Genetics. 13: 745–753. doi:10.1038/nrg3295.
^ Moorjani P, Gao Z, and Przeworski M (2016). "Human germline mutation and the erratic evolutionary clock". PLoS Biology. 14: e2000744. doi:10.1371/journal.pbio.2000744.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Scally, Aylwyn (2016). "The mutation rate in human evolution and demographic inference". Current opinion in genetics & development. 41: 36–43. doi:10.1016/j.gde.2016.07.008.
^ Besenbacher S, Hvilsom C, Marques-Bonet T, Mailund T, and Schierup M (2019). "Direct estimation of mutations in great apes reconciles phylogenetic dating". Nature ecology & evolution. 3: 286–292. doi:10.1038/s41559-018-0778-x.
^ ^a ^b Haldane, JBS (1949). "The rate of mutation of human genes". Hereditas. 35: 267–273. doi:10.1111/j.1601-5223.1949.tb03339.x.
^ ^a ^b Muller, Hermann J (1950). "Our load of mutations" (PDF). American journal of human genetics. 2: 111–175.
^ ^a ^b Muller HJ (1966). "The gene material as the initiator and the organizing basis of life". American Naturalist. 100: 493–517.
^ ^a ^b King JL, and Jukes TH (1969). "Non-Darwinian evolution". Science. 164: 788–798.
^ ^a ^b Ohno S (1972). "An argument for the genetic simplicity of man and other mammals". Journal of Human Evolution. 1: 651–662. doi:10.1016/0047-2484(72)90011-5.
^ Ohno S (1972). "An argument for the genetic simplicity of man and other mammals". Journal of Human Evolution. 1: 651–662. doi:10.1016/0047-2484(72)90011-5.
^ Halldorsson BV, Eggertsson HP, Moore KH, Hauswedell H, Eiriksson O, Ulfarsson MO, Palsson G, Hardarson MT, Oddsson A, Jensson BO, et al. (2022). "The sequences of 150,119 genomes in the UK biobank". Nature. 607: 732–740. doi:10.1038/s41586-022-04965-x.
^ Galeota-Sprung B, Sniegowski P, and Ewens W (2020). "Mutational load and the functional fraction of the human genome". Genome Biology and Evolution. 12: 273–281. doi:10.1093/gbe/evaa040.
^ Dukler N, Mughal MR, Ramani R, Huang YF, and Siepel A (2022). "Extreme purifying selection against point mutations in the human genome". Nature communications. 13: 4312. doi:10.1038/s41467-022-31872-6.
^ Ohno S, Yomo T (1991). "The grammatical rule for all DNA: junk and coding sequences". Electrophoresis. 12: 103–108. doi:10.1002/elps.1150120203.
^ "Thomas Jukes letter to Francis Crick". The Francis Crick Papers, National Library of Medicine (USA). Retrieved May 17, 2022.
^ Gil R, and Latorre A (2012). "Factors behind junk DNA in bacteria". Genes. 3: 634–650. doi:10.3390/genes3040634.((cite journal)): CS1 maint: unflagged free DOI (link)
^ ^a ^b ^c Palazzo AF, Gregory TR (May 2014). "The case for junk DNA". PLoS Genetics. 10 (5): e1004351. doi:10.1371/journal.pgen.1004351. PMC 4014423. PMID 24809441.((cite journal)): CS1 maint: unflagged free DOI (link) Cite error: The named reference "PalazzoGregory2014" was defined multiple times with different content (see the help page).
^ Morange, Michel (2014). "Genome as a Multipurpose Structure Built by Evolution". Perspectives in Biology and Medicine. 57: 162–171. doi:10.1353/pbm.2014.0008.
^ Haerty W, and Ponting CP (2014). "No Gene in the Genome Makes Sense Except in the Light of Evolution". Annual Review of Genomics and Human Genetics. 25: 71–92. doi:10.1146/annurev-genom-090413-025621.
^ Cavalier-Smith T (1991). "Intron phylogeny: a new hypothesis". Trends in Genetics. 7: 145–148. doi:10.1016/0168-9525(91)90377-3.
^ Doolittle WF (1991). "The origins of introns". Current Biology. 1: 145–146. doi:10.1016/0960-9822(91)90214-h.
^ Sharp PA (1991). ""Five easy pieces."(role of RNA catalysis in cellular processes)". Science. 254: 663–664.
^ ^a ^b ^c Irimia M, and Roy SW (2014). "Origin of spliceosomal introns and alternative splicing". Cold Spring Harbor perspectives in biology. 6: a016071. doi:10.1101/cshperspect.a016071.
^ Wong GK, Passey DA, Huang YZ, Yang Z, and Yu J (2000). "Is "junk" DNA mostly intron DNA?". Genome Research. 10: 1672–1678. doi:10.1101/gr.148900.
^ Yu J, Yang Z, Kibukawa M, Paddock M, Passey DA, and Wong GK (2002). "Minimal introns are not "junk"". Genome Research. 12: 1185–1189. doi:10.1101/gr.224602.
^ Neil CR, and Fairbrother WG (2019). "Intronic RNA: Ad 'junk'mediator of post-transcriptional gene regulation". Biochimica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms. 1862: 194439. doi:10.1016/j.bbagrm.2019.194439.
^ Gymrek M, Willems T, Guilmatre A, Zeng H, Markus B, Georgiev S, Daly MJ, Price AL, Pritchard JK, Sharp AJ, Erlich Y (2016). "Abundant contribution of short tandem repeats to gene expression variation in humans". Nature Genetics. 48: 22–29. doi:10.1038/ng.3461.
^ Kronenberg ZN, Fiddes IT, Gordon D, Murali S, Cantsilieris S, Meyerson OS, Underwood JG, Nelson BJ, Chaisson MJ, Dougherty ML (2018). "High-resolution comparative analysis of great ape genomes". Science. 360: 1085. doi:10.1126/science.aar6343.
^ Alberts B, Bray D, Lewis J, Raff M, Roberts K, Watson JD (1994). Molecular Biology of the Cell, 3rd edition. London, UK: Garland Publishing Inc.
^ Lewin B (2004). Genes VIII. Upper Saddle River, NJ, USA: Pearson/Prentice Hall.
^ Moran L, Horton HR, Scrimgeour KG, Perry MD (2012). Principles of Biochemistry Fifth Edition. Upper Saddle River, NJ, USA: Pearson.
^ Kirchberger PC, Schmidt ML, and Ochman H (2020). "The ingenuity of bacterial genomes". Annual review of microbiology. 74: 815–834. doi:10.1146/annurev-micro-020518-115822.
^ Graur D (2016). Molecular and Genome Evolution. Sunderland MA, USA: Sinauer Associates, Inc. ISBN 9781605354699.
^ Brown, TA (2018). Genomes 4. New York, NY, USA: Garland Science. ISBN 9780815345084.
^ "Ensembl Human Assembly and gene annotation (GRCh38)". Ensembl. Retrieved May 30, 2022.
^ ^a ^b ^c ^d Kampourakis K (2017). Making Sense of Genes. Cambridge, UK: Cambridge University Press.
^ Gericke NM, Hagberg M (2007). "Definition of historical models of gene function and their relation to students' understanding of genetics". Science & Education. 16: 849–881. doi:10.1007/s11191-006-9064-4.
^ Meunier, Robert (2022). "Stanford Encyclopedia of Philosophy: Gene". Stanford Encyclopedia of Philosophy. Retrieved 2023/02/28. ((cite web)): Check date values in: |access-date= (help)
^ Dawkins R (1976). The selfish gene. Oxford, UK: Oxford University Press.
^ <cite journal | vauthors = Orgogozo V, Peluffo AE, Morizot B | date = 2016 | title = Chapter One-The “Mendelian Gene” and the “Molecular Gene”: Two Relevant Concepts of Genetic Units | journal = Current topics in developmental biology | volume = 119 pages = 1-26 | doi = 10.1016/bs.ctdb.2016.03.002))
^ ^a ^b Watson JD (1965). Molecular Biology of the Gene. New York, NY, USA: W.A. Benjamin, Inc.
^ Judson HF (1996). The Eight Day of Creation (Expanded Edition). Plainview, NY (USA): Cold Spring Harbor Laboratory Press.((cite book)): CS1 maint: extra punctuation (link)
^ Alberts B, Bray D, Lewis J, Raff M, Roberts K, Watson JD (1994). Molecular Biology of the Cell: Third Edition. London, UK: Garland Publishing, Inc. ISBN 0-8153-1619-4.
^ Moran LA, Horton HR, Scrimgeour KG, Perry MD (2012). Principles of Biochemistry: Fifth Edition. Upper Saddle River, NJ, USA: Pearson.
^ Lewin B (2004). Genes VIII. Upper Saddle River, NJ, USA: Pearson/Prentice Hall.
^ Piovesan A, Pelleri MC, Antonaros F, Strippoli P, Caracausi M, and Vitale L (2019). "On the length, weight and GC content of the human genome". BMC Research Notes. 12: 106–173. doi:10.1186/s13104-019-4137-z.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Hubé F, and Francastel C (2015). "Mammalian Introns: When the Junk Generates Molecular Diversity". International journal of molecular sciences. 16: 4429–4452. doi:10.3390/ijms16034429.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Francis WR, and Wörheide G (2017). "Similar ratios of introns to intergenic sequence across animal genomes". Genome biology and evolution. 9: 1582–1598. doi:10.1093/gbe/evx103.
^ Mortola E, Long M (2021). "Turning Junk into Us: How Genes Are Born". American Scientist. 109: 174–182.
^ Hopkin K (2009). "The Evolving Definition of a Gene: With the discovery that nearly all of the genome is transcribed, the definition of a "gene" needs another revision". BioScience. 59: 928–931. doi:10.1525/bio.2009.59.11.3.
^ Pearson H (2006). "What Is a Gene?". Nature. 441: 399–401.
^ Pennisi E (2007). "DNA study forces rethink of what it means to be a gene". Science. 316: 1556–1557. doi:10.1126/science.316.5831.1556.
^ ^a ^b Johnson WE (2019). "Origins and evolutionary consequences of ancient endogenous retroviruses". Nature Reviews Microbiology. 17: 355–370. doi:10.1038/s41579-019-0189-2.
^ De Parseval N, and Heidmann T (2005). "Human endogenous retroviruses: from infectious elements to human genes". Cytogenetic and genome research. 110: 318–332. doi:10.1159/000084964.
^ Hoyt SJ, Storer JM, Hartley GA, Grady PG, Gershman A, de Lima LG, Limouse C, Halabian R, Wojenski L, and Rodriguez M (2022 From telomere to telomere: the transcriptional and epigenetic state of human repeat elements). Science: 57. doi:10.1126/science.abk3112. ((cite journal)): Check date values in: |date= (help); Missing or empty |title= (help); Unknown parameter |voume= ignored (help)
^ Francis WR, and Wörheide G (2017). "Similar ratios of introns to intergenic sequence across animal genomes". Genome Biology and Evolution. 9: 1582–1598. doi:10.1093/gbe/evx103.
^ Ponicsan SL, Kugel JF, Goodrich JA (April 2010). "Genomic gems: SINE RNAs regulate mRNA production". Current Opinion in Genetics & Development. 20 (2): 149–155. doi:10.1016/j.gde.2010.01.004. PMC 2859989. PMID 20176473.
^ Häsler J, Samuelsson T, Strub K (July 2007). "Useful 'junk': Alu RNAs in the human transcriptome". Cellular and Molecular Life Sciences (Submitted manuscript). 64 (14): 1793–1800. doi:10.1007/s00018-007-7084-0. PMID 17514354. S2CID 5938630.
^ Walters RD, Kugel JF, Goodrich JA (August 2009). "InvAluable junk: the cellular impact and function of Alu and B2 RNAs". IUBMB Life. 61 (8): 831–837. doi:10.1002/iub.227. PMC 4049031. PMID 19621349.
^ Nelson PN, Hooley P, Roden D, Davari Ejtehadi H, Rylance P, Warren P, et al. (October 2004). "Human endogenous retroviruses: transposable elements with potential?". Clinical and Experimental Immunology. 138 (1): 1–9. doi:10.1111/j.1365-2249.2004.02592.x. PMC 1809191. PMID 15373898.
^ Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. (February 2001). "Initial sequencing and analysis of the human genome". Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. PMID 11237011.
^ Piegu B, Guyot R, Picault N, Roulin A, Sanyal A, Saniyal A, et al. (October 2006). "Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice". Genome Research. 16 (10): 1262–1269. doi:10.1101/gr.5290206. PMC 1581435. PMID 16963705.
^ Hawkins JS, Kim H, Nason JD, Wing RA, Wendel JF (October 2006). "Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium". Genome Research. 16 (10): 1252–1261. doi:10.1101/gr.5282906. PMC 1581434. PMID 16954538.
^ ^a ^b Abascal F, Juan D, Jungreis I, Martinez L, Rigau M, Rodriguez JM, Vazquez J, and Tress ML (2018). "Loose ends: almost one in five human genes still have unresolved coding status". Nucleic Acids Research. doi:10.1093/nar/gky587.
^ Hatje K, Mühlhausen S, Simm D, Killmar M (2019). "The Protein-Coding Human Genome: Annotating High-Hanging Fruits". BioEssays. 11: 1900066. doi:10.1002/bies.201900066.
^ ^a ^b Omenn GS, Lane L, Overall CM, Cristea IM, Corrales FJ, Lindskog C, Paik YK, Van Eyk JE, Liu S, and Pennington SR (2020). "Research on the human proteome reaches a major milestone:> 90% of predicted human proteins now credibly detected, according to the HUPO human proteome project". Journal of Proteome Research. 19: 4735–4746. doi:10.1021/acs.jproteome.0c00485.
^ ^a ^b Amaral P, Carbonell-Sala S, De La Vega FM, Faial T, Frankish A, Gingeras T, Guigo R, Harrow JL, Hatzigeorgiou AG, and Johnson R (2023). "The status of the human gene catalogue". Nature. 622: 41–47. doi:10.1038/s41586-023-06490-x.
^ ^a ^b Piovesan A, Antonaros F, Vitale L, Strippoli P, Pelleri MC, Caracausi M (2019). "Human protein-coding genes and gene feature statistics in 2019". BMC research notes. 12: 315. doi:10.1186/s13104-019-4343-8.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Francis WR, Wörheide G (June 2017). "Similar Ratios of Introns to Intergenic Sequence across Animal Genomes". Genome Biology and Evolution. 9 (6): 1582–1598. doi:10.1093/gbe/evx103. PMC 5534336. PMID 28633296.
^ ^a ^b Hatje K, Mühlhausen S, Simm D, Killmar M (2019). "The Protein-Coding Human Genome: Annotating High-Hanging Fruits". BioEssays. 11: 1900066. doi:10.1002/bies.201900066.
^ Cite error: The named reference Nature489p57 was invoked but never defined (see the help page).
^ ^a ^b Pennisi E (2012). "ENCODE Project Writes Eulogy for Junk DNA". Science. 337: 1159–1161. doi:10.1126/science.337.6099.1159.
^ ^a ^b Casane D, Fumey J, and Laurenti P (2015). "L'apophénie d'ENCODE ou Pangloss examine le génome humain". médecine/sciences. 31: 680–686. doi:10.1051/medsci/20153106023.
^ McKie R (February 24, 2013). "Scientists attacked over claim that 'junk DNA' is vital to life". The Observer.
^ ^a ^b Eddy SR (2013). "The ENCODE project: missteps overshadowing a success". Current Biology. 23: R259–R261. doi:10.1016/j.cub.2013.03.023.
^ ^a ^b Doolittle WF (April 2013). "Is junk DNA bunk? A critique of ENCODE". Proceedings of the National Academy of Sciences of the United States of America. 110 (14): 5294–5300. Bibcode:2013PNAS..110.5294D. doi:10.1073/pnas.1221376110. PMC 3619371. PMID 23479647. Cite error: The named reference "doolittle2013" was defined multiple times with different content (see the help page).
^ ^a ^b Brunet TD, and Doolittle WF (2014). "Getting "function" right". Proceedings of the National Academy of Sciences (USA). 111: E3365–E3365. doi:10.1073/pnas.1409762111.
^ ^a ^b Doolittle WF, Brunet TD, Linquist S, and Gregory TR (2014). "Distinguishing between "function" and "effect" in genome biology". Genome Biology and Evolution. 6: 1234–1237. doi:10.1093/gbe/evu098.
^ ^a ^b Morange, Michel (2014). "Genome as a Multipurpose Structure Built by Evolution". Perspectives in Biology and Medicine. 57: 162–171. doi:10.1353/pbm.2014.0008.
^ Cite error: The named reference kellis was invoked but never defined (see the help page).
^ Abascal F, Acosta R, Addleman NJ, Adrian J, et al. (July 30, 2020). "Expanded Encyclopaedias of DNA elements in the Human and Mouse Genomes". Nature. 583 (7818): 699–710. Bibcode:2020Natur.583..699E. doi:10.1038/s41586-020-2493-4. PMC 7410828. PMID 32728249. Operationally, functional elements are defined as discrete, linearly ordered sequence features that specify molecular products (for example, protein-coding genes or noncoding RNAs) or biochemical activities with mechanistic roles in gene or genome regulation (for example, transcriptional promoters or enhancers).
^ ^a ^b Mattick JS, and Dinger ME (2013). "The extent of functionality in the human genome". The HUGO Journal. 7: 2. doi:10.1186/1877-6566-7-2.((cite journal)): CS1 maint: unflagged free DOI (link)
^ ^a ^b Mattick, John S (2023). "RNA out of the mist". TRENDS in Genetics. 39: 187–207. doi:10.1016/j.tig.2022.11.001.
^ ^a ^b Mattick JS (2023). "A Kuhnian revolution in molecular biology: Most genes in complex organisms express regulatory RNAs". BioEssays. 2300080. doi:10.1002/bies.202300080.
^ ^a ^b Alberts B, Bray D, Lewis J, Raff M, Roberts K, Watson JD (1994). Molecular Biology of the Cell: Third Edition. London, UK: Garland Publishing, Inc. ISBN 0-8153-1619-4.
^ Gould 2002, pp. 1–10.
^ Alberts B, Bray D, Lewis J, Raff M, Roberts K, Watson JD (1994). Molecular Biology of the Cell: Third Edition. London, UK: Garland Publishing, Inc. ISBN 0-8153-1619-4.
^ Besenbacher S, Hvilsom C, Marques-Bonet T, Mailund T, and Schierup M (2019). "Direct estimation of mutations in great apes reconciles phylogenetic dating". Nature ecology & evolution. 3: 286–292. doi:10.1038/s41559-018-0778-x.
^ Bishop, J.O. (1974). "The gene numbers game". Cell. 2: 81–86. doi:10.1016/0092-8674(74)90095-6.
^ Britten RJ, and Davidson EH (1969). "Gene regulation for higher cells: a theory". Science. 165: 349–357. doi:10.1126/science.165.3891.349.
^ Britten R, and Kohne D (1968). "Repeated Sequences in DNA". Science. 161: 529–540. doi:10.1126/science.161.3841.529.
^ Brown, TA (2018). Genomes 4. New York, NY, USA: Garland Science. ISBN 9780815345084.
^ Brown, TA (2018). "Chapter 12: Transcriptomics". Genomes 4. New York, NY, USA: Garland Science. ISBN 9780815345084.
^ Brzović Z, and Šustar P. "Postgenomics function monism". Studies in History and Philosophy of Science Part C: Studies in History and Philosophy of Biological and Biomedical Sciences: 101243. doi:10.1016/j.shpsc.2019.101243.
^ Cavalier-Smith, Thomas (1978). "Nuclear volume control by nucleoskeletal DNA, selection for cell volume and cell growth rate, and the solution of the DNA C-value paradox". Journal of Cell Science. 34: 247–278.
^ Cavalier-Smith, Thomas (1980). "How selfish is DNA?". Nature. 285: 617–618. doi:10.1038/285617a0.
^ Cavalier-Smith T (1991). "Intron phylogeny: a new hypothesis". Trends in Genetics. 7: 145–148. doi:10.1016/0168-9525(91)90377-3.
^ Christmas MJ, Kaplow IM, Genereux DP, Dong MX, Hughes GM, Li X, Sullivan PF, Hindle AG, Andrews G, and Armstrong JC (2023). "Evolutionary constraint and innovation across hundreds of placental mammals". Science. 380: 366. doi:10.1126/science.abn3943.
^ Comings DE (1972). "The structure and function of chromatin". Advances in human genetics. Springer. p. 237-431.
^ Comings, DE (1972). "Review of Evolution of Genetics Systems". American Journal of Human Genetics. 25: 340-342.
^ Coyne, Jerry A. (2009). Why Evolution is True. New York: Viking. ISBN 978-0-670-02053-9. LCCN 2008033973. OCLC 233549529.
^ Crick, Francis (1979). "Split genes and RNA splicing". Science. 204: 264–271. doi:10.1126/science.373120.
^ Dawkins R (1976). The selfish gene. Oxford, UK: Oxford University Press.
^ Dawkins R, and Wong Y (2016). "The Humped Bladderwort's Tale". The Ancestor's Tale 2nd ed. Weidenfeld & Nicolson.
^ De Parseval N, and Heidmann T (2005). "Human endogenous retroviruses: from infectious elements to human genes". Cytogenetic and genome research. 110: 318–332. doi:10.1159/000084964.
^ Doolittle, W.F. (1978). "Genes in pieces: were they ever together?". Nature. 272: 581–582. doi:10.1038/272581a0.
^ Doolittle WF (1991). "The origins of introns". Current Biology. 1: 145–146. doi:10.1016/0960-9822(91)90214-h.
^ Doolittle WF, and Sapienza C (1980). "Selfish genes, the phenotype paradigm and genome evolution". Nature. 284: 601–603. doi:10.1038/284601a0.
^ Glover, G (1980). "Ignorant DNA?". Nature. 285: 618–619.
^ Dover G, and Doolittle WF (1980). "Modes of genome evolution". Nature. 288: 646–647.
^ Dukler N, Mughal MR, Ramani R, Huang YF, and Siepel A (2022). "Extreme purifying selection against point mutations in the human genome". Nature communications. 13: 4312. doi:10.1038/s41467-022-31872-6.
^ Echols H, and Goodman MF (1991). "Fidelity mechanisms in DNA replication". Annual review of biochemistry. 60: 477–511. doi:10.1146/annurev.bi.60.070191.002401.
^ Elliott TA, Linquist S, and Gregory TR (2014). "Conceptual and empirical challenges of ascribing functions to transposable elements". The American Naturalist. 184: 14–24. doi:10.1086/676588.
^ "An integrated encyclopedia of DNA elements in the human genome". Nature. 489: 57–74. 2012. doi:10.1038/nature11247. ((cite journal)): Cite uses deprecated parameter |authors= (help)
^ "The Story of You: ENCODE and the human genome". YouTube. Nature/Illumina. 2012.
^ "ENCODE: Encyclopedia of DNA Elements". YouTube. European Molecular Biology Laboratories. 2012.
^ "What the Encode project tells us about the human genome and 'junk DNA'". YouTube. The Guardian. 2012.
^ Maher, Brendan. "Fighting about ENCODE and junk". Nature News Blog. Nature.
^ "Human assembly and gene annotation". Ensembl. 2022. Retrieved 2023-02-28.
^ Francis WR, Wörheide G (June 2017). "Similar Ratios of Introns to Intergenic Sequence across Animal Genomes". Genome Biology and Evolution. 9 (6): 1582–1598. doi:10.1093/gbe/evx103. PMC 5534336. PMID 28633296.
^ Galeota-Sprung B, Sniegowski P, and Ewens W (2020). "Mutational load and the functional fraction of the human genome". Genome Biology and Evolution. 12: 273–281. doi:10.1093/gbe/evaa040.
^ Gericke NM, Hagberg M (2007). "Definition of historical models of gene function and their relation to students' understanding of genetics". Science & Education. 16: 849–881. doi:10.1007/s11191-006-9064-4.
^ Germain PL, Ratti E, and Boem F (2014). "Junk or functional DNA? ENCODE and the function controversy". Biology & Philosophy. 29: 807–821. doi:10.1007/s10539-014-9441-3.
^ Gil R, and Latorre A (2012). "Factors behind junk DNA in bacteria". Genes. 3: 634–650. doi:10.3390/genes3040634.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Gilbert, Walter (1978). "Why genes in pieces?". Nature. 271: 501–501. doi:10.1038/271501a0.
^ Gilbert, Walter (1985). "Genes-in-pieces revisited". Science. 228: 823–824. doi:10.1126/science.4001923.
^ Gould, Stephen Jay (2002). The Structure of Evolutionary Theory. Cambridge, Massachusetts: Belknap Press of Harvard University Press. ISBN 978-0-674-00613-3. LCCN 2001043556. OCLC 47869352.
^ Graur, Dan (2016). "Eukaryotic Genome Evolution". Molecular and Genome Evolution. Sinauer Associates, Inc.
^ Graur, Dan (2017). "Rubbish DNA: The functionless fraction of the human genome". In Saitou, Naruya (ed.). Evolution of the Human Genome I. Springer. pp. 19–60.
^ Graur D, Zheng Y, Azevedo RB (2015). "An evolutionary classification of genomic function". Genome Biology and Evolution. 7: 642–645. doi:10.1093/gbe/evv021.
^ Gregory, TR (2005). "Genome Size Evolution in Animals". The Evolution of the Genome. Elsevier. p. 3-87.
^ Gymrek M, Willems T, Guilmatre A, Zeng H, Markus B, Georgiev S, Daly MJ, Price AL, Pritchard JK, Sharp AJ, Erlich Y (2016). "Abundant contribution of short tandem repeats to gene expression variation in humans". Nature Genetics. 48: 22–29. doi:10.1038/ng.3461.
^ Halldorsson BV, Eggertsson HP, Moore KH, Hauswedell H, Eiriksson O, Ulfarsson MO, Palsson G, Hardarson MT, Oddsson A, Jensson BO, et al. (2022). "The sequences of 150,119 genomes in the UK biobank". Nature. 607: 732–740. doi:10.1038/s41586-022-04965-x.
^ Häsler J, Samuelsson T, Strub K (2007). "Useful 'junk': Alu RNAs in the human transcriptome". Cellular and Molecular Life Sciences (Submitted manuscript). 64 (14): 1793–1800. doi:10.1007/s00018-007-7084-0. PMID 17514354. S2CID 5938630.
^ Haerty W, and Ponting CP (2014). "No Gene in the Genome Makes Sense Except in the Light of Evolution". Annual Review of Genomics and Human Genetics. 25: 71–92. doi:10.1146/annurev-genom-090413-025621.
^ Hopkin K (2009). "The Evolving Definition of a Gene: With the discovery that nearly all of the genome is transcribed, the definition of a "gene" needs another revision". BioScience. 59: 928–931. doi:10.1525/bio.2009.59.11.3.
^ Hoyt SJ, Storer JM, Hartley GA, Grady PG, Gershman A, de Lima LG, Limouse C, Halabian R, Wojenski L, and Rodriguez M (2022). "From telomere to telomere: the transcriptional and epigenetic state of human repeat elements". Science. 376: 57. doi:10.1126/science.abk3112.
^ Hubé F, and Francastel C (2015). "Mammalian Introns: When the Junk Generates Molecular Diversity". International journal of molecular sciences. 16: 4429–4452. doi:10.3390/ijms16034429.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Jain, HK (1980). "Incidental DNA". Nature. 288: 647–648.
^ Jensen, Roy A (2001). "Orthologs and paralogs - we need to get it right". Genome Biology. 2: interactions1002.1. doi:10.1186/gb-2001-2-8-interactions1002.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Jensen TH, Jacquier A, and Libri D (2013). "Dealing with pervasive transcription". Molecular Cell. 52: 473–484. doi:10.1016/j.molcel.2013.10.032.
^ Johnson WE (2019). "Origins and evolutionary consequences of ancient endogenous retroviruses". Nature Reviews Microbiology. 17: 355–370. doi:10.1038/s41579-019-0189-2.
^ Judson HF (1996). The Eight Day of Creation (Expanded Edition). Plainview, NY (USA): Cold Spring Harbor Laboratory Press.((cite book)): CS1 maint: extra punctuation (link)
^ "Thomas Jukes letter to Francis Crick". The Francis Crick Papers, National Library of Medicine (USA). 1979. Retrieved May 17, 2022.
^ Keightley, PD (2012). "Rates and fitness consequences of new mutations in humans". Genetics. 190: 295–304. doi:10.1534/genetics.111.134668.
^ Kimura M, and Ohta T (1971). "Protein polymorphism as a phase of molecular evolution". Nature. 229: 467–469. doi:10.1038/229467a0.
^ Kirchberger PC, Schmidt ML, and Ochman H (2020). "The ingenuity of bacterial genomes". Annual review of microbiology. 74: 815–834. doi:10.1146/annurev-micro-020518-115822.
^ Kunkel, TA (2009). "Evolving views of DNA replication (in) fidelity". Cold Spring Harbor Symposia on Quantitative Biology. 74: 91–101. doi:10.1101/sqb.2009.74.027.
^ Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al. (2001). "Initial sequencing and analysis of the human genome". Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. PMID 11237011.
^ Larsen PA (2018). "Transposable elements and the multidimensional genome". Chromosome Research. 26: 1–3. doi:10.1007/s10577-018-9575-2.
^ Lewin, Benjamin (1974). "Chapter 4: Sequences of Eukaryotic DNA". Gene Expression-2: Eukaryotic Chromosomes. John Wiley & Sons.
^ Lewin, Benjamin (1974). "Chapter 5: Transcription and Processing of RNA". Gene Expression-2: Eukaryotic Chromosomes. John Wiley & Sons.
^ Lewin, Benjamin (1974). "Sequence Organization of Eukaryotic DNA: Defining the Unit of Gene Expression". Cell. 1: 107–111. doi:10.1016/0092-8674(74)90125-1.
^ Leypold NA, and Speicher MR (2021). "Evolutionary conservation in noncoding genomic regions". TRENDS in Genetics. 37: 903–918. doi:10.1016/j.tig.2021.06.007.
^ Linquist, Stefan. "Causal-role myopia and the functional investigation of junk DNA". Biology & Philosophy. 37: 1–23. doi:10.1007/s10539-022-09853-2.
^ Michael, Lynch (2016). "Mutation and Human Exceptionalism: Our Future Genetic Load". Genetics. 202: 869–875. doi:10.1534/genetics.115.180471.
^ Lynch M, Ackerman MS, Gout JF, Long H, Sung W, Thomas WK, and Foster PL (2016). "Genetic drift, selection and the evolution of the mutation rate". Nature Reviews Genetics. 17: 704–714. doi:10.1038/nrg.2016.104.
^ McHughen A (2020). DNA Demystified: Unraveling the Double Helix. New York, New York, USA: Oxford University Press.
^ Moorjani P, Gao Z, and Przeworski M (2016). "Human germline mutation and the erratic evolutionary clock". PLoS Biology. 14: e2000744. doi:10.1371/journal.pbio.2000744.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Morange, Michel (2020). "Chapter 17: Split Genes and Splicing". The Black Box of Biology: A History of the Molecular Revolution. Harvard University Press.
^ Mortola E, Long M (2021). "Turning Junk into Us: How Genes Are Born". American Scientist. 109: 174–182.
^ Nachman, Michael W (2004). "Haldane and the first estimates of the human mutation rate". Journal of Genetics. 83: 231–233.
^ Neil CR, and Fairbrother WG (2019). "Intronic RNA: Ad 'junk'mediator of post-transcriptional gene regulation". Biochimica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms. 1862: 194439. doi:10.1016/j.bbagrm.2019.194439.
^ Nelson PN, Hooley P, Roden D, Davari Ejtehadi H, Rylance P, Warren P, et al. (2004). "Human endogenous retroviruses: transposable elements with potential?". Clinical and Experimental Immunology. 138 (1): 1–9. doi:10.1111/j.1365-2249.2004.02592.x. PMC 1809191. PMID 15373898.
^ Nowak MA, and Waclaw B (2017). "Genes, environment, and "bad luck"". Science. 355: 1266–1267. doi:10.1126/science.aam9746.
^ O'Brian, S.J. (1973). "On estimating functional gene number in eukaryotes". Nature New Biology. 242: 52–54.
^ Ohno S (1972). "So much "junk" DNA in our genome". Brookhaven symposia in biology. 23: 366–370.
^ Ohno, S (1972). "Simplicity of Mammalian Regulatory Systems". Developmental Biology. 27: 131–136. doi:10.1016/0012-1606(72)90117-0.
^ Ohno S, and Yomo T (1991). "The grammatical rule for all DNA: junk and coding sequences". Electrophoresis. 12: 103–108. doi:10.1002/elps.1150120203.
^ Ohta, Tomoko (1973). "Slightly deleterious mutant substitutions in evolution". Nature. 246: 96–98. doi:10.1038/246096a0.
^ Ohta, Tomoko (1998). "Evolution by nearly-neutral mutations". Genetica. 102: 83–90.
^ Ohta T, and Kimura M (1971). "Functional organization of genetic material as a product of molecular evolution". Nature. 233: 118–119.
^ Orgel LE, and Crick FH (1980). "Selfish DNA: the ultimate parasite". Nature. 284: 604–607. doi:10.1038/284604a0.
^ Orgel LE, Crick F, and Sapienza C (1980). "Selfish dna". Nature. 288: 645–647. doi:10.1038/288645a.
^ Palazzo AF, and Kejiou NS (2022). "Non-Darwinian Molecular Biology". Frontiers in genetics. 13: 831068. doi:10.3389/fgene.2022.831068.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Palazzo AF, and Lee ES (2015). "Non-coding RNA: what is functional and what is junk?". Frontiers in genetics. 6: 1–11. doi:10.3389/fgene.2015.00002.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Pearson H (2006). "What Is a Gene?". Nature. 441: 399–401.
^ Pennisi E (2007). "DNA study forces rethink of what it means to be a gene". Science. 316: 1556–1557. doi:10.1126/science.316.5831.1556.
^ Piovesan A, Antonaros F, Vitale L, Strippoli P, Pelleri MC, Caracausi M (2019). "Human protein-coding genes and gene feature statistics in 2019". BMC research notes. 12: 315. doi:10.1186/s13104-019-4343-8.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Piovesan A, Pelleri MC, Antonaros F, Strippoli P, Caracausi M, and Vitale L (2019). "On the length, weight and GC content of the human genome". BMC Research Notes. 12: 106–173. doi:10.1186/s13104-019-4137-z.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Ponicsan SL, Kugel JF, Goodrich JA (2010). "Genomic gems: SINE RNAs regulate mRNA production". Current Opinion in Genetics & Development. 20 (2): 149–155. doi:10.1016/j.gde.2010.01.004. PMC 2859989. PMID 20176473.
^ Ponting CP (2017). "Biological function in the twilight zone of sequence conservation". BMC biology. 15: 1–9. doi:10.1186/s12915-017-0411-5.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Ponting CP, Hardison RC (2011). "What fraction of the human genome is functional?". Genome Research. 21: 1769–1776. doi:10.1101/gr.116814.110.
^ Ponting CP, Haerty W (2022). "Genome-Wide Analysis of Human Long Noncoding RNAs: A Provocative Review". Annual Review of Genomics and Human Genetics. 23: 153–172. doi:10.1146/annurev-genom-112921-123710.
^ Ségurel L, Wyman MJ, and Przeworski M (2014). "Determinants of mutation rate variation in the human germline". Annual review of genomics and human genetics. 15: 47–70. doi:10.1146/annurev-genom-031714-125740.
^ Scally, Aylwyn (2016). "The mutation rate in human evolution and demographic inference". Current opinion in genetics & development. 41: 36–43. doi:10.1016/j.gde.2016.07.008.
^ Scally A, and Durbin R (2012). "Revising the human mutation rate: implications for understanding human evolution". Nature Reviews Genetics. 13: 745–753. doi:10.1038/nrg3295.
^ Sharp PA (1991). ""Five easy pieces."(role of RNA catalysis in cellular processes)". Science. 254: 663–664.
^ Sverdlov, Eugene (2017). "Transcribed Junk Remains Junk If It Does Not Acquire A Selected Function in Evolution". BioEssays. 39: 1700164. doi:10.1002/bies.2017001641.
^ Sweet, Amalia (2022). Requiem for a Gene: The Problem of Junk DNA for the Molecular Paradigm (MA). University of Chicago.
^ Thomas, Charles A. Jr. (1971). "The genetic organization of chromosomes". Annual review of genetics. 5: 237–256. doi:10.1146/annurev.ge.05.120171.001321.
^ Yu J, Yang Z, Kibukawa M, Paddock M, Passey DA, and Wong GK (2002). "Minimal introns are not "junk"". Genome Research. 12: 1185–1189. doi:10.1101/gr.224602.
^ van Bakel H, Nislow C, Blencowe BJ, and Hughes TR (2011). "Response to "the reality of pervasive transcription". PLoS Biology. 9: e1001102. doi:10.1371/journal.pbio.1001102.((cite journal)): CS1 maint: unflagged free DOI (link)
^ Wade JT, and Grainger DC (2018). "Spurious transcription and its impact on cell function". Transcription. 9: 182–189. doi:10.1080/21541264.2017.1381794.
^ Walters RD, Kugel JF, Goodrich JA (2009). "InvAluable junk: the cellular impact and function of Alu and B2 RNAs". IUBMB Life. 61 (8): 831–837. doi:10.1002/iub.227. PMC 4049031. PMID 19621349.
^ Watson JD (1965). Molecular Biology of the Gene. New York, NY, USA: W.A. Benjamin, Inc.
^ Wong GK, Passey DA, Huang YZ, Yang Z, and Yu J (2000). "Is "junk" DNA mostly intron DNA?". Genome Research. 10: 1672–1678. doi:10.1101/gr.148900.
^ Zhou ZX, Lujan SA, Burkholder AB, StCharles J, Dahl J, Farrell CE, Williams JS, and Kunkel TA (2021). "How asymmetric DNA replication achieves symmetrical fidelity". Nature structural & molecular biology. 28: 1020–1028. doi:10.1038/s41594-021-00691-6.
^ Colleen Shaddox (2012). "Junk no more". Yale Shcool of Medicine. Retrieved 2023-04-07. Hopefully, ENCODE will help put an end to the notion of junk DNA.

More

Junk DNA

How much of the human genome is junk?

Arguments against junk DNA

Mutation load

The C-Value Paradox

How much of the human genome is junk?

Mutation load

Notes

Transposon-related sequences

Selfish DNA

Molecular evolution

DRAFT SECTION

Junk DNA stub for Non-Coding DNA

Origin of introns (Feb. 25, 2023)

Remove Prokaryotic Cell Diagram

Are introns mostly junk?

Highly repetitive DNA

Untranslated regions

Defining the genome

Conflicting definitions 'gene'

Repeat sequences, transposons and viral elements

Protein-coding genes

Biochemical activity

Kellis et al. (2014)

References