Last universal common ancestor



The last universal common ancestor (LUCA) is the hypothesized common ancestral cell from which the three domains of life, the Bacteria, the Archaea, and the Eukarya originated. It is suggested to have been a "cellular organism that had a lipid bilayer and used DNA, RNA, and protein". The LUCA has also been defined as "a hypothetical organism ancestral to all three domains". The LUCA probably existed at latest 3.6 billion years ago, and possibly as early as 4.3 billion years ago or earlier. The nature of this point or stage of divergence remains a topic of research.

All earlier forms of life preceding this divergence and all extant organisms are generally thought to share common ancestry. On the basis of a formal statistical test, this theory of a universal common ancestry (UCA) is supported versus competing multiple-ancestry hypotheses. The first universal common ancestor (FUCA) is a hypothetical non-cellular ancestor to LUCA and other now-extinct sister lineages.

Whether the genesis of viruses falls before or after the LUCA–as well as the diversity of extant viruses and their hosts–remains a subject of investigation.

While no fossil evidence of the LUCA exists, the detailed biochemical similarity of all current life (divided into the three domains) makes its existence widely accepted by biochemists. Its characteristics can be inferred from shared features of modern genomes. These genes describe a complex life form with many co-adapted features, including transcription and translation mechanisms to convert information from DNA to mRNA to proteins.

Historical background


A phylogenetic tree directly portrays the idea of evolution by descent from a single ancestor. An early tree of life was sketched by Jean-Baptiste Lamarck in his Philosophie zoologique in 1809. Charles Darwin more famously proposed the theory of universal common descent through an evolutionary process in his book On the Origin of Species in 1859: "Therefore I should infer from analogy that probably all the organic beings which have ever lived on this earth have descended from some one primordial form, into which life was first breathed." The last sentence of the book begins with a restatement of the hypothesis: "There is grandeur in this view of life, with its several powers, having been originally breathed into a few forms or into one ..."

The term "last universal common ancestor" or "LUCA" was first used in the 1990s for such a primordial organism.

An anaerobic thermophile
In 2016, Madeline C. Weiss and colleagues genetically analyzed 6.1 million protein-coding genes and 286,514 protein clusters from sequenced prokaryotic genomes representing many phylogenetic trees, and identified 355 protein clusters that were probably common to the LUCA. The results of their analysis are highly specific, though debated. They depict LUCA as "anaerobic, CO2-fixing, H2-dependent with a Wood–Ljungdahl pathway (the reductive acetyl-coenzyme A pathway), N2-fixing and thermophilic. LUCA's biochemistry was replete with FeS clusters and radical reaction mechanisms." The cofactors also reveal "dependence upon transition metals, flavins, S-adenosyl methionine, coenzyme A, ferredoxin, molybdopterin, corrins and selenium. Its genetic code required nucleoside modifications and S-adenosylmethionine-dependent methylations." They show that methanogenic clostridia was basal, near the root of the phylogenetic tree, in the 355 protein lineages examined, and that the LUCA may therefore have inhabited an anaerobic hydrothermal vent setting in a geochemically active environment rich in H2, CO2, and iron, where ocean water interacted with hot magma beneath the ocean floor. It is even inferred that LUCA also grew from H2 and CO2 via the reverse incomplete Krebs cycle. Other metabolic pathways inferred in LUCA are the pentose phosphate pathway, glycolysis, and gluconeogenesis. Even if phylogenetic evidence may point to a hydrothermal vent environment for a thermophilic LUCA, this does not constitute evidence that the origin of life took place at a hydrothermal vent since mass extinctions may have removed previously existing branches of life.

While the gross anatomy of the LUCA can only be reconstructed with much uncertainty, its biochemical mechanisms can be described in some detail, based on the "universal" properties currently shared by all independently living organisms on Earth.



The LUCA certainly had genes and a genetic code. Its genetic material was most likely DNA, so that it lived after the RNA world. The DNA was kept double-stranded by an enzyme, DNA polymerase, which recognises the structure and directionality of DNA. The integrity of the DNA was maintained by a group of repair enzymes including DNA topoisomerase. If the genetic code was based on dual-stranded DNA, it was expressed by copying the information to single-stranded RNA. The RNA was produced by a DNA-dependent RNA polymerase using nucleotides similar to those of DNA. It had multiple DNA-binding proteins, such as histone-fold proteins. The genetic code was expressed into proteins. These were assembled from 20 free amino acids by translation of a messenger RNA via a mechanism of ribosomes, transfer RNAs, and a group of related proteins.

LUCA was likely capable of sexual interaction in the sense that adaptive gene functions were present that promoted the transfer of DNA between individuals of the population to facilitate genetic recombination. Homologous gene products that promote genetic recombination are present in bacteria, archaea and eukaryota, such as the RecA protein in bacteria, the RadA protein in archaea, and the Rad51 and Dmc1 proteins in eukaryota.

The functionality of LUCA as well as evidence for the early evolution membrane-dependent biological systems together suggest that LUCA had cellularity and cell membranes. As for the cell's gross structure, it contained a water-based cytoplasm effectively enclosed by a lipid bilayer membrane; it was capable of reproducing by cell division. It tended to exclude sodium and concentrate potassium by means of specific ion transporters (or ion pumps). The cell multiplied by duplicating all its contents followed by cellular division. The cell used chemiosmosis to produce energy. It also reduced CO2 and oxidized H2 (methanogenesis or acetogenesis) via acetyl-thioesters.

By phylogenetic bracketing, analysis of the presumed LUCA's offspring groups, LUCA appears to have been a small, single-celled organism. It likely had a ring-shaped coil of DNA floating freely within the cell. Morphologically, it would likely not have stood out within a mixed population of small modern-day bacteria. The originator of the three-domain system, Carl Woese, stated that in its genetic machinery, the LUCA would have been a "simpler, more rudimentary entity than the individual ancestors that spawned the three [domains] (and their descendants)".



An alternative to the search for "universal" traits is to use genome analysis to identify phylogenetically ancient genes. This gives a picture of a LUCA that could live in a geochemically harsh environment and is like modern prokaryotes. Analysis of biochemical pathways implies the same sort of chemistry as does phylogenetic analysis. Weiss and colleagues write that "Experiments ... demonstrate that ... acetyl-CoA pathway [chemicals used in anaerobic respiration] formate, methanol, acetyl moieties, and even pyruvate arise spontaneously ... from CO2, native metals, and water", a combination present in hydrothermal vents.

An experiment shows that Zn2+, Cr3+, and Fe can promote 6 of the 11 reactions of an ancient anabolic pathway called the reverse Krebs cycle in acidic conditions which implies that LUCA might have inhabited either hydrothermal vents or acidic metal-rich hydrothermal fields.

Because both bacteria and archaea have differences in the structure of phospholipids and cell wall, ion pumping, most proteins involved in DNA replication, and glycolysis, it is inferred that LUCA had a permeable membrane without an ion pump. The emergence of Na+/H+ antiporters likely lead to the evolution of impermeable membranes present in eukaryotes, archaea, and bacteria. It is stated that "The late and independent evolution of glycolysis but not gluconeogenesis is entirely consistent with LUCA being powered by natural proton gradients across leaky membranes. Several discordant traits are likely to be linked to the late evolution of cell membranes, notably the cell wall, whose synthesis depends on the membrane and DNA replication". Although LUCA likely had DNA, it is unknown if it could replicate DNA and is suggested to "might just have been a chemically stable repository for RNA-based replication". It is likely that the permeable membrane of LUCA was composed of archaeal lipids (isoprenoids) and bacterial lipids (fatty acids). Isoprenoids would have enhanced stabilization of LUCA's membrane in the surrounding extreme habitat. Nick Lane and coauthors state that "The advantages and disadvantages of incorporating isoprenoids into cell membranes in different microenvironments may have driven membrane divergence, with the later biosynthesis of phospholipids giving rise to the unique G1P and G3P headgroups of archaea and bacteria respectively. If so, the properties conferred by membrane isoprenoids place the lipid divide as early as the origin of life".

UV light between 200-280 nm (at the time, unprotected by the ozone layer) would have been damaging to nucleotides at the surface, as it can cause mutations or transcription errors and ultimately damaging consequences for organisms at the cellular level. However, it is likely that LUCA existed in a UV environment because of the prevalence of photolyase across the tree of life. Photolyase uses UV to drive photoreactivation, a mechanism that works to repair damage from radiation, caused by UV.

Alternative interpretations
Some other researchers have challenged Weiss et al.'s 2016 conclusions. Sarah Berkemer and Shawn McGlynn argue that Weiss et al. undersampled the families of proteins, so that the phylogenetic trees were not complete and failed to describe the evolution of proteins correctly. There are two risks in attempting to attribute LUCA's environment from near-universal gene distribution (as in Weiss et al. 2016). On the one hand, it risks misattributing convergence or horizontal gene transfer events to vertical descent; on the other hand, it risks misattributing potential LUCA gene families as horizontal gene transfer events.

A phylogenomic and geochemical analysis of a set of proteins that probably traced to the LUCA show that it had K+-dependent GTPases and the ionic composition and concentration of its intracellular fluid was seemingly high K+/Na+ ratio,, Fe2+, CO2+, Ni2+, Mg2+, Mn2+, Zn2+, pyrophosphate, and which would imply a terrestrial hot spring habitat. It possibly had a phosphate-based metabolism. Further, these proteins were unrelated to autotrophy (the ability of an organism to create its own organic matter), suggesting that the LUCA had a heterotrophic lifestyle (consuming organic matter) and that its growth was dependent on organic matter produced by the physical environment. Nick Lane argues that Na+/H+ antiporters could readily explain the low concentration of Na+ in the LUCA and its descendants.

The presence of the energy-handling enzymes CODH/acetyl-coenzyme A synthase in LUCA could be compatible not only with being an autotroph but also with life as a mixotroph or heterotroph. Weiss et al. 2018 reply that no enzyme defines a trophic lifestyle, and that heterotrophs evolved from autotrophs.

Evidence that LUCA was mesophilic
Several lines of evidence now suggest that LUCA was non-thermophilic.

The content of G + C nucleotide pairs (compared to the occurrence of A + T pairs) can indicate an organism's thermal optimum as they are more thermally stable due to an additional hydrogen bond. As a result they occur more frequently in the rRNA of thermophiles; however this is not seen in LUCA's reconstructed rRNA.

The identification of thermophilic genes in the LUCA has been criticized, as they may instead represent genes that evolved later in archaea or bacteria, then migrated between these via horizontal gene transfer, as in Woese's 1998 hypothesis. LUCA could have been a mesophile that fixed CO2 and relied on H2, and lived close to hydrothermal vents.

Further evidence that LUCA was mesophilic comes from the amino acid composition of its proteins. The abundance of I, V, Y, W, R, E, and L amino acids (denoted IVYWREL) in an organism's proteins is correlated with its optimal growth temperature. According to phylogentic analysis, the IVYWREL content of LUCA's proteins suggests its ideal temperature was below 50°C.

Finally, evidence that bacteria and archaea both independently underwent phases of increased and subsequently decreased thermo-tolerance suggests a dramatic post-LUCA climate shift that affected both populations and would explain the seeming genetic pervasiveness of thermo-tolerant genetics.

Age
Studies from 2000 to 2018 have suggested an increasingly ancient time for the LUCA. In 2000, estimates of the LUCA's age ranged from 3.5 to 3.8 billion years ago in the Paleoarchean, a few hundred million years before the earliest fossil evidence of life, for which candidates range in age from 3.48 to 4.28 billion years ago. This placed the origin of the first forms of life shortly after the Late Heavy Bombardment which was thought to have repeatedly sterilized Earth's surface. However, a 2018 study by Holly Betts and colleagues applied a molecular clock model to the genomic and fossil record (102 species, 29 common protein-coding genes, mostly ribosomal), concluding that LUCA preceded the Late Heavy Bombardment. They assumed that there was no sterilizing event after the Moon-forming event, which they dated as 4.520 billion years ago, and concluded that the most likely date for LUCA was within 50 million years of that.

Root of the tree of life


In 1990, a novel concept of the tree of life was presented, dividing the living world into three stems, classified as the domains Bacteria, Archaea, Eukarya. It is the first tree founded exclusively on molecular phylogenetics, and which includes the evolution of microorganisms. It has been called a "universal phylogenetic tree in rooted form". This tree and its rooting became the subject of debate.

In the meantime, numerous modifications of this tree, mainly concerning the role and importance of horizontal gene transfer for its rooting and early ramifications have been suggested (e.g. ). Since heredity occurs both vertically and horizontally, the tree of life may have been more weblike or netlike in its early phase and more treelike when it grew three-stemmed. Presumably horizontal gene transfer has decreased with growing cell stability.

A modified version of the tree, based on several molecular studies, has its root between a monophyletic domain Bacteria and a clade formed by Archaea and Eukaryota. A small minority of studies place the root in the domain bacteria, in the phylum Bacillota, or state that the phylum Chloroflexota (formerly Chloroflexi) is basal to a clade with Archaea and Eukaryotes and the rest of bacteria (as proposed by Thomas Cavalier-Smith). Metagenomic analyses recover a two-domain system with the domains Archaea and Bacteria; in this view of the tree of life, Eukaryotes are derived from Archaea. With the later gene pool of LUCA's descendants, sharing a common framework of the AT/GC rule and the standard twenty amino acids, horizontal gene transfer would have become feasible and could have been common.

The nature of LUCA remains disputed. In 1994, on the basis of primordial metabolism (sensu Wächtershäuser), Otto Kandler proposed a successive divergence of the three domains of life from a multiphenotypical population of pre-cells, reached by gradual evolutionary improvements (cellularization). These phenotypically diverse pre-cells were metabolising, self-reproducing entities exhibiting frequent mutual exchange of genetic information. Thus, in this scenario there was no "first cell". It may explain the unity and, at the same time, the partition into three lines (the three domains) of life. Kandler's pre-cell theory is supported by Wächtershäuser. In 1998, Carl Woese, based on the RNA world concept, proposed that no individual organism could be considered a LUCA, and that the genetic heritage of all modern organisms derived through horizontal gene transfer among an ancient community of organisms. Other authors concur that there was a "complex collective genome" at the time of the LUCA, and that horizontal gene transfer was important in the evolution of later groups; Nicolas Glansdorff states that LUCA "was in a metabolically and morphologically heterogeneous community, constantly shuffling around genetic material" and "remained an evolutionary entity, though loosely defined and constantly changing, as long as this promiscuity lasted."

The theory of a universal common ancestry of life is widely accepted. In 2010, based on "the vast array of molecular sequences now available from all domains of life," D. L. Theobald published a "formal test" of universal common ancestry (UCA). This deals with the common descent of all extant terrestrial organisms, each being a genealogical descendant of a single species from the distant past. His formal test favoured the existence of a universal common ancestry over a wide class of alternative hypotheses that included horizontal gene transfer. Basic biochemical principles imply that all organisms do have a common ancestry.

A proposed, earlier, non-cellular ancestor to LUCA is the First universal common ancestor (FUCA). FUCA would therefore be the ancestor to every modern cell as well as ancient, now-extinct cellular lineages not descendant of LUCA. FUCA is assumed to have had other descendants than LUCA, none of which have modern descendants. Some genes of these ancient now-extinct cell lineages are thought to have been horizontally transferred into the genome of early descendants of LUCA.

LUCA and viruses
The origin of viruses remains disputed. Since viruses need host cells for their replication, it is likely that they emerged after the formation of cells. Viruses may even have multiple origins and different types of viruses may have evolved independently over the history of life. There are different hypotheses for the origins of viruses, for instance an early viral origin from the RNA world or a later viral origin from selfish DNA.

Based on how viruses are currently distributed across the bacteria and archaea, the LUCA is suspected of having been prey to multiple viruses, ancestral to those that now have those two domains as their hosts. Furthermore, extensive virus evolution seems to have preceded the LUCA, since the jelly-roll structure of capsid proteins is shared by RNA and DNA viruses across all three domains of life. LUCA's viruses were probably mainly dsDNA viruses in the groups called Duplodnaviria and Varidnaviria. Two other single-stranded DNA virus groups within the Monodnaviria, the Microviridae and the Tubulavirales, likely infected the last bacterial common ancestor. The last archaeal common ancestor was probably host to spindle-shaped viruses. All of these could well have affected the LUCA, in which case each must since have been lost in the host domain where it is no longer extant. By contrast, RNA viruses do not appear to have been important parasites of LUCA, even though straightforward thinking might have envisaged viruses as beginning with RNA viruses directly derived from an RNA world. Instead, by the time the LUCA lived, RNA viruses had probably already been out-competed by DNA viruses.

LUCA might have been the ancestor to some viruses, as it might have had at least two descendants: LUCELLA, the Last Universal Cellular Ancestor, the ancestor to all cells, and the archaic virocell ancestor, the ancestor to large-to-medium-sized DNA viruses. Viruses might have evolved before LUCA but after the First universal common ancestor (FUCA), according to the reduction hypothesis, where giant viruses evolved from primordial cells that became parasitic.