Streamlining theory

Genomic streamlining is a theory in evolutionary biology and microbial ecology that suggests that there is a reproductive benefit to prokaryotes having a smaller genome size with less non-coding DNA and fewer non-essential genes. There is a lot of variation in prokaryotic genome size, with the smallest free-living cell's genome being roughly ten times smaller than the largest prokaryote. Two of the free-living bacterial taxa with the smallest genomes are Prochlorococcus and Pelagibacter ubique, both highly abundant marine bacteria commonly found in oligotrophic regions. Similar reduced genomes have been found in uncultured marine bacteria, suggesting that genomic streamlining is a common feature of bacterioplankton. This theory is typically used with reference to free-living organisms in oligotrophic environments.

Overview
Genome streamlining theory states that certain prokaryotic genomes tend to be small in size in comparison to other prokaryotes, and all eukaryotes, due to selection against the retention of non-coding DNA. The known advantages of small genome size include faster genome replication for cell division, fewer nutrient requirements, and easier co-regulation of multiple related genes, because gene density typically increases with decreased genome size. This means that an organism with a smaller genome is likely to be more successful, or have higher fitness, than one hindered by excessive amounts of unnecessary DNA, leading to selection for smaller genome sizes.

Some mechanisms that are thought to underlie genome streamlining include deletion bias and purifying selection. Deletion bias is the phenomenon in bacterial genomes where the rate of DNA loss is naturally higher than the rate of DNA acquisition. This is a passive process that simply results from the difference in these two rates. Purifying selection is the process by which extraneous genes are selected against, making organisms lacking this genetic material more successful by effectively reducing their genome size. Genes and non-coding DNA segments that are less crucial for an organism survival will be more likely to be lost over time.

This selective pressure is stronger in large marine prokaryotic populations, because intra-species competition favours fast, efficient and inexpensive replication. This is because large population sizes increase competition among members of the same species, and thus increases selective pressure and causes the reduction in genome size to occur more readily among organisms of large population sizes, like bacteria. This may explain why genome streamlining seems to be particularly prevalent in prokaryotic organisms, as they tend to have larger population sizes than eukaryotes.

It has also been proposed that having a smaller genome can help minimize overall cell size, which increases a prokaryotes surface-area to volume ratio. A higher surface-area to volume ratio allows for more nutrient uptake proportional to their size, which allows them to outcompete other larger organisms for nutrients. This phenomenon has been noted particularly in nutrient depleted waters.

Genomic signatures
Genomic analysis of streamlined organisms have shown that low GC content, low percentage of non-coding DNA, and a low fraction of genes encoding for cytoplasmic membrane proteins, periplasmic proteins, transcriptionally related proteins, and signal transduction pathways are all characteristic of free-living streamlined prokaryotic organisms. Oftentimes, highly streamlined organisms are difficult to isolate by culturing in a laboratory (SAR11 being a central example).

Pelagibacter ubique (SAR11)
Pelagibacter ubique are members of the SAR11 clade, a heterotrophic marine group which are found throughout the oceans and are rather common. These microbes have the smallest genome and encode the smallest number of Open Reading Frames of any known non-sessile microorganism. P. ubique has complete biosynthetic pathways and all necessary enzymes for the synthesis of 20 amino acids and only lack a few cofactors despite the genome's small size. The genome size for this microorganism is achieved by lack of, "pseudogenes, introns, transposons, extrachromosomal elements, or inteins". The genome also contains fewer paralogs compared to other members of the same clade and the shortest intergenic spacers for any living cell. In these organisms, unusual nutrient requirements were found due to the streamlining selection and gene loss when selection occurred for more efficient resource utilization in oceans with limited nutrients for uptake. These observations indicate that some microbes may be difficult to grow in a laboratory setting because of unusual nutrient requirements.

Prochlorococcus
Prochlorococcus is one of the dominant cyanobacteria and is a main participant in primary production in oligotrophic waters. It is the smallest and most abundant photosynthetic organism recorded on Earth. As a cyanobacteria, they have an incredible ability to adapt to environments with very poor nutrient availability, as they maintain their energy from light. The nitrogen assimilation pathway in this organism has been significantly modified to adapt to the nutritional limitations of the organisms’ habitats. These adaptations led to the removal of key enzymes from the genome, such as nitrate reductase, nitrite reductase, and often urease. Unlike some cyanobacterial counterparts, Prochlorococcus is not able to fix atmospheric nitrogen (N 2 ). The only nitrogen sources found to be used by this species are ammonia, which is incorporated into glutamate via the enzyme glutamine synthetase and uses less energy compared to nitrate usage, and in certain species, urea. Moreover, metabolic regulation systems of Prochlorococcus were found to be greatly simplified.

Nitrogen-fixing marine cyanobacteria (UCYN-A)
Nitrogen-fixing marine cyanobacteria are known to support oxygen production in oceans by fixing inorganic nitrogen using the enzyme nitrogenase. A special subset of these bacteria, UCYN-A, was found to lack the photosystem II complex usually used in photosynthesis and that it lacks a number of major metabolic pathways but is still capable of using the electron transport chain to generate energy from a light source. Furthermore, anabolic enzymes needed for creating amino acids such as valine, leucine and isoleucine are missing, as well as some which lead to phenylalanine, tyrosine and tryptophan biosynthesis.

This organism seems to be an obligate photoheterotroph that uses carbon substrates for energy production and some biosynthetic materials for biosynthesis. It was discovered that UCYN-A developed a reduced genome of only 1.44 Megabases that is smaller but similar in structure to that of chloroplasts. In comparison with related species such as Crocosphaera watsonii and Cyanothece sp., which employ genomes which range in length from 5.46 to 6.24 megabases, the UCYN-A genome is much smaller. The compacted genome is a single, circular chromosome with “1,214 identified protein-coding regions”. The genome of UCYN-A is also highly conserved ( >97% nucleotide identity) across ocean waters, which is atypical of ocean microbes. The lack of UCYN-A genome diversity, presence of nitrogenase and hydrogenase enzymes for the TCA cycle, reduced genome size and coding efficiency of the DNA suggest that this microorganism may have symbiotic lifestyle and live in close association with a host. However, the true lifestyle of this microbe remains unknown.

Bacterial symbionts, commensals, parasites, and pathogens
Bacterial symbionts, commensals, parasites, and pathogens often have even smaller genomes and fewer genes than free-living organisms, and non-pathogenic bacteria. They reduce their "core" metabolic repertoire, making them more dependent on their host and environment. Their genome reduction occurs by different evolutionary mechanisms than those of streamlined free-living organisms. Pathogenic organisms are thought to undergo genome reduction due to genetic drift, rather than purifying selection. Genetic drift is caused by small and effective populations within a microbial community, rather than large and dominating populations. In this case, DNA mutations happen by chance, and thus often lead to maladaptive genome degradation and lower overall fitness. Rather than losing non-coding DNA regions or extraneous genes to increase fitness during replication, they lose certain "core" metabolic genes that may now be supplemented by their host, symbiont, or environment. Since their genome reduction is less dependent on fitness, pseudogenes are frequent in these organisms. They also typically undergo low rates of horizontal gene transfer (HGT).

Viruses
Viral genomes resemble prokaryotic genomes in that they have very few non-coding regions. They are, however, significantly smaller than prokaryotic genomes. While viruses are obligate intracellular parasites, viral genomes are considered streamlined due to the strong purifying selection that occurs when the virus has successfully infected a host. During the initial phase of an infection, there is a large bottleneck for the virus population which allows for more genetic diversity, but due to the rapid replication of these viruses, the population size is restored quickly and the diversity within the population is reduced.

RNA viruses in particular are known to have exceptionally small genomes. This is at least in part due to the fact that they have overlapping genes. By reducing their genome size, they increase their fitness due to faster replication. The virus will then be able to increase population size more rapidly with faster replication rates.

Eukaryotes - birds
Genomic streamlining has been used to explain certain eukaryotic genome sizes as well, particularly bird genomes. Larger genomes require a larger nucleus, which typically translates to a larger cell size. For this reason, many bird genomes have also been under selective pressure to decrease in size. Flying with a larger mass due to larger cells is more energetically expensive than with a smaller mass.