Genotype

The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a specific gene depends on the number of copies of each chromosome found in that species, also referred to as ploidy. In diploid species like humans, two full sets of chromosomes are present, meaning each individual has two alleles for any given gene. If both alleles are the same, the genotype is referred to as homozygous. If the alleles are different, the genotype is referred to as heterozygous.

Genotype contributes to phenotype, the observable traits and characteristics in an individual or organism. The degree to which genotype affects phenotype depends on the trait. For example, the petal color in a pea plant is exclusively determined by genotype. The petals can be purple or white depending on the alleles present in the pea plant. However, other traits are only partially influenced by genotype. These traits are often called complex traits because they are influenced by additional factors, such as environmental and epigenetic factors. Not all individuals with the same genotype look or act the same way because appearance and behavior are modified by environmental and growing conditions. Likewise, not all organisms that look alike necessarily have the same genotype.

The term genotype was coined by the Danish botanist Wilhelm Johannsen in 1903.

Phenotype
Any given gene will usually cause an observable change in an organism, known as the phenotype. The terms genotype and phenotype are distinct for at least two reasons:
 * To distinguish the source of an observer's knowledge (one can know about genotype by observing DNA; one can know about phenotype by observing outward appearance of an organism).
 * Genotype and phenotype are not always directly correlated. Some genes only express a given phenotype in certain environmental conditions.  Conversely, some phenotypes could be the result of multiple genotypes. The genotype is commonly mixed up with the phenotype which describes the end result of both the genetic and the environmental factors giving the observed expression (e.g. blue eyes, hair color, or various hereditary diseases).

A simple example to illustrate genotype as distinct from phenotype is the flower colour in pea plants (see Gregor Mendel). There are three available genotypes, PP (homozygous dominant), Pp (heterozygous), and pp (homozygous recessive). All three have different genotypes but the first two have the same phenotype (purple) as distinct from the third (white).

A more technical example to illustrate genotype is the single-nucleotide polymorphism or SNP. A SNP occurs when corresponding sequences of DNA from different individuals differ at one DNA base, for example where the sequence AAGCCTA changes to AAGCTTA. This contains two alleles : C and T. SNPs typically have three genotypes, denoted generically AA Aa and aa. In the example above, the three genotypes would be CC, CT and TT. Other types of genetic marker, such as microsatellites, can have more than two alleles, and thus many different genotypes.

Penetrance is the proportion of individuals showing a specified genotype in their phenotype under a given set of environmental conditions.

Mendelian inheritance
Traits that are determined exclusively by genotype are typically inherited in a Mendelian pattern. These laws of inheritance were described extensively by Gregor Mendel, who performed experiments with pea plants to determine how traits were passed on from generation to generation. He studied phenotypes that were easily observed, such as plant height, petal color, or seed shape. He was able to observe that if he crossed two true-breeding plants with distinct phenotypes, all the offspring would have the same phenotype. For example, when he crossed a tall plant with a short plant, all the resulting plants would be tall. However, when he self-fertilized the plants that resulted, about 1/4 of the second generation would be short. He concluded that some traits were dominant, such as tall height, and others were recessive, like short height. Though Mendel was not aware at the time, each phenotype he studied was controlled by a single gene with two alleles. In the case of plant height, one allele caused the plants to be tall, and the other caused plants to be short. When the tall allele was present, the plant would be tall, even if the plant was heterozygous. In order for the plant to be short, it had to be homozygous for the recessive allele.

One way this can be illustrated is using a Punnett square. In a Punnett square, the genotypes of the parents are placed on the outside. An uppercase letter is typically used to represent the dominant allele, and a lowercase letter is used to represent the recessive allele. The possible genotypes of the offspring can then be determined by combining the parent genotypes. In the example on the right, both parents are heterozygous, with a genotype of Bb. The offspring can inherit a dominant allele from each parent, making them homozygous with a genotype of BB. The offspring can inherit a dominant allele from one parent and a recessive allele from the other parent, making them heterozygous with a genotype of Bb. Finally, the offspring could inherit a recessive allele from each parent, making them homozygous with a genotype of bb. Plants with the BB and Bb genotypes will look the same, since the B allele is dominant. The plant with the bb genotype will have the recessive trait.

These inheritance patterns can also be applied to hereditary diseases or conditions in humans or animals. Some conditions are inherited in an autosomal dominant pattern, meaning individuals with the condition typically have an affected parent as well. A classic pedigree for an autosomal dominant condition shows affected individuals in every generation. Other conditions are inherited in an autosomal recessive pattern, where affected individuals do not typically have an affected parent. Since each parent must have a copy of the recessive allele in order to have an affected offspring, the parents are referred to as carriers of the condition.

In autosomal conditions, the sex of the offspring does not play a role in their risk of being affected. In sex-linked conditions, the sex of the offspring affects their chances of having the condition. In humans, females inherit two X chromosomes, one from each parent, while males inherit an X chromosome from their mother and a Y chromosome from their father. X-linked dominant conditions can be distinguished from autosomal dominant conditions in pedigrees by the lack of transmission from fathers to sons, since affected fathers only pass their X chromosome to their daughters. In X-linked recessive conditions, males are typically affected more commonly because they are hemizygous, with only one X chromosome. In females, the presence of a second X chromosome will prevent the condition from appearing. Females are therefore carriers of the condition and can pass the trait on to their sons.

Mendelian patterns of inheritance can be complicated by additional factors. Some diseases show incomplete penetrance, meaning not all individuals with the disease-causing allele develop signs or symptoms of the disease. Penetrance can also be age-dependent, meaning signs or symptoms of disease are not visible until later in life. For example, Huntington disease is an autosomal dominant condition, but up to 25% of individuals with the affected genotype will not develop symptoms until after age 50. Another factor that can complicate Mendelian inheritance patterns is variable expressivity, in which individuals with the same genotype show different signs or symptoms of disease. For example, individuals with polydactyly can have a variable number of extra digits.

Non-Mendelian inheritance
Many traits are not inherited in a Mendelian fashion, but have more complex patterns of inheritance.

Incomplete dominance
For some traits, neither allele is completely dominant. Heterozygotes often have an appearance somewhere in between those of homozygotes. For example, a cross between true-breeding red and white Mirabilis jalapa results in pink flowers.

Codominance
Codominance refers to traits in which both alleles are expressed in the offspring in approximately equal amounts. A classic example is the ABO blood group system in humans, where both the A and B alleles are expressed when they are present. Individuals with the AB genotype have both A and B proteins expressed on their red blood cells.

Epistasis
Epistasis is when the phenotype of one gene is affected by one or more other genes. This is often through some sort of masking effect of one gene on the other. For example, the "A" gene codes for hair color, a dominant "A" allele codes for brown hair, and a recessive "a" allele codes for blonde hair, but a separate "B" gene controls hair growth, and a recessive "b" allele causes baldness. If the individual has the BB or Bb genotype, then they produce hair and the hair color phenotype can be observed, but if the individual has a bb genotype, then the person is bald which masks the A gene entirely.

Polygenic traits
A polygenic trait is one whose phenotype is dependent on the additive effects of multiple genes. The contributions of each of these genes are typically small and add up to a final phenotype with a large amount of variation. A well studied example of this is the number of sensory bristles on a fly. These types of additive effects is also the explanation for the amount of variation in human eye color.

Genotyping
Genotyping refers to the method used to determine an individual's genotype. There are a variety of techniques that can be used to assess genotype. The genotyping method typically depends on what information is being sought. Many techniques initially require amplification of the DNA sample, which is commonly done using PCR.

Some techniques are designed to investigate specific SNPs or alleles in a particular gene or set of genes, such as whether an individual is a carrier for a particular condition. This can be done via a variety of techniques, including allele specific oligonucleotide (ASO) probes or DNA sequencing. Tools such as multiplex ligation-dependent probe amplification can also be used to look for duplications or deletions of genes or gene sections. Other techniques are meant to assess a large number of SNPs across the genome, such as SNP arrays. This type of technology is commonly used for genome-wide association studies.

Large-scale techniques to assess the entire genome are also available. This includes karyotyping to determine the number of chromosomes an individual has and chromosomal microarrays to assess for large duplications or deletions in the chromosome. More detailed information can be determined using exome sequencing, which provides the specific sequence of all DNA in the coding region of the genome, or whole genome sequencing, which sequences the entire genome including non-coding regions.

Genotype encoding
In linear models, the genotypes can be encoded in different manners. Let us consider a biallelic locus with two possible alleles, encoded by $A$ and $$a$$. We consider $$a$$ to correspond to the dominant allele to the reference allele $A$. The following table details the different encoding.