Ronald Fisher

Sir Ronald Aylmer Fisher (17 February 1890 – 29 July 1962) was a British polymath who was active as a mathematician, statistician, biologist, geneticist, and academic. For his work in statistics, he has been described as "a genius who almost single-handedly created the foundations for modern statistical science" and "the single most important figure in 20th century statistics". In genetics, Fisher was the one to most comprehensively combine the ideas of Gregor Mendel and Charles Darwin, as his work used mathematics to combine Mendelian genetics and natural selection; this contributed to the revival of Darwinism in the early 20th-century revision of the theory of evolution known as the modern synthesis. For his contributions to biology, Richard Dawkins declared Fisher to be the greatest of Darwin's successors. He is also considered one of the founding fathers of Neo-Darwinism. According to statistician Jeffrey T. Leek, Fisher is the most influential scientist of all time based off the number of citations of his contributions.

From 1919, he worked at the Rothamsted Experimental Station for 14 years; there, he analyzed its immense body of data from crop experiments since the 1840s, and developed the analysis of variance (ANOVA). He established his reputation there in the following years as a biostatistician.

Fisher founded quantitative genetics, and together with J. B. S. Haldane and Sewall Wright, is known as one of the three principal founders of population genetics. Fisher outlined Fisher's principle, the Fisherian runaway, the sexy son hypothesis theories of sexual selection, parental investment, and also pioneered linkage analysis and gene mapping. On the other hand, as the founder of modern statistics, Fisher made countless contributions, including creating the modern method of maximum likelihood and deriving the properties of maximum likelihood estimators, fiducial inference, the derivation of various sampling distributions, founding the principles of the design of experiments, and much more. Fisher's famous 1921 paper alone has been described as "arguably the most influential article" on mathematical statistics in the twentieth century, and equivalent to "Darwin on evolutionary biology, Gauss on number theory, Kolmogorov on probability, and Adam Smith on economics", and is credited with completely revolutionizing statistics. Due to his influence and numerous fundamental contributions, he has been described as the "most original evolutionary biologist of the twentieth century" and as the "greatest statistician of all time". His work is further credited with later initiating the Human Genome Project. Fisher also contributed to the understanding of human blood groups.

Fisher has also been praised as a pioneer of the Information Age. His work on a mathematical theory of information ran parallel to the work of Claude Shannon and Norbert Wiener, though based on statistical theory. A concept to have come out of his work is that of Fisher information.

Fisher held strong views on race and eugenics, insisting on racial differences. Although he was clearly a eugenicist, there is some debate as to whether Fisher supported scientific racism (see ). He was the Galton Professor of Eugenics at University College London and editor of the Annals of Eugenics.

Early life and education


Fisher was born in East Finchley in London, England, into a middle-class household; his father, George, was a successful partner in Robinson & Fisher, auctioneers and fine art dealers. He was one of twins, with the other twin being still-born and grew up the youngest, with three sisters and one brother. From 1896 until 1904 they lived at Inverforth House in London, where English Heritage installed a blue plaque in 2002, before moving to Streatham. His mother, Kate, died from acute peritonitis when he was 14, and his father lost his business 18 months later.

Lifelong poor eyesight caused his rejection by the British Army for World War I, but also developed his ability to visualize problems in geometrical terms, not in writing mathematical solutions, or proofs. He entered Harrow School age 14 and won the school's Neeld Medal in mathematics. In 1909, he won a scholarship to study Mathematics at Gonville and Caius College, Cambridge. In 1912, he gained a First in Mathematics. In 1915 he published a paper, The evolution of sexual preference, on sexual selection and mate choice.

Career
During 1913–1919, Fisher worked as a statistician in the City of London and taught physics and maths at a sequence of public schools, at the Thames Nautical Training College, and at Bradfield College. There he settled with his new bride, Eileen Guinness, with whom he had two sons and six daughters.

In 1918 he published "The Correlation Between Relatives on the Supposition of Mendelian Inheritance", in which he introduced the term variance and proposed its formal analysis. He put forward a genetics conceptual model showing that continuous variation amongst phenotypic traits measured by biostatisticians could be produced by the combined action of many discrete genes and thus be the result of Mendelian inheritance. This was the first step towards establishing population genetics and quantitative genetics, which demonstrated that natural selection could change allele frequencies in a population, reconciling its discontinuous nature with gradual evolution. Joan Box, Fisher's biographer and daughter, says that Fisher had resolved this problem already in 1911. Today, Fisher's additive model is still regularly used in genome-wide association studies.

Rothamsted Experimental Station, 1919–1933
In 1919, he began working at the Rothamsted Experimental Station in Hertfordshire, where he would remain for 14 years. He had been offered a position at the Galton Laboratory in University College London led by Karl Pearson, but instead accepted a temporary role at Rothamsted to investigate the possibility of analysing the vast amount of crop data accumulated since 1842 from the "Classical Field Experiments". He analysed the data recorded over many years, and in 1921 published Studies in Crop Variation I, his first application of the analysis of variance (ANOVA). Studies in Crop Variation II written with his first assistant, Winifred Mackenzie, became the model for later ANOVA work. Later assistants who mastered and propagated Fisher's methods were Joseph Oscar Irwin John Wishart and Frank Yates. Between 1912 and 1922 Fisher recommended, analyzed (with heuristic proofs) and vastly popularized the maximum likelihood estimation method.



Fisher's 1924 article On a distribution yielding the error functions of several well known statistics presented Pearson's chi-squared test and William Gosset's Student's t-distribution in the same framework as the Gaussian distribution, and is where he developed Fisher's z-distribution, a new statistical method commonly used decades later as the F-distribution. He pioneered the principles of the design of experiments and the statistics of small samples and the analysis of real data.

In 1925 he published Statistical Methods for Research Workers, one of the 20th century's most influential books on statistical methods. Fisher's method is a technique for data fusion or "meta-analysis" (analysis of analyses). Fisher formalized and popularized use of the p-value in statistics, which plays a central role in his approach. Fisher proposes the level p=0.05, or a 1 in 20 chance of being exceeded by chance, as a limit for statistical significance, and applies this to a normal distribution (as a two-tailed test), yielding the rule of two standard deviations (on a normal distribution) for statistical significance. The significance of 1.96, the approximate value of the 97.5 percentile point of the normal distribution used in probability and statistics, also originated in this book. "The value for which P = 0.05, or 1 in 20, is 1.96 or nearly 2; it is convenient to take this point as a limit in judging whether a deviation is to be considered significant or not." In Table 1 of the work, he gave the more precise value 1.959964.

In 1928, Fisher was the first to use diffusion equations to attempt to calculate the distribution of allele frequencies and the estimation of genetic linkage by maximum likelihood methods among populations.

In 1930, The Genetical Theory of Natural Selection was first published by Clarendon Press and is dedicated to Leonard Darwin. A core work of the neo-Darwinian modern evolutionary synthesis, it helped define population genetics, which Fisher founded alongside Sewall Wright and J. B. S. Haldane, and revived Darwin's neglected idea of sexual selection.

One of Fisher's favourite aphorisms was "Natural selection is a mechanism for generating an exceedingly high degree of improbability."

Fisher's fame grew, and he began to travel and lecture widely. In 1931, he spent six weeks at the Statistical Laboratory at Iowa State College where he gave three lectures per week, and met many American statisticians, including George W. Snedecor. He returned there again in 1936.

University College London, 1933–1943
In 1933, Fisher became the head of the Department of Eugenics at University College London. In 1934, he become editor of the Annals of Eugenics (now called Annals of Human Genetics).

In 1935, he published The Design of Experiments, which was "also fundamental, [and promoted] statistical technique and application... The mathematical justification of the methods was not stressed and proofs were often barely sketched or omitted altogether .... [This] led H.B. Mann to fill the gaps with a rigorous mathematical treatment". In this book Fisher also outlined the Lady tasting tea, now a famous design of a statistical randomized experiment which uses Fisher's exact test and is the original exposition of Fisher's notion of a null hypothesis.

The same year he also published a paper on fiducial inference and applied it to the Behrens–Fisher problem, the solution to which, proposed first by Walter Behrens and a few years later by Fisher, is the Behrens–Fisher distribution.

In 1936, he introduced the Iris flower data set as an example of discriminant analysis.

In his 1937 paper The wave of advance of advantageous genes he proposed Fisher's equation in the context of population dynamics to describe the spatial spread of an advantageous allele, and explored its travelling wave solutions. Out of this also came the Fisher–Kolmogorov equation. In 1937, he visited the Indian Statistical Institute in Calcutta, and its one part-time employee, P. C. Mahalanobis, often returning to encourage its development. He was the guest of honour at its 25th anniversary in 1957, when it had 2000 employees.

In 1938, Fisher and Frank Yates described the Fisher–Yates shuffle in their book Statistical tables for biological, agricultural and medical research. Their description of the algorithm used pencil and paper; a table of random numbers provided the randomness.

University of Cambridge, 1943–1956
In 1943, along with A.S. Corbet and C.B. Williams he published a paper on relative species abundance where he developed the log series distribution (sometimes called the logarithmic distribution) to fit two different abundance data sets. In the same year he took the Balfour Chair of Genetics where the Italian researcher Luigi Luca Cavalli-Sforza was recruited in 1948, establishing a one-man unit of bacterial genetics.

In 1936, Fisher used a Pearson's chi-squared test to analyze Mendel's data and concluded that Mendel's results were far too perfect, suggesting that adjustments (intentional or unconscious) had been made to the data to make the observations fit the hypothesis. Later authors have claimed Fisher's analysis was flawed, proposing various statistical and botanical explanations for Mendel's numbers. In 1947, Fisher co-founded the journal Heredity with Cyril Darlington and in 1949 he published The Theory of Inbreeding.

In 1950, he published "Gene Frequencies in a Cline Determined by Selection and Diffusion". He developed computational algorithms for analyzing data from his balanced experimental designs, with various editions and translations, becoming a standard reference work for scientists in many disciplines. In ecological genetics he and E. B. Ford showed that the force of natural selection was much stronger than had been assumed, with many ecogenetic situations (such as polymorphism) being maintained by the force of selection.

During this time he also worked on mouse chromosome mapping, breeding the mice in laboratories in his own house.

Fisher publicly spoke out against the 1950 study showing that smoking tobacco causes lung cancer, arguing that correlation does not imply causation. To quote his biographers Yates and Mather, "It has been suggested that the fact that Fisher was employed as consultant by the tobacco firms in this controversy casts doubt on the value of his arguments. This is to misjudge the man. He was not above accepting financial reward for his labours, but the reason for his interest was undoubtedly his dislike and mistrust of puritanical tendencies of all kinds; and perhaps also the personal solace he had always found in tobacco." Others have suggested that his analysis was biased by professional conflicts and his own love of smoking; he was a heavy pipe smoker.

He gave the 1953 Croonian lecture on population genetics.

In the winter of 1954–1955 Fisher met Debabrata Basu, the Indian statistician who wrote in 1988, "With his reference set argument, Sir Ronald was trying to find a via media between the two poles of Statistics – Berkeley and Bayes. My efforts to understand this Fisher compromise led me to the likelihood principle".

Adelaide, 1957–1962
In 1957, a retired Fisher emigrated to Australia, where he spent time as a senior research fellow at the Australian Commonwealth Scientific and Industrial Research Organisation (CSIRO) in Adelaide, South Australia. During this time, he continued in his denial of tobacco harm, and enlisted German eugenicist Otmar von Verschuer to his cause.

Following surgery for colon cancer, he died of post-operative complications in Queen Elizabeth Hospital in Adelaide in 1962. His remains are interred in St Peter's Cathedral, Adelaide.

Legacy
Fisher's doctoral students included Walter Bodmer, D. J. Finney, Ebenezer Laing, Mary F. Lyon and C. R. Rao. Although a prominent opponent of Bayesian statistics, Fisher was the first to use the term "Bayesian", in 1950. The 1930 The Genetical Theory of Natural Selection is commonly cited in biology books, and outlines many important concepts, such as:
 * Parental investment, is any parental expenditure (time, energy etc.) that benefits one offspring at a cost to parents' ability to invest in other components of fitness,
 * Fisher-stainedglass-gonville-caius.jpg, in Cambridge, commemorating Ronald Fisher and representing a Latin square, discussed by him in The Design of Experiments]]Fisherian runaway, explaining how the desire for a phenotypic trait in one sex combined with the trait in the other sex (for example a peacock's tail) creates a runaway evolutionary extremizing of the trait.
 * Fisher's principle, which explains why the sex ratio is mostly 1:1 in nature.
 * Reproductive value which implies that sexually reproductive value measures the contribution of an individual of a given age to the future growth of the population.
 * Fisher's fundamental theorem of natural selection, which states that "the rate of increase in fitness of any organism at any time is equal to its genetic variance in fitness at that time."
 * Fisher's geometric model, an evolutionary model of the effect sizes on fitness of spontaneous mutations proposed by Fisher to explain the distribution of effects of mutations that could contribute to adaptive evolution.
 * Sexy son hypothesis, which hypothesizes that females may choose arbitrarily attractive male mates simply because they are attractive, thus increasing the attractiveness of their sons who attract more mates of their own. This is in contrast to theories of female mate choice based on the assumption that females choose attractive males because the attractive traits are markers of male viability.
 * Mimicry, a similarity of one species to another that protects one or both.
 * The evolution of dominance, a relationship between alleles of one gene, in which the effect on phenotype of one allele masks the contribution of a second allele at the same locus.
 * Heterozygote advantage which was later found to play a frequent role in genetic polymorphism.
 * Demonstrating that the probability of a mutation increasing the fitness of an organism decreases proportionately with the magnitude of the mutation and that larger populations carry more variation so that they have a greater chance of survival.

Fisher is also known for:


 * Linear discriminant analysis is a generalization of Fisher's linear discriminant
 * Fisher information, see also scoring algorithm also known as Fisher's scoring, and Minimum Fisher information, a variational principle which, when applied with the proper constraints needed to reproduce empirically known expectation values, determines the best probability distribution that characterizes the system.
 * F-distribution, arises frequently as the null distribution of a test statistic, most notably in the analysis of variance
 * Fisher–Tippett–Gnedenko theorem: Fisher's contribution to this was made in 1927
 * Fisher–Tippett distribution
 * Fisher–Yates shuffle algorithm
 * Von Mises–Fisher distribution
 * Inverse probability, a term Fisher used in 1922, referring to "the fundamental paradox of inverse probability" as the source of the confusion between statistical terms which refer to the true value to be estimated, with the actual value arrived at by estimation, which is subject to error.
 * Fisher's permutation test
 * Fisher's inequality
 * Sufficient statistic, when a statistic is sufficient with respect to a statistical model and its associated unknown parameter if "no other statistic that can be calculated from the same sample provides any additional information as to the value of the parameter".
 * Fisher's noncentral hypergeometric distribution, a generalization of the hypergeometric distribution, where sampling probabilities are modified by weight factors.
 * Student's t-distribution, widely used in statistics.
 * The concept of an ancillary statistic and the notion (the ancillarity principle) that one should condition on ancillary statistics.

Personal life and beliefs


Fisher married Eileen Guinness, with whom he had two sons and six daughters. His marriage disintegrated during World War II, and his older son George, an aviator, was killed in combat. His daughter Joan, who wrote a biography of her father, married the statistician George E. P. Box.

According to Yates and Mather, "His large family, in particular, reared in conditions of great financial stringency, was a personal expression of his genetic and evolutionary convictions." Fisher was noted for being loyal, and was seen as a patriot, a member of the Church of England, politically conservative, as well as a scientific rationalist. He developed a reputation for carelessness in his dress and was the archetype of the absent-minded professor. H. Allen Orr describes him in the Boston Review as a "deeply devout Anglican who, between founding modern statistics and population genetics, penned articles for church magazines". In a 1955 broadcast on Science and Christianity, he said:

"The custom of making abstract dogmatic assertions is not, certainly, derived from the teaching of Jesus, but has been a widespread weakness among religious teachers in subsequent centuries. I do not think that the word for the Christian virtue of faith should be prostituted to mean the credulous acceptance of all such piously intended assertions. Much self-deception in the young believer is needed to convince himself that he knows that of which in reality he knows himself to be ignorant. That surely is hypocrisy, against which we have been most conspicuously warned."

Fisher was involved with the Society for Psychical Research.

Views on race
Between 1950 and 1951, Fisher, along with other leading geneticists and anthropologists of his time, was asked to comment on a statement that UNESCO was preparing on the nature of race and racial differences, which was published in 1950 as the UNESCO Statement on Race. The statement, along with the comments and criticisms of a large number of scientists including Fisher, is published in "The Race Concept: Results of an Inquiry" (1952).

Fisher was one of four scientists who opposed the statement. In his own words, Fisher's opposition is based on "one fundamental objection to the Statement", which "destroys the very spirit of the whole document." He believes that human groups differ profoundly "in their innate capacity for intellectual and emotional development" and concludes from this that the "practical international problem is that of learning to share the resources of this planet amicably with persons of materially different nature, and that this problem is being obscured by entirely well-intentioned efforts to minimize the real differences that exist."

Fisher's opinions are clarified by his more detailed comments on Section 5 of the statement, which are concerned with psychological and mental differences between the races. Section 5 concludes as follows:

"Scientifically, however, we realized that any common psychological attribute is more likely to be due to a common historical and social background, and that such attributes may obscure the fact that, within different populations consisting of many human types, one will find approximately the same range of temperament and intelligence."

Of the entire statement, Section 5 recorded the most dissenting viewpoints. It was recorded that "Fisher's attitude … is the same as Muller's and Sturtevant's". Muller's criticism was recorded in more detail and was noted to "represent an important trend of ideas":

"I quite agree with the chief intention of the article as a whole, which, I take it, is to bring out the relative unimportance of such genetic mental differences between races as may exist, in contrast to the importance of the mental differences (between individuals as well as between nations) caused by tradition, training and other aspects of the environment. However, in view of the admitted existence of some physically expressed hereditary differences of a conspicuous nature, between the averages or the medians of the races, it would be strange if there were not also some hereditary differences affecting the mental characteristics which develop in a given environment, between these averages or medians. At the same time, these mental differences might usually be unimportant in comparison with those between individuals of the same race…. To the great majority of geneticists it seems absurd to suppose that psychological characteristics are subject to entirely different laws of heredity or development than other biological characteristics. Even though the former characteristics are far more influenced than the latter by environment, in the form of past experiences, they must have a highly complex genetic basis."

Fisher's own words were quoted as follows:

"As you ask for remarks and suggestions, there is one that occurs to me, unfortunately of a somewhat fundamental nature, namely that the Statement as it stands appears to draw a distinction between the body and mind of men, which must, I think, prove untenable. It appears to me unmistakable that gene differences which influence the growth or physiological development of an organism will ordinarily pari passu influence the congenital inclinations and capacities of the mind. In fact, I should say that, to vary conclusion (2) on page 5, 'Available scientific knowledge provides a firm basis for believing that the groups of mankind differ in their innate capacity for intellectual and emotional development,' seeing that such groups do differ undoubtedly in a very large number of their genes." Fisher also ended a 1954 letter to Reginald Ruggles Gates, a Canadian-born geneticist who argued that different racial groups were different species, with the words: "I am sorry that there should be propaganda in favour of miscegenation in North America as I am sure it can do nothing but harm. Is it beyond human endeavour to give and justly administer equal rights to all citizens without fooling ourselves that these are equivalent items?"

Fisher's writings nearly all discuss human populations or humanity as a whole without reference to race or specific racial groups, and none of his work explicitly supports the idea of racial superiority or white supremacy. Fisher had a close personal relationship with Indian statistician P.C. Mahalanobis, and significantly contributed to the development of the Indian Statistical Institute; and Fisher's graduate students included Walter Bodmer, a child of Jewish-German parents who fled from Nazi Germany while he was young, and Ebenezer Laing, an African geneticist from Ghana. Daniel Kevles, an American historian of science, described Fisher as an "anti-racist conservative". However, British historian Robert J. Evans, writing in The New Statesman, argued that Fisher's views on eugenics and his opposition to UNESCO's statement about genetic racial differences were indicative of racism.

Eugenics
In 1911, Fisher became founding Chairman of the University of Cambridge Eugenics Society, whose other founding members included John Maynard Keynes, R. C. Punnett, and Horace Darwin. After members of the Cambridge Society – including Fisher – stewarded the First International Eugenics Congress in London in summer 1912, a link was forged with the Eugenics Society (UK). He saw eugenics as addressing pressing social and scientific issues that encompassed and drove his interest in both genetics and statistics. During World War I Fisher started writing book reviews for The Eugenics Review and volunteered to undertake all such reviews for the journal, being hired for a part-time position.

The last third of The Genetical Theory of Natural Selection focused on eugenics, attributing the fall of civilizations to the fertility of their upper classes being diminished, and used British 1911 census data to show an inverse relationship between fertility and social class, which was partly due, he claimed, to the lower financial costs and hence increasing social status of families with fewer children. He proposed the abolition of extra allowances to large families, with the allowances proportional to the earnings of the father. He served in several official committees to promote eugenics, including the Committee for Legalizing Eugenic Sterilization which drafted legislation aiming to limit the fertility of "feeble minded high-grade defectives ... comprising a tenth of the total population". It was proposed that this policy would allow for voluntary sterlization and Fisher was against the idea of forced sterilisation.

Beginning in 1934, Fisher became disillusioned with the Eugenics Society over concerns that its activities were increasingly aimed in a political rather than scientific direction; he formally dissociated with the Society in 1941.

Fisher wrote a testimony on behalf of the eugenicist Otmar Freiherr von Verschuer. He wrote that, although the Nazis used Verschuer's work to give scientific support for their ideology, it was "[Verschuer's] misfortune rather than his fault that racial theory was a part of the Nazi ideology." He conducted extensive correspondence with von Verschuer over decades, which is held at the University of Adelaide.

Appraisal of scientific merits
Fisher was elected to the Royal Society in 1929, the American Academy of Arts and Sciences in 1934, the American Philosophical Society in 1941, and the United States National Academy of Sciences in 1948. He was made a Knight Bachelor by Queen Elizabeth II in 1952 and awarded the Linnean Society of London Darwin–Wallace Medal in 1958.

He won the Copley Medal and the Royal Medal. He was an Invited Speaker of the ICM in 1924 in Toronto and in 1928 in Bologna.

In 1950, Maurice Wilkes and David Wheeler used the Electronic Delay Storage Automatic Calculator to solve a differential equation relating to gene frequencies in a paper by Ronald Fisher. This represents the first use of a computer for a problem in the field of biology. The Kent distribution (also known as the Fisher–Bingham distribution) was named after him and Christopher Bingham in 1982, while the Fisher kernel was named after Fisher in 1998.

The R. A. Fisher Lectureship was a North American Committee of Presidents of Statistical Societies (COPSS) annual lecture prize, established in 1963, until the name was changed to COPSS Distinguished Achievement Award and Lectureship in 2020. On 28 April 1998 a minor planet, 21451 Fisher, was named after him.

In 2010, the R.A. Fisher Chair in Statistical Genetics was established in University College London to recognise Fisher's extraordinary contributions to both statistics and genetics.

Anders Hald called Fisher "a genius who almost single-handedly created the foundations for modern statistical science", while Richard Dawkins named him "the greatest biologist since Darwin": "Not only was he the most original and constructive of the architects of the neo-Darwinian synthesis, Fisher also was the father of modern statistics and experimental design. He therefore could be said to have provided researchers in biology and medicine with their most important research tools, as well as with the modern version of biology's central theorem."

Geoffrey Miller said of him:"To biologists, he was an architect of the 'modern synthesis' that used mathematical models to integrate Mendelian genetics with Darwin's selection theories. To psychologists, Fisher was the inventor of various statistical tests that are still supposed to be used whenever possible in psychology journals. To farmers, Fisher was the founder of experimental agricultural research, saving millions from starvation through rational crop breeding programs."

Contentious views on eugenics
Fisher and Sewall Wright both contributed to the development of population genetics, which became part of the modern synthesis. The interpretation of the mathematical theories of population genetics became a bone of contention between Fisher and Wright by the mid-1920s, and the issue became acrimonious. Dispute persisted for the rest of Fisher's life. A 2021 paper, authored by trustees of the "Fisher Memorial Trust", commented that recent criticism of Fisher could mostly be characterised as "reconsideration of the honour given to individuals from preceding times who are felt to have contributed to social injustice in the past, or to have held views that are felt to have promoted social injustice."

In June 2020, during the international protests caused by the murder of George Floyd, Gonville and Caius College announced that a 1989 stained-glass window commemorating Fisher's work would be removed because of his connection with eugenics. An accommodation building, built in 2018 and previously named after him, was subsequently renamed too. University College London also decided to remove his name from its Centre for Computational Biology.

Contentious views on smoking
Fisher rejected the notion of smoking cigarettes being dangerous, calling it "propaganda".