The biologists Tine Tammes and Jan Willem Moll and the astronomer Jacobus Kapteyn tried to apply Pearson’s statistics Ida H. Stamhuis Vrije Universiteit.

Slides:



Advertisements
Similar presentations
Writing Research Papers - A presentation by William Badke
Advertisements

A small taste of inferential statistics
Concepts and Connections © 2013 W. H. Freeman and Company
Chapter 6 Sampling and Sampling Distributions
Mendelian Genetics. Genes- genetic material on a chromosome that codes for a specific trait Genotype- the genetic makeup of the organism Phenotype- the.
The Scientific Method.
Writing Scientific Papers Lecturer: Prof. Nyoman S. Antara, Ph.D. Agroindustrial Technology Department Faculty of Agricultural Technology Udayana University.
Random variable Distribution. 200 trials where I flipped the coin 50 times and counted heads no_of_heads in a trial.
Gregor Johann Mendel “Father of Genetics”. Mendel’s success Came up with an elegant model of experimental design –chose a good “model” organism: Pisum.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
“The Father of Genetics” Gregor Johann Mendel ( ) Shane Connolly.
QM Spring 2002 Business Statistics Normally Distributed Random Variables.
I. Gregor Mendel “father of genetics”. a. Inheritance Theory Prior to Mendel 1. Traits “blended” Trait: characteristics to be passed from parent to offspring.
Section 9.3 Testing a Proportion p. 2 Focus Points Identify the components needed for testing a proportion. Compute the sample test statistic. Find the.
RSS Centre for Statistical Education Improving Statistical Literacy in School and Society: The UK Experience Peter Holmes RSS Centre for Statistical Education,
LOGIC AND CRITICAL THINKING Jonathan Dolhenty, Ph.D. Logic and Critical Thinking. Available at
Hypothesis Testing II The Two-Sample Case.
Answering questions about life with statistics ! The results of many investigations in biology are collected as numbers known as _____________________.
Writing Research Papers. Research papers are often required of students in high school and in higher education.
Writing Literary Analysis Papers
LeMoyne-Owen College December 16, 2009 Mimi Czarnik, Professor of English and Dean of Humanities Becky Burton, Associate Professor of Biology Alverno College,
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 8: Quantitative.
TEA Science Workshop #3 October 1, 2012 Kim Lott Utah State University.
Environmental Science Bellringers
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
Lesson Overview Lesson Overview Science in Context Lesson Overview 1.2 Science in Context.
What Makes an Essay an Essay. Essay is defined as a short piece of composition written from a writer’s point of view that is most commonly linked to an.
Population Genetics is the study of the genetic
@earthscience92. What is Science? Science – The systematic study of natural events and condition. Anything in living or nonliving world Scientific knowledge.
Elementary School Science: Emphasizing the Basics Presented by Frank H. Osborne, Ph. D. © 2015 EMSE 3123 Math and Science in Elem. Ed.
Lesson Overview Lesson Overview Science in Context Lesson Overview 1.2 Science in Context.
Introduction to Earth Science Section 1- What is Earth Science Section 2- Science as a Process.
Inductive Generalizations Induction is the basis for our commonsense beliefs about the world. In the most general sense, inductive reasoning, is that in.
Section 1 What Is Earth Science? Section 2 Science as a Process
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Lesson Overview 1.2 Science in Context.
Mendel and his Peas Chapter 5 Lesson 1.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
WHAT IS THE NATURE OF SCIENCE?. SCIENTIFIC WORLD VIEW 1.The Universe Is Understandable. 2.The Universe Is a Vast Single System In Which the Basic Rules.
G E N E T I C S A Study of Heredity and Inheritance.
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
Scientific Methods and Terminology. Scientific methods are The most reliable means to ensure that experiments produce reliable information in response.
Question paper 1997.
The Scientific Method. Objectives Explain how science is different from other forms of human endeavor. Identify the steps that make up scientific methods.
Mendelian Genetics Gregor Mendel (1822 – 1884) performed one of the classic scientific experiments on inheritance during the mid-19 th century. Mendel.
 An article review is written for an audience who is knowledgeable in the subject matter instead of a general audience  When writing an article review,
Scientific Inquiry & Design Scientific Method
The Nature of Science. Objective  COS 12.0 Identify the metric units used for mass, temperature, time and length (distance)
The World of Life Science
Wheeler High School The Center for Advanced Studies in Science, Math & Technology Post-AP DNA/Genetics – Ms. Kelavkar Course Introduction Genetics Lecture.
Scientific Methodology Vodcast 1.1 Unit 1: Introduction to Biology.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
Gregor Mendel 1822 – 1884 Born into a German family in, what is now, the Czech Rupublic but what was then called the Austrian Empire. He wasn’t given the.
Introduction to Earth Science Section 1 SECTION 1: WHAT IS EARTH SCIENCE? Preview  Key Ideas Key Ideas  The Scientific Study of Earth The Scientific.
Nature of Science Lesson 1: Understanding Science ESSENTIAL QUESTIONS 1)What is scientific inquiry? 2)What are the results of scientific investigations?
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Lesson Overview Lesson Overview Science in Context Lesson Overview 1.2 Science in Context Scientific methodology is the heart of science. But that vital.
1-1 What Is Science? Slide 1 of 21 Copyright Pearson Prentice Hall Thinking Like a Scientist: The Scientific Method is a series of steps scientist follow.
Glossary of Terms Used in Science Papers AS
Section 2: Science as a Process
Chapter 5 Hypothesis Testing
CHAPTER 26: Inference for Regression
The Origin of Modern Genetics
15.1 Goodness-of-Fit Tests
Chapter 5: Sampling Distributions
Presentation transcript:

The biologists Tine Tammes and Jan Willem Moll and the astronomer Jacobus Kapteyn tried to apply Pearson’s statistics Ida H. Stamhuis Vrije Universiteit Amsterdam

From 1983 I became educated as a historian of science 1989 PhD: Statistics in the Netherlands in the 19 th century. See also STAtOR 2002 and 2003 for three short articles About statistics in Dutch medicine and biology: Statistical Mind , Vol. II (Stamhuis, Klep and Maarseveen eds)

Two of the statements of today deal with the past –Statisticians should know the history of their own field; too many wheels are reinvented over and over again –“For evident reasons a theory for the benefit of biologists would be best worked out by a biologist. If he cannot do it, because he is but a poor statistician, the next best thing – still not approximately good – would be to have the work entrusted to a biologist working in close cooperation with an expert statistician. Almost the worst possible thing will be to put the task wholly on a regular statistician.” Kapteyn (1916)

Story about three scientists in Groningen, a case of collaboration between scientists (and statisticians?) with the aim to apply statistics in biology –Johannes Kapteyn ( ) astronomer (statistician?) –Jan Willem Moll ( ) botanist –Tine Tammes ( ) geneticist How can we use this story to give substance to the two statements? For panel discussion

Small community: In 1914, ‘Senaat’,consisted of 43 members, met regularly; Faculty of Science and Mathematics: much smaller number Kapteyn and Moll: Ossenmarkt Tammes: Oranjesingel. Botanical Institute at Rozenstraat

Johannes Kapteyn (1851 – 1922) Professor in Groningen 1878 (27 years old) No astronomical observatory Photographic plates from Cape Town 1900: 345,000 stars recorded in catalogue (3 vol.)

More than a million stars were known: too many to study each of these Kapteyn developed ‘Plan of Selected Area’s’ Agreed with colleagues to concentrate on 206 stellar area’s ‘uniformly demarcated over the sky’ Kapteyn developed statistical methods

Jan Willem Moll (1851 – 1933) Studied botany in Amsterdam Became professor of botany in Groningen in Botanical Laboratory at the Rozenstraat The famous Hugo de Vries was his friend and colleague

From 1894 Hugo de Vries started to apply statistical methods which he learnt from Adolphe Quetelet ( ), Francis Galton ( ) and Walter Weldon ( ) (This resulted in the Rediscovery of the Mendelian laws) He wrote Moll about it. Moll’s reaction “I browsed through Galton, but don't understand it yet. It takes me a surprising effort to imagine these things, especially when something new is introduced”

1890 Tine Tammes entered botanical institute 1892 teacher’s certificate in physics, chemistry, cosmography 1897 in botany, zoology, mineralogy, geology In 1897 became Moll’s assistant 1911 honorary doctor 1919 extraordinary professor: in the Netherlands the first in genetics, the second female professor

1902: Kapteyn was asked by ‘several botanical students’ and other interested persons to explain (in the Botanical Institute) the statistical methods of Quetelet, Galton, and Pearson ‘in a popular way’. Moll will have initiated this and Tammes will have been one of the ‘students’.

What happened? Kapteyn started to study; and discovered Karl Pearson’s: Contributions to the mathematical theory of evolution. II. Skew variation in homogeneous material of 1895 He published in 1903: Skew Frequency Curves in Biology and Statistics Controversy between the two scholars: see paper ISR (by Stamhuis and Seneta)

Kapteyn 1903 – Pearson 1905 – Kapteyn 1906 – Pearson 1906 [– Kapteyn (and Van Uven) 1916] 'mathematical statistics' plus examples about skew frequency curves Accusations: Inappropriate starting points, plagiarism, serious mathematical mistakes

The astronomer David Gill did not encourage his collaborator Kapteyn: “Why a man of special astronomical gifts, like yourself should waste his days in abstract mathematical work which so many men are capable of working at –- whilst there are so few to do what you can so well do -- I don’t know. After all what is the value or interest of a frequency curve compared with the structure of the universe?”

How successful was Kapteyn in his mediating role? Moll's skew curve machine: “those who are familiar with Galton's apparatus will readily understand the present one” “The effect of the various causes is strictly proportional to the absolute dimensions” ripe berries: diameters normal; cubes skew

Quincunx Galton

How successful was Kapteyn in his mediating role? Let us look at Tine Tammes' efforts to use statistics She was known and admired by her colleagues through her inclination towards statistics What did she grasp and apply?

Publications Tammes applying statistics (1904). On the influence of nutrition on the fluctuating variability of some plants, (1907). Der Flachsstengel. Eine statistisch-anatomische Monographie, (1911). Das Verhalten fluktuierend variierender Merkmale bei der Bastardierung (1914). Die statistische Methode bei der Beschreibung von Nahrungsmitteln, insbesondere von Stärke. Onzième Congrès Internationale de Pharmacie, (1915). Die genotypische Zusammensetzung einiger Varietäten derselben Art und ihr genetischer Zusammenhang.

1904; nutrition of plants She expressed herself about the importance of statistics: “The introduction of the statistical method (...) into botany has enabled us to formulate more sharply the formerly vague and insufficiently defined question of the influence of nutrition and also to interpret the results obtained easily and accurately.” She referred to Kapteyn: “300 measurements or countings, a number which, according to the calculations of Prof. Kapteyn, gives in investigations of this kind a sufficient guarantee of accuracy.”

Median M instead of the arithmetical mean, and the variability coefficient Q/M, instead of the standard deviation. Q is the difference between the third and the first quartile. She introduced a so-called sensibility coefficient for the median as well as for the variability. The sensibility coefficient of the median was defined as M 1 – M 2 / M 1 with M 1 representing the median of the results of good nutrition and M 2 the median of the results of bad nutrition.

Moll later explicitly referred to her introduction of this coefficient when he argued that she was worthy to become an honorary doctor and later an extraordinary professor. For botanists the definition of such a concept was apparently something exceptional and admirable.

Der Flachsstengel, eine statistisch-anatomische Monographie, pages She introduced the Pearson correlation coefficient. She needed 10 pages to explain what was for the readership something new: “only recently correlation phenomena have begun to be closely studied” She composed a correlation table. She did not calculate the correlation coefficient directly, but used a formula, again made with the help of Kapteyn, so that the median played a standard role

Das Verhalten fluktuierend variierender Merkmale bei der Bastardierung 1911 Hybridizations of four flax varieties and various quantitative characters: seed length and width, flower colour and others. Especially seed length produced nice results.

Mendel’s Laws for Quantitative Characters: multiple factor hypothesis No dominance and recessiveness AA aa Aa AAx aa becomesAa self pollination AA aa Aa Everything from AA aaAA AAuntil aa for exampleaa (3A 5a) ~ B (8, ½) AA aaaa AA aaaA

This work was ground braking from the viewpoint of genetics: continuous characters in the 'Mendelian domain‘ compared to her colleagues she took more characteristics of the relevant distribution into account: not only the chance of extreme outcomes, but also the shape of the corresponding Bernoulli distribution As a result she could draw well-argued conclusions on the number of factors that contributed to the value of a particular character She was apparently more familiar with this kind of probabilistic reasoning than the others working on the same problem.

Die statistische Methode bei der Beschreibung von Nahrungsmitteln, insbesondere von Stärke (1914) in a volume of an international pharmaceutical congress. “In a statistical investigation it is very important that the necessary material is randomly chosen. When this aspect is not given sufficient attention, the whole investigation is without meaning.’ She explained the choice of the use of M and Q/M. She took on the role of a mediator to make statistical methods better known in the pharmaceutical world.

Die genotypische Zusammensetzung einiger Varietäten derselben Art und ihr genetischer Zusammenhang (1914) Several Mendelian crossings Observed and expected numbers in several classes were compared. Like her colleagues in Mendelian genetics, she did not apply a formal test, like the Chi square test, to conclude if empirical outcomes and theoretical expectations were sufficiently in accordance with each other

Concluding remarks At first sight Kapteyn’s efforts not very successful. Nothing in the three extensive papers on skew frequency curves was used by Tammes. The character of the methods Tammes used was different from Kapteyn’s: the argumentations of Kapteyn were essentially mathematical. The methods Tammes used were, from a mathematical viewpoint, rather elementary. Nevertheless it is obvious that for her the application of statistical concepts was essential, that she was admired by her colleagues and attained important results. It also became clear that Kapteyn played a decisive role. In this sense Kapteyn was succesful in his role of mediator.