Numerical and Graphical Analysis Finding and understanding patterns in data.

Slides:



Advertisements
Similar presentations
Moving from text or numbers to pictures. English translation of original map by Charles Minard (1885) showing the fate of Napoleons Grand Army in the.
Advertisements

STATISTICAL ANALYSIS. Your introduction to statistics should not be like drinking water from a fire hose!!
Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Table of Contents Exit Appendix Behavioral Statistics.
IB Math Studies – Topic 6 Statistics.
Basic Statistical Concepts
Introduction to Biostatistics. Biostatistics The application of statistics to a wide range of topics in biology including medicine.statisticsbiology.
Data analysis Incorporating slides from IS208 (© Yale Braunstein) to show you how 208 and 214 are telling you many of the the same things; and how to use.
Copyright 2006 – Biz/ed Business Analysis.
Social Research Methods
Computer in Education Jiaying Zhao CSE 610 Western Oregon University.
Spreadsheet in excel o Spreadsheet in excel o Uses of spreadsheet o Advantages Prepared by: Yusra Waseem 8 th C.
Statistical Techniques in Hospital Management QUA 537
Understanding Research Results
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
Chapter 2 The Research Enterprise in Psychology. n Basic assumption: events are governed by some lawful order  Goals: Measurement and description Understanding.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 3.
CHAPTER 1 Basic Statistics Statistics in Engineering
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 8: Quantitative.
RESEARCH STRATEGIES. A. Scientific Method: 1. Begin with theory 2. Develop hypothesis – the testable prediction 3. Description – gather information about.
Semester 2: Lecture 2 Quantitative Data Analysis Prepared by: Dr. Lloyd Waller ©
Data Reference (the very, very basics) Data-reference: what do we need? Tools Strategies Terminology Understanding of what we are looking for: not.
Descriptive Statistics
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Lecture 1.2 Field work (lab work). Analysis of data.
Exploring Text: Zipf’s Law and Heaps’ Law. (a) (b) (a) Distribution of sorted word frequencies (Zipf’s law) (b) Distribution of size of the vocabulary.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
Descriptive Statistics Prepared by: Asma Qassim Al-jawarneh Ati Sardarinejad Reem Suliman Dr. Dr. Balakrishnan Muniandy PTPM-USM.
L. Liu PM Outreach, USyd.1 Survey Analysis. L. Liu PM Outreach, USyd.2 Types of research Descriptive Exploratory Evaluative.
1.1 example these are prices for Internet service packages find the mean, median and mode determine what type of data this is create a suitable frequency.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Carrying out a statistics investigation. A process.
Statistical Analysis. Variability of data All living things vary, even two peas in the same pod, so how do we measure this variation? We plot data usually.
Exam Review Day 6 Chapters 2 and 3 Statistics of One Variable and Statistics of Two Variable.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
 Chapter 3! 1. UNIT 7 VOCABULARY – CHAPTERS 3 & 14 2.
Discovering Mathematics Week 5 BOOK A - Unit 4: Statistical Summaries 1.
CCGPS Advanced Algebra UNIT QUESTION: How do we use data to draw conclusions about populations? Standard: MCC9-12.S.ID.1-3, 5-9, SP.5 Today’s Question:
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Exploring Text: Zipf’s Law and Heaps’ Law. (a) (b) (a) Distribution of sorted word frequencies (Zipf’s law) (b) Distribution of size of the vocabulary.
Statistics is... a collection of techniques for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting,
Lesson 4.5 – Conducting a Survey to Collect Two-Variable Data.
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Welcome to MDM4U (Mathematics of Data Management, University Preparation)
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
THE ROLE OF STATISTICS IN RESEARCH. Reading APPENDIX A: Statistics pp
Data: Categorical vs. Quantitative Mr. Diaz Math 3.
Descriptive Statistics Dr.Ladish Krishnan Sr.Lecturer of Community Medicine AIMST.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Welcome to MM305 Unit 2 Seminar Dr. Bob Statistical Foundations for Quantitative Analysis.
Investigating the Relationship between Incoming A Level Grades and Final Degree Classification Tomas James Introduction Universities use A Levels as the.
Engineering Probability and Statistics - SE-205 -Chap 1
Prof. Eric A. Suess Chapter 3
MATH-138 Elementary Statistics
Business Analysis.
Do Now: 1. About how many acres of land were burned by wildfires in 1992? 2. The number of acres burned in 2007 is about the same as the number burned.
Unit 4 Statistics Review
2-1 Data Summary and Display 2-1 Data Summary and Display.
Descriptive Statistics
DATA ANALYSIS DR. ELIZABETH M. ANTHONY
Descriptive Statistics
Describing Data Coordinate Algebra.
Exercise 1: Entering data into SPSS
Business Analysis.
Presentation transcript:

Numerical and Graphical Analysis Finding and understanding patterns in data

The course so far Academic use of the web. Publishing on the web Analysing text Manipulating textual lists Tables Numerical Analysis – why it matters Graphical Analysis – a better way for humanists

Lists Ann Simms of Riverhead (Female) left £1560 died at age 89 years Anne Potts of Ide Hill (Female) left £34 died at 17 years Charles Forth of Chevening (Male) left £129 died at age 48 years :::: GeorgeSalter of Riverhead (male) left £190, died at age 26 years

Data as a table Forename Surname Village Gender Wealth Age at death Ann SimmsRiverhead Female £ Anne PottsIde Hill Female £34 17 Charles ForthChevening Male £129 48

Tables in a Spreadsheet

Spreadsheet Software Evolved from financial accounting practice. Tabular data Simple lists Simple databases Establish relationships within and between data sets – simple statistics Apply various functions Plot Graphs and Charts

Applications in the humanities Maintaining and manipulating lists. Studying quantifiable information. Managing budgets and projects. Plotting graphs and charts. Compensating for weaknesses in other software applications. Building utility programs.

What would your tutors comments be? During the early 19 th century the population of London grew rapidly due to mass migration in from the countryside. The overcrowding caused by the rising population placed a strain on the sanitation systems causing a series of cholera epidemics, each worse than the one before.

Quantify your statements: evidence? During the early 19 th century the population of London grew rapidly due to mass migration in from the countryside. The overcrowding caused by the rising population placed a strain on the sanitation systems causing a series of cholera epidemics, each worse than the one before.

Poetry or Maths? Reproduced from: Burrows J. (2002) Delta: a Measure of Stylistic Difference and a Guide to Likely Authorship, Literary and Linguistic Computing, Vol. 17:3 p. 270.

Reproduced from: Burrows J. (2002) Delta: a Measure of Stylistic Difference and a Guide to Likely Authorship, Literary and Linguistic Computing, Vol. 17:3 p. 280.

Examples Social History - New Poor Law - Effects of the Industrial Revolution - Voting patterns in elections Textual analysis - word frequencies etc George Orwell, Author attribution Shakespeare or Marlowe

Why use numerical analysis? Wide variety of techniques –suitable for different types of data and questions. In the humanities it usually means statistics Three Roles - Summarise and compare data sets - Test hypotheses - Determine the significance of findings

Research Process What is your question? What results would prove/disprove it? Write a code book defining - variable names - variable data type - categories, ranges (controlled vocabulary for numeric data) Code data Analysis Interpretation

Authorship attribution Analysis of writing style Consistency of style Find frequently used words and look at their frequency in different portions of the book. End up with tables of frequencies and various indices – need to interpret them

Simple Statistics To summarise a set of data - mean average value - mode most common value - median middle value - range minimum, maximum and the difference between them

Graphical Analysis Allows us to: Summarise data Explore and identify areas for further study. To communicate the meaning of large volumes of data

Variance and correlation Are two things related? - ability in one language to another - poverty and disease - smoking and cancer Mostly easily done by drawing a graph.

Lung Cancer and Smoking

With regression line fitted

Variation in data How much variation is there in the data values? Standard deviation measures the deviation of the data from its mean Small value means very little spread

What does this mean?

Warnings Think about what you are doing. A correlation does not mean there is a link. Even if there is a mathematical relationship it may not be a causal one. Beware of interpolated and extrapolated values.

Correlation? Tufte (2001) p. 15

René Magritte: La Trahison des Images (1928-9) (The Treachery of Images) Los Angeles County Museum of Art