Introduction to Data Analysis and Processing www.cma-science.nl Technology Enhanced Inquiry Based Science Education.

Slides:



Advertisements
Similar presentations
Image Enhancement in the Frequency Domain (2)
Advertisements

Standard Data Analysis - Integration
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
The frequency spectrum
Analysis of Research Data
Image Enhancement.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
CHAPTER 1: Picturing Distributions with Graphs
Chapter 2 Frequency Distributions and Graphs 1 © McGraw-Hill, Bluman, 5 th ed, Chapter 2.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics, Part Two Farrokh Alemi, Ph.D. Kashif Haqqi, M.D.
Calibration & Curve Fitting
CHAPTER 39 Cumulative Frequency. Cumulative Frequency Tables The cumulative frequency is the running total of the frequency up to the end of each class.
2009 Mathematics Standards of Learning Training Institutes Algebra II Virginia Department of Education.
Curve Modeling Bézier Curves
Basic Statistics Standard Scores and the Normal Distribution.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Quantitative Skills: Data Analysis and Graphing.
ELECTRICAL CIRCUIT ET 201 Define and explain characteristics of sinusoidal wave, phase relationships and phase shifting.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
09/16/2010© 2010 NTUST Today Course overview and information.
TIME SERIES by H.V.S. DE SILVA DEPARTMENT OF MATHEMATICS
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
Chapter Representing Motion 2.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Dr. Richard Young Optronic Laboratories, Inc..  Uncertainty budgets are a growing requirement of measurements.  Multiple measurements are generally.
Ranjeet Department of Physics & Astrophysics University of Delhi Working with Origin.
Quantitative Skills 1: Graphing
Chapter 1: Functions & Models 1.2 Mathematical Models: A Catalog of Essential Functions.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Worked examples and exercises are in the text STROUD (Prog. 28 in 7 th Ed) PROGRAMME 27 STATISTICS.
Chemical Kinetics CHAPTER 14 Part B
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Chapter 19.
Chapter 11 Descriptive Statistics Gay, Mills, and Airasian
Descriptive Statistics
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Workshop 6 : Advanced Charts. Gantt Chart A Gantt chart is a horizontal bar chart often used in project management applications. Excel does not support.
Descriptive Statistics
Curve-Fitting Regression
Chapter 8 – Basic Statistics. 8.1 – Introduction to Basic Statistics.
Unit 1, Chapter 2 Integrated Science. Unit One: Forces and Motion 2.1 Using a Scientific Model to Predict Speed 2.2 Position and Time 2.3 Acceleration.
Dr. Serhat Eren Other Uses for Bar Charts Bar charts are used to display data for different categories where the data are some kind of quantitative.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Histograms, Frequency Polygons, and Ogives. What is a histogram?  A graphic representation of the frequency distribution of a continuous variable. Rectangles.
Histograms, Frequency Polygons, and Ogives
Introduction to statistics I Sophia King Rm. P24 HWB
Statistical analysis and graphical representation In Psychology, the data we have collected (raw data) does not really tell us anything therefore we need.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
STROUD Worked examples and exercises are in the text Programme 28: Data handling and statistics DATA HANDLING AND STATISTICS PROGRAMME 28.
S D.. In probability and statistics, the standard deviation is the most common measure of statistical dispersion. Simply put, standard deviation measures.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Analyzing Data Week 1. Types of Graphs Histogram Must be Quantitative Data (measurements) Make “bins”, no overlaps, no gaps. Sort data into the bins.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Plotting in Excel KY San Jose State University Engineering 10.
Working with data in Coach
Probability and Statistics
PROGRAMME 27 STATISTICS.
STATS DAY First a few review questions.
Sinusoidal Waveform Phasor Method.
HISTORICAL AND CURRENT PROJECTIONS
CHAPTER 1: Picturing Distributions with Graphs
Statistical Methods For Engineers
2.1 Normal Distributions AP Statistics.
9.5 Least-Squares Digital Filters
Presentation transcript:

Introduction to Data Analysis and Processing Technology Enhanced Inquiry Based Science Education

Data analysis and processing Once data is collected, you need tools to analyse or process the data. Data analysis usually refers to getting information from the data, for instance by looking in more detail (zooming), reading co- ordinates of points in the graph, or determine a slope of a tangent line to the graph. Data processing means that the data are worked in one way or another to produce new data.

Overview of tools Processing Select/Remove data Smooth Derivative Integral Function fit Signal Analysis Other tools Using formulas Analysis Zoom Scan Slope Area Statistical tools Statistics Histogram

Zoom If you want to see the data in more detail, you can use the Zoom function. An area of the diagram will be enlarged for closer inspection.

Scan If you need to read the co-ordinates of points on graphs (or other points in the diagram), use the option Scan. The co-ordinates are displayed in the box (top right corner). Example: scanning a position vs. time graph

Slope Slope gives the slope of the tangent at any point of a displayed graph. This is a measure of the rate with which a quantity changes, e.g. the speed of an object. Example: the rate at which a capacitor discharges at t=0,29s.

Area With Area you can determine the area between a displayed graph, the horizontal axis and two boundary lines. An area below the axis is negative. Example: Area under the graph of an induced EMF by a falling magnet. The area is a measure of the magnetic flux B.

Select/Remove data If your data set has spikes (erroneous measurements) or if part of the data is irrelevant, with Select/Remove Data you can both select single points or a range of data for removal or retention. Examples: Range: discharging a capacitor (cut off first horizontal part) (top). Points: keeping the lower envelope of the pressure variation in a blood pressure measurement (bottom).

Smooth If you have a rough or limited set of measured points you can use Smooth to approximate your data with a smooth curve or with a dataset consisting of more points. There are three smoothing methods: Moving average, Bezier, Spline. The smoothed graph can successively be processed further.

Smooth: Moving average Use Moving average to reduce noise and eliminate fluctuations in the graph. The smoothed graph has the same number of points and each point is replaced by the average of a number of neighbouring points. The Filter width parameter determines this number of points. Moving average is often used to highlight long-term trends and cycles. Example: CO 2 -measurement, Filter width determines the degree of filtering. Original graphFilter width = 1Filter width = 2 Original graphFilter width = 1Filter width = 10

Smooth: Bézier Use Bezier to create a smooth curve with more points then the original data set. The smoothed graph is forced through the first and the last original point. The intermediate points determine the degree of curvature of the smooth graph.

Smooth: Spline …..…. Use Spline to smooth a graph by means of a polynomial approximation of 5 th degree. A smoothing factor controls the trade off between fitting the raw data and minimizing the roughness of the approximation. For a lower value of smoothing factor the spline curve gets closer to the raw data. When its value is 0 the smoothing curve is a natural quintic spline curve through all original points. Spline is a powerful tool to deal with noisy data and for computation of smooth derivatives. Smoothing factor of resp. 0, 0.05, (auto) and 10,000.

Smooth: Derivative Derivatives are very important in science, as they are a measure of the rate of change of a quantity. They are used often to calculate the speed of processes, or the point where change is maximum. Differences method: direct calculation via differences between successive points (often noisy). Smooth method: the derivative is applied to a smooth spline function from the data. Example: determining the equivalence point of an acid-base titration (top: differences; bottom: smooth).

Function fit For verifying data against theory one usually wants to approximate the data with a standard mathematical function. Function fit is a procedure to make such approximation. A large number of standard mathematical functions are available. The coefficients of the fit function are determined using a least-squares method.

Function fit : linear fit Position vs. time data of a motion of a cart. A linear fit on the straight part of the graph gives the speed during this phase of the motion (v=14.7 cm/s).

Function fit : exponential fit A radioactive decay can be described by an exponential function a*exp(b*x)+c. By fitting the data to this function, one gets ‘a’ - the initial number of atoms and ‘b’ the parameter which is related to the speed of the decay process (the half-life time of the source).

Function fit : exponential fit The process of discharging of a capacitor can be described by an exponential function. The coefficient ‘a’ of the fit-function is related to the begin voltage of the capacitor, while ‘b’ is related to the ‘RC-time’, the time interval it takes to reach half of the begin voltage of the capacitor.

Signal Analysis If you have a sound signal consisting of a number of frequencies, e.g. a tone of a musical instrument, or a spoken or sung vowel, Signal Analysis can help you to analyse which frequencies are present or, in case of speech, which formants are present in the signal. There are four methods: Fourier Transform, Linear Prediction, R-ESPRIT, Prony.

Signal Analysis of sound beats R-Esprit Original waveform Prony Fourier transform

Signal Analysis: linear prediction Linear Prediction (LP) is suitable for analysing sound vibrations of the human voice. Example: two spectra from the vowel ‘a’ (as in ‘Cake’) sung one fifth apart (same male voice). The overall shape of the graphs share some characteristics resulting from resonances from the shape and cavities of the singer.

Statistics Use Statistics to display statistical information about your data. Example: statistical data from a data set of the EMF induced in a coil by a falling magnet. From the symmetry in the signal the statistical data confirm that the average value lies near 0.

Histogram With Histogram you can get a graphical representation of the distribution of your data. It indicates the number of data points that lie within a range of values called bins. The height of the bar equals the number of times the data point value fell within the bin. Example: The subject got 100 times a sound signal to react to as fast as possible. The histogram shows how the reaction times are distributed in time (e.g. 19 times between 0.2 and 0.3 s).

Using Formulas It is also possible to do calculations on the data by using formulas. You can use different kinds of arithmetical and mathematical functions. Data ranges with formulas are automatically recalculated. During a new run, the newly calculated values appear in real-time. Example: measuring p and V, and creating a diagram of V vs. 1/p (Boyle’s law).

Centre for Microcomputer Applications