# D1 Planning and collecting data

## Presentation on theme: "D1 Planning and collecting data"— Presentation transcript:

D1 Planning and collecting data
KS3 Mathematics The aim of this unit is to teach pupils to: Discuss a problem that can be addressed by statistical methods, and identify related questions to explore. Decide which data to collect and identify possible sources. Plan how to collect and organise the data and design suitable data collection sheets and tables. Collect and record data from primary and secondary sources. Communicate methods and results. Material in this unit is linked the Framework’s supplement of examples pp 248–255, 272 –274. D1 Planning and collecting data

D1.1 Planning a statistical enquiry
Contents D1 Planning and collecting data D1.1 Planning a statistical enquiry D1.2 Collecting data D1.3 Organizing data D1.4 Writing a statistical report

Specifying the problem
The first step in planning a statistical enquiry is to decide what problem you want to explore. This can be done by asking questions that you want your data to answer and by stating a hypothesis. For example, you may wish to investigate the lengths of words used in newspapers. Do different types of newspaper use different length words? We could ask: Discuss the first steps in planning a statistical enquiry. Define a hypothesis as a statement of something that you believe to be true but do not here any evidence to support. For example, we could hypothesize that tabloid newspapers use shorter words than broadsheet newspapers.

Specifying the problem
Related questions could include: Is there a link between the lengths of the words used and the lengths of the sentences for a particular newspaper? Is there a difference between the use of two- and three-letter words? A possible hypothesis could be: Tabloid newspapers use shorter words to appeal to a wider audience.

Deciding on the data The next step is to decide what data is needed and where it can be collected from. Data can be collected from a primary source or a secondary source. Data from a primary source is data that you have collected yourself, for example: From a survey or questionnaire of a sample of people. From an experiment involving observation, counting or measuring. Discuss posible sourses of secondary data. Data from a secondary source is data that you have collected from somewhere else including the Internet, reference books or newspapers.

Sources of data Decide whether the data shown is data that would come from an experiment, a survey, or from a secondary source.

Choosing the sample Once you have decided where the data will come from you must decide on the size and the type of the sample. How big should a sample be? The sample should be as large as possible. This will depend on the time and resources available. Discuss factors involved in selecting a representative sample group. If the sample size is too small, then the results will be unrepresentative.

Can you suggest a better sample?
Choosing the sample It is important that the sample is representative of the group you are investigating. Suppose, for example, that you wish to investigate the favourite sports of 11 to 15 year-olds. Would it be reasonable to question a sample of people outside a football ground following a game? Can you suggest a better sample? You would have to make sure that you ask equal numbers of girls and boys and that the sample is spread out across all age groups in the range. Discuss ways to ensure that a sample is representative. It would be unreasonable to ask a sample of people leaving a football ground for their favourite sport because most of these people would be football fans. If however we were investigating the eating habits of football fans this would be an ideal sample. When choosing a sample it is sensible to consider all the factors that may have an influence on the outcome. In this example, girls may prefer different sports to boys so either equal numbers of both sexes must be questioned or the data must be collected separately.

Choosing units If your statistical investigation involves measurement then you must decide what units to use and to what degree of accuracy. Suppose, for example, that you wish to investigate the relationship between age and height. How will you measure age? In weeks? In months? In years and months? In years? Discuss the fact that different units may be appropriate to different lines of enquiry. For example, if investigating the relationship of age and weight in newborn babies it would be appropriate to measure age in weeks or even days to takes account of the fact that their height changes very rapidly. How will you measure height? In centimetres? In inches? In metres?

Planning a statistical enquiry
Once you have decided on the purpose of the enquiry, the type of data that will be collected and where it will come from, the sample size and type, you can start the next stage which is to design a data collection sheet or questionnaire.

D1 Planning and collecting data
Contents D1 Planning and collecting data D1.1 Planning a statistical enquiry D1.2 Collecting data D1.3 Organizing data D1.4 Two-way tables

Collecting data Data can be collected using a questionnaire or a data collection sheet. A questionnaire is used when you wish to ask a sample of people a series of structured questions relevant to your line of enquiry. A data collection sheet or observation sheet is used when recoding results involving counting, measuring or observing. It can also be used to collect the answers to a few simple questions. Remind pupils that data that they collect themselves in a questionnaire or collection sheet is called primary data. Data can also be collected from secondary sources such as the Internet, newspapers or reference books.

Designing a questionnaire
It is important to design a questionnaire so that: People will co-operate and answer the questions honestly. The answers to the questions can be analysed and presented. When designing you own questionnaire you should try to follow these rules: 1) Provide an introduction, so that the person filling in the questionnaire knows the purpose of your enquiry. 2) Write questions in a sensible order, putting easier questions first.

Designing a questionnaire
3) Make sure that questions are not embarrassing or personal. For example, you need to think carefully about questions asking about age or income. How old are you? Do not ask : Tick one box for your age group. 15-20 21-25 26-30 31 + A better question is :

Designing a questionnaire
4) If possible, write questions so that they have a specific answer. Did you see the Olympics on TV ? For example : People could answer : Only the best bits No Sometimes Yes Once a day Not much

Designing a questionnaire
A better question would be: How much of the Olympics coverage did you watch? Tick one box only. None Less than 1 hour a day Between 1 to 2 hours a day More than 2 hours a day Every eventuality has been accounted for and the person answering the question cannot give another choice.

Designing a questionnaire
A scale can be used when asking for an opinion. For example, How would you rate the leisure facilities available in your local area? Tick one box only. Excellent Good Satisfactory Poor Unsatisfactory

Designing a questionnaire
5) Do not ask leading questions. For example, this question conveys a particular opinion, Don’t you agree that football is the best sport? A better question is : What one of the following sports do you like the best? football rugby tennis golf cricket boxing Stress that questions should not convey the questioner’s own opinion or indicate that one response would be preferable to another.

Suggest a better question
How much do you weigh? This is too personal, also some people don’t know their weight. A better question would be: Underweight Average weight Overweight Would you consider yourself to be: Add that if you needed to know peoples’ weights exactly then an experiment involving measuring and recording would be more appropriate than a questionnaire.

Suggest a better question
Most people use a deodorant, do you ? This is a leading question and may offend people. A more useful question would be: Which make of deodorant do you use ? Male: Female: Sure Impulse Dove Other None Lynx Adidas Slazenger Please circle any that apply.

Suggest a better question
How many books did you read last month? The intervals given overlap. Also, if a person has read more than 6 books there is nowhere to tick. A better question would be: How many books did you read last month? Tick one box. 0-2 3-5 6-8 8+ Also, tell pupils that the intervals, unless they are open, should be of equal size.

Trialling a questionnaire
Once you have written a questionnaire it is a good idea to try it out on a small sample of people. This is called a pilot survey. Note down their responses and use these to refine any questions that are causing difficulty. Do I use a tick or a cross to show the box I want? There isn’t a box to cover my answer. I don’t want to answer this question because it’s too personal. What does this question mean?

Designing a data collection sheet
A data collection sheet can be used to record data that comes from counting, observing or measuring. It can also be used to record responses to specific questions. For example, to investigate a claim that the amount of TV watched has an impact on weight we can use the following: age sex height (cm) weight (kg) hours of TV watched per week Point out that the headings used in the sheet should incluse units where appropriate.

Using a tally chart When collecting data that involves counting something we often use a tally chart. For example, this tally chart can be used to record peoples’ favourite snacks. favourite snack tally frequency crisps fruit nuts sweets 13 6 3 8 The tally marks are recorded, as responses are collected, and the frequencies are then filled in.

Using a tally chart Use this tally chart to collect data from the class. For example, to find out the size of each person’s family.

D1 Planning and collecting data
Contents D1 Planning and collecting data D1.1 Planning a statistical enquiry D1.2 Collecting data D1.3 Organizing data D1.4 Two-way tables

Using a frequency table
Once data has been collected it is often organized into a frequency table. For example, this frequency table shows the favourite take-away meals of a group of pupils: Favourite take-away Pizza Fish and chips Burgers Indian Frequency 11 7 8 5 Chinese Define frequency as the number of times something occurs. In this example, the frequency represents the number of pupils how prefer that take-away.

Categorical data is data that is non-numerical.
For example, favourite football team, eye colour, birth place. Sometimes categorical data can contain numbers. For example, favourite number, When we say that categorical data is non-numerical we mean that it is not counting anything or a measure of any thing. Ask pupils to suggest other examples. Categorical data can be displayed in a pie chart or bar graph. last digit in your telephone number, most used bus route.

Discrete and continuous data
Numerical data can be discrete or continuous. Discrete data can only take certain values. For example, shoe sizes, the number of children in a class, the number of sweets in a packet. Continuous data comes from measuring and can take any value within a given range. Ask pupils to give further examples. For example, the weight of a banana, the time it takes for pupils to get to school, the height of 13 year-olds.

Discrete or continuous data
Use this activity to review the difference between discrete and continuous data.

Grouping discrete data
A group of 20 people were ask how much change they were carrying in their wallets. These were their responses: 34p £1.72 83p £6.36 £4.07 £2.97 £3.53 6p £9.54 £1.68 50p 82p £7.54 £1.09 £2.81 £2.43 46p £1.70 £1.29 Each amount of money is different and the values cover a large range. This type of data is usually grouped into equal class intervals.

Choosing appropriate class intervals
When choosing class intervals it is important that they include every value without overlapping and are of equal size. For the following data: 34p £1.72 83p £6.36 £4.07 £2.97 £3.53 6p £3.54 £1.68 50p 82p £7.54 £1.09 £2.81 £2.43 46p £1.70 £1.29 We can use class sizes of £1: £ £1.00, £ £2.00, £ £3.00, £ £4.00, £ £5.00, Over £5. This is an open class interval.

Choosing appropriate class intervals
The size of the class intervals depends on the range of the data and the number of intervals required. For the following data: 34p £1.72 83p £6.36 £4.07 £2.97 £3.53 6p £3.54 £1.68 50p 82p £7.54 £1.09 £2.81 £2.43 46p £1.70 £1.29 Explain why class sizes of £5 would be inappropriate. Tell pupils that we should aim to have between five and ten class intervals, depending on the data. Could we use a class size of 20p?

Grouping continuous data
Continuous data is usually grouped into equal class intervals. What is wrong with the class intervals in this grouped frequency table showing lengths? 30 ≤ length 20 ≤ length < 30 10 ≤ length < 20 0 ≤ length < 10 Frequency Length (cm) 30 ≤ length 20 ≤ length ≤ 30 10 ≤ length ≤ 20 0 < length ≤ 10 Frequency Length (cm) The class intervals in this table overlap so that it would be possible to put 10 cm, 20 cm, and 30 cm in either of two groups. Click to reveal the table correctly. An alternative would be to have class intervals of: 0 < length ≤ 10 10 < length ≤ 20 20 < length ≤ 30 30 < length This is an open class interval.

Grouping continuous data
Continuous data is usually grouped into equal class intervals. What is wrong with the class intervals in this grouped frequency table showing weights? Weight (g) Frequency 0 < weight < 10 10 < weight < 20 20 < weight < 30 30 < weight Weight (g) Frequency 0 ≤ weight < 10 10 ≤ weight < 20 20 ≤ weight < 30 30 ≤ weight Note that 10 < weight < 20 means that the weights in this interval are greater than ten and less than 20. This interval does not include 10g or 20g. This time 10g, 20g, and 30g are not included in any of the class intervals. Click to reveal the table correctly. An alternative would be to have class intervals of: 0 < weight ≤ 10 10 < weight ≤ 20 20 < weight ≤ 30 30 < weight

Grouping continuous data
Continuous data is usually grouped into equal class intervals. What is wrong with the class intervals in this grouped frequency table showing times? 5:30 ≤ time < 6:00 4:00 ≤ time < 5:30 3:00 ≤ time < 4:00 2:30 ≤ time < 3:00 2:00 ≤ time < 2:30 Frequency Time (minutes:seconds) 4:00 ≤ time < 4:30 3:30 ≤ time < 4:00 3:00 ≤ time < 3:30 2:30 ≤ time < 3:00 2:00 ≤ time < 2:30 Frequency Time (minutes:seconds) The class intervals in this example are not of equal size. Click to reveal the table correctly.

Using two-way tables A two-way table can be used to organize two sets of data. For example, pupils from Years 7, 8 and 9 were asked what they usually did during their lunch break. This two-way table shows the results: Year 7 Year 8 Year 9 Eat school dinners 35 29 38 Eat a packed lunch 42 34 32 Eat at home 19 22 18 Ask questions to ensure that pupils are able extract information from a two-way table. For example, How many Year 9 pupils eat a packed lunch? How many pupils eat school dinners? How many pupils are there in Year 8? How many more Year 8 pupils go home for lunch than Year 9 pupils? What do most of the pupils asked do at lunch time?

D1.4 Writing a statistical report
Contents D1 Planning and collecting data D1.1 Planning a statistical enquiry D1.2 Collecting data D1.3 Organizing data D1.4 Writing a statistical report

The data collection cycle
The following diagram shows the stages needed to conduct a statistical enquiry. Specify the problem and plan Interpret and discuss the results Collect the data from a variety of sources Discuss the data collection cycle, giving examples for each stage. Discuss the fact that when we examine the result at the end of a survey, new questions that were not anticipated during the planning stage are thrown up. These can be investigated further and so the cycle begins again. Process and display the data

The data collection cycle
Use this activity to discuss the various stages involved in conducting a statistical enquiry.

Writing a statistical report
Once you have planned, collected and processed data relevant to a statistical enquiry you will often have to communicate your findings in the form of a report. A report should contain the following: An introduction stating the purpose of the survey and any initial conjectures which you plan to investigate. A description of what sources you were used including a justification of the type and size of any samples used. Use this and the following slide to summarize the main teaching points before pupils write the reports of their own statistical enquiries. Calculations, such as the mean, median and mode, the give an overall picture of the data.

Writing a statistical report
Tables or graphs of the results, using ICT as appropriate. (Remember to justify you choice of what is presented.) Problems or ambiguities that arose during the course of the investigation and how you dealt with them. A summery of the conclusions shown by the data, not forgetting to refer back to your initial conjecture. Sometimes your data will give results that you did not expect. These will lead to new lines of enquiry which you should investigate if possible.

Writing a statistical report
Collect the relevant data and write a statistical report investigating one of the following: The types of sports young people take part in outside of school hours. How pupils travel to school. The difference in word lengths used in men’s and woman’s magazines. Remind pupils to start by writing down any hypotheses or conjectures for the problem, including any related questions that might affect the results. They should also include a description of their sample. If there any measurements are to be collected, include the units that will be used and the degree of acuracy used. Any graphs that pupils decide to use should include explanations of why they have chosen them. The examples shown do not require the use of secondary data. Modify these examples as appropriate for the group. Use of mobile phones among teenagers. The relationship between handspan and foot length.