Presentation is loading. Please wait.

Presentation is loading. Please wait.

Datahantering Data handling

Similar presentations


Presentation on theme: "Datahantering Data handling"— Presentation transcript:

1 Datahantering Data handling
Eenheid 2: Bl. 456 Unit 2: P. 456

2 Hoofstappe van statistiese ondersoek Main steps of a statistical investigation
Formuleer vraag vir ondersoek Versamel data om die vraag te beantwoord. Prosesseer die data en stel dit voor (organiseer, som op en grafieke) Interpreteer en ontleed data Develop a question for investigation Collect data to answer the question Process and represent the data (organise, summarise and graphs) Interpret and analyse the data.

3 Formuleer ‘n vraag en versamel data: Formulate a question and collect data:
Stel vrae op wat beantwoord kan word deur versameling van data. Develop questions that can be answered by collecting data.

4 Metodes: Methods: Waarneming (Tel, aantekeninge)
Obeservation (Count, make notes) Onderhoude Gestruktureerde (Stel vrae word gebruik) Ongestruktureerde (vra word gevra en respons aangeteken. Interviews Structured (A set of questions is used) Unstructured (questions asked and response noted)

5 Surveys and questionnaires
Opnames en vraelyste ‘n opname behels vra wat gevra word om inligting uit te vind. Opname word ook gedoen om waar te neem of te tel. Surveys and questionnaires A survey involves asking questions to obtain information. Surveys can also be done to observe or to count. Sekondêre databronne Data wat jy nie self versamel nie, gebruik betroubare bronne. Bv. Koerante, tydskrifte, boeke en internet. Secondary data sources Data that you did not collect yourself. Use reliable sources. E.g. Newspaper , magazines, books and the internet.

6 Vraelyste: Questionnaires:
Word gebruik om data te versamel. Is used to collect data. Vrae moet duidelik wees. Questions must be clear. Inligting moet vir jou ondersoek handig wees. Informations must be useful for your investigation. Bewoording moet nie misleidend wees nie. Wording must not be misleading.

7 Huiswerk Homework Oef. 2.1 bl. 459 Exersise 2.1 p.459

8 Populasie & steekproefneming Population & sampling
Populasie: die groep van wie die data versamel word bv. Mense,diere ens. Population: The group from which data is collected. E.g. People, animals etc.

9 Steekproefneming Sampling
Steekproef: die deel van die populasie wat in ‘n ondersoek betrokke is. Sample: the portion of the population that is involved in an investigation. Die steekproef moet groot genoeg wees sodat dit verteenwoordigend van die populasie is. The sample size should be large enough to be a fair representation of the population.

10 Ewekansige steekproef: elke lid van die populasie het ‘n gelyke kans om gekies te word.
Random sampling: every member of the population has an equal chance of being chosen. Sydige steekproef: teenoorgestelde van ewekansige steekproef, verteenwoordig nie die populasie regverdig nie. Biased sampling: opposite of random sampling, not representing the population fairly.

11 Huiswerk Homework Oef. 2.2 bl. 460 Exersise 2.2 p.460

12 Klassifiseer en organiseer data: Classifying and organising data:
Katagoriese data word in katagorieë groepeer bv. Kleure, soorte ens. Categorical data is grouped into catagories e.g. colours, types etc. Numeriese data word volgens getalle geroepeer: Diskreet: Heelgetalle wat getel kan word. Kontinu: Kan enige waarde wees, gewoonlik meetbaar. Numerical data is grouped according to numbers: Discrete: Whole numbers that can be counted. Continuous: Can be any value, usually a measurement.

13 Frekwensietabelle Frequency tables
Frekwensieverdelingstabelle kan gebruik word om data te organiseer (met of sonder telmerkies) Frequency distribution tables can be used to organise data (with or without tally marks

14 Huiswerk Homework Oef. 2.6 bl. 467 Exersise 2.6 p.467

15 Opsomming van data Summarising data
Sentrale waardes/tendens: Gemiddelde, modus en mediaan Kan bereken word vanuit rou data (data wat nog nie georganiseer is nie) Central values/tendency: Mean, mode and median Can be calculated from raw data (data that has not been organised)

16 Gemiddelde Mean (average)
Word die meeste gebruik. Is used the most. Word ook deursnee waarde genoem. Is also called averages. Gemiddelde = 𝑠𝑜𝑚 𝑣𝑎𝑛 𝑡𝑒𝑙𝑙𝑖𝑛𝑔𝑠 𝑎𝑎𝑛𝑡𝑎𝑙 𝑡𝑒𝑙𝑙𝑖𝑛𝑔𝑠 Mean = 𝑠𝑢𝑚 𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠

17 Bepaal gemiddeld van frekwensietabel af Calculate mean by using a frequency table
Frekwensieverdelingstabel Gemiddeld = 𝑡𝑜𝑡𝑎𝑎𝑙 𝑣𝑎𝑛 𝑓𝑥−𝑘𝑜𝑙𝑜𝑚 𝑡𝑜𝑡𝑎𝑎𝑙 𝑣𝑎𝑛 𝑓𝑟𝑒𝑘𝑤𝑒𝑛𝑠𝑖𝑒𝑘𝑜𝑙𝑜𝑚 Frequency distribution table Mean = 𝑡𝑜𝑡𝑎𝑙 𝑜𝑓 𝑡ℎ𝑒 𝑓𝑥 𝑐𝑜𝑙𝑢𝑚𝑛 𝑡𝑜𝑡𝑎𝑙 𝑜𝑓 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑐𝑜𝑙𝑢𝑚𝑛

18 Mediaan Median Is die middelste waarde in ‘n stel tellings wat in volgorde van grootte gerangskik.(stygende) As ewe aantal data is, deel laaste waardes deur 2. Is the middle value in a set of numbers when they are arranged in size order.(ascending) If it is an even number of data, divide the last values by 2

19 Modus Mode Telling of resultaat wat die meeste in die stel data voorkom. Kan meer as 1 modus hê As daar 2 modusse = bimodaal. Number or result that occurs most often in a set of data. Can have more than 1 mode. If there is 2 modes = bimodal.

20 Variasiewydte (Omvang) Range
Variasiewydte is ‘n waarde (spreidingsmaat) wat vir jou sê hoe die data verdeel is (hoe ver dit versprei is) Variasiewydte = hoogste waarde – laagste waarde The value that tells you how spread out the data is. Range = highest value – lowest value

21 Kies ‘n sentrale waarde Chose a central value
Tabel 2.1 bl. 474 Table 2.1 p. 474

22 Uitskieters Outliers Waarde wat baie hoër of laer is as die ander waardes. Value that is much higher or lower than the other values

23 Huiswerk Homework Oef. 2.7 & 2.8 bl. 475-477
Exersise 2.7 & 2.8 p


Download ppt "Datahantering Data handling"

Similar presentations


Ads by Google