Software for data management: The contribution of Stata

Slides:



Advertisements
Similar presentations
Dr. Engr. Sami ur Rahman Data Analysis Lecture 6: SPSS.
Advertisements

Data Analysis using SPSS By Dr. Shaik Shaffi Ahamed Ph. D
Stata and logit recap. Topics Introduction to Stata – Files / directories – Stata syntax – Useful commands / functions Logistic regression analysis with.
Variables 9/10/2013. Readings Chapter 3 Proposing Explanations, Framing Hypotheses, and Making Comparisons (Pollock) (pp.48-58) Chapter 1 Introduction.
Teaching Statistics Using Stata Software Susan Hailpern BSN MPH MS Department of Epidemiology and Population Health Albert Einstein College of Medicine.
Andrea Meld, PhD Data Analyst, OSPI
Teaching Stata—some reflections after 8 years of training experiences Karen Robson York University.
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
Software for data management: The contribution of Stata Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland.
Srinivasulu Rajendran Centre for the Study of Regional Development (CSRD) School of Social Sciences (SSS) Jawaharlal Nehru University (JNU) New Delhi -
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Course director: Mark Pletcher Teaching Assistant: Lee Zane.
1 SPSS Recently it has gone through a name change so your icon on your computer may be under a different name (i.e. PASW- Predictive Analytics SoftWare).
Good Data Management Practices Patty Glynn 10/31/05
Technological Innovation in Teaching Epidemiology and Statistics Supported in part by the National Science Foundation DUE Erika Friedmann & Mark.
Everything I wish I had known about research design and data analysis… Statlab Workshop Fall 2006 Kyle Hood and Frank Farach.
SPSS 1: An Introduction to the Statistical Package SPSS Suzie Cro MRC Clinical Trials Unit.
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
RESEARCH HUB AT THE UNIVERSITY LIBRARIES PENN STATE UNIVERSITY TOUR OF STATISTICAL PACKAGES.
Basic Unix Dr Tim Cutts Team Leader Systems Support Group Infrastructure Management Team.
Introduction to SPSS Short Courses Last created (Feb, 2008) Kentaka Aruga.
Managing Your Own Data (…if you have to) Kathryn A. Carson, Sc.M. Senior Research Associate Department of Epidemiology Johns Hopkins Bloomberg School of.
Multilevel Modeling Using HLM and MLwiN Xiao Chen UCLA Academic Technology Services.
© Paradigm Publishing Inc. 4-1 Chapter 4 System Software.
Chapter 4 System Software.
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
Introduction to SPSS Edward A. Greenberg, PhD
Standard Grade Computing System Software & Operating Systems.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 4 l Introduction to Statistical Software Package 4.1 Data Input 4.2 Data Editor 4.3 Data.
Introduction to STATA for Clinical Researchers Jay Bhattacharya August 2007.
33 CHAPTER General- Purpose APPLICATION SOFTWARE.
© Paradigm Publishing Inc. 4-1 OPERATING SYSTEMS.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
Introduction to Statistical Computing in Clinical Research Biostatistics 212.
Dr. Engr. Sami ur Rahman Research Methods in Computer Science Lecture: Data Analysis (Introduction to SPSS)
Analysis Introduction Data files, SPSS, and Survey Statistics.
© Paradigm Publishing, Inc. 4-1 Chapter 4 System Software Chapter 4 System Software.
1.Introduction to SPSS By: MHM. Nafas At HARDY ATI For HNDT Agriculture.
THE WINDOWS OPERATING SYSTEM Computer Basics 1.2.
Computer Operating Systems And Software applications.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
With the support of the LPP programme of the European Union 1 This project has been funded with support from the European Commission. This publication.
Introduction to R Dr. Satish Nargundkar. What is R? R is a free software environment for statistical computing and graphics. It compiles and runs on a.
Understanding SPSS II Workshop Series August 9, 2016.
System SOFTWARE.
A quick guide to other statistical software
Chapter 5 Operating Systems.
An Introduction to Epi Info 6/7
Stata Statistical Data Analysis Application Software
A very brief introduction to R
Introduction to Statistical Software Package
Analyze ICD-10 Diagnosis Codes with Stata
Introduction to SPSS.
By Dr. Madhukar H. Dalvi Nagindas Khandwala college
SPSS Assignment Help. Sage-Fox.com Free PowerPoint Templates SPSS is an abbreviation to Statistical Package for Social Science. It’s a windows based software.
CHAPTER 2 Computer Software.
DEPARTMENT OF COMPUTER SCIENCE
Introduction to R Studio
TexPREP Summer Camp Computer Science
Chapter 3 Software Interfaces.
ECONOMETRICS ii – spring 2018
SPSS Overview COM 631/731.
Technological Innovation in Teaching Epidemiology and Statistics
Today’s Beginner Workshop
Look ahead: Advanced modelling and learning resources
Chapter Overview Operating System Basics
Objectives This is an introduction to the statistical software STATA aiming at: Preparing the participants in STATA basics (interphase and commands) for.
Presentation, data and programs at:
Stata Basic Course.
An Introduction to SPSS
Presentation transcript:

Software for data management: The contribution of Stata Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland

Getting acquainted with Stata StataCorp develops and distributes Stata, software for statistical analysis. Stata is available for Windows, Macintosh, and Unix computers. Stata is used by medical researchers, biostatisticians, epidemiologists, economists, sociologists, political scientists, geographers, psychologists, social scientists, and other research professionals needing to analyze data. Gaining popularity in the social and medical sciences Particularly useful for handling large-scale longitudinal data

Stata SE (for large data sets) can analyze datasets with as many as 32,766 variables, and the only limit on observations is the amount of RAM on your computer can handle string variables with a maximum length of 244 characters can handle matrices up to 11,000 x 11,000. requires at least 512 megabytes of RAM and 80 megabytes of disk space

Stata/Intercooled (the standard one) can analyze datasets with as many as 2,047 variables, and the only limit on observations is the amount of RAM on your computer can handle string variables with a maximum length of 244 characters can handle matrices up to 800 x 800.

Small Stata A smaller, student version of Stata (for educational purchases only)

Stata MP The fastest version of Stata (for dual-core and multicore/multiprocessor computers) Stata/MP is the fastest and largest version of Stata.

Resources StataCorp website (www.stata.com)

Resources StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk)

Resources StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk) UCLA Stata “portal” (http://www.ats.ucla.edu/stat/)

Resources StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk) UCLA Stata “portal” (statcomp.ats.ucla.edu/stata) Statalist (www.hsph.harvard.edu/statalist)

Resources StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk) UCLA Stata “portal” (statcomp.ats.ucla.edu/stata) Statalist (www.hsph.harvard.edu/statalist) Stata Journal (www.stata-journal.com)

As well, available Dec 2008

Launching Stata OS contingent Default window preferences Window preferences fully adjustable Auto memory set

Comparing with SPSS Start up differences

Comparing with SPSS Start up differences With data file open

Comparing with SPSS Start up differences With data file open Viewing data data viewer, data editor

Comparing with SPSS Start up differences With data file open Viewing data data viewer, data editor Viewing variables

Comparing with SPSS Start up differences With data file open Viewing data data viewer, data editor Viewing variables Viewing output/commands output window buffer, log files

Comparing with SPSS Start up differences With data file open Viewing data data viewer, data editor Viewing variables Viewing output/commands output window buffer, log files Syntax and “do files”

Variable window INPUT Stata command window Do file Pull-down menu Review window Computation RESULTS Output window Log file

Advantages and disadvantages of Stata User driven Free STBs Dedicated journal Web active Memory requirements Backward compatible Change! SPSS dominance Orientated to writing syntax/code Pull-down windows debate! Now in version 8 forward

Advantages and disadvantages of Stata Easier code Easier data handling Clarity of operations/ feedback Results table function Before version 8, limited graphics Now, complex graphics Variable labelling Editing of output

Advantages and disadvantages of Stata Copy and paste Nested/master do files Flexible terminology Setting types of data Interactive help Switch output (log file) on/off

Overview of analytic techniques Too numerous to mention! Comprehensive manuals A selection: All types of regression Survey package Epidemiological package Multilevel modelling Time series functions Cluster analysis

Data Data files .dta Stat/Transfer software

Stata – using wide and long file formats Wide file formats (everything you add goes to the right of the existing data) Long file formats (everything you add goes underneath the existing data)

MERGE Data 1 Data 2 APPEND

Data 1 (indi) ‘master’ Data 2 (indj) ‘using’ _merge values 1 3 2