Introduction to Statistical Computing in Clinical Research Biostatistics 212 Lecture 1.

Slides:



Advertisements
Similar presentations
1 SESSION 5 Graphs for data analysis. 2 Objectives To be able to use STATA to produce exploratory and presentation graphs In particular Bar Charts Histograms.
Advertisements

Do files, log files, and workflow in Stata Biostatistics 212 Lecture 2.
TS 313 Multimedia Applications Welcome to TS 313 Multimedia Applications There is no audio lecture associated with this set of introduction slides Refer.
Variables 9/10/2013. Readings Chapter 3 Proposing Explanations, Framing Hypotheses, and Making Comparisons (Pollock) (pp.48-58) Chapter 1 Introduction.
Generating new variables and manipulating data with STATA Biostatistics 212 Lecture 3.
Teaching Statistics Using Stata Software Susan Hailpern BSN MPH MS Department of Epidemiology and Population Health Albert Einstein College of Medicine.
Welcome to Physics 2025! ( General Physics Lab 2 - Spring 2013)
Welcome to Physics 1809! General Physics Lab Spring 2013.
Srinivasulu Rajendran Centre for the Study of Regional Development (CSRD) School of Social Sciences (SSS) Jawaharlal Nehru University (JNU) New Delhi -
Transitioning from Gradequick to ABI Gradebook April 16, 2009.
Today: Run SAS programs on Saturn (UNIX tutorial) Runs SAS programs on the PC.
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Generating new variables and manipulating data with STATA Biostatistics 212 Lecture 3.
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Course director: Mark Pletcher Teaching Assistant: Lee Zane.
SADC Course in Statistics Adding a statistics package Module I3, Session 13.
S-005 Introduction to Educational Research Fall Harvard Graduate School of Education.
A Simple Guide to Using SPSS© for Windows
Generating new variables and manipulating data with STATA Biostatistics 212 Session 2.
Everything I wish I had known about research design and data analysis… Statlab Workshop Fall 2006 Kyle Hood and Frank Farach.
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
RESEARCH HUB AT THE UNIVERSITY LIBRARIES PENN STATE UNIVERSITY TOUR OF STATISTICAL PACKAGES.
Pet Fish and High Cholesterol in the WHI OS: An Analysis Example Joe Larson 5 / 6 / 09.
Introduction to SPSS (For SPSS Version 16.0)
The audio will be turned on just before our start time at 7:00 pm ET.
Summer  Session starts at 11:00 am ◦ We’ll be online shortly ◦ Speaker test starts about 10:45  To ask questions, ◦ use the chat window.
Quantitative Research in Education Sohee Kang Ph.D., lecturer Math and Statistics Learning Centre.
BIT 115: Introduction To Programming1 Sit in front of a computer Log in –Username: 230class –password: –domain: student Bring up the course web.
Making a figure, dates, and other advanced topics Biostatistics 212 Lecture 6.
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
Introduction to SPSS Edward A. Greenberg, PhD
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
How to be an online student. How does it work? An online course follows a schedule and syllabus with due dates for assignments (just like an on-campus.
Welcome to Physics 2215! Physics Lab for Scientist & Engineers 1 Fall 2012.
STATA Mini Course Fall 2015 Jane Leber Herr Littauer 113 1Stata Mini Course – Spring 2015.
BMTRY 789 Introduction to SAS Programming Lecturer: Annie N. Simpson, MSc.
FALL 2011 TECHNICAL ORIENTATION. Session starts at 11:00 am We’ll be online shortly Speaker test starts about 10:45 To ask questions, use the chat window.
SuccessMaker. Where are they? Math: Intranet On a server at Vanhoose Reading Web-based.
USNSCC Instructions for Test Admin View this manual using Microsoft’s Internet Explorer. May not be compatible with other browsers To download this document.
Organizing a project, making a table Biostatistics 212 Lecture 7.
Welcome to Physics 2015! ( General Physics Lab 1 - Fall 2012)
What is SPSS  SPSS is a program software used for statistical analysis.  Statistical Package for Social Sciences.
ENGR Welcome to ENGR Excellence – Impact - Innovation.
Introduction to Statistical Computing in Clinical Research Biostatistics 212.
Technical Orientation Summer Technical Orientation Session starts at 2:00 pm – We’ll be online shortly – Speaker test starts about 1:45 pm To ask.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
Moodle (Course Management Systems). Surveys and Choices.
MS Power point Tutorial
Getting Started With Stata Session 1 Jim Anthony John Troost Department of Epidemiology Michigan State University.
Welcome to the MTLC MATH 115 Spring MTLC Information  Hours of Operation  Sunday:4:00pm – 10:00pm  Monday – Thursday: 8:00am – 10:00pm  Friday:8:00am.
TECHNICAL ORIENTATION WINTER Technical Orientation Session starts at 2:00 pm We’ll be online shortly Speaker test starts about 1:45 To ask questions,
Introduction to Statistical Computing in Clinical Research
Welcome to Physics 2225! Physics Lab for Scientist & Engineers 2 Spring 2013.
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Lecture 1.
STATA for S-052 M. Shane Tutwiler Your Friendly S-040 Lecturer William Johnston IT Services Harvard Graduate School of Education.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
ASU Rosters for Roster Contacts Class Roster ASR Roster Grade Roster Grade Changes.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Introduction to MATLAB 7 Engineering 161 Engineering Practices II Joe Mixsell Spring 2012.
BIT 143: Programming – Data Structures It is assumed that you will also be present for the slideshow for the first day of class. Between that slideshow.
Creating Your Own Online Classroom MOODLE. Welcome Amy Basket – 17 years with Bay City Public Schools – Gifted and Talented Program – Volunteer Program.
1 1.Log in to the computer in front of you –Temp account: 210class / 2.Update your in Cascadia's system –If I need to you I'll use.
Welcome to Physics 2225! Physics Lab for Scientist & Engineers 2 Fall 2012.
Financial Management of ECE Programs.  Go to “Tools”  Click on “Personal Information” to edit your personal information (including address) or.
Welcome to Physics 2215! Physics Lab for Scientist & Engineers 1 Spring 2013.
Welcome to Physics 2015! (General Physics Lab 1 – Spring 2013)
Week 1 Gates Introduction to Information Technology cosc 010 Week 1 Gates
Introduction to Microbiology BI 234
Statistical Analysis with
Introduction to Matlab
Welcome to Physics 2025! (General Physics Lab 2 - Fall 2012)
Presentation transcript:

Introduction to Statistical Computing in Clinical Research Biostatistics 212 Lecture 1

Today... Course overview –Course objectives –Course details: grading, homework, etc –Schedule, lecture overview Where does Stata fit in? Basic data analysis with Stata Stata demos Lab

Course Objectives Introduce you to using STATA and Excel for –Data management –Basic statistical and epidemiologic analysis –Turning raw data into presentable tables, figures and other research products Prepare you for Fall courses Start analyzing your own data

Course Objectives NOT Statistical theory –You’ll get a bit today and later in the course, but we don’t focus on this. –I’m a clinical epidemiologist, not a statistician

Course details Biostats 212 –1 Unit Course –Satisfactory/Unsatisfactory vs. Grades –7 Sessions – Lecture + Lab –Online Office Hours –Online Forum

Course details Course Teaching Staff –Brief introductions Mark Pletcher, MD MPH Jen Cocohoba, PharmD, MAS Mohammad Al Komser, MD Sanoj Punnen, MD Elizabeth Rogers, MD Melissa Rosenstein, MD Mandana Khalili, Barbara Grimes, Nancy Hills

Course details Lectures –Tuesdays 1:15-2:45, but most will be shorter –Simulcast to 6704! (new this year) Rationale (why not move to a lecture hall?) 30 second delay “Pop over” to 6702 to ask a question? –Both didactic and “demo’s”, time for questions –Jen gives last lecture – special format?

Course details Recorded Lectures –Audio + video of lecturer + video of screen –Available same day for viewing –Links posted on website syllabus

Course details Labs –Tuesdays 3:00-4:00 officially, but usually starts earlier –6702 and 6704 –TA’s staff from start to 4:00 –Lab instructors staff from 3:00-4:00 –Most important part of the course!

Course details Online Office Hours: –Thursdays, 8:00-9:30AM –Jennifer Cocohoba (Asst Course Director) will lead –Serves as the lab session for off-site (online only) students –Will use GoToMeeting See instructions posted on the Syllabus –Drop in if you need help with lab!

Course details Forum –Demo –Post all questions here! TA turnaround time –Before you post, see if it’s already there and answered –Consider turning ON all your alerts around lab time? –Quick demo

Course details Course Requirements –Hand in all six Labs (even if late) –Satisfactory Final Project Not required –Reading –Attendance

Course details Grading (not relevant for all students) –Letter grades: Standard cutoffs % A 80-89% B 70-79% C 60-69% D <60% or Course Requirements not met: F –Satisfactory/Unsatisfactory >80%Satisfactory

Overview of lecture topics 1- Introduction to STATA 2- Do files, log files, and workflow in STATA 3- Generating variables and manipulating data with STATA 4- Using Excel 5- Basic epidemiologic analysis with STATA 6- Making tables and figures with STATA 7- Advanced Programming Topics

Overview of labs Lab 1 – Load a dataset and analyze it Lab 2 – Learn how to use do and log files Lab 3* – Import data from excel, generate new variables and manipulate data, document everything with do and log files. Lab 4 – Using and creating Excel spreadsheets Lab 5* – Epidemiologic analysis using Stata Lab 6 – Making a figure with Stata Last lab session will be dedicated to working on the Final Project * - Labs 3 and 5 are significantly longer and harder than the others

Overview of labs, cont Official In-Person Lab time is 3:00-4:00 on Tuesday, but we will start right after lecture, and you can leave when you are done.

Overview of labs, cont Labs are due the following week prior to lecture. Labs turned in late (less than 1 week) will receive only half credit; after that, no points will be awarded. However, ALL labs must be turned in to pass the class (even if no points are awarded). Lab 1 is paper Labs 2-6 are electronic files, and should be ed to your section leader’s course address: (Melissa/Sanoj) or (Elizabeth/Mohammed)

Final Project Create a Table and a Figure using your own data, document analysis using Stata. Due 1 week after last lab session, 20 points docked for each 1 day late. See 1-page description in Syllabus Start looking for data!

Course Materials Online Syllabus ( –Lectures and Labs/Datasets (“just in time”) –Miscellaneous handouts –Final Project –Short demo

Getting started with STATA Session 1

Types of software packages used in clinical research Statistical analysis packages Spreadsheets Database programs Custom applications –Cost-effectiveness analysis (TreeAge, etc) –Survey analysis (SUDAAN, etc)

Software packages for analyzing data STATA SAS S-plus, and R SPS-S SUDAAN Epi-Info JMP MatLab StatExact

Why use STATA? Quick start, user friendly Immediate results, response You can look at the data Menu-driven option Good graphics Log and do files Good manuals, help menu

Why NOT use STATA? SAS is used more often? SAS does some things STATA does not? Programming easier with S-plus and R? R is free Complicated data structure and manipulation easier with SAS? Epi-info is free and even easier than STATA?

STATA – Basic functionality Holds data for you –Stata holds 1 “flat” file dataset only (.dta file) Listens to what you want –Type a command, press enter Does stuff –Statistics, data manipulation, etc Shows you the results –Results window

Demo #1 Open the program Entering vs. loading data Look at data Run a command Orient to windows and buttons

STATA - Windows Two basic windows –Command –Results Optional windows –Variable list –Properties –History of commands Other functions –Data browser/editor –Variables Manager –Do file editor –Viewer (for log, help files, etc)

STATA - Buttons The usual – open, save, print Log-file open/suspend/close Do-file editor Browse and Edit Break

STATA - Menus Almost every command can be accessed via menu  dialog box

Menu vs. Command line Menu advantages –Browse for commands you don’t know already –See the options for each command in dialog boxes –Good way to learn syntax for complex commands Command line advantages –MUCH faster –ONLY way to write “do” files Document and repeat analyses

Demo #2 Load a STATA dataset –(intro to CARDIA) Explore the data Describe the data Answer some simple research questions –Variables: male sex, smoking, binge drinking, BMI and systolic blood pressure

STATA commands Describing your data describe [varlist] –Displays variable names, types, labels list [varlist] –Displays the values of all observations codebook [varlist] –Displays labels and codes for all variables

STATA commands Descriptive statistics – continuous data summarize [varlist] [, detail] –# obs, mean, SD, range –“, detail” gets you more detail (median, etc) ci [varlist] –Mean, standard error of mean, and confidence intervals –Actually works for dichotomous variables, too.

STATA commands Graphical exploration – continuous data histogram varname –Simple histogram of your variable graph box varlist –Box plot of your variable qnorm varname –Quantile plot of your variable to check normality

STATA commands Descriptive statistics – categorical data tabulate [varname] –Counts and percentages –(see also, table - this is very different!)

STATA commands Analytic statistics – 2 categorical variables

tabulate [var1] [var2] –“Cross-tab” –Descriptive options, row(row percentages), col(column percentages) –Statistics options, chi2(chi2 test), exact(fisher’s exact test)

Getting help Try to find the command on the pull-down menus Help menu –If you don’t know the command – “Search...” –If you know the command – “Stata command...” Try the manuals –PDF files with more detail, theoretical underpinnings, etc –Accessed through the help menu

STATA commands Analytic statistics – 1 categorical, 1 continuous

bysort catvar: summarize [contvar] –mean, SD, range of one in subgroup ttest [contvar], by(catvar) –t-test oneway [contvar] [catvar] –ANOVA table [catvar] [, contents(mean [contvar]…) –Table of statistics

STATA commands Analytic statistics – 2 continuous

scatter [var1] [var2] –Scatterplot of the two variables pwcorr [varlist] [, sig] –Pairwise correlations between variables –“sig” option gives p-values spearman [varlist] [, stats(rho p)] lowess yvar xvar

In Lab Today… Expect some chaos! –IT will be here to help with wireless, logins, etc –All ATCR and MAS students need logins for our network Familiarize yourself with Stata Load a dataset Use Stata commands to analyze data and fill in the blanks

Next week Do files, log files, and workflow in Stata Start looking for a dataset

Website addresses Course website – Computing information – computinghttp:// computing Download RDP for Macs (for Stata Server) – Citrix Web Server – Stata 12 Server –