Universal and Mass Customization of Tables in Stata Roy Wada University of Illinois at Chicago.

Slides:



Advertisements
Similar presentations
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Advertisements

1 Adding a statistics package Module 2 Session 7.
Reviewing a research paper
Statistics S2 Year 13 Mathematics. 17/04/2015 Unit 1 – The Normal Distribution The normal distribution is one of the most important distributions in statistics.
Objectivity in Journalism Incredibly difficult concept to explain.
Julia Ballenger, Ph.D. Professor TAMUC Texas Association of Black Personnel in Higher Education Conference Austin, TX March 5-7, 2015.
The Scientific Method.
 1  Outline  Model  problem statement  detailed ARENA model  model technique  Output Analysis.
Versioning Extensions for Linux CS736 Spring 1999 J. Adam Butts Paramjit Oberoi.
1 RUNNING a CLASS (2) Pertemuan Matakuliah: G0454/Class Management & Education Media Tahun: 2006.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Elementary hypothesis testing
Chapter 9 Simultaneous Equations Models. What is in this Chapter? In Chapter 4 we mentioned that one of the assumptions in the basic regression model.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Writing tips Based on Michael Kremer’s “Checklist”,
CS 597 Your Ph.D. at USC The goal of a Ph.D. What it takes to achieve a great Ph.D. Courses Advisor How to read papers? How to keep up-to-date with research?
So, You’re Going to Write an Empirical Paper Statlab Workshop October 31 st, 2003 David Nickerson.
Sampling Distributions
MR2300: MARKETING RESEARCH PAUL TILLEY Unit 10: Basic Data Analysis.
How Students Interpret Writing Assignments Two Studies.
Chapter 4 Basic Probability
Testing Test Plans and Regression Testing. Programs need testing! Writing a program involves more than knowing the syntax and semantics of a language.
Introduction to R. Statistical Software Statistical software – Wide variety of software tools that researchers use to analyze data – Common examples are.
Dr Chris
Requirements analysis Speaker: Chuang-Hung Shih Date:
WHEN, WHY, AND HOW SCIENCE RESEARCH IS REPORTED IMRAD.
Advanced Excel for Finance Professionals A self study material from South Asian Management Technologies Foundation.
Term 2, 2011 Week 1. CONTENTS Types and purposes of graphic representations Spreadsheet software – Producing graphs from numerical data Mathematical functions.
Academic Integrity How to do it right. Why it matters Virtually everything we know has come to us because someone else has taken the time to think about.
Automated Data Analysis National Center for Immunization & Respiratory Diseases Influenza Division Nishan Ahmed Data Management Training Cairo, Egypt April.
STAT02 - Descriptive statistics (cont.) 1 Descriptive statistics (cont.) Lecturer: Smilen Dimitrov Applied statistics for testing and evaluation – MED4.
Adding Whole Numbers © Math As A Second Language All Rights Reserved next #5 Taking the Fear out of Math
Planning & Writing Laboratory Reports A Brief Review of the Scientific Method.
Using the Margins Command to Estimate and Interpret Adjusted Predictions and Marginal Effects Richard Williams
Conceptual (knowledge) confusion: Some deliberatively provocative remarks Jon R. Star Harvard Graduate School of Education.
Standard Deviation Z Scores. Learning Objectives By the end of this lecture, you should be able to: – Describe the importance that variation plays in.
Copyright ©2014 Pearson Education Chap 4-1 Chapter 4 Basic Probability Statistics for Managers Using Microsoft Excel 7 th Edition, Global Edition.
INTRODUCTION TO INFORMATIONAL WRITING AND READING.
Observation & Analysis. Observation Field Research In the fields of social science, psychology and medicine, amongst others, observational study is an.
Data Storage Choices File or Database ? Binary or Text file ? Variable or fixed record length ? Choice of text file record and field delimiters XML anyone.
Week 10 Nov 3-7 Two Mini-Lectures QMM 510 Fall 2014.
Axiomatic Design Theory (Axiom 2). Axiom 2 2 Motivation for Axiom 2: There may be more than one design that satisfies with Axiom 1. The problem is to.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Writing a Research Proposal 1.Label Notes: Research Proposal 2.Copy Notes In Your Notebooks 3.Come to class prepared to discuss and ask questions.
Close Reading Intermediate 2. Time The Close Reading exam paper lasts for one hour. (Date and time for 2011: Friday 13 May, 1.00pm to 2.00pm.) NAB: Friday.
SOFTWARE ENGINEERING1 Introduction. SOFTWARE ENGINEERING2 Software Q : If you have to write a 10,000 line program in C to solve a problem, how long will.
11 Version Control Systems Mauro Jaskelioff (originally by Gail Hopkins)
Stat 13, Tue 5/29/ Drawing the reg. line. 2. Making predictions. 3. Interpreting b and r. 4. RMS residual. 5. r Residual plots. Final exam.
1 Comparing multiple tests for separating populations Juliet Popper Shaffer Paper presented at the Fifth International Conference on Multiple Comparisons,
Dr. Imtithal AL-Thumairi Webpage: Guide to the Research Proposal.
Common Sense Validation Using SAS Lisa Eckler Lisa Eckler Consulting Inc. TASS Interfaces, December 2015.
Principals of Research Writing. What is Research Writing? Process of communicating your research  Before the fact  Research proposal  After the fact.
Does music affect on animal behavior? Does the color of food or drinks affect whether or not we like them? Where are the most germs in your school? Does.
THE 7 BASIC QUALITY TOOLS AS A PROBLEM SOLVING SYSTEM Kelly Roggenkamp.
Probability Statistics Introduction This stuff can be a bit hard, but don’t be afraid We use probability for our purposes, so it will be a tool,
Review of literature S. Balakrishnan. What is literature review? The terms literature search, literature review and literature survey are one and the.
Rapid formation of regression tables for research purposes Roy Wada UCLA/RAND October 2007.
Publishing your Research Dr. David O’Sullivan. Publishing your Research Academic Writing  Academic writing is meant for a critical and informed audience,
Conjoint Analysis. 1. Managers frequently want to know what utility a particular product feature or service feature will have for a consumer. 2. Conjoint.
© 2016 IBM Corporation Virtual Appliance migration self-assessment May 2016 IBM Security Identity Manager.
Proposal development and research design. What is a research proposal? A research proposal is a document written by a researcher that provides a detailed.
There’s a Custom Field, a Report, and a Pivot Table for That
Stat 31, Section 1, Last Time Sampling Distributions
Introduction SOFTWARE ENGINEERING.
ECONOMETRICS ii – spring 2018
The structure of a scientific paper:
Conjoint Analysis.
Learning and The Learning Curve
Deborah Schnipke, PhD Virtual Psychometrics
Econometric Tests of Copyright Openness
Presentation transcript:

Universal and Mass Customization of Tables in Stata Roy Wada University of Illinois at Chicago

Purpose of this presentation: Strong demand for universal approach to systematic table-making in Stata Strangest advice seems to be coming from people with no background in empirical research Latest vaporware features in outreg2

Recent complaints about table-making “With eight dimensions it is difficult to see how any program could cope with line lengths ~ 100 (rather than say ~ 1000) and not produce an awful mess one way or the other.” - Tue, 21 Apr 2009 “But there is a point where it no longer makes sense for official Stata or ssc contributions to support complicated structures that are only needed every once in a while...” - Wed, 11 Nov 2009 “Stata has *no* ability to do this (though I assume one could program it, but I doubt it would be easy)” - Thu, 13 Jan 2011

Universal table-making There is no important difference between various types of tables – Regression tables & summary statistics are merged on conditions (happens to be variable names) – Cross-tabulation (Stub-and-Banner) Tabulation is merely conditional counting Cross-tabulation is a conditional counting merged on conditions Stub-and-Banner happens to be a particular type of conditional counting

Mass customization “Mass customization … is the use of flexible computer-aided manufacturing systems to produce custom output” (Wikipedia 07/14/2011) A solution was to do it column by column (the original outreg by John Gallup), which had been criticized as somehow nontechnical or outdated

Some issues with previous efforts Many good programs exist (outreg, estout, parmest, xml_tab, tabout, etc) General commentary: – Rube-Goldberg syndrome – over-engineered non- solutions for performing “simple” tasks – Mata-based programs are expensive and generally not extensive nor easily upgraded – Wrapper-based programs suffer from the existential question, i.e. then why didn’t Stata Corporation do it that way – Some programs were clearly designed by people with no background in empirical research

Desirable functionality What researchers wanted was quantity of tables, not quality (hundreds of regressions per day) Rarely for publication purposes (they don’t get published) Exact formatting often gets destroyed by the journal type-setters anyway Excel is a fact of life – virtually every researcher uses it LaTeX is not that popular – just look at the working papers floating around, clearly something else

Latest vaporware in outreg2 Already does cross-tabulation Extending it to including summary stats is straightforward Sideway-tranpose operation doubles the type of table format Some minor tweaks and bug fixes Probably by the beginning of August 2011

Terms of use (fair use) outreg2 implements tasks previously unknown in Stata or described nearly impossible. It is provided as a professional courtesy. I strongly object to re-publication this work of under false pretense Plagiarism is unethical, unprofessional, and academically dishonest, and egregious cases should be publicized as a deterrent.