The increasing role of data science in undergraduate statistics programs: new guidelines, new opportunities, and new challenges Nicholas Horton,

Slides:



Advertisements
Similar presentations
Association of American Colleges and Universities.
Advertisements

Metadisciplinary Outcomes for Science Literacy (Can Assess Now by Standardized Concept Inventory) STUDENT WILL BE ABLE TO… 1. Define the domain of science.
Year 11 Mathematics What type of Maths courses are there in year 11? ► ATAR Courses: Examinable courses, which may be used towards a university.
The Complete Statistician -- Modernizing the Undergraduate Curriculum JSM The Complete Statistician: Modernizing the Undergraduate.
A. John Bailer Statistics and Statistical Modeling in The First Two Years of College Math.
Assessment of Undergraduate Programs Neeraj Mittal Department of Computer Science The University of Texas at Dallas.
Introduction to Pre-calculus Math.  Confidently solve problems  Communicate and reason mathematically  Increase mathematical literacy  Make connections.
Teaching Courses in Scientific Computing 30 September 2010 Roger Bielefeld Director, Advanced Research Computing.
Three-Dimensional Teaching Study on the College Statistics Education Tengzhong Rong, Qiongsun Liu Chongqing university, China
© Copyright CSAB 2013 Future Directions for the Computing Accreditation Criteria Report from CAC and CSAB Joint Criteria Committee Gayle Yaverbaum Barbara.
Computational Science Education Programs CASC Meeting October 4,2012 Steven I. Gordon
A Workshop on Subject GRE / AGRE Maths in 9 Classes, II Hours each Day & Three mock tests for AGRE By: Satyadhar Joshi
The Education of a Software Engineer Mehdi Jazayeri Presented by Matthias Hauswirth.
Computer Science Department Program Improvement Plan December 3, 2004.
Quantitative Skills What Science Students Need to Know ? LTC 3 March 2005.
Glenn Ledder Department of Mathematics University of Nebraska-Lincoln Designing Math Courses:
Page 0 Optimization Uncertainty Decision Analysis Systems Economics Masters of Engineering With Concentration in Systems Engineering A 30 hour graduate.
Mohammad Alshayeb 19 May Agenda Update on Computer Science Program Assessment/Accreditation Work Update on Software Engineering Program Assessment/Accreditation.
Unit Assessment Plan Weber State University’s Teacher Preparation Program.
Algebra I Model Course Background. Education Reform Act signed into law by Governor Rell May 26, 2010 Includes many recommendations of the ad hoc committee.
Opportunities in Quantitative Finance in the Department of Mathematics.
Robert delMas (Univ. of Minnesota, USA) Ann Ooms (Kingston College, UK) Joan Garfield (Univ. of Minnesota, USA) Beth Chance (Cal Poly State Univ., USA)
ABET Accreditation Board for Engineering and Technology
The Influence of the University/College/Department Mission How your university and department’s missions influence your engineering degree requirements.
FLCC knows a lot about assessment – J will send examples
Ryann Kramer EDU Prof. R. Moroney Summer 2010.
15 th Conference on Software Engineering Education and Training Foundation Software Engineering Practices for Capstone Projects and Beyond Annegret Goold.
MATHEMATICS KLA Years 1 to 10 Understanding the syllabus MATHEMATICS.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
Updating the Guidelines for Undergraduate Programs in Statistics Nicholas Horton (Amherst College) November 12, 2013 CAUSE Teaching.
1 UTeach Professional Development Courses. 2 UTS Step 1 Early exposure to classroom environment (can be as early as a student’s first semester)
Understanding the Shifts in the Common Core State Standards A Focus on Mathematics Wednesday, October 19 th, :00 pm – 3:30 pm Doug Sovde, Senior.
LinearRelationships Jonathan Naka Intro to Algebra Unit Portfolio Presentation.
Writing Across the Curriculum (WAC) at Sojourner Douglass College Faculty and Staff Session One Saturday, November 9, 2013.
EEA 2012 – Middle School STEM Day 1, PM Content Session.
Building Strong Geoscience Departments for the Future Cathy Manduca, Carol Ormand Carleton College Heather Macdonald, Geoff Feiss, College of William and.
Welcome To LCHS 7/8 Math Night
BUSINESS INFORMATICS descriptors presentation Vladimir Radevski, PhD Associated Professor Faculty of Contemporary Sciences and Technologies (CST) Linkoping.
The Changing Face of Education: How Common Core Impacts Our Curriculum Beth Smith President, ASCCC Oct. 31, 2013.
Part 0 -- Introduction Statistical Inference and Regression Analysis: Stat-GB , C Professor William Greene Stern School of Business IOMS.
Chapter 1 Defining Social Studies. Chapter 1: Defining Social Studies Thinking Ahead What do you associate with or think of when you hear the words social.
Educator Effectiveness Academy Day 2, Session 1. Find Someone Who…. The purpose of this activity is to review concepts presented during day 1.
The Mathematical Association of America Committee on the Undergraduate Program in Mathematics (CUPM) Charged with making recommendations to guide mathematics.
Resources and Reflections: Using Data in Undergraduate Geosciences Cathy Manduca SERC Carleton College DLESE Annual Meeting 2003.
Science Department Draft of Goals, Objectives and Concerns 2010.
S IMULATIONS AND D ATA A NALYSIS Chapter 12. D ATA A NALYSIS What is data analysis? “…the process of transforming data into information…in the process.
What should students learn, and when? Matthew A. Carlton Statistics Department California Polytechnic State University San Luis Obispo, CA, USA.
1. October 25, 2011 Louis Everett & John Yu Division of Undergraduate Education National Science Foundation October 26, 2011 Don Millard & John Yu Division.
Impact of the New ASA Undergraduate Curriculum Guidelines on the Hiring of Future Undergraduates Robert Vierkant Mayo Clinic, Rochester, MN.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
MU Core Revision Proposal The Atom Visual Structure Please read information provided in each slide as well as the notes under each slide.
Connection with Community Colleges Helen Burn, Highline Community College Rob Gould, UCLA Brooke Orosz, Essex County College Mary Parker, Austin Community.
Office of Curriculum,Instruction, and Professional Learning Division of Teaching and Learning Date Elementary Science Meets Elementary Mathematics Metric.
Preparing Statistics Majors for Graduate Study (Perhaps Your Own!) ASA Working Group to Revise the Undergraduate Statistics Curriculum Winter 2013.
Guidelines for Undergraduate Programs in Statistics Beth Chance – Cal Poly Feedback can be sent to Rebecca Nichols, ASA Director.
Observations from Large Programs September 27, 2013 Guidelines for Undergraduate Statistics Programs Workgroup Webinar Series
Defining 21st Century Skills: A Frameworks for Norfolk Public Schools NORFOLK BOARD OF EDUCATION Fall 2009.
The Role and Variety of Undergraduate Statistics Capstones December 4, 2013 Guidelines for Undergraduate Statistics Programs Workgroup Webinar Series
Introduction to Math Methods Math Standards. Why can math be fun? Math can be fun because… it can have so much variety in topics. many different ways.
Amy Wagaman Amherst College Mathematics and Statistics.
8/23/ th ACS National Meeting, Boston, MA POGIL as a model for general education in chemistry Scott E. Van Bramer Widener University.
Robert P. King Department of Applied Economics April 14, 2017
Curriculum and Career preparation
The Role of Statistics in Data Science, and Vice Versa
Joan Donohue University of South Carolina
Computational Reasoning in High School Science and Math
Business analytics Lessons from an undergraduate introductory course
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Mathematics in the Data Science Movement
DESIGN OF EXPERIMENTS by R. C. Baker
Presentation transcript:

The increasing role of data science in undergraduate statistics programs: new guidelines, new opportunities, and new challenges Nicholas Horton, American Statistical Association Education Program Webinar February 3, 2015

Guidelines for undergraduate statistics programs While we wait, please download the report: delines.cfmhttp:// delines.cfm You are encouraged to submit questions for the discussion to follow the presentation

Thanks to workgroup members Beth Chance (Cal Poly San Luis Obispo) Stephen H. Cohen (National Science Foundation) Scott Grimshaw (Brigham Young University) Johanna Hardin (Pomona College) Tim Hesterberg (Google) Roger Hoerl (Union College) Nicholas Horton (Amherst College, chair) Chris Malone (Winona State University) Rebecca Nichols (American Statistical Association) Deborah Nolan (University of California, Berkeley)

Additional thanks ASA President Nat Schenker Megan Murphy, Val Nirala, and Sara Davidson for their graphic design work Steve Pierson and Jeff Myers for their valuable contributions Many others who provided critically important feedback and suggestions

Source: NSF IPEDS

Growth and demand McKinsey & Company report stated that “by 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions” A large number of those workers will be at the bachelors level How do we ensure that they have appropriate training to be successful?

Math Sciences in 2025 report “Two major drivers of increased reach: ubiquity of computational simulations … and exponential increases in the amount of data available” (p. 6) “Scientific computing pursued in non-unified way” (p. 9)

Committee on the Undergraduate Program in Mathematics (CUPM) 2015 Cognitive Recommendation 3: Students should learn to use technological tools. Mathematical sciences major programs should teach students to use technology effectively, both as a tool for solving problems and as an aid to exploring mathematical ideas. Use of technology should occur with increasing sophistication throughout a major curriculum.

CUPM 2015 Content Recommendation 3: Mathematical sciences major programs should include concepts and methods from data analysis, computing, and mathematical modeling. Students often face quantitative problems to which analytic methods do not apply. Solutions often require data analysis, complex mathematical models, simulation, and tools from computational science.

Why is computing so important? Motivating example Setting: Let A, B, and C be independent random variables each distributed uniform in the interval [0,1]. Question: What is the probability that the roots of the quadratic equation given by Ax^2 + Bx + C = 0 are real? Source, Rice Mathematical Statistics and Data Analysis third edition exercise 3.11 [also in first and second editions]

The analytic solution

Rice example: empirical problem solving Straightforward to simulate in R (noting that roots will be real only if the discriminant is non-negative):

Rice example: empirical problem solving Straightforward to simulate in R (noting that roots will be real only if the discriminant is non-negative): Rice reports the correct answer as 1/9 (in all three editions!)

Why is computing so important? Math Sciences 2025: “The ability to simulate a phenomenon is often regarded as a test of our ability to understand it” (p. 74) Implication: it’s hard to get probability problems wrong if you can check them in this manner Still useful to be able to get the correct answer (and not just an approximation) Goal: develop parallel empirical and analytical problem- solving skills

Undergraduate guidelines (endorsed 2014)

Executive summary: solve real-world problems Increased importance of data-related skills in modern practice More emphasis on teamwork, communications, and related experiences (e.g., internships, REUs, and capstones) Motivation: other disciplines have staked their claim As statisticians, we run the risk of becoming irrelevant if we don’t aggressively engage

Key skills Effective statisticians at any level display an integrated combination of skills (statistical theory, application, data and computation, mathematics, and communication) Students need scaffolded exposure to develop connections between statistical concepts/theory and their application to statistical practice Programs should provide their students with sufficient background in each of these areas

Curriculum for statistics majors Statistical method and theory Data-related topics and computation Mathematical foundation Statistical practice

Statistical method and theory Statistical theory (e.g., distributions of random variables, likelihood theory, point/interval estimation, hypothesis tests, decision theory, Bayesian methods, and resampling) Exploratory and graphical data analysis Design of studies (e.g., random assignment, random selection, data collection, and efficiency) and issues of bias, causality, confounding Statistical models (e.g., variety of linear and non-linear parametric, semi-parametric, and non-parametric regression models)

Key changes: more diverse models/approaches The expectations for statistical modeling go far beyond a second course in statistics Students need exposure and practice with a variety of predictive and explanatory models Need to refine methods for model building and assessment Need to understand design, confounding, and bias Need to be able to apply their knowledge of theoretical foundations to the sound analysis of data

Mathematical foundation The study of mathematics lays the foundation for statistical theory Undergraduate statistics majors should have a firm understanding of why and when statistical methods work They should be able to communicate in the language of mathematics and explain the interplay between mathematical derivations and statistical applications

Mathematical foundation (cont.) Calculus (e.g., integration and differentiation) Linear algebra (e.g., matrix manipulations, linear transformations, projections in Euclidean space, eigenvalues/eigenvectors, and matrix decompositions) Probability (e.g., properties of univariate and multivariate random variables, discrete and continuous distributions) Emphasis on connections between concepts in these mathematical foundation courses and their applications in statistics (e.g. Markov chains)

Key changes: importance of data science Working with data requires extensive computing skills far beyond those described in the previous guidelines Students need facility with professional statistical analysis software, the ability to access and “wrangle” data in various ways, and the ability to utilize algorithmic problem- solving Students need to be able to be fluent in higher-level languages and be facile with database systems

Data-related topics Use of one or more professional statistical software environments Data analysis skills undertaken in a well-documented and reproducible manner Basic programming concepts (e.g., breaking a problem down into modular pieces, algorithmic thinking, structured programming, debugging, and efficiency) Computationally intensive statistical methods (e.g., iterative methods, optimization, resampling, and simulation/Monte Carlo methods)

Key changes: ability to communicate Students need to be able to communicate complex statistical methods in basic terms to managers and other audiences and visualize results in an accessible manner They need a clear understanding of ethical standards Programs need to provide multiple opportunities to refine these statistical practice skills

Statistical practice Effective technical writing, presentation skills, and visualizations Practice with teamwork and collaboration Ability to interact with and communicate with a variety of clients and collaborators

Recommendations for minors Hard to meet all of these guidelines for a major program! Key focus for minor programs: –General statistical methodology –Statistical modeling (e.g., simple and multiple regression, confounding, diagnostics) –Facility with professional statistical software, along with data management skills –Multiple experiences analyzing data and communicating results

Recommendations at the core of the guidelines Students need to be able to “think with data” (Lambert) Need multiple opportunities to analyze messy data using modern statistical practices Key theoretical concepts (design and confounding!) need to be integrated with data preparation, analysis, and interpretation Mathematical techniques play a lesser role (still important for people planning doctoral work in theoretical statistics)

Next steps Faculty development Engagement with two year colleges Surveys of graduates and employers Certification/accreditation pathway Multiple pathways for introduction to statistics Periodic review

The increasing role of data science in undergraduate statistics programs: new guidelines, new opportunities, and new challenges Nicholas Horton, American Statistical Association Education Program Webinar February 3, 2015

Guidelines for undergraduate statistics programs Download the report: delines.cfmhttp:// delines.cfm Please submit questions for the discussion