Parallel Processing in SAS CPUCOUNT A comparison of Proc Means for the Project.

Slides:



Advertisements
Similar presentations
Timothy Forsyth Ashok Viswanathan Debbie McCullough Donn Garvert.
Advertisements

SAS Programming:File Merging and Manipulation. Reading External Files (review) data barf; * create the dataset BARF; infile ’s:\mysas\Table7.1'; * open.
S ORTING WITH SAS L ONG, VERY LONG AND LARGE, VERY LARGE D ATA Aldi Kraja Division of Statistical Genomics SAS seminar series June 02, 2008.
Path Analysis SAS/Calis. Read in the Data options formdlim='-' nodate pagno=min; TITLE 'Path Analysis, Ingram Data' ; data Ingram(type=corr); INPUT _TYPE_.
SAS Programming: Working With Variables. Data Step Manipulations New variables should be created during a Data step Existing variables should be manipulated.
Today: Run SAS programs on Saturn (UNIX tutorial) Runs SAS programs on the PC.
Mean Comparison With More Than Two Groups
Two-Way ANOVA in SAS Multiple regression with two or
Understanding SAS Data Step Processing Alan C. Elliott stattutorials.com.
Data Cleaning 101 Ron Cody, Ed.D Robert Wood Johnson Medical School Piscataway, NJ.
Topics in Data Management SAS Data Step. Combining Data Sets I - SET Statement Data available on common variables from different sources. Multiple datasets.
Data Preparation for Analytics Using SAS Gerhard Svolba, Ph.D. Reviewed by Madera Ebby, Ph.D.
Welcome to SAS…Session..!. What is SAS..! A Complete programming language with report formatting with statistical and mathematical capabilities.
Week 3 Topic - Descriptive Procedures Program 3 in course notes Cody & Smith (Chapter 2)
1 Chapter 3: Macro Definitions 3.1 Defining and Calling a Macro 3.2 Macro Parameters 3.3 Macro Storage (Self-Study)
Math – Getting Information from the Graph of a Function 1.
SAS SQL SAS Seminar Series
SAS Macros ® 101 How I learned to stop worrying and love macros Alex Chaplin BCS USA Section.
SAS PROC REPORT PROC TABULATE
Lecture 5 Sorting, Printing, and Summarizing Your Data.
SAS Workshop Lecture 1 Lecturer: Annie N. Simpson, MSc.
Chapter 9 Producing Descriptive Statistics PROC MEANS; Summarize descriptive statistics for continuous numeric variables. PROC FREQ; Summarize frequency.
Introduction to SAS Essentials Mastering SAS for Data Analytics Alan Elliott and Wayne Woodward SAS ESSENTIALS -- Elliott & Woodward1.
PROC REPORT organizes the output in many ways, from the simple to highly complex… PROC REPORT NOWINDOWS HEADLINE HEADSKIP; COLUMN variable-list; DEFINE.
Introduction to SAS BIO 226 – Spring Outline Windows and common rules Getting the data –The PRINT and CONTENT Procedures Manipulating the data.
1 Experimental Statistics - week 4 Chapter 8: 1-factor ANOVA models Using SAS.
USING SAS PROCEDURES SAS System Options OPTIONS Statement
Niraj J. Pandya, Element Technologies Inc., NJ.  Summarize all possible combinations of class level variables even if few categories are altogether missing.
Chapter 20 Creating Multiple Observations from a Single Record Objectives Create multiple observations from a single record containing repeating blocks.
Tips & Tricks MASUG02/18/2005. Multiple Graphs on One Page.
SAS Macro: Some Tips for Debugging Stat St. Paul’s Hospital April 2, 2007.
SAS Efficiency Techniques and Methods By Kelley Weston Sr. Statistical Programmer Quintiles.
Use the UPDATE statement to: –update a master dataset with new transactions (e.g. a bank account updated regularly with deposits and withdrawals…). Not.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
A Brief Introduction to PROC TRANSPOSE prepared by Voytek Grus for
Macro Overview Mihaela Simion. Macro Facility Overview Definition : The SAS Macro Facility is a tool within base SAS software that contains the essential.
Lesson 2 Topic - Reading in data Chapter 2 (Little SAS Book)
1 Filling in the blanks with PROC FREQ Bill Klein Ryerson University.
The Completely Randomized Design (§8.3)
Lesson 6 - Topics Reading SAS datasets Subsetting SAS datasets Merging SAS datasets.
Chapter 5 Reading and Manipulating SAS ® Data Sets and Creating Detailed Reports Xiaogang Su Department of Statistics University of Central Florida.
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
By: Corey T. Williams 03 May Situation Objective.
Kurtosis SAS. g1g2.sas; data EDA; infile 'C:\Users\Vati\Documents\StatData\EDA.d at'; input Y; proc means mean skewness kurtosis N; var Y; run; Analysis.
Lesson 4 - Topics Creating new variables in the data step SAS Functions.
Chapter 4 concerns various SAS procedures (PROCs). Every PROC operates on: –the most recently created dataset –all the observations –all the appropriate.
Chapter 19: Introduction to Efficient SAS Programming 1 STAT 541 ©Spring 2012 Imelda Go, John Grego, Jennifer Lasecki and the University of South Carolina.
Lesson 8 - Topics Creating SAS datasets from procedures Using ODS and data steps to make reports Using PROC RANK Programs in course notes LSB 4:11;5:3.
Lecture 4 Ways to get data into SAS Some practice programming
An Introduction Katherine Nicholas & Liqiong Fan.
Computing with SAS Software A SAS program consists of SAS statements. 1. The DATA step consists of SAS statements that define your data and create a SAS.
SAS procedure for the C.I. of a population average PROC MEANS DATA = data-name N MEAN STD CLM ALPHA=value MAXDEC = number; VAR measurement-variable; RUN;
Lesson 2 Topic - Reading in data Programs 1 and 2 in course notes –Chapter 2 (Little SAS Book)
SAS Programming Training Instructor:Greg Grandits TA: Textbooks:The Little SAS Book, 5th Edition Applied Statistics and the SAS Programming Language, 5.
The Urban Institute - SAS Training6/9/20161 SAS Training This SAS Training Course was designed to introduce users at The Urban Institute to SAS programming.
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 16 & 17 By Tasha Chapman, Oregon Health Authority.
Test Title Test Content.
Applied Business Forecasting and Regression Analysis
제 5장 기술통계 및 추론 PROC MEANS 절차 PROC MEANS <options> ;
Chapter 19: Introduction to Efficient SAS Programming
Instructor: Raul Cruz-Cano
Assume as previously that we have k samples on as many treatments
Defining and Calling a Macro
Computer Science 2 Review the Bubble Sort
3 Iterative Processing.
Computer Science 2 Getting an unknown # of …. Into an array.
Introduction to DATA Step Programming: SAS Basics II
Producing Descriptive Statistics
Sampling Distribution of the Mean in IML
Wilcoxon Rank-Sum Test
Presentation transcript:

Parallel Processing in SAS CPUCOUNT A comparison of Proc Means for the Project

THREADS and CPUCOUNT In SAS there is an option to let SAS create THREADS=YES, this allows SAS to start different threads that can be run on different CPUs. SAS also has an option to set the number of available CPUs, for example CPUCOUNT = 2

CPUCOUNT = The times to run the following code on the Project problem 2 were recorded for different CPUCOUNTs proc means data=data01 n mean std maxdec=2; class location; title 'Mean scores for each location'; var M1 - M8; run;

Code DM 'LOG;CLEAR;OUT;CLEAR'; OPTIONS THREADS=YES CPUCOUNT=8; options mlogic mprint mtrace symbolgen; data data01; infile 'C:\Documents and Settings\Eric A. Suess\Desktop\Stat6250\data01.txt'; input ID Location M1 M2 M3 M4 M5 M6 M7 M8; run; /* Compute the mean and stdev of the measurement. */ proc means data=data01 n mean std maxdec=2; class location; title 'Mean scores for each location'; var M1 - M8; run; proc sort data=data01; by location; run; proc means data=data01 n mean std maxdec=2; by location; title 'Mean scores for each location'; var M1 - M8; run; /* Create a new data set using the "output" subcommand. */ proc means data=data01 noprint nway; class location; var M1 - M8; output out=locationsum n = n_measurement mean = m_measurement std = s_measurement max = max_measurement min = min_measurement; run; proc print data=locationsum; title 'Listing of data set locationsum'; run; proc contents data=data01 ; run;

For CPUCOUNT=7 419 /* Compute the mean and stdev of the measurement. */ 420 proc means data=data01 n mean std maxdec=2; 421 class location; 422 title 'Mean scores for each location'; 423 var M1 - M8; 424 run; NOTE: There were observations read from the data set WORK.DATA01. NOTE: PROCEDURE MEANS used (Total process time): real time 1.64 seconds cpu time 3.39 seconds

Times CPUsreal timecpu time

Graph

Question For what CPUCOUNT is there the most gain? After what number of CPUs is there no clear added advantage?