Bina Ramamurthy 6/28/2014 CSE651C, B. Ramamurthy1.

Slides:



Advertisements
Similar presentations
MediTract Contract Management Software
Advertisements

Introduction to Graphics in R Department of Statistical Sciences and Operations Research Computation Seminar Series Speaker: Edward Boone
Welcome to the Plant Breeding and Genomics Webinar Series Today’s Presenter: Dr. Heather Merk Presentation & Supplemental Files:
Chapter 12 Information Systems Chapter Goals Define the role of general information systems Explain how spreadsheets are organized Create spreadsheets.
True or false A variable of type char can hold the value 301. ( F )
1 Programming for Engineers in Python Autumn Lecture 5: Object Oriented Programming.
Computing in Statistical Education Pang Du Department of Statistics Virginia Tech.
XP 1 Working with JavaScript Creating a Programmable Web Page for North Pole Novelties Tutorial 10.
Chapter 12 Information Systems Nell Dale John Lewis.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Chapter 14 The Second Component: The Database.
XP Tutorial 1 New Perspectives on JavaScript, Comprehensive1 Introducing JavaScript Hiding Addresses from Spammers.
Chapter 12 Information Systems. 2 Chapter Goals Define the role of general information systems Explain how spreadsheets are organized Create spreadsheets.
Computer Software.
Chapter 5 Application Software.
Chapter 12 Information Systems. 2 Chapter Goals Define the role of general information systems Explain how spreadsheets are organized Create spreadsheets.
1 Chapter 6 Understanding Computers, 11 th Edition Software Ownership Rights Software license: agreement, either included in a software package or displayed.
TC2-Computer Literacy Mr. Sencer February 8, 2010.
Basic R Programming for Life Science Undergraduate Students Introductory Workshop (Session 1) 1.
B. RAMAMURTHY EAP#2: Data Mining, Statistical Analysis and Predictive Analytics for Automotive Domain CSE651C, B. Ramamurthy 1 6/28/2014.
© Paradigm Publishing, Inc. 5-1 Chapter 5 Application Software Chapter 5 Application Software.
Introduction to MATLAB Session 1 Prepared By: Dina El Kholy Ahmed Dalal Statistics Course – Biomedical Department -year 3.
Introduction to R By Robert Biddle. About Me Data Professional with over 10 years experience. Hilton Grand Vacations, Orlando Data Architect MCITP Database.
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
Fluency with Information Technology INFO100 and CSE100 Katherine Deibel Katherine Deibel, Fluency in Information Technology1.
Introduction to SPSS Edward A. Greenberg, PhD
Using Visual Basic 6.0 to Create Web-Based Database Applications
Lecture 4 MATLAB Windows Arithmetic Operators Maintenance Functions
Objectives Understand what MATLAB is and why it is widely used in engineering and science Start the MATLAB program and solve simple problems in the command.
Intro to R R is a free version of S-plus R is a free version of S-plus Can be used interactively but script or syntax files are commonly used to record.
Computer Science 111 Fundamentals of Programming I Overview of Programming.
Unit Seven Database 1.Passage One. Foundation of Database.
Bina Ramamurthy 6/7/2014 CSE6511.  R is a software package for statistical computing.  R is an interpreted language  It is open source with high level.
Numeric Processing Chapter 6, Exploring the Digital Domain.
1 Computer Programming (ECGD2102 ) Using MATLAB Instructor: Eng. Eman Al.Swaity Lecture (1): Introduction.
SUSE Linux Enterprise Desktop Administration Chapter 6 Manage Software.
Principles of programming languages 5: An operational semantics of a small subset of C Department of Information Science and Engineering Isao Sasano.
© Paradigm Publishing Inc. 5-1 Chapter 5 Application Software.
Data Manipulation Steve Allison
MIS 105 LECTURE 1 INTRODUCTION TO COMPUTER HARDWARE CHAPTER REFERENCE- CHP. 1.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Introduction to R Introductions What is R? RStudio Layout Summary Statistics Your First R Graph 17 September 2014 Sherubtse Training.
1 12/4/1435 h Lecture 2 Programs and Programming Languages.
1 Title: Introduction to Computer Instructor: I LTAF M EHDI.
KING SAUD UNIVERSITY – COLLAGE OF COMPUTER AND INFORMATION SCIENCES CSC 113 JAVA ONLINE DOCUMENTATION.
PROGRAMMING IN R Introduction to R. In this session I will: Introduce you to the R program and windows Show how to install R Write basic programs in R.
Software. Introduction n A computer can’t do anything without a program of instructions. n A program is a set of instructions a computer carries out.
Chris Knight Beginners’ workshop.
PYTHON FOR HIGH PERFORMANCE COMPUTING. OUTLINE  Compiling for performance  Native ways for performance  Generator  Examples.
Lecture 11 Introduction to R and Accessing USGS Data from Web Services Jeffery S. Horsburgh Hydroinformatics Fall 2013 This work was funded by National.
XP Tutorial 10New Perspectives on HTML, XHTML, and DHTML, Comprehensive 1 Working with JavaScript Creating a Programmable Web Page for North Pole Novelties.
Pinellas County Schools
Introduction to Data Manipulation, Analysis, and Visualization with R Patrick Grof-Tisza.
How to Get Started With Python
Python for data analysis Prakhar Amlathe Utah State University
Data Tools: R and RStudio
Matlab.
Introduction to R.
Test Review Computer Science History
Introduction to Data Analytics
Lab 1 Introductions to R Sean Potter.
Introduction to R.
ICPSR: Resources for Instructors Finding and Analyzing Data 9/26/2012
Today’s Beginner Workshop
R Bina Ramamurthy CSE651C, B. Ramamurthy 12/7/2018.
Installing Packages Introduction to R, Part II
Java Online documentation
R Bina Ramamurthy CSE4/587 4/7/2019.
> Introduction to Nelson Rios, Tulane University
Presentation transcript:

Bina Ramamurthy 6/28/2014 CSE651C, B. Ramamurthy1

 R is a software package for statistical computing.  R is an interpreted language  It is open source with high level of contribution from the community  “R is very good at plotting graphics, analyzing data, and fitting statistical models using data that fits in the computer’s memory.”  “It’s not as good at storing data in complicated structures, efficiently querying data, or working with data that doesn’t fit in the computer’s memory.” 6/28/2014 CSE651C, B. Ramamurthy2

 There are many packages available for statistical analysis such as SAS and SPSS but there are expensive (user license based) and are proprietary.  R is open source and it can pretty much what SAS can do but free.  R is considered one of the best statistical tools in the world.  For R people can submit their own packages/libraries, using the latest cutting edge techniques.  To date R has got almost 15,000 packages in the CRAN (Comprehensive R Archive Network – The site which maintains the R project) repository.  R is great for exploratory data analysis (EDA): for understanding the nature of your data before you launch serious analytics. 6/28/2014 CSE651C, B. Ramamurthy3

 An R package is a set of related functions  To use a package you need to load into R  R offers a large number of packages for various vertical and horizontal domains:  Horizontal: display graphics, statistical packages, machine learning  Verticals: wide variety of industries: analyzing microarray data, modeling credit risks, social sciences, automobile data (none so far on sensor data from automobiles!) 6/28/2014 CSE651C, B. Ramamurthy4

 Library  Package  Class   R considers every item as a class/object  Thousands of Online libraries  packages  CRAN: Comprehensive R Archive Network  Look at all the packages available in CRAN  R-Forge is another source for people to collaborate on R projects 6/28/2014 CSE651C, B. Ramamurthy5

 R Basics, fundamentals  The R language  Working with data  Statistics with R language 6/28/2014 CSE651C, B. Ramamurthy6

 Obtaining the R package  Installing it  Install and use packages  Quick overview and tutorial 6/28/2014 CSE651C, B. Ramamurthy7

 See p.98 onwards till p.102 of simpleR: Using R for introductory statistics By J. Verzani SimpleR.pdf  R in a nutshell, by Joseph Adler, O’reilly, 2010 Chapter 3 Basics, Ch.4 packages, (search for this online) Look for these resources online…and try these.  See Rhandout.docx linked to today’s lecture 6/28/2014 CSE651C, B. Ramamurthy8

 A package is a collection of functions and data files bundled together.  In order to use the components of a package it needs to be installed in the local library of the R environment.  Loading packages  Custom packages  Building packages  Activity: explore what R packages are available, if any, for the automotive domain  (Later on, when the need arises) Try to create a custom package for the domain of your interest. 6/28/2014 CSE651C, B. Ramamurthy9

 R syntax  R Control structures  R Objects  R formulas 6/28/2014 CSE651C, B. Ramamurthy10

 R language is composed of series of expression resulting in a value  Examples of expression include assignment statements, conditional statements, and arithmetic expressions > a<- 42 > b <- a% 5 > if (b == 0) " a divisible evenly by 5" else " not evenly divisible by 5" [1] " not evenly divisible by 5"  Variables in R are called symbols 6/28/2014 CSE651C, B. Ramamurthy11

 All items vectors, lists, even functions are considered as objects in R  Example : a vector of integers, and then floats ◦ p<- c(6, 8,4,5,78) ◦ p ◦ q<- c(5.6, 4.5, 7.8, 9.3) ◦ Q  A list can be made of items of any type > r<- list(p, q, "this demo") > r [[1]] [1] [[2]] [1] [[3]] [1] "this demo“ 6/28/2014 CSE651C, B. Ramamurthy12

 > v <- c(1,2,3)  > v  [1]  > length(v) <- 4  > v  [1] NA  NA : not defined or not available  Very Large and very small numbers:  > 2 ^ 1024  [1] Inf  > - 2 ^ 1024  [1] -Inf 6/28/2014 CSE651C, B. Ramamurthy13

 Curly braces are used to group a set of operations in the body of a function: > f <- function() {x <- 1; y <- 2; x + y} > f() [1] 3 6/28/2014 CSE651C, B. Ramamurthy14

 > i <-4  > repeat {if (i > 25) break else {print(i); i <- i + 5;}}  [1] 4  [1] 9  [1] 14  [1] 19  [1] 24 6/28/2014 CSE651C, B. Ramamurthy15

 survey.results <- factor( c("Disagree", "Neutral", "Strongly Disagree", "Neutral", "Agree", "Strongly Agree","Disagree", "Strongly Agree", "Neutral","Strongly Disagree", "Neutral", "Agree"),levels=c("Strongly Disagree", "Disagree", "Neutral", "Agree", "Strongly Agree"),ordered=TRUE)  survey.results  R will automatically compute the numbers in each category!  There are many more functions and operations available in R that are related to data. 6/28/2014 CSE651C, B. Ramamurthy16

6/28/2014 CSE651C, B. Ramamurthy17 Lets explore RStudio

 Look at the tutorial in handout#1  R is good for Exploratory Data Analytics  It is really good for most statistcal computing you will you in your domain.  Explore R and its application in automotive domain. 6/28/2014 CSE651C, B. Ramamurthy18