Presentation is loading. Please wait.

Presentation is loading. Please wait.

Source Code -Tons of Code Package -More Code -Statistical Functions -Datasets Workspace -Fewer Lines of Code -Capability.

Similar presentations


Presentation on theme: "Source Code -Tons of Code Package -More Code -Statistical Functions -Datasets Workspace -Fewer Lines of Code -Capability."— Presentation transcript:

1

2

3 Source Code -Tons of Code Package -More Code -Statistical Functions -Datasets Workspace -Fewer Lines of Code -Capability

4 http://www.statmethods.net/management/functions.html

5 Currently, how many R Packages? At the command line enter:  dim(available.packages())  available.packages()

6 Is there an R App Store?

7

8 Two heavyweights in the statistical software market are SAS and SPSS/IBM

9 R Packages have been created that are equivalent to the functionality of SAS and SPSS

10 XLConnect XML rhbase sas7bdat Rcpp Packages for reading, writing for various file formats RJSONIO Hmisc RODBC / ROracle foreign RMySQL RWeka Comma Separated Variables

11 Oracle R Enterprise (ORE) R Being Integrated Into Other Data-Related Products http://help.sap.com/hana/hana_dev_r_emb_en.pdf https://blogs.oracle.com/R/ http://www-142.ibm.com/software/products/us/en/spss-stats-developer/ “Both R and SAS are here to stay, and finding ways to make them work better with each other is in the best interests of our customers.”` http://support.sas.com/rnd/app/studio/Rinterface2.html

12 R “Machine Learning” Libraries Analytic TechniqueR Package/LibraryAuthorOrganization Support Vector Mach.libsvm (ksvm) Chih-Chung Chang Chih-Jen Lin National Taiwan Univ. + EBay Research Labs Neural NetworksneuralnetFrauke Gunther Stefan Fritsch Epidemiology and Prevention Research nnetBrian RipleyUniversity of Oxford monmlpAlex J. CannonAtmospheric Science Randomized ForestsrandomForestFortran original by Leo Breiman & Adele Cutler, R port by Andy Liaw and Matthew Wiener.Merck Decision TreesrpartTerry M Therneau and Beth Atkinson. R port by Brian Ripley. Mayo Clinic University of Oxford Boosting ModelAdaMark CulpWest Virginia University Maximum EntropymaxentYoshimasha Tsuruoka Timothy Jurka University of Tokyo UC-Davis Bagging, bootstrapadabagEsteban Alfaro-CortesLa Universidad de Castilla-La Mancha Latent DiralectsldaJonathan ChangFacebook Naïve Bayese1071David Meyer Evgenia Dimitriadout Vienna University Bayesian NetworkbnlearnMarco Scutari.UCL Genetics Institute Hidden MarkovhiddenmarkovDavid HarteStatistics Research

13

14 IndustryPct. Research24% Higher Education7% Information Technology9% Computer Software7% Financial Services6% Banking2% Pharmaceuticals4% Biotechnology4% Market Research3% Management Consulting3% Total69% Hadley Wickham Asst. Professor of Statistics at Rice University ggplot2 plyr reshape rggobi profr Industries / Organizations Creating and Using R

15 http://www.r-statistics.com/2013/06/top-100-r-packages-for-2013-jan-may/ Top 100 R packages for 2013 (Jan-May)

16 Specialized “Domain” BeginnerSome Coverage stats graphics (both built-in) Data Management plyr reshape Graphics ggplot2 Bayesian DifferentialEquations Econometrics Environmetrics ExperimentalDesign Finance Genetics HighPerformanceComputing MachineLearning MedicalImaging NaturalLanguageProcessing Pharmacokinetics Phylogenetics Psychometrics SocialSciences Spatial TimeSeries

17 Easy to Use Interactive Standard Visualizations Steep Learning Curve Visualization and Reporting

18

19 The R Graphics Package Graphing Parameters Titles X-Axis Title Y-Axis Title Legend Scales Color Gridlines library(help="graphics") Basic Chart Types

20 In ggplot2 a plot is made up of layers. ggplot2 Plot Grammar of Graphics Layer -Data - Mapping -Geom -Stat -Postiion Scale Coord Facet

21 Correlations Matrix library(car) scatterplotMatrix(h)

22 The Correlation Package was built on top of the Pairs Package

23

24 The next data visual was produced with about 150 lines of R code

25 http://rcharts.io/gallery/

26 https://plot.ly/r/

27 http://shiny.rstudio.com/gallery/movie-explorer.html

28 http://statmethods.net/ good documentation and sample code http://stackoverflow.com/ helpful for trouble-shooting code http://www.r-bloggers.com/ helpful for hearing about new things Additional Resources


Download ppt "Source Code -Tons of Code Package -More Code -Statistical Functions -Datasets Workspace -Fewer Lines of Code -Capability."

Similar presentations


Ads by Google