Presentation on theme: "USER! 8 TH ANNUAL INTERNATIONAL R USERS CONFERENCE Joanne Ellwood."— Presentation transcript:
USER! 8 TH ANNUAL INTERNATIONAL R USERS CONFERENCE Joanne Ellwood
The useR! Conference June 12-15, 2012 Nashville, TN 482 attendees Very open environment Presenters, probably no one rejected
The useR! Attendees Lots of academia Mostly advanced degrees They are really into graphs but call them plots
The useR! Community Unix and Mac Open source most everything else R language constantly in flux
The useR! Key Points Reproducible Results Moving your org to R R and Big Data
The useR! Conference Reproducible Results The cautionary tale Potti et el of Notre Dame 2006 Biogenenics for cancer treatment Kevin Coombes team discovered Off by 1 error Willful data manipulations and fraud National news coverage Fallout – people died, lawsuits, Potti
The useR! Conference Moving your org to R Robert Muenchen Manager, University of Tennessee Office of Information Technology, Research Computing Support In-house Do what users know Stop licensing little used SAS modules like ETS, QC Replace with R packages Freeze new development in SAS
The useR! R and Big Data R and Hadoop RHive Is SQLish but called HQL The architecture
The useR! R and Big Data PL/R in EMC GreenPlum PL/R is a PostgreSQL language extension that allows you to write PostgreSQL functions and aggregate functions in the R statistical computing language. Use PostgreSQL to query Greenplum data. Parallel processing Install R on every GreenPlum node Know how data is split Define and query on the splits
The useR! R and Big Data in-database vendor solutions Oracle Netezza EMC GreenPlum PL/R explicit parallel MADLib implicit parallel GPHD GreenPlum + Hadoop
The useR! Conclusion Knitr /= Sweave HiveR /= Rhive Eclipse/Rstudio PL/R to PostgreSQL PostgreSQL to ECM greenPlum GPHD R/R/R/R/R/R/R/R!
Your consent to our cookies if you continue to use this website.