Python4ML An open-source course for everyone

Slides:



Advertisements
Similar presentations
Centre for the Enhancement of Learning and Teaching Supporting & Enhancing Online Teaching & Learning by Catherine Ogilvie Centre for the Enhancement of.
Advertisements

A Data Mining Course for Computer Science and non Computer Science Students Jamil Saquer Computer Science Department Missouri State University Springfield,
1 CS 446 – Tutorial 6 Frid. Nov. 6 th, 2009 Implementation Tutorial.
Trnsport Test Suite Project Tony Compton, Texas DOT Charles Engelke, Info Tech.
Creating a Program In today’s lesson we will look at: what programming is different types of programs how we create a program installing an IDE to get.
Adaptive Book: A Platform for teaching, learning and student modeling Ananda Gunawardena School of Computer Science Carnegie Mellon University.
Geo 118/318 – Introduction to GIS Programming Dr. Jim Graham.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
1 Networks and the Internet A network is a structure linking computers together for the purpose of sharing resources such as printers and files Users typically.
New Advanced Higher Subject Implementation Events
Today’s Topic Language of web page - HTML (Hypertext Markup Language)
With your friendly Web Developer, Chris.. Terminology  HTML - > Hypertext Markup Language  CSS -> Cascading Style Sheet  open tag  close tag  HTTP->Hypertext.
Sumedha Rubasinghe October,2009 Introduction to Programming Tools.
K-12 Web Content Development Process
Hello World! CSE442. Course Summary A semester long group project – You will develop software from idea to implementation You have full freedom to choose.
New Advanced Higher Subject Implementation Events Computing Science Advanced Higher Course Assessment.
Social Authoring Raising Quality and Access to Online Course Content Ruth Rominger Director of Learning Design National Repository of Online Courses Module.
Creating a Digital Classroom. * Introduction * The Student Experience * Schoology’s Features * Create a Course & Experiment.
Tweets Metadata May 4, 2015 CS Multimedia, Hypertext and Information Access Department of Computer Science Virginia Polytechnic Institute and State.
Module 5: Creating and Configuring Group Policies.
University of Macau Faculty of Science and Technology Computer and Information Science SFTW 241 Programming Languages Architecture 1 Group B5.
Adaptive Book: Teaching and Learning Environment for Programming Education Ananda Gunawardena & Victor Adamchik.
Autograder Better management and evaluation of lab assignments.
Collegiate Times Grades James O’Hara Hang Lin Client: Alex Koma, managing editor, Collegiate Times Blacksburg March 4, 2014 Virginia Tech CS 4624.
Portfolios for TAs. PI34 “Students must complete a portfolio of evidence for the teaching standards” to be certified Review.
Problem Based Learning To Build And Search Tweet And Web Archives Richard Gruss Edward A. Fox Digital Library Research Laboratory Dept. of Computer Science.
Welcome to Snap! Below the Line Decal Facilitated By: Zachary McPherson and Bao Xie.
Geo 118/318 – Introduction to GIS Programming Dr. Jim Graham.
By: Jamie Morgan  A wiki is a web page or collection of web pages which you and your students can access to contribute or modify content without having.
VTCAR Final Presentation CS Dr. Fox Virginia Tech - Blacksburg, VA /3/16 Kyle Simmons, Steven Whitehead, Sebastian Welsh Client: Dr. Julee.
Big Data Processing of School Shooting Archives
Machine Learning Models
CS 445/545 Machine Learning Winter, 2017
Michael Liu, Andrew Chuba, Divya Sengar, James Wong, Alan Kai
SPE 578 STUDY perfect education/spe578study.com
It’s called “wifi”! Source: Somewhere on the Internet!
CS 499 PROJECT DOCUMENTATION
Rdoc2vec Jake Clark, Austin Cooke, Steven Rolph, Stephen Sherrard
You’ve created an e-Learning resource...
Background Check Website for R4 OpSec, LLC
Zenodo Data Archive Irtiza Delwar, Michael Culhane, John Sizemore, Gil Turner Client: Dr. Seungwon Yang Instructor: Dr. Edward A. Fox CS 4624 Multimedia,
GCE Applied ICT G053: Lesson 01 Introduction To The Unit
IS1500: Introduction to Web Development
Virginia Tech Center for Drug Discovery Website Migration and Redesign
Wikispaces for Teachers A Guide to Using Them in Your Classroom
Trail Study Kevin Cianfarini, Shane Davies, Marshall Hansen, Andrew Eason … CS4624: Multimedia, Hypertext, and Information Access Instructor: Dr. Edward.
The Team Ernesto Cortes Kipp Dunn Sar Gregorczyk Alex Schmidt
Hey everyone, I’m Sunny …harsh caroline xavier
Multimedia Database Virginia Polytechnic Institute and State University Blacksburg, VA CS 4624 Multimedia, Hypertext and Information Access Client.
Event Focused URL Extraction from Tweets
Lecturer: Geoff Hulten TAs: Kousuke Ariga & Angli Liu
Collection Management Webpages Final Presentation
Stream Field Final Project Presentation
Event Trend Detector Ryan Ward, Skylar Edwards, Jun Lee, Stuart Beard, Spencer Su CS 4624 Multimedia, Hypertext, and Information Access Instructor: Edward.
Final Presentation: Neural Network Doc Summarization
Twitter Equity Firm Value
Validation of Ebola LOD
LucidWorks: Vectorize Workflow Module
News Event Detection Website Joe Acanfora, Briana Crabb, Jeff Morris
Paleontology Topic Trends
Tweet URL Analysis Guoxin Sun, Kehan Lyu, Liyan Li
IDRgeneralization: Music Appreciation
Writing for Communication on the Web
Autism Support Portal Members: Sib Quayum, Ryan Galliher, Ayumi Ritchie, Kenneth Nagies Course: Multimedia, Hypertext, and Information Access (CS 4624)
Good Evening! Please take a moment to peruse the sample portfolio (Welcome and three Narrative pages). A link to this site can be found at the bottom of.
Lecture 1a- Introduction
Neal Kurande, WinaGodwin Anyanwu Jr., Adam Chau
UQ Course Site Design Guidelines
Machine Learning for Cyber
Lecturer: Geoff Hulten TAs: Alon Milchgrub, Andrew Wei
Presentation transcript:

Python4ML An open-source course for everyone Team: James Hopkins | Brendan Sherman | Zachery Smith | Eric Wynn Client: Amirsina Torfi Instructor: Dr. Edward Fox CS 4624: Multimedia, Hypertext, and Information Access Virginia Tech, Blacksburg VA 24061 5/12/2019

Outline Summary Deliverables Documentation Testing Plans Post-Semester Work Lessons Learned

Summary Creating an open-source guide to Machine Learning using Python

Deliverables Course website Contains all module write-ups Entire site can be downloaded as PDF or other formats Links to code examples on GitHub

Deliverables GitHub repository All course content is open-source Anyone can contribute and suggest changes Highly structured system for files

Documentation Documentation written in reStructuredText (rST), a form of markup Seamless integration with Sphinx Annotated, linked figures Clean tables Embeddable code

Demo https://machine-learning-course.readthedocs.io/en/latest/ Project Stats 6500+ lines of content 100+ pages of course documentation (another 60+ in the final report) 70+ sources 25+ unique python examples

Testing Plans We have been testing scripts as they are added, but the whole team will come together to re-test each one for the final deliverable. Every navigation link in the site needs to be tested. We had sample users go through the modules and provide us with feedback.

End of Semester Plans Complete the last two topics on Neural Networks Build final site, test navigation, and test every script Get feedback from user testing

Python4ML A Machine Learning Course for Everyone Live Course Site Topics A Machine Learning Course for Everyone WHAT? Python4ML is an open-source course for machine learning using the Python programming language. WHO? This course is being developed with Virginia Tech’s Open Source for Science organization, led by our client Amirsina Torfi. HOW? The course is made up of reStructuredText documents and example programs written in Python, using libraries such as scikit learn. WHERE? The course is available on GitHub. The code and live course site can be found by scanning the QR code on this poster. Overview Technologies Linear Regression Overfitting / Underfitting WHY? The course is aimed at those with little knowledge of machine learning. We want to facilitate education in an open-source context, bringing important topics together in a high-level overview of ML. Cross Validation Regularization Supervised Learning Decision Trees Logistic Regression Support Vector Machines Unsupervised Learning Clustering Principal Component Analysis Learn More K-Nearest Neighbors Naive Bayes Team: James Hopkins, Brendan Sherman, Zachery Smith, and Eric Wynn Client: Amirsina Torfi, Head of Open Source for Science @ VT 4/30/2019 Instructor: Edward A. Fox

Post-Semester Work This course can be expanded outside this capstone or in future semesters by: Writing more example scripts and content Integrating the project with Docker: A containerized system, similar to virtual machines Can pre-package dependencies together for each section and provide users with a single command to run scripts Users won’t need python OR dependencies installed

Lessons Learned / Takeaways Importance of high quality documentation As important as or more important than actual code Makes reviews faster and documents easier to understand Teaching about a subject requires deep understanding Had to know why specific decisions were made at every step Required lots of research on the topics

Lessons Learned / Takeaways Importance of finishing personal assignments in a timely manner Several reviewing stages that required individual approval Delayed when someone doesn't respond or approve changes

Acknowledgements Client: Amirsina Torfi PhD student in Computer Science Focused on deep learning, neural networks Scikit-Learn Open-source Python Machine Learning library