My BITE Fellowship Edward Challis. This is a picture of me:

Slides:



Advertisements
Similar presentations
E-Science Data Information and Knowledge Transformation Thoughts on Education and Training for E-Science Based on edikt project experience Dr. Denise Ecklund.
Advertisements

Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Virginia Bourdeau, Professor Oregon 4-H Science Rich Inquiry Activity.
Internship Seminar What will be covered: The internship context
BITE Scheme Report Shuai Yuan Department of Computer Science, UCL MediaGamma
National Association of Chronic Disease Directors January 2013 Dr. Jewlya Lynn Jason Vahling.
Teaching Secondary Mathematics Overview of the Mathematics Developmental Continuum P-10 Module 2: 2.
CIS 678 Artificial Intelligence problems deduction, reasoning knowledge representation planning learning natural language processing motion and manipulation.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
© Franz Kurfess Project Topics 1 Topics for Master’s Projects and Theses -- Winter Franz J. Kurfess Computer Science Department Cal Poly.
Machine Learning (Extended) Dr. Ata Kaban
Detecting Near Duplicates for Web Crawling Authors : Gurmeet Singh Mank Arvind Jain Anish Das Sarma Presented by Chintan Udeshi 6/28/ Udeshi-CS572.
College Algebra for Teachers Laurie Burton Western Oregon University MAA PREP Active Learning Workshop July 7, 2003 “Algebra Theme” Day.
Welcome and Overview Richard Anderson University of Washington June 30, 20081IUCEE: Welcome and Overview.
Manchester, 2007 Adaptive learning paths for improving Lifelong Learning experiences Ana Elena Guerrero Roldán, Julià Minguillón Universitat Oberta de.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Module 1: A Closer Look at the Common Core State Standards for Mathematics High School Session 2: Matching Clusters of Standards to Critical Areas in one.
Tyson Condie.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Learning Law Orientation: August 16, Synthesis Judgment 4. Problem Solving 3. Spotting Issues 2. Understanding 1. Knowledge 1. Recognition vs.
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, 2.
How to get the most out of the survey task + suggested survey topics for CS512 Presented by Nikita Spirin.
CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.
Medical Data Classifier undergraduate project By: Avikam Agur and Maayan Zehavi Advisors: Prof. Michael Elhadad and Mr. Tal Baumel.
1 Machine Learning (Extended) Dr. Ata Kaban Algorithms to enable computers to learn –Learning = ability to improve performance automatically through experience.
Artificial Intelligence By Michelle Witcofsky And Evan Flanagan.
2009 ML Project: Goal: Do some real machine learning… A project you are interested in works better Data is often the hard part (get it in plenty of time)
Archived File The file below has been archived for historical reference purposes only. The content and links are no longer maintained and may be outdated.
Human Interaction with Data “Meaningful Interpretations” “The Power of Crowdsourcing” &
Most of contents are provided by the website Introduction TJTSD66: Advanced Topics in Social Media Dr.
WebQuest: A Brief Introduction. Life-long Learning Authentic Situation Subject knowledge Generic skills Scaffolding Rubrics.
Advanced Analytics on Hadoop Spring 2014 WPI, Mohamed Eltabakh 1.
Top 10 Teacher Apps
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
1 Knowledge Acquisition and Learning by Experience – The Role of Case-Specific Knowledge Knowledge modeling and acquisition Learning by experience Framework.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Mathematics and the Common Core Principals’ Conference November 18, 2011.
User Interface Design for a Large-Scale Computer Science Research Digital Library Min-Yen Kan Department of Computer Science National University of Singapore.
1 Some Guidelines for Good Research Dr Leow Wee Kheng Dept. of Computer Science.
Music in Artificial Intelligence Victoria Tran. Why Music in Artificial Intelligence? Technology is improving every day, so music is beginning to depend.
Low Power Communications 4 month secondment at ARM Limited – Cambridge, UK. George Smart Dr Paul Whatmough Dr Yiannis Andreopoulos.
Ellis Paul Technical Solution Specialist – System Center Microsoft UK Operations Manager Overview.
#SQLSAT454 Using Power BI in Enterprise Andrea
Biological data representation and data mining Xin Chen
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
Dato Confidential 1 Danny Bickson Co-Founder. Dato Confidential 2 Successful apps in 2015 must be intelligent Machine learning key to next-gen apps Recommenders.
Kathleen O’Connor Supervisor of Mathematics and Science K-4.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Brief Intro to Machine Learning CS539
Thesis presentation Kaisa Pihlaja
CPS : Information Management and Mining
I’m doing good! The computer helps me to learn Mental maths skills
Machine Learning Library for Apache Ignite
What does it mean for Hawridge and Cholesbury? Angela Hughes
Distributed Programming in “Big Data” Systems Pramod Bhatotia wp
Eick: Introduction Machine Learning
Who is the Expert? Combining Intention and Knowledge of Online Discussants in Collaborative RE Tasks Itzel Morales-Ramirez1,2, Matthieu Vergne1,2, Mirko.
Map Reduce.
Data Mining, Distributed Computing and Event Detection at BPA
Developing our skills & strengths
Developing our skills & strengths
CS110: Discussion about Spark
Overview of Machine Learning
TIM TAYLOR AND JOSH NEEDHAM
Deep Into the Cosmos(DB)
Data Mining, Distributed Computing and Event Detection at BPA
Live Event resources Pre- event checklist Planning template
Cloud Forensics Logging
Presentation transcript:

My BITE Fellowship Edward Challis

This is a picture of me:

This is my background: Postdoc in machine learning with David Barber at UCL. Short 6 month postdoc in using machine learning methods to detect disease in fMRI scans. PhD in scalable approximate inference for Bayesian linear models with David Barber at CSML. 2 years working in finance. MSc in Artificial Intelligence. Maths degree. School and childhood.

Machine Learning? Machine Learning is the study of algorithms that use data to improve their ability to perform some task. Iteration 1Iteration 2Iteration 3

BITE Fellowship Received an , liked the idea, started looking for good places to do the internship.

Martin Goodson

Skimlinks

Skimlinks’ Data Skimlinks collects loads of data: Links to products on publisher pages Clicks on links in pages Purchases of products from product links. But in its raw form this data looks like:

Data processing and machine learning How do we convert raw log files into understandable concepts such as products, topics, themes, intents. The primary problem is that the datasets are extremely large. 1TB + for most sub-problems.

What I worked on: Because the datasets are so large you can only work on them using cluster computing. Skimlinks is a leader in the UK in the adoption of the Apache Spark cluster computing framework. My seccondment at Skimlinks focused on implementing and applying machine learning algorithms on large clusters of large machines running Apache Spark to extract meaningful information from log files.

Why this was great for me: In academia its hard to get your hands on such large and interesting datasets. Skimlinks works on the largest datasets of any startup in London I know of. Distributed computing is the future. Adapting my ML knowledge into this domain is fascinating and challenging. Real problems are hard and subtle. Experience is required to solve them.

Why this was great for them: Skimlinks is always looking for ways to improve the ‘intelligence’ of its products. My ML knowledge helped the team approach and solve some of their hard problems. During the secondment we built effectively systems the processed TBs of data into useful and interpretable knowledge about products and content.

Future plans.. My relationship with Skimlinks will continue. I’m now working with them part-time as a Data Scientist. This year we plan to publish papers on the methods we’ve developed. Skimlinks are UK experts in Apache Spark – I want to increase my expertise in this domain and do further research into ML on Spark. Skimlinks have deepened their relationship with CSML – new masters and postdoc projects are in the pipline.

Thank you BITE!! Specially: Ryan for being so helpful throughout, Prof Izzat Darwazeh for making all this happen and Skimlinks + Martin Goodson for being great hosts.