Data Mining (Don’t worry, I am not presenting these slides; just for your reading pleasure)

Slides:



Advertisements
Similar presentations
The NVO Data Discovery Portal Tom McGlynn NASA/GSFC.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Indexing DNA Sequences Using q-Grams
Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.
Introduction to Information Retrieval
MAST-VizieR/NED cross correlation tutorial 1. Introduction Figure 1: Screenshot of the MAST VizieR Catalog Search Form. or enter here as object class:
MOST - Moving Object Search Tool for NEOWISE and IRSA Kevin Yau 6/11/2010.
Processing raw electrophysiological signals in CARMEN:detecting and sorting spikes Leslie Smith University of Stirling.
Brian Merrick CS498 Seminar.  Introduction to Neural Networks  Types of Neural Networks  Neural Networks with Pattern Recognition  Applications.
Learning from Observations Chapter 18 Section 1 – 4.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
Scaling Content Based Image Retrieval Systems Christine Lo, Sushant Shankar, Arun Vijayvergiya CS 267.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
 Image Search Engine Results now  Focus on GIS image registration  The Technique and its advantages  Internal working  Sample Results  Applicable.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Supervised Learning & Classification, part I Reading: DH&S, Ch 1.
Recommender systems Ram Akella November 26 th 2008.
The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.
A Customisable Question and Answer Database Kate Lindsay.
Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.
Brief overview of ideas In this introductory lecture I will show short explanations of basic image processing methods In next lectures we will go into.
Face Detection and Neural Networks Todd Wittman Math 8600: Image Analysis Prof. Jackie Shen December 2001.
Machine Learning in Simulation-Based Analysis 1 Li-C. Wang, Malgorzata Marek-Sadowska University of California, Santa Barbara.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Numerical Grid Computations with the OPeNDAP Back End Server (BES)
NUITS: A Novel User Interface for Efficient Keyword Search over Databases The integration of DB and IR provides users with a wide range of high quality.
Carnegie Mellon Life in the Atacama, Design Review, December 19, 2003 Science Planner Science Observer Life in the Atacama Design Review December 19, 2003.
Data Management Subsystem: Data Processing, Calibration and Archive Systems for JWST with implications for HST Gretchen Greene & Perry Greenfield.
Multimedia Databases (MMDB)
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
Virtual Ice Charting System Archive Browser Interface Distribution IngestProduction Ice Analyst Application Database Click on the boxes for more information.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
AI Week 14 Machine Learning: Introduction to Data Mining Lee McCluskey, room 3/10
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Graphic material for HSC FAQ – March 23, Five things you should know about the Hubble Source Catalog (HSC) 1. Coverage can be very non-uniform,
26 October 2005HST Calibration Workshop1 The National Virtual Observatory and HST T HE US N ATIONAL V IRTUAL O BSERVATORY Robert Hanisch US National Virtual.
Image Comparison Tool Product Proposal Tim La Fond and Peter Beckfield.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
Sea Ice Mapping Systems Archive Browser Interface Distribution IngestProduction Ice Analyst Application Database Henrik Steen Anderson DMI Virtual System.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
Event and Feature Catalogs in the Virtual Solar Observatory Joseph A. Hourclé and the VSO Team SP54A-07 : 2008 May 30.
Data Mining and Decision Support
Copyright Paula Matuszek Kinds of Machine Learning.
Cs Future Direction : Collaborative Filtering Motivating Observations:  Relevance Feedback is useful, but expensive a)Humans don’t often have time.
ECE172A Project Report Image Search and Classification Isaac Caldwell.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Mining Job Monitoring Data Automatic Error.
Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Hadoop and Spark Dynamic Data Models Amila Kottege Software Developer
What Is Cluster Analysis?
Information Organization: Overview
Object-based Classification
Data Mining 101 with Scikit-Learn
CH. 1: Introduction 1.1 What is Machine Learning Example:
OUTLINE Basic ideas of traditional retrieval systems
Associative Query Answering via Query Feature Similarity
Self organizing networks
Unsupervised Learning and Autoencoders
CSE 403 Project SDS Presentation
What is Pattern Recognition?
Robotic Search Engines for the Physical World
Virtual Ice Charting System
Clustering John Owen Sarah Smith.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Grouping/Segmentation
Information Organization: Overview
Presentation transcript:

Data Mining (Don’t worry, I am not presenting these slides; just for your reading pleasure)

There has been an evolution in how we look the relationship between the user and the archive. Shortly after HST launch, the user gave simple requests like “give me the output of my observations” and the archive provided data that was relatively unprocessed. As time progressed more users looked at the archive as a whole, and were able to query more complex metadata (e.g. give me F218W data of M33 taken between these dates) and the archive provided much more completely reduced data, along with access to the raw data. Hubble Source Catalog has extended this trend, allowing users to make queries on the characteristics of objects (e.g. SExtractor outputs) rather than metadata.

What does the future of this trend look like? One could imagine more advanced queries like: “I have this SED; give me all the HSC sources that are consistent with it that have observations in these filters. This requires a better understanding of, e.g., non-detections. One could extend these advanced queries to the spectral database: “I have a spectrum, find me more that that are similar to it”. We could enable this by applying algorithms on our end (clustering, unsupervised learning) to the existing spectral database.

What does the future of this trend look like? continued We could provide a richer set of qualities (feature vectors) for objects larger than the PSF that would allow the users to better sort for the kinds of extended objects they are interested in. We could provide server-side classification services: users provide input to train our algorithms to find certain kinds of sources within the data (supervised learning). We could provided trainable/teachable image search (segmentation) algorithms to find complex sources that are not found by SExtractor, e.g. shocks, shells, filaments, arcs, etc.

What does the future of this trend look like? continued Your idea here!