CS 257 Database Systems Dr. T Y Lin Ultimate Goal Data Science (Big Data)

Slides:



Advertisements
Similar presentations
Lecture 6: Creating a simplicial complex from data. in a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305) Topics in Topology:
Advertisements

Tyler White MATH 493 Dr. Wanner
Face Recognition Face Recognition Using Eigenfaces K.RAMNATH BITS - PILANI.
Lecture 5: Triangulations & simplicial complexes (and cell complexes). in a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305)
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Department of Mathematics and Computer Science
1 Latent Semantic Indexing Jieping Ye Department of Computer Science & Engineering Arizona State University
Kyle Heath, Natasha Gelfand, Maks Ovsjanikov, Mridul Aanjaneya, Leo Guibas Image Webs Computing and Exploiting Connectivity in Image Collections.
Intelligent Systems Group Emmanuel Fernandez Larry Mazlack Ali Minai (coordinator) Carla Purdy William Wee.
E.G.M. PetrakisDimensionality Reduction1  Given N vectors in n dims, find the k most important axes to project them  k is user defined (k < n)  Applications:
Overview of Web Data Mining and Applications Part I
Cyberinfrastructure Supporting Social Science Cyberinfrastructure Workshop October Chicago Geoffrey Fox
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Final Search Terms: Archiving (digital or data) Authentication (data) Conservation (digital or data) Curation (digital or data) Cyberinfrastructure Data.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Introduction to the Course January.
Structure of Study Programmes
CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
X-Informatics Web Search; Text Mining B 2013 Geoffrey Fox Associate Dean for.
Digital Libraries: Background and Overview NAWeb 2003 Jeremy Rowe Arizona State University Partnership for Research In Spatial Modeling.
CS6700 Advanced AI Bart Selman. Admin Project oriented course Projects --- research style or implementation style with experimental component. 1 or 2.
Push Singh & Tim Chklovski. AI systems need data – lots of it! Natural language processing: Parsed & sense-tagged corpora, paraphrases, translations Commonsense.
Structure of Study Programmes Bachelor of Computer Science Bachelor of Information Technology Master of Computer Science Master of Information Technology.
Lecture 4: Addition (and free vector spaces) of a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305) Topics in Topology:
1 Granular Computing: Formal Theory & Applications Tsau Young (‘T. Y.’) Lin Computer Science Department, San Jose State University San Jose, CA 95192,
Data Science for DTIC Data Ecosystem Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Overviews of ITCS 6161/8161: Advanced Topics on Database Systems Dr. Jianping Fan Department of Computer Science UNC-Charlotte
Research Interests of Dr. Dennis J Bouvier Fall 2007.
Lecture 2: Addition (and free abelian groups) of a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305) Topics in Topology:
© copyright 2014 Semantic Insights™ “A New Natural Language Understanding Technology for Research of Large Information Corpora." By Chuck Rehberg, CTO.
Optional Lecture: A terse introduction to simplicial complexes in a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305)
1 Granular Computing: Formal Theory & Applications Tsau Young (‘T. Y.’) Lin GrC Society and Computer Science Department, San Jose State University San.
COMPUTER SCIENCE Data Representation and Machine Concepts Section 2.2 Instructor: Lin Chen Sept 2013.
Master’s Degree in Computer Science. Why? Acquire Credentials Learn Skills –Existing software: Unix, languages,... –General software development techniques.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
9/03 Data Mining – Introduction G Dong (WSU)1 CS499/ Data Mining Fall 2003 Professor Guozhu Dong Computer Science & Engineering WSU.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
National Technical University of Ukraine “Kiev Polytechnic Institute” Heat and energy design faculty Department of automation design of energy processes.
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Sept 9, 2013: Create your own homology. Fall 2013.
FastMap : Algorithm for Indexing, Data- Mining and Visualization of Traditional and Multimedia Datasets.
Computer Troubleshooting Intelligent System (CTIS) is being developed for Computer Services Department of the Student Health Center at UCF EEL 5874 EXPERT.
67 x 89 = ? 67 x
Data Mining in Germany IIM Conference, Oct. 24, 2012 Gottfried Schwarz, DLR > Lecture > Author Document > Datewww.DLR.de Chart 1.
1 ALGEBRAIC TOPOLOGY SIMPLICAL COMPLEX ALGEBRAIC TOPOLOGY SIMPLICAL COMPLEX Tsau Young (‘T. Y.’) Lin Institute of Data Science and Computing and Computer.
1 Algebraic Topology in Data Science Algebraic Topology in Data Science GrC in Big Data Tsau Young (‘T. Y.’) Lin Institute of Data Science and Computing.
1 CS 430: Information Discovery Lecture 28 (a) Two Examples of Cluster Analysis (b) Conclusion.
Why Should You Apply to Graduate School? Masters Degree
Brief Intro to Machine Learning CS539
Faculty of Computer and Information Science
Independent Study of Ontologies
Data and Applications Security Developments and Directions
INFORMATION COMPRESSION, MULTIPLE ALIGNMENT, AND INTELLIGENCE
Data and Applications Security Developments and Directions
Dr. T Y Lin Ultimate Goal Data Science (Big Data)
Future Technologies FTC 2016 Future Technologies Conference December 2016 San Francisco, United States.
What is Pattern Recognition?
Basic Intro Tutorial on Machine Learning and Data Mining
机器感知与智能教育部重点实验室学术报告 Key Laboratory of Machine Perception (Minister of Education) Peking University Scalable, Robust and Integrative Algorithms for Analyzing.
Clouds & Containers: Case Studies for Big Data
Frontiers of Computer Science, 2015, 9(6):980–989
Multimedia Information Retrieval
CS6700 Advanced AI Prof. Carla Gomes Prof. Bart Selman
Algebraic Topology Simplical Complex
Create PT Applications from Ideas
Search Engine Architecture
Data and Applications Security Developments and Directions
Chapter 3: Simplicial Homology Instructor: Yusu Wang
AI Application Session 12
Presentation transcript:

CS 257 Database Systems Dr. T Y Lin Ultimate Goal Data Science (Big Data)

CS 257- OverView CS257 and Big Data +: VLDB (Very Large Database ) +: Unstructured Data, i.e. Text/Web Image, Multimedia, Video, Vision Bio, Scientific Data Processing Light: Cloud Computing Light: Data Science /Knowledge Engineering etc

CS 257- OverView Major Applications in Big Data Medical Informatic VLDB + Image +Cloud + Security (CS286) Financial Informatic VLDB + BI + Cloud + Security (CS286) Web Engineering Business Intelligence(BI) Data Science (Knowledge Engineering in Web/Image/Bio/etc Data)

CS 257- OverView Instructor: IEEE Best Contribution Award in Data Mining (ICDM 2001) ACM/IEEE Best Service Award Web Intelligent (WI-2007) Best Contribution Award Rough Set (2005) Pioneer Award in Granular Computing (2008)

CS 257- OverView

6 Project Overview Verification and Validation of the Core Engine of a Concept Based Semantic Search Engine

7 Main Idea A set of documents is associated with a Matrix, called 1) Latent Semantic Index(LSI), by treating the row vectors as points in Euclidean space (point=TFIDF), - Google’s approach

8 Main Idea 2) Topological approach : A polyhedron (combinatorially, = a Simplicial Complex) is built to capture and structure the concepts

9 An open segment is a 1-simplex, an open triangle (faces) is a 2-simplex and an open tetrahedron is a 3-simplex, and... n-simplex.segmentfaces A collection of simlexes (satisfies closed condition) is called simplicial complex that is a combinatorial representation of a polyhedron that led to a “new” subject called algebraic topology. The project is algebraic topology based search engine.polyhedron