Presentation is loading. Please wait.

Presentation is loading. Please wait.

Laboratory for Internet Computing Harnessing Distributed, Heterogeneous Information Sources –Data integration with different formats –Extraction of information.

Similar presentations


Presentation on theme: "Laboratory for Internet Computing Harnessing Distributed, Heterogeneous Information Sources –Data integration with different formats –Extraction of information."— Presentation transcript:

1 Laboratory for Internet Computing Harnessing Distributed, Heterogeneous Information Sources –Data integration with different formats –Extraction of information and attributes –Data mining –Presentation of knowledge in intuitive, user- oriented format Center for Advanced Computer Studies

2 Mission Statement UL/CACS Initiative for Commercialization and Transfer of Distributed Heterogeneous Information Systems exists –To research, develop and apply innovative Information and Knowledge based solutions to business problems by developing industrial applications, computer-based tool kits, web services, and commercial tools that implement technologies for searching and mining distributed, multimedia information and knowledge resources, corporate data, normal and deep Web sources, and –To deliver viable, valuable, reproducible, business solutions providing a high return on investment for our Industrial, entrepreneurial, and corporate partners, UL/CACS, and the communities we serve.

3 Laboratory for Internet Computing Personnel Vijay Raghavan Henry Chu Ryan Benton Biren Shah Zonghuan Wu Michael Pratt Research topics Data mining Data visualization Digital library Machine learning Information retrieval Web resource integration Image processing

4 Laboratory for Internet Computing Software development environment for multi-member projects –CVS: Version control system –Plone: Content management system –Scons: build tool for cross-platform compilation –Bugzilla: Bug-tracking system

5 Laboratory for Internet Computing Machine learning –SNNS neural network simulator –WEKA Visualization –Vtk Terrain visualization –TerraVista

6 Laboratory for Internet Computing Technology transfer –Star Software: Data-driven prognostics and Automatic satellite image annotation –Evidence Management: Web-based document management system for experts in legal domain –Fenstermaker and Associates: Perceptual data visualization –ARAICOM: Conceptual biology –Webscalers: Large-scale and customized meta-search –Wisesoft: Creation of 3D airport model for air traffic controller simulator

7 Laboratory for Internet Computing Current R&D projects –Digital photogrammetry for visualization content creation –LITE application support –Knowledge discovery from texts –Image steganalysis –Information extraction –Search and metasearch –Discovery driven online analytical processing –Semi-structured data management

8 Laboratory for Internet Computing Strategic Direction Exabyte-scale Data Engineering A terabyte is a 1000 billion (10 12 ) bytes of data An exabyte is a billion, billion (10 18 ) bytes of data

9 Exabyte-scale Data Engineering 10 14 bytesNational Climactic Data Center 10 13 bytesPrint collection at the U.S. Library of Congress 10 11 bytesBiological sequence databases

10 Exabyte-scale Data Engineering Look at rate at which data are collected and generated Amount of new information generated –In 1999 = 2 Exabytes –In 2002 = 5 Exabytes 92% stored as digital data

11 Exabyte-scale Data Engineering Test beds –Geospatial information management –Health informatics –Bioinformatics –Metasearch engines for the Internet –In-memory databases


Download ppt "Laboratory for Internet Computing Harnessing Distributed, Heterogeneous Information Sources –Data integration with different formats –Extraction of information."

Similar presentations


Ads by Google