1 Thoughts about Computer Science Research in Information-rich Applications Areas William Y. Arms Cornell University March 14, 2000.

Slides:



Advertisements
Similar presentations
1 Technical Developments Related to Quality Issues Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY
Advertisements

INTERNET A collection of networks. History ARPANet – developed for security of sending in case of a nuclear attack IDEA – the system would not go down.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Admit Day Schedule Welcome and Overview (12:25-12:35)  Geoff Voelker, Asst. Prof. Questions and Answers (1:05-1:30)  Pat Raczka, CSE Undergrad.
Working Knowledge William Muehlbauer Bharadwaj Raghuram.
1 CS 502: Computing Methods for Digital Libraries Lecture 16 Web search engines.
The Internet. What is the Internet? A community with about 100 million users Available in almost every country about 160,000 people are added each month.
Web Development Using ASP.NET CA – 240 Kashif Jalal Welcome to week – 1 of…
Social Sciences and Humanities Research Council of Canada Conseil de recherches en sciences humaines du Canada April 27, 2010 Presentation to the 2010.
1 Economic Models for Open Access William Y. Arms Department of Computer Science Cornell University Professional.
Department of Computer and Information Sciences Postgraduate Study.
Application Layer. This graphic is taken from The Abdus Salam International Centre for Theoretical Physics.
1 Internet History Internet made up of thousands of networks worldwide No one in charge of Internet - No governing body Internet backbone owned by private.
Microsoft Exchange Exchange is more than just Electronic Mail The server that embraces Internet standards and extends rich messaging and collaboration.
Internet Basics مهندس / محمد العنزي
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Technology Guide 5 The Internet & the Web.
Technological Innovation: Generating Economic Results NSF IGERT Program Presentation REE October 27, 2004 Marie Thursby Hal and John Smith Chair for Entrepreneurship.
TEMPUS JEP : TEACHING BUSINESS INFORMATION SYSTEMS CURRICULUM DEVELOPMENT Information Technology courses Second Project Meeting, Belgrade, January.
Networks and Security. Types of Attacks/Security Issues  Malware  Viruses  Worms  Trojan Horse  Rootkit  Phishing  Spyware  Denial of Service.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
Cool Computing News Computing majors are in demand By 2016 there will be more than 1.5 million new high- end computing jobs Five of.
Partnerships for Innovation Key Underlying Tenets ¬ Innovation happens locally - partnerships with state, regional and local governments and industry are.
Ch CHAPTER The Internet and the Web Irfan A. Ilyas Lecture 23.
1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation.
Topics in Technology and Marketing In The Beginning.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
1 What is the history of the Internet? ARPANET (Advanced Research Projects Agency Network) TCP/IP (Transmission Control Protocol/Internet Protocol) NSFNET.
LIS510 lecture 11 Thomas Krichel Historical part Technological progress is not new. Rubin starts with a useful historical overview. –he looks.
Tools of the Trade: Inquiry CECS 5030: Introduction to the Internet Dr. Cathleen Norris & Jennifer Smolka.
Internet Research Tips Daniel Fack. Internet Research Tips The internet is a self publishing medium. It must be be analyzed for appropriateness of research.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
BING!-Microsoft's new search engine Launched May 28, 2009 Appealing interface A “decision engine” not just a search engine *Shopping, health, travel, local.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
1 The NSDL Program Stephen Griffin National Science Foundation.
Internet. What is Internet Internet is a computer network made up of millions of networks worldwide. No one knows exactly how many computers are connected.
CHAPTER 7 THE INTERNET AND INTRANETS 1/11. What is the Internet? 2/11 Large computer network ARPANET (Dept of Defense) It is international and growing.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Keeping Up With Moore’s Law 1 Keeping Up With Moore’s Law: Course Management Panel Robert Cartolano Manager, Academic Technologies, Academic Information.
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
WEB SERVER SOFTWARE FEATURE SETS
Topics in Technology and Marketing In The Beginning.
 Explore fundamental issues in computing and develop theories and models to address those issues  Help scientists and engineers solve complex computing.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Electronic Commerce Semester 1 Term 1 Lecture 7. Introduction to the Web The Internet supports a variety of important tools, such as file transfer, electronic.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Necessary Changes to Modern Library Catalogs and Potential Solutions Meg Gill ILS 506-S70.
1 Next Generation Cybertools: Social Science Research using Web Data A project of Cornell University and the Internet Archive, Funded by the National Science.
The Internet. The Internet and Systems that Use It Internet –A group of computer networks that encircle the entire globe –Began in 1969 Protocol –Language.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Computer Science at Cornell The Environment for PhD Students Charlie Van Loan Professor & Chair.
REMOVE THIS SLIDE BEFORE PRESENTATION
Marking the Most of the Web’s Resources
What is WWW? The term WWW refers to the World Wide Web or simply the Web. The World Wide Web consists of all the public Web sites connected to the Internet.
Computer Science Department, University of Missouri, Columbia
Introduction to Web Mining
Internet LINGO.
Information Technology (IT)
Computer Science Education Week
Search Before Google Computer Science 49S
Unit# 5: Internet and Worldwide Web
Partnering to Enhance Electronic Dissemination of Official Statistics The USDA Economics, Statistics, and Market Information System United States Department.
CS 345A Data Mining Lecture 1
Introduction to Web Mining
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

1 Thoughts about Computer Science Research in Information-rich Applications Areas William Y. Arms Cornell University March 14, 2000

2 Changes in Computer Science Over 25 years, computer science has broadened From: a narrow range of academic topics To include: systems human computer interactions economic, legal, and social aspects

3 Computer Science Today Past achievements in computer science are a powerful force in the national prosperity. Universities have excellent students who have tremendous opportunities. An extensive body of theoretical and practical knowledge has accumulated. Exciting research can be found in every direction.

4 Approaches to Computer Science Research Applications Theory Experimentation

5 Computing and Information Science (Cornell) Interdisciplinary partnerships: Computational biology, genomics, protein folding, etc. Computational science Computer graphics, architecture, design, film-making Digital libraries, information management Computational finance, economics Computer science can contribute to each of these fields. Each field can stimulate new research in computer science.

6 The University as a Test Bed University tradition of innovation in computing: Time sharing (MIT, Dartmouth) Networks and distributed computing (Carnegie Mellon, MIT) Online information (Illinois, etc.) Wireless and nomadic computing (???) Advantages: Tight feedback loop between researcher and user Innovation valued for its own sake Access to resources (equipment, people, money)

7 Research Partners Academic research Industrial R&D Entrepreneurs

8 Example: Digital Libraries In 1990, there were many experiments in building digital libraries: CORE (Bellcore, Cornell, OCLC) Lesk, et al. Gopher (Minnesota) Gopher team Mercury (Carnegie Mellon) Arms, et al. WAIS (Thinking Machines) Kahle, et al. World Wide Web (CERN) Berners-Lee, et al. Z (Major libraries) Lynch, et al. The leaders of all projects were either computer scientists or had spent most of their working life in state-of-the-art computing.

9 Foundations of the Web TechnologyAncestors InternetARPAnet/NSFnet, X.25, ISO URLDomain Name System HTMLSGML, TeX, PostScript HTTPTCP / FTP / Gopher, Z 39.50, SQL MIME , ODA SecurityNone, SNA, Kerberos Business modelNone, pay-by-use, subscription

10 Example: Web Search Engines Lycos (Mauldin, Carnegie Mellon) Technical basis: Research in text-skimming (Ph.D. thesis) Pursuit free text retrieval engine (TREC) Robot exclusion research (private interest) Organizational basis: Center for Machine Translation Grant flexibility (DARPA)

11 Example: Web Search Engines Google (Page and Brin, Stanford) Technical basis: Research in ranking hyperlinks (Ph.D. thesis) Organizational basis: Grant flexibility (NSF Digital Libraries Initiative) Equipment grant (Hewlett Packard)

12 The Internet Graph Theoretical research in graph theory Six degrees of separation Pareto distributions Algorithms Hubs and authorities (Kleinberg, Cornell) Empirical data Commercial (Yahoo!, Google, Alexa, AltaVista, Lycos) Not-for-profit (Internet Archive)

13 The Limits of the Web The web has grown upon existing computer science knowledge. The strengths of that knowledge have enabled enormous growth. The limits of that knowledge have constrained the growth. Al Demers

14 The Web: Limits to Growth -- Databases Transaction processing databases: e.g, Amazon.com The biggest online systems ever built, with many computers around the world. Desirable features: No interruptions No transactions ever lost Secure from all intruders In practice some transactions are lost; data is sometimes inconsistent. This is acceptable for selling books, but what about banking?

15 The Web: Limits to Growth -- Security Why is security on the Internet so difficult? 1. Public key encryption invented in mid-1980s, yet widespread deployment remains elusive. 2. System security is riddled with loopholes operating system security developed when operating systems were simple monitors now operating systems are very complex and hence vulnerable language based security seeks for simpler interfaces to attach security Fred Schneider

16 The Web: Limits to Growth -- Security The Internet is based on stateless protocols routing http Stateless protocols have allowed flexible growth, but inhibit certain controls junk denial of service attacks Can we quantify the trade-off?

17 Priorities Function Schedule Cost academic research industry

18 Priorities: Andrew File System Carnegie Mellon Industry Microsoft (2000) IBM (1989) Campus file system (1985) Coda research

19 Two Fears Two fears for digital libraries: Librarians will ignore the expertise of computer science. Two fears for X: Specialists in X will ignore the expertise of computer science. Computer scientists will ignore the insights of specialists in X. Computer scientists will ignore the insights of librarians.

20 Thoughts for the NSF Applications and computer science need to be side by side. Big projects appear to be more productive than small ones. Inter-disciplinary collaboration cannot be forced.

21 Thoughts about Computer Science Research in Information-rich Applications Areas William Y. Arms Cornell University March 14, 2000