The Management of a Website’s Historical Resources David Chao College of Business San Francisco State University.

Slides:



Advertisements
Similar presentations
Tracking a Web Site’s Historical Links with Temporal URLs David Chao College of Business San Francisco State University.
Advertisements

Languages for Dynamic Web Documents
Project 1 Introduction to HTML.
Server-Side vs. Client-Side Scripting Languages
1 Chapter 12 Working With Access 2000 on the Internet.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Servlets and a little bit of Web Services Russell Beale.
Macromedia Dreamweaver 4 Advanced Level Course. Add Rollovers Rollovers or mouseovers are possibly the most popular effects used in designing Web pages.
ASP.NET 2.0 Chapter 6 Securing the ASP.NET Application.
The Designing of Web Services to Deliver Web Documents Associated with Historical Links David Chao College of Business San Francisco State University.
Web Page Introduction. What is a web page? A web page is a text file containing markup language tags. –A markup language combines text and extra information.
The Design Of A Web Document Snapshots Delivery System David Chao College of Business San Francisco State University.
1st Project Introduction to HTML.
Part or all of this lesson was adapted from the University of Washington’s “Web Design & Development I” Course materials.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Chapter 9 Collecting Data with Forms. A form on a web page consists of form objects such as text boxes or radio buttons into which users type information.
Section 13.1 Add a hit counter to a Web page Identify the limitations of hit counters Describe the information gathered by tracking systems Create a guest.
Sys Prog & Scripting - HW Univ1 Systems Programming & Scripting Lecture 15: PHP Introduction.
Web 2.0: Concepts and Applications 2 Publishing Online.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
INTRODUCTION TO DHTML. TOPICS TO BE DISCUSSED……….  Introduction Introduction  UsesUses  ComponentsComponents  Difference between HTML and DHTMLDifference.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
CHAPTER 12 COOKIES AND SESSIONS. INTRO HTTP is a stateless technology Each page rendered by a browser is unrelated to other pages – even if they are from.
Server-side Scripting Powering the webs favourite services.
© Cheltenham Computer Training 2001 Macromedia Dreamweaver 4 - Slide No 1 Macromedia Dreamweaver 4 Advanced Level Course.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
5 Chapter Five Web Servers. 5 Chapter Objectives Learn about the Microsoft Personal Web Server Software Learn how to improve Web site performance Learn.
CHAPTER 3 Working with Templates and Styles. CHAPTER OBJECTIVES  Create an Expression Web site from a template  Rename a page  Rename a folder  Add.
10 Adding Interactivity to a Web Site Section 10.1 Define scripting Summarize interactivity design guidelines Identify scripting languages Compare common.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Section 17.1 Add an audio file using HTML Create a form using HTML Add text boxes using HTML Add radio buttons and check boxes using HTML Add a pull-down.
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
Web Applications BIS4430 – unit 8. Learning Objectives Explain the uses of web application frameworks Relate the client-side, server-side architecture.
Active Server Pages  In this chapter, you will learn:  How browsers and servers interacted on the Internet when the Internet first became popular 
End-User Training Editing Content with JustEdit Brandon Scheirman Instructional Designer, OmniUpdate.
Copyright © 2002 ProsoftTraining. All rights reserved. Java Servlets.
Microsoft FrontPage 2003 Illustrated Complete Integrating a Database with a Web Site.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Module: Software Engineering of Web Applications Chapter 2: Technologies 1.
Hubnet Training One Health Network South East Asia Network Overview | Public and Members-only Pages; Communicating and Publishing using Blogs and News.
Introduction to HTML. _______________________________________________________________________________________________________________ 2 Outline Key issues.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
Exploring Microsoft Microsoft FrontPage Chapter 21 Exploring Microsoft FrontPage 2002 Chapter 1 Creating a Home Page: Introduction to MS FrontPage.
Web Page Introduction. What is a web page? A hypertext is a document contains clickable links. A web page is a text file containing Hyper Text MarkUp.
 Web pages originally static  Page is delivered exactly as stored on server  Same information displayed for all users, from all contexts  Dynamic.
Fundamentals of Web DevelopmentRandy Connolly and Ricardo HoarFundamentals of Web DevelopmentRandy Connolly and Ricardo Hoar Fundamentals of Web DevelopmentRandy.
External Study Credit Granting with SurveyGizmo Shepherd University Department of Psychology.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
CSC 2720 Building Web Applications Basic Frameworks for Building Dynamic Web Sites / Web Applications.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Web Page Introduction. What is a web page? A hypertext that contains clickable links. A web page is a text file containing Hyper Text MarkUp Language.
CHAPTER 7 LESSON C Creating Database Reports. Lesson C Objectives  Display image data in a report  Manually create queries and data links  Create summary.
Understanding Web-Based Digital Media Production Methods, Software, and Hardware Objective
Web based Documentation Distribution Tools: MSAccess database (DSN) DreamWeaver Ultradev Microsoft Image Composer Clicking on the document will open an.
Content Management System (CMS) Introduction for the Prospective Students site.
COMP 143 Web Development with Adobe Dreamweaver CC.
Preface IIntroduction Course Objectives I-2 Course Content I-3 1Introduction to Oracle Reports Developer Objectives 1-2 Business Intelligence 1-3 Enterprise.
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
Dreamweaver – Setting up a Site and Page Layouts
Project 1 Introduction to HTML.
Getting Started with CSS
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Chapter 1 Introduction to HTML.
Project 1 Introduction to HTML.
Website Testing Checklist
Dreamweaver.
Hypertext Preprocessor
INTRODUCTION TO OU CAMPUS
Presentation transcript:

The Management of a Website’s Historical Resources David Chao College of Business San Francisco State University

Introduction An organization’s websites change constantly to reflect the dynamic nature of its environment causing changes in website structure, contents and the supporting technologies.

Types of Change Website structure: –Causing web pages’ URL to change Website content: –Changes to web pages: Insertions, deletions, modifications –Changes to content databases Technology

What are a website’s historical resources? Outdated URLs Outdated web pages: –Web page snapshots –Content database snapshots –Deleted web pages Replaced technologies

The Objective of Managing Historical Resources The major objective of the management of historical resources is to satisfy users’ needs for historical information by enabling the website to recreate or retrieve web page snapshots. –Web page snapshot is the state of a web page at a specific point in time.

Factors Affecting The State Of A Web Page Content factors: –Web page code –The state of internal resources it references: Images, style sheet, components, script files, databases, etc. –The state of external resources it references: External resources are files not managed by the web site but can be referenced in creating the web site’s contents. Environment factors: –Web site host environment variables: System clock –Web technologies implemented on the server-side as well as on the client-side

Levels of Web Page Snapshot Level 1 snapshot: A web document snapshot is the state of web document code at snaptime. –Creating level 1 snapshot enables a web site to trace the changes to the web document code over time. Level 2 snapshot: A level 2 snapshot is a level 1 snapshot with the additional requirement that all the internal resources it references are at least level 1 snapshots at the same snaptime. –Referencing database snapshots Level 3 snapshot: A level 3 snapshot is a level 2 snapshot with the additional requirement that all the external resources it references are at least level 2 snapshots at the snaptime.

Enforcing Environment Factors Page (1) Plus 0: If both environment factors are not enforced. (2) Plus 1: If the host variables are reset to the snapshot time. (3) Plus 2: If web technologies are compatible with the technologies at the snapshot time. (4) Plus 3: If both factors are enforced.

Possible Levels of Snapshot States

Schemes for Tracking Changes Scheme for tracking website structure changes and web page code changes –A logging and archiving scheme Scheme for tracking content database changes.

Design of a Logging and Archiving Scheme for Tracking Website Changes The log, named TemporalURLLog, has five fields: URL, PublishDate, DocExpireDate, URLExpireDate, and NewURL. Those archived documents are saved in the Archive using URL + PublishDate as file name.

Impacts of Website Changes to Historical Links and Archive TimeWebsite ChangesCurrent Web Pages Historical Links Generated Snapshots in Archive T0T0 P1, P2, P3None T1T1 P1 renamed to P4 P5 is added P2, P3, P4, P5P1+ T 0 T2T2 P2 is deleted P3 is modified P3, P4, P5P2+ T 0, P3+ T 0 P2+ T 0, P3+ T 0 T3T3 P3, P4, P5 is modified P1, P6 are added P1, P3, P4, P5, P6P3+ T 2, P4+ T 1 P5+ T 1 P3+ T 2, P4+ T 1, P5+ T 1 T4T4 P3 is deleted P4 is renamed to P8 P5 is renamed to P7 A new page P3 is added P1, P3, P6, P7, P8P3+ T 3, P4+ T 3 P5+T 3 P3+ T 3

The contents of TemporalURLLog URLPublishDateDocExpireDateURLExpireDateNewURL P1T0T0 NullT1P4 P2T0T0 T2T2 T2Null P3T0T0 T2T2 Null P4T1T1 T3T3 Null P4T3NullT4P8 P5T1T1 T3T3 Null P3T2T2 T3T3 Null P3T3T3 T4T4 T4Null P5T3T3 NullT4P7 P6T3T3 Null P1T3T3 Null P8T4T4 Null P7T4T4 Null P3T4T4 Null

Examples of Using the Log Retrieve a snapshot of a current web page: Retrieve a deleted page: Retrieve the snapshot of a deleted web page: –The snapshot of P3 at T2 is in the Archive: P3+ T2. Retrieve the current web page of an out-dated URL: –An old URL P5 is now renamed to P7. If users submit a request for P5, it can be traced to P7. Retrieve the web page previously associated with a current link: –A historical link P1 is now renamed to P8, and a current link P1 points to a new web page. If the current web page associated with P1 is not what the users need, it can be redirected to P8. Determine if an invalid URL ever exists: – A URL P12 has never existed in the web site.

Tracking Changes to Content Databases A web page may use content databases: –(1) as a source for querying. –(2) as storage for contents of placeholders on a web page.

Database Snapshot Management Defining snapshots: CREATE SNAPSHOT snapshotname AS query AS OF snaptime Refreshing snapshots: REFRESH SNAPSHOT snapshotname AS OF new snaptime

Issues in Tracking Changes to Content Databases The content data databases may exist in many formats: –XML, delimited text files, Etc. –Not all content databases are supported by a snapshot management system. The website may not have the authority in the management of the content databases. A web page may retrieve data from many databases. There is no single way in designing content databases.

Tracking Content Database Changes Using Log – An Example Assuming: –One content database supports many web pages. –Each page contains many placeholders. Log design: –PageID + PlaceHolderID + Content + Update Flag + Time Stamp PageID is (URL + Page publish time)

Working with the TemporalLog Because a web page’s URL may change, the content database log needs the support of the TemporalURL log to track the changes of URL. Example:

Delivering Historical Resources to Users A website consists of: –(1) a current website where current web pages are published. –(2) a historical website where historical resources are stored and accessed. A typical web server serves requests for current web pages only and is inadequate to serve a request for historical information.

The Design of a Web Page Snapshot Management System

Summary We developed a scheme to track changes to website structure, web pages and files referenced by web pages, and a second scheme to track changes to content databases so that the website is capable of creating Level 2 snapshots.