Database Applications: Web-Enabled Databases and Search Engines

Slides:



Advertisements
Similar presentations
IIS Technologies.
Advertisements

DT211/3 Internet Application Development Active Server Pages & IIS Web server.
10/25/2001Database Management -- R. Larson Data Administration and Database Administration University of California, Berkeley School of Information Management.
SLIDE 1IS 257 – Fall 2009 More on MySQL and SQL University of California, Berkeley School of Information IS 257: Database Management.
Oct. 12, 2000Database Management -- R. Larson Web-Enabled Databases and Search Engines University of California, Berkeley School of Information Management.
15 Chapter 15 Web Database Development Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel.
B.Sc. Multimedia ComputingMedia Technologies Database Technologies.
SLIDE 1IS 257 – Fall 2009 Database Applications and Web-Enabled Databases University of California, Berkeley School of Information IS 257:
SLIDE 1IS 257 – Fall 2006 Coldfusion and PHP introduction University of California, Berkeley School of Information IS 257: Database Management.
SLIDE 1IS Fall 2002 Database Applications: Web-Enabled Databases and Search Engines University of California, Berkeley School of Information.
SLIDE 1IS Fall 2004 Database Administration: Security and Integrity University of California, Berkeley School of Information Management.
Introduction to Web Database Processing
Oct. 11, 2001Database Management -- R. Larson Database Applications: Web-Enabled Databases and Search Engines University of California, Berkeley School.
SLIDE 1IS 257 – Fall 2010 PHP introduction University of California, Berkeley School of Information IS 257: Database Management.
Outline IS400: Development of Business Applications on the Internet Fall 2004 Instructor: Dr. Boris Jukic Server Side Web Technologies: Part 2.
10/17/2000Database Management -- R. Larson Data Administration and Database Administration University of California, Berkeley School of Information Management.
SLIDE 1IS 202 – FALL 2002 Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 pm Fall 2002
Week 2 IBS 685. Static Page Architecture The user requests the page by typing a URL in a browser The Browser requests the page from the Web Server The.
Introduction to Web Interface Technology (CSE2030)
SLIDE 1IS 257 – Fall 2010 Database Applications and Web-Enabled Databases University of California, Berkeley School of Information IS 257:
SLIDE 1IS 257 – Spring 2005 Database Applications and Web-Enabled Databases University of California, Berkeley School of Information Management.
1 CS6320 – Why Servlets? L. Grewe 2 What is a Servlet? Servlets are Java programs that can be run dynamically from a Web Server Servlets are Java programs.
SLIDE 1IS 257 – Spring 2004 Database Applications and Introduction to ColdFusion and PHP University of California, Berkeley School of Information.
Introduction to Web Interface Technology (CSE2030)
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
Oct. 16, 2001Database Management -- R. Larson Database Applications: Web-Enabled Databases and Search Engines: Cont. University of California, Berkeley.
Advanced Distributed Software Architectures and Technology group ADSaT 1 Application Architectures Ian Gorton, Paul Greenfield.
SLIDE 1IS Fall 2002 Database Applications: Using ColdFusion University of California, Berkeley School of Information Management and Systems.
Electronic Commerce Last Week Internet utility programs
Database Management Systems (DBMS)
SLIDE 1IS 257 – Fall 2014 Database Applications and Web-Enabled Databases University of California, Berkeley School of Information IS 257:
2440: 141 Web Site Administration Web Server-Side Programming Professor: Enoch E. Damson.
10/5/1999Database Management -- R. Larson Data Administration and Database Administration University of California, Berkeley School of Information Management.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Server- Side technologies Client-side vs. Server-side scripts PHP basic ASP.NET basic ColdFusion.
FALL 2005CSI 4118 – UNIVERSITY OF OTTAWA1 Part 4 Web technologies: HTTP, CGI, PHP,Java applets)
CSCI 6962: Server-side Design and Programming Course Introduction and Overview.
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
Week 7 Lecture Web Database Development Samuel Conn, Asst. Professor
About Dynamic Sites (Front End / Back End Implementations) by Janssen & Associates Affordable Website Solutions for Individuals and Small Businesses.
Introduction to ColdFusion Penn State Web 2001 Conference Brian Panulla Elmwood Media Group, LLC.
COLD FUSION Deepak Sethi. What is it…. Cold fusion is a complete web application server mainly used for developing e-business applications. It allows.
Fundamentals of Database Chapter 7 Database Technologies.
Web Server Administration Chapter 7 Installing and Testing a Programming Environment.
Introduction to ColdFusion Yu Fu 2003 MEC Candidate.
Putting it all together Dynamic Data Base Access Norman White Stern School of Business.
Chapter 6 Server-side Programming: Java Servlets
1 CS122B: Projects in Databases and Web Applications Spring 2015 Notes 03: Web-App Architectures Professor Chen Li Department of Computer Science CS122B.
Web Server Administration Chapter 7 Installing and Testing a Programming Environment.
MBA 664 Database Management Dave Salisbury ( )
CITA 310 Section 7 Installing and Testing a Programming Environment (Textbook Chapter 7)
WEB SERVER SOFTWARE FEATURE SETS
Database Connectivity and Server-Side Scripting Chapter 12.
ASP-2-1 SERVER AND CLIENT SIDE SCRITPING Colorado Technical University IT420 Tim Peterson.
Unit 1 – Web Concepts Instructor: Brent Presley.
1 Connecting Databases to the Web January 31 th, 2000 Seree Chinodom.
1 CSC160 Chapter 1: Introduction to JavaScript Chapter 2: Placing JavaScript in an HTML File.
A S P. Outline  The introduction of ASP  Why we choose ASP  How ASP works  Basic syntax rule of ASP  ASP’S object model  Limitations of ASP  Summary.
Database Applications and Web-Enabled Databases
Introduction to Dynamic Web Programming
Connecting Databases to the Web
Connecting Databases to the Web
Introduction and Principles
PHP / MySQL Introduction
Database Driven Websites
Coldfusion and PHP introduction
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Introduction of Week 13 Return assignment 11-1 and 3-1-5
IntroductionToPHP Static vs. Dynamic websites
Presentation transcript:

Database Applications: Web-Enabled Databases and Search Engines University of California, Berkeley School of Information Management and Systems SIMS 257: Database Management IS 257 – Spring 2004

Lecture Outline Review Databases for Web Applications – Overview Introduction to SQL Application Development in Access Databases for Web Applications – Overview IS 257 – Spring 2004

Lecture Outline Review Databases for Web Applications – Overview Introduction to SQL Application Development in Access Databases for Web Applications – Overview IS 257 – Spring 2004

Access Usability Hierarchy API VBA MACROS Functions/Expressions Objects – Tables, queries Forms, Reports From McFadden Chap. 10 IS 257 – Spring 2004

The MS JET Database Engine Database app Visual Basic Access Excel Word Visual Basic for Applications (VBA) Host Languages for the Jet DBMS Data Access Objects (DAO) Includes DDL and DML Jet Database Engine (Jet DBMS) Jet Query Engine Internal ISAM Replication Database Adapted from Roman, “Access Database Design and Programming” IS 257 – Spring 2004

Using Access for Applications Forms Reports Macros VBA programming Application framework HTML Pages IS 257 – Spring 2004

Lecture Outline Review Databases for Web Applications – Overview Introduction to SQL Application Development in Access Databases for Web Applications – Overview IS 257 – Spring 2004

Overview Why use a database system for Web design and e-commerce? What systems are available? Pros and Cons of different web database systems? Text retrieval in database systems Search Engines for Intranet and Intrasite searching IS 257 – Spring 2004

Why Use a Database System? Simple Web sites with only a few pages don’t need much more than static HTML files IS 257 – Spring 2004

Simple Web Applications Server Web Internet Files Clients IS 257 – Spring 2004

Adding Dynamic Content to the Site Small sites can often use simple HTML and CGI scripts accessing data files to create dynamic content for small sites. IS 257 – Spring 2004

Dynamic Web Applications 1 Server CGI Web Internet Files Clients IS 257 – Spring 2004

Issues For Scaling Up Web Applications Performance Scalability Maintenance Data Integrity Transaction support IS 257 – Spring 2004

Performance Issues Problems arise as both the data to be managed and usage of the site grows. Interpreted CGI scripts are inherently slower than compiled native programs Starting CGI applications takes time for each connection Load on the system compounds the problem Tied to other scalability issues IS 257 – Spring 2004

Scalability Issues Well-designed database systems will permit the applications to scale to accommodate very large databases A script that works fine scanning a small data file may become unusable when the file becomes large. Issues of transaction workload on the site Starting a separate copy of a CGI program for each user is NOT a scalable solution as the workload grows IS 257 – Spring 2004

Maintenance Issues Dealing with multiple data files (customer list, product list, customer orders, etc.) using CGI means: If any data element in one of the files changes, all scripts that access that file must be rewritten If files are linked, the programs must insure that data in all the files remains synchronized A large part of maintenance will involve dealing with data integrity issues Unanticipated requirements may require rewriting scripts IS 257 – Spring 2004

Data Integrity Constraint Issues These are constraints we wish to impose in order to protect the database from becoming inconsistent. Five basic types Required data attribute domain constraints entity integrity referential integrity enterprise constraints IS 257 – Spring 2004

Transaction support Concurrency control (ensuring the validity of database updates in a shared multiuser environment). IS 257 – Spring 2004

No Concurrency Control: Lost updates John Marsha Read account balance (balance = $1000) Withdraw $200 (balance = $800) Write account balance (balance = $800) Read account balance (balance = $1000) Withdraw $300 (balance = $700) Write account balance (balance = $700) ERROR! IS 257 – Spring 2004

Concurrency Control: Locking Locking levels Database Table Block or page Record Field Types Shared (S locks) Exclusive (X locks) IS 257 – Spring 2004

Concurrency Control: Updates with X locking John Marsha Lock account balance Read account balance (balance = $1000) Withdraw $200 (balance = $800) Write account balance (balance = $800) Unlock account balance Read account balance (DENIED) Lock account balance Read account balance (balance = $800) etc... IS 257 – Spring 2004

Concurrency Control: Deadlocks John Marsha Place S lock Read account balance (balance = $1000) Request X lock (denied) wait ... Place S lock Read account balance (balance = $1000) Request X lock (denied) wait... Deadlock! IS 257 – Spring 2004

Transaction Processing Transactions should be ACID: Atomic – Results of transaction are either all committed or all rolled back Consistent – Data is transformed from one consistent state to another Isolated – The results of a transaction are invisible to other transactions Durable – Once committed the results of a transaction are permanent and survive system or media failures IS 257 – Spring 2004

Why Use a Database System? Database systems have concentrated on providing solutions for all of these issues for scaling up Web applications Performance Scalability Maintenance Data Integrity Transaction support While systems differ in their support, most offer some support for all of these. IS 257 – Spring 2004

Dynamic Web Applications 2 Server database CGI DBMS Web Internet Files Clients IS 257 – Spring 2004

Server Interfaces Database Web Server Web Application Server Web DB HTML JavaScript DHTML CGI API’s ColdFusion PhP Perl Java ASP SQL ODBC Native DB interfaces JDBC Native DB Interfaces Adapted from John P Ashenfelter, Choosing a Database for Your Web Site IS 257 – Spring 2004

What Database systems are available? Choices depend on: Size (current and projected) of the application Hardware and OS Platforms to be used in the application Features required E.g.: SQL? Upgrade path? Full-text indexing? Attribute size limitations? Locking protocols? Direct Web Server access? Security? Staff support for DBA, etc. Programming support (or lack thereof) Cost/complexity of administration Budget IS 257 – Spring 2004

Desktop Database Systems Individuals or very small enterprises can create DBMS-enabled Web applications relatively inexpensively Some systems will require an application server (such as ColdFusion) to provide the access path between the Web server and the DBMS IS 257 – Spring 2004

Pros and Cons of Database Options Desktop databases usually simple to set up and administer inexpensive often will not scale to a very large number of users or very large database size May lack locking management appropriate for multiuser access Poor handling for full-text search Well supported by application software (Coldfusion, PHP, etc.) IS 257 – Spring 2004

Enterprise Database Systems Enterprise servers are powerful and available in many different configurations They also tend to be VERY expensive Pricing is usually based on users, or CPU’s IS 257 – Spring 2004

Pros and Cons of Database Options Enterprise databases Can be very complex to set up and administer Oracle, for example recommends RAID-1 with 7x2 disk configuration as a bare minimum, more recommended Expensive Will scale to a very large number of users Will scale to very large databases Incorporate good transaction control and lock management Native handling of Text search is poor, but most DBMS have add-on text search options Support for applications software (ColdFusion, PHP, etc.) IS 257 – Spring 2004

Free Database Servers System is free, but there is also no help line. Include many of the features of Enterprise systems, but tend to be lighter weight Versions may vary in support for different systems Open Source -- So programmers can add features IS 257 – Spring 2004

Pros and Cons of Database Options Free databases Can be complex to set up and administer Inexpensive (FREE!) usually will scale to a large number of users Incorporate good transaction control and lock management Native handling of Text search is poor Support for applications software (ColdFusion, PHP, etc.) IS 257 – Spring 2004

Embedded Database Servers May require programming experience to install Tend to be fast and economical in space requirements IS 257 – Spring 2004

Pros and Cons of Database Options Embedded databases Must be embedded in a program Can be incorporated in a scripting language inexpensive (for non-commercial application) May not scale to a very large number of users (depends on how it is used) Incorporate good transaction control and lock management Text search support is minimal May not support SQL IS 257 – Spring 2004

Database Security Different systems vary in security support: Views or restricted subschemas Authorization rules to identify users and the actions they can perform User-defined procedures (and rule systems) to define additional constraints or limitations in using the database Encryption to encode sensitive data Authentication schemes to positively identify a person attempting to gain access to the database IS 257 – Spring 2004

Views A subset of the database presented to some set of users. SQL: CREATE VIEW viewname AS SELECT field1, field2, field3,…, FROM table1, table2 WHERE <where clause>; Note: “queries” in Access function as views. IS 257 – Spring 2004

Authorization Rules Most current DBMS permit the DBA to define “access permissions” on a table by table basis (at least) using the GRANT and REVOKE SQL commands. Some systems permit finer grained authorization (most use GRANT and REVOKE on variant views. Some desktop systems have poor authorization support. IS 257 – Spring 2004

Database Backup and Recovery Journaling (audit trail) Checkpoint facility Recovery manager IS 257 – Spring 2004

Web Application Server Software ColdFusion PHP ASP All of the are server-side scripting languages that embed code in HTML pages IS 257 – Spring 2004

ColdFusion Developing WWW sites typically involved a lot of programming to build dynamic sites e.g. Pages generated as a result of catalog searches, etc. ColdFusion was designed to permit the construction of dynamic web sites with only minor extensions to HTML through a DBMS interface IS 257 – Spring 2004

ColdFusion Started as CGI Split into cooperating components Drawback, as noted above, is that the entire system is run for each cgi invocation Split into cooperating components NT service -- runs constantly Server modules for 4 main Web Server API (glue that binds web server to ColdFusion service) {Apache, ISAPI, NSAPI, WSAPI} Special CGI scripts for other servers IS 257 – Spring 2004

What ColdFusion is Good for Putting up databases onto the Web Handling dynamic databases (Frequent updates, etc) Making databases searchable and updateable by users. IS 257 – Spring 2004

Requirements Unix or NT systems Install as SuperUser Databases must be defined via “data source names (DSNs) by administrator IS 257 – Spring 2004

Requirements and Set Up Field names should be devoid of spaces. Use the underscore character, like new_items instead of "new items." Use key fields. Greatly reduces search time. Check permissions on the individual tables in your database and make sure that they have read-access for the username your Web server uses to log in. If your fields include large blocks of text, you'll want to include basic HTML coding within the text itself, including boldface, italics, and paragraph markers. IS 257 – Spring 2004

Templates Assume we have a database named contents_of_my_shopping_cart.mdb -- single table called contents... Create an HTML page (uses extension .cfm), before <HEAD>... <CFQUERY NAME= ”cart" DATASOURCE=“contents_of_my_shopping_cart"> SELECT * FROM contents ; </CFQUERY> IS 257 – Spring 2004

Templates cont. <HEAD> <TITLE>Contents of My Shopping Cart</TITLE> </HEAD> <BODY> <H1>Contents of My Shopping Cart</H1> <CFOUTPUT QUERY= ”cart"> <B>#Item#</B> <BR> #Date_of_item# <BR> $#Price# <P> </CFOUTPUT> </BODY> </HTML> IS 257 – Spring 2004

Templates cont. Contents of My Shopping Cart Bouncy Ball with Psychedelic Markings 12 December 1998 $0.25 Shiny Blue Widget 14 December 1998 $2.53 Large Orange Widget $3.75 IS 257 – Spring 2004

CFIF and CFELSE <CFOUTPUT QUERY= ”cart"> Item: #Item# <BR> <CFIF #Picture# EQ""> <IMG SRC=“generic_picture.jpg"> <BR> <CFELSE> <IMG SRC="#Picture#"> <BR> </CFIF> </CFOUTPUT> IS 257 – Spring 2004

More Templates <CFQUERY DATASOURCE = “AZ2”> INSERT INTO Employees(firstname, lastname, phoneext) VALUES(‘#firstname#’, ‘#lastname#’, ‘#phoneext#’) </CFQUERY> <HTML><HEAD><TITLE>Employee Added</TITLE> <BODY><H1>Employee Added</H1> <CFOUTPUT> Employee <B>#firstname# #lastname#</B> added. </CFOUTPUT></BODY> </HTML> IS 257 – Spring 2004

CFML ColdFusion Markup Language Read data from and update data to databases and tables Create dynamic data-driven pages Perform conditional processing Populate forms with live data Process form submissions Generate and retrieve email messages Perform HTTP and FTP function Perform credit card verification and authorization Read and write client-side cookies IS 257 – Spring 2004

PHP PHP is an Open Source Software project with many programmers working on the code. Commonly paired with MySQL, another OSS project Free Both Windows and Unix support Estimated that more than 250,000 web sites use PHP as an Apache Module. IS 257 – Spring 2004

PHP Syntax Similar to ASP Includes most programming structures (Loops, functions, Arrays, etc.) Loads HTML form variables so that they are addressable by name <HTML><BODY> <?php $myvar = “Hello World”; echo $myvar ; ?> </BODY></HTML> IS 257 – Spring 2004

Combined with MySQL DBMS interface appears as a set of functions: <HTML><BODY> <?php $db = mysql_connect(“localhost”, “root”); mysql_select_db(“mydb”,$db); $result = mysql_query(“SELECT * FROM employees”, $db); Printf(“First Name: %s <br>\n”, mysql_result($result, 0 “first”); Printf(“Last Name: %s <br>\n”, mysql_result($result, 0 “last”); ?></BODY></HTML> IS 257 – Spring 2004

ASP – Active Server Pages Another server-side scripting language From Microsoft using Visual Basic as the Language model (VBScript), though Javascript (actually MS Jscript) is also supported Works with Microsoft IIS and gives access to ODBC databases IS 257 – Spring 2004

ASP Syntax <% SQL="SELECT last, first FROM employees ORDER BY last" set conn = server.createobject("ADODB.Connection") conn.open “employee" set people=conn.execute(SQL) %> <% do while not people.eof set resultline=people(0) & “, “ & people(1) & “<BR>” Response.Write(resultline) people.movenext loop%> <% people.close %> IS 257 – Spring 2004

Conclusions Database technology is a required component for large-scale dynamic Web sites, especially E-Commerce sites Web databases cover most of the needs of dynamic sites (except for text search) Many solutions and systems are available for web-enabled databases IS 257 – Spring 2004

Next week More on application development and Web DBs including more detail on Cold Fusion and PHP More on SQL, including introduction to ORACLE ORACLE Account information ORACLE Documentation IS 257 – Spring 2004