GROUP 00000011 PresentsPresents. WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston.

Slides:



Advertisements
Similar presentations
USING WORDPRESS. WEEK 1 1.Why WP? 2.Setting Up WP 3.Exploring the Admin screen 4.Page Organization 5.Posting 6.Polls.
Advertisements

Usage of the memoQ web service API by LSP – a case study
WSUS Presented by: Nada Abdullah Ahmed.
Web Visualization Technology Horner APG Ver 1.0.
T-FLEX DOCs PLM, Document and Workflow Management.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
Quality Assurance CS 615. Mission Statement The Quality Assurance team will provide assurance to stakeholders in CS-615/616 projects that their projects.
Kerim KORKMAZ A. Tolga KILINÇ H. Özgür BATUR Berkan KURTOĞLU.
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
© InLoox ® InLoox PM Web App product presentation The Online Project Software.
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
A project management tool in Polytechnic University of Tirana (Bachelor thesis) Iva Tarelli Prof. Assoc. Elinda Kajo 14 th Workshop “SEE&RE” Sinaia, Romania,
Louisa Lambregts, What Makes a Web Site Successful and Effective? Bottom Line... Site are successful if they meet goals/expectations.
11 SUPPORTING INTERNET EXPLORER IN WINDOWS XP Chapter 11.
Linux Operations and Administration
Students: Nadia Goshmir, Yulia Koretsky Supervisor: Shai Rozenrauch Industrial Project Advanced Tool for Automatic Testing Final Presentation.
Object Oriented Databases by Adam Stevenson. Object Databases Became commercially popular in mid 1990’s Became commercially popular in mid 1990’s You.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Crystal Hoyer Program Manager IIS Team Preview of features that will be announced at MIX09 Please do not blog, take pictures or video of session.
VeribisCRM CUSTOMER RELATIONSHIP MANAGEMENT Engin Duran Experience is our know how.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
SIUE Injury Tracking System Project Plan. Team Members: Robbie Marsh Robbie Marsh –Project Manager/Webmaster Ken Metcalf Ken Metcalf –Lead Programmer.
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
 Chapter 6 Architecture 1. What is Architecture?  Overall Structure of system  First Stage in Design process 2.
Web Categorization Crawler Mohammed Agabaria Adam Shobash Supervisor: Victor Kulikov Winter 2009/10 Design & Architecture Dec
Objective Understand concepts used to web-based digital media. Course Weight : 5%
LloydBank ATM Project Engineering Project Design Team: Saud Malik Brittany Johnson Joseph Garrison.
CHAPTER TEN AUTHORING.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Project Plan for nSite Central Michael Dunn Ryan Sessions Kyle Kerrigan.
CSE 219 Computer Science III Program Design Principles.
Introduction to Version Control SE-2030 Dr. Rob Hasker 1 Based on material at and slides written.
Problem Statement: Users can get too busy at work or at home to check the current weather condition for sever weather. Many of the free weather software.
What’s new in Kentico CMS 5.0 Michal Neuwirth Product Manager Kentico Software.
Lecture 6 Title: Web Planning, Designing, Developing for E-Marketing By: Mr Hashem Alaidaros MKT 445.
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
NETWORK HARDWARE AND SOFTWARE MR ROSS UNIT 3 IT APPLICATIONS.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
WEP Presentation for non-IT Steps and roles in software development 2. Skills developed in 1 st year 3. What can do a student in 1 st internship.
Ad Hoc Graphical Reports Ad Hoc Graphical Reports Copyright © Team #4 CSCI 6838 Spring CSCI Research Project and Seminar Team# 4 (
Sample School Website. What is wrong with the existing School Webspace Site? Can only host static pages – no dynamic content possible. Can not be edited.
 Network  A _____ of computers that can _________ w/ each other  Examples of hardware  ______________ & communication lines  Internet  Hardware.
Copyright (c) 2003 by Prentice Hall Chapter 2 Applications Software: Getting the Work Done Computers: Tools for an Information Age BSM025 Computers.
D R A T D R A T ABSTRACT Every semester each department at Iowa State University has to assign its faculty members and teaching assistants (TAs) to the.
Web Browsing *TAKE NOTES*. Millions of people browse the Web every day for research, shopping, job duties and entertainment. Installing a web browser.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
8 th Semester, Batch 2009 Department Of Computer Science SSUET.
+ Publishing Your First Post USING WORDPRESS. + A CMS (content management system) is an application that allows you to publish, edit, modify, organize,
Thrdplace Social Networking Team #7 1. TRR Outline Operational Concept Overview System benefits to Customer 1.Introduction Demo of System Operational.
Internet Searching the World Wide Web. The Internet and the World Wide Web The Internet is a worldwide collection of networks that allows people to communicate.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
In order to survive in the era of competition a business firm needs market research. Researching market involves thorough analysis and gathering of data.
Expertsfromindia for Joomla Development. Introduction Joomla is an open source and free content management system (CMS) for publishing content on the.
How to use Drupal Awdhesh Kumar (Team Leader) Presentation Topic.
CS 501: Software Engineering Fall 1999 Lecture 23 Design for Usability I.
HedEx Lite Obtaining and Using Huawei Documentation Easily
Architecture Review 10/11/2004
Web Programming Language
Chapter 1 Introduction to HTML
Personnel Effort Budget
InLoox PM Web App product presentation
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
An ecosystem of contributions
Slides prepared by Sarah Benis Scheier-Dolberg
Intro Project Introduction to HTML.
Presentation transcript:

GROUP PresentsPresents

WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston North 2009 Software Engineering C Semester Two Massey University - Palmerston North 2009

About the team Amir Hoshang Kioumars Amir Hoshang Kioumars Major: Software Engineering Major: Software Engineering Position: Team Leader Position: Team Leader Chagitha Ranhotigamage Chagitha Ranhotigamage Major: Computer System Engineering Major: Computer System Engineering Position: Team Member Position: Team Member Jeffrey Hamilton Jeffrey Hamilton Major: Computer Science Major: Computer Science Position: Team Member Position: Team Member Reference: Wikipedia

The purpose of the project Design and implement a network analysis tool. The end product would be a desktop application that can be used to display and visualise the relationships of web sites to other web sites and display as a visualize graph. Reference: Wikipedia

Project Requirements Graph must be animated. It should be redrawn if more sites and links are discovered. Graph must be animated. It should be redrawn if more sites and links are discovered. The application should run multi-threaded. The application should run multi-threaded. The graph can be stored in a database, and restored for analysis. The graph can be stored in a database, and restored for analysis. The customer should be able to control look and feel. The customer should be able to control look and feel. The design should be modular, and easy to extend. For example: export/import of graphs, plug-in other views.” The design should be modular, and easy to extend. For example: export/import of graphs, plug-in other views.”

What was our plan ? Understanding the needs and establishing the requirements Understanding the needs and establishing the requirements Select a suitable language which all the team members were familiar with Select a suitable language which all the team members were familiar with Select a graph package that suits our needs Select a graph package that suits our needs Prioritise the tasks Prioritise the tasks Design a project plan in terms of our time and resources Design a project plan in terms of our time and resources Release a version of software after finishing each milestone Release a version of software after finishing each milestone To keep everything as simple as possible To keep everything as simple as possible

What was the plan result ? An accurate project plan An accurate project plan The dead lines we set were achieved The dead lines we set were achieved We released a new version at the conclusion of each milestone We released a new version at the conclusion of each milestone We update the project plan by highlighting the tasks for better monitoring the process We update the project plan by highlighting the tasks for better monitoring the process

What is the Web Crawler ? Given a URL address, it searches through the page and finds all the links on the page. It then follows all the links it found, and continues to do this until it reaches a pre- defined depth. Given a URL address, it searches through the page and finds all the links on the page. It then follows all the links it found, and continues to do this until it reaches a pre- defined depth. As it searches it will display all the links as nodes on a graph. As it searches it will display all the links as nodes on a graph.

Why is the Web Crawler useful ? Can be used to analyse the changes that have occurred to the links from a page over time. Can be used to analyse the changes that have occurred to the links from a page over time. It would also be useful to investigate how many links there are between two separate pages. It would also be useful to investigate how many links there are between two separate pages. Could be used for working out degrees of freedom (on social sites). Could be used for working out degrees of freedom (on social sites).

Animating the graph

Optional Extras Nodes can be labeled in Five different ways Nodes can be labeled in Five different ways Links can be filtered according to their type Links can be filtered according to their type Look and Feel of the User Interface can be changed Look and Feel of the User Interface can be changed Graph can be exported as images Graph can be exported as images Depth of the search is user definable Depth of the search is user definable

Multi-Threading Program is threaded, preventing the whole application becoming frozen when a single part fails. For Example, when a link is invalid, or the internet connection is lost. Program is threaded, preventing the whole application becoming frozen when a single part fails. For Example, when a link is invalid, or the internet connection is lost.

Storing in a database Program stores the links into a database Program stores the links into a database Database has the same name as project Database has the same name as project Database uses Apache Derby Database uses Apache Derby Database is in a SQL standard format which can be queried by any SQL support program (mySQL, Access and…) Database is in a SQL standard format which can be queried by any SQL support program (mySQL, Access and…) Database contents can be read as a static graph Database contents can be read as a static graph

Why our product is special and what is the features of our program ?

Simpler is better Easy to use Easy to use Open source Open source Free Free Platform independent Platform independent No need for installation No need for installation Stable Stable Standard user interface Standard user interface Easy to maintain Easy to maintain Uses simple algorithms Uses simple algorithms Works with proxy Works with proxy Uses an SQL database Uses an SQL database We like it We like it

How we managed the project ?

By breaking down the complexities and defining the tasks for each milestone or version or release By breaking down the complexities and defining the tasks for each milestone or version or release

Functionality at each milestone: Version 1.0 Version 1.0 Version 2.0 Version 2.0 Version 3.0 Version 3.0 Version 4.0 Version 4.0 Final Release Final Release

Our issues policy was: Our issues policy was: Open a new issue Open a new issue Accept issue: max 2 days Accept issue: max 2 days Work on Issue: max 5 days Work on Issue: max 5 days Assign Issue for verification: 1 day Assign Issue for verification: 1 day Verify and close issue: max 2 days Verify and close issue: max 2 days Issues priorities are in terms of its dependencies. For example, if issue is on a important function which share between classes, it would be critical and some issues like GUI and usability will be cosmetic and can be done at any stage. Also before deployment, all the issues should be done. Issue tracking policy

Deployment The application can be run as a standalone jar file, and is available to download as a zipped file with the user manual from our website: The application can be run as a standalone jar file, and is available to download as a zipped file with the user manual from our website: Any updates and maintenance that are made to the program will be documented and logged on the page above. Any updates and maintenance that are made to the program will be documented and logged on the page above. As part of the website, there is a step by step example video of a simple search taking place, and how to use the program (this is also linked to directly from the help menu of the application. As part of the website, there is a step by step example video of a simple search taking place, and how to use the program (this is also linked to directly from the help menu of the application.

What did we learn ?

Always have a plan B (at least) Always have a plan B (at least) Communication is the key Communication is the key Research the problem thoroughly Research the problem thoroughly Better use of issue tracking system Better use of issue tracking system It’s a good reminder to avoid any clashes

Dr. Jens Dietrich for his incredible support, help and advice for the duration of this project Dr. Jens Dietrich for his incredible support, help and advice for the duration of this project Dr. Russell Johnson for his friendly help Dr. Russell Johnson for his friendly help And all the guests and students for your attention And all the guests and students for your attention Special thanks to:

The End Good luck in your final exams & have a great long holiday