Presentation is loading. Please wait.

Presentation is loading. Please wait.

Restricted Search Engine Laurent Balat Christophe Decis Thomas Forey Sebastien Leclercq ESSI2 Project Supervisor: Johny BOND June 2002.

Similar presentations


Presentation on theme: "Restricted Search Engine Laurent Balat Christophe Decis Thomas Forey Sebastien Leclercq ESSI2 Project Supervisor: Johny BOND June 2002."— Presentation transcript:

1 Restricted Search Engine Laurent Balat Christophe Decis Thomas Forey Sebastien Leclercq ESSI2 Project Supervisor: Johny BOND June 2002

2 Introduction(1) What is a search engine? 3 types: –disciplinary –global –thematic Internet users spend more than 50% of their time to search!

3 Introduction (2) Lots of pages can’t be reached. WEB Indexable WEB Google

4 How does it work ? The search engine is composed of two parts First processing : the WEB site spider WEBSpiderProcessing indexing PDF unit DOC unit HTML processing unit DATABASE Constraint

5 How does it work ? User part architecture DATABASE Query engine Query Interface User

6 Constraints Domain Restriction. Search depth. Theme: words accepted or not. Document type. Time delay.

7 The Spider Part Check if link already visited Check type data in constraints Error download HTTP HEAD link link priority queue Stack data page Push pageDownload

8 Document Processing Analyse of type Send to the appropriate unit. Extract words and links Trying to resolve bad links

9 Indexation Binary Search Tree: - quick building - efficient use Check constraints: - start list and stop list.

10 Database MySQL database. General Structure: Keywords Web links Correspondence between keywords and links

11 User interface and query engine The web page is generated by a script (cgi). The query engine questions the database Formatting the results

12 Demonstration (1) Fill the Database

13 Demonstration (2) How to search pages?

14 Conclusion Results and perspective –Original search engine. –Easy to improve by adding units to process differents file format (ps, doc, xls,…). Team working and repartition. This Project shows us how to use the different tools seen this year

15 References http://www.w3c.org http://www.mysql.com http://www.sgi.com/tech/stl http://www.searchengineshowdown.com


Download ppt "Restricted Search Engine Laurent Balat Christophe Decis Thomas Forey Sebastien Leclercq ESSI2 Project Supervisor: Johny BOND June 2002."

Similar presentations


Ads by Google