Presentation is loading. Please wait.

Presentation is loading. Please wait.

Unicode Normalize Engine Submitted by: Jose Yallouz Shlomi Ben-Shabat Supervisor: Maxim Gurevich.

Similar presentations


Presentation on theme: "Unicode Normalize Engine Submitted by: Jose Yallouz Shlomi Ben-Shabat Supervisor: Maxim Gurevich."— Presentation transcript:

1 Unicode Normalize Engine Submitted by: Jose Yallouz Shlomi Ben-Shabat Supervisor: Maxim Gurevich

2 Project Goals Recognition of web pages’ encoding. Translation of web page to Utf-8. Normalize the web into a single encoding standard- Utf-8.

3 Translation Decision HTML HTTP Header URL Bom tag Auto Detection METAHTTP Unicode Output

4 Class Diagram

5 Heuristic For Encoding Detection

6 ODP analysis Average detection of 92.615685 percent.

7 Application Usage Client usage – client browser can use this system to show the different web page in one encoding format – utf8. Server usage – web server can use this system to translate the different storage pages into utf8. Processing usage – different web page processing systems, like search engines, can use our system to convert different pages into the standard Unicode encoding.


Download ppt "Unicode Normalize Engine Submitted by: Jose Yallouz Shlomi Ben-Shabat Supervisor: Maxim Gurevich."

Similar presentations


Ads by Google