Presentation is loading. Please wait.

Presentation is loading. Please wait.

NTU Natural Language Processing Lab. 1 An Analysis of Effectiveness of Tagging in Blogs Christopher H. Brooks and Nancy Montanez University of San Francisco.

Similar presentations


Presentation on theme: "NTU Natural Language Processing Lab. 1 An Analysis of Effectiveness of Tagging in Blogs Christopher H. Brooks and Nancy Montanez University of San Francisco."— Presentation transcript:

1 NTU Natural Language Processing Lab. 1 An Analysis of Effectiveness of Tagging in Blogs Christopher H. Brooks and Nancy Montanez University of San Francisco Advisor: Hsin-Hsi Chen Speaker: Sheng-Chung Yen ( 嚴聖筌 )

2 2 NTU Natural Language Processing Lab. Agenda Introduction Technorati Uses of Tags Experiments Conclusion & Future Work References

3 3 NTU Natural Language Processing Lab. Introduction Tagging Tag are collections of keywords that are attached to blog entries, obviously to help describe the entry. Folksonomy A labeling system that enables Internet users to categorize contents. del.icio.us http://del.icio.us/ flickr http://www.flickr.com/

4 4 NTU Natural Language Processing Lab. Technorati Technorati is a search engine and aggregation site that focuses on indexing and collecting all of the information in blogosphere. http://www.technorati.com/

5 5 NTU Natural Language Processing Lab. Uses of Tags (1/2) The 250 most popular tags on Technorati, as of October 6, 2005.

6 6 NTU Natural Language Processing Lab. Uses of Tags (2/2) The greatest potential uses of tags is as a means for annotating particular articles and indicating their content. It may be the case that less popular tags are better at describing the subject of specific articles.

7 7 NTU Natural Language Processing Lab. Experiments Hypothesis A cluster of documents that shared a tag should be more similar than a randomly constructed set of documents. 350 most popular tags from Technorati. For each tag, 250 most resent articles.

8 8 NTU Natural Language Processing Lab. Experiments

9 9 NTU Natural Language Processing Lab. Experiments

10 10 NTU Natural Language Processing Lab. Experiments

11 11 NTU Natural Language Processing Lab. Experiments

12 12 NTU Natural Language Processing Lab. Experiments Automated tagging Extracting the three words with the top TFIDF score. These words were then treated as the article’s “autotags.”

13 13 NTU Natural Language Processing Lab. Experiments

14 14 NTU Natural Language Processing Lab. Conclusion & Future Work Tags help users group their blog entries into broad categories.

15 15 NTU Natural Language Processing Lab. References [1] Christopher H Brooks and Nancy Montanez, An Analysis of the Effectiveness of Tagging in Blogs, AAAI 2006. [2] Wikipedia, http://en.wikipedia.org/wiki/Main_Page. http://en.wikipedia.org/wiki/Main_Page


Download ppt "NTU Natural Language Processing Lab. 1 An Analysis of Effectiveness of Tagging in Blogs Christopher H. Brooks and Nancy Montanez University of San Francisco."

Similar presentations


Ads by Google