Presentation is loading. Please wait.

Presentation is loading. Please wait.

Adaptive News Access Daniel Billsus Presented by Chirayu Wongchokprasitti.

Similar presentations


Presentation on theme: "Adaptive News Access Daniel Billsus Presented by Chirayu Wongchokprasitti."— Presentation transcript:

1 Adaptive News Access Daniel Billsus billsus@fxpal.com Presented by Chirayu Wongchokprasitti

2 Introduction WWW is a common source for news access anywhere and anytime. WWW is a common source for news access anywhere and anytime. The availability of updated news content recently overloads us The availability of updated news content recently overloads us Adaptive web technology can help discovering relevant content from thousands of sources. Adaptive web technology can help discovering relevant content from thousands of sources.

3 Types of Adaptive News Access News Personalization News Personalization Adaptive News Navigation Adaptive News Navigation Contextual Recommendations Contextual Recommendations News Aggregation News Aggregation

4 News Personalization Dynamic content Dynamic content News stories are released and updated continuously News stories are released and updated continuously Content-based methods fit to news personalization Content-based methods fit to news personalization Content-based methods predict user’s interest based on text alone Content-based methods predict user’s interest based on text alone Changing interests Changing interests User’s interests tend to change frequently. User’s interests tend to change frequently. A user model can adjust its interests quickly. A user model can adjust its interests quickly. The techniques of changing target concepts is known as concept drift. The techniques of changing target concepts is known as concept drift.

5 News Personalization (cont.) Multiple interests Multiple interests Users are interested in different news topics Users are interested in different news topics A user model must be capable of representing multiple interests A user model must be capable of representing multiple interests The k-nearest-neighbor methods (kNN) are good choices. The k-nearest-neighbor methods (kNN) are good choices. Novelty Novelty A new “unknown” story is considered most interesting. A new “unknown” story is considered most interesting. A new story too close to what user previously accessed is classified as a “known” story. A new story too close to what user previously accessed is classified as a “known” story.

6 News Personalization (cont.) Avoiding tunnel vision Avoiding tunnel vision Personalization should not get in the way of finding important novel information. Personalization should not get in the way of finding important novel information. Editorial input Editorial input A user model ranks stories by a prediction function. A user model ranks stories by a prediction function. Retaining editorial input is an important feature for news organizations. Retaining editorial input is an important feature for news organizations. To ensure users will get to see the top n stories. To ensure users will get to see the top n stories.

7 News Personalization (cont.) Brittleness Brittleness A single action, with or without intention, should not have a radical effect on a user model. A single action, with or without intention, should not have a radical effect on a user model. Availability of meta-tags Availability of meta-tags News personalization algorithms can usually not rely on the availability of meta-tags. News personalization algorithms can usually not rely on the availability of meta-tags.

8 News Personalization (cont.)

9 Adaptive News Navigation The objective is to simplify access to relevant content. The objective is to simplify access to relevant content. This technique focuses on analyzing user’s access patterns to determine the position of menu items within a menu hierarchy. This technique focuses on analyzing user’s access patterns to determine the position of menu items within a menu hierarchy. This approach is suitable to mobile applications due to limited screen space. This approach is suitable to mobile applications due to limited screen space.

10 Adaptive News Navigation (cont.) On average, the number of selected menu and scroll operations was reduced by over 50%. On average, the number of selected menu and scroll operations was reduced by over 50%. However, this approach does not provide any news recommendations. However, this approach does not provide any news recommendations.

11 Contextual Recommendations An approach draws on currently displayed information on the screen as an expression of the user’s current interests. An approach draws on currently displayed information on the screen as an expression of the user’s current interests. The system extracts textual information on the user’s screen and the extracted text is used to retrieve related content. The system extracts textual information on the user’s screen and the extracted text is used to retrieve related content. Statistical term-weighting techniques are used to identify informative terms. Statistical term-weighting techniques are used to identify informative terms. Blinkx is a publicly available contextual recommender (http://www.blinkx.com). Blinkx is a publicly available contextual recommender (http://www.blinkx.com).http://www.blinkx.com

12 Contextual Recommendations (cont.)

13 News Aggregation News aggregators are services that aggregate content from many news sources, and then adapt to the current news landscape as a whole. News aggregators are services that aggregate content from many news sources, and then adapt to the current news landscape as a whole. The services use RSS (Rich Site Summary) feeds to provide links to available content. The services use RSS (Rich Site Summary) feeds to provide links to available content. A news aggregation implementation can use statistical term-weighting and text similarity techniques. A news aggregation implementation can use statistical term-weighting and text similarity techniques. Google News (http://news.google.com) is one of these services. Google News (http://news.google.com) is one of these services.http://news.google.com

14 News Aggregation (cont.)

15 Case Study Adaptive News Personalization for Mobile Content Access Adaptive News Personalization for Mobile Content Access Learning User Models for News Access Learning User Models for News Access Evaluation Evaluation

16 Adaptive News Personalization for Mobile Content Access The constraints of mobile information access make personalization important to produce usable applications. The constraints of mobile information access make personalization important to produce usable applications. A news system in mobile personalizes the orders of news sections the most relevant stories are displayed on the topmost A news system in mobile personalizes the orders of news sections the most relevant stories are displayed on the topmost

17 Adaptive News Personalization for Mobile Content Access (cont.)

18 Learning User Models for News Access The system uses a machine learning approach to build a simple model of each user’s interests. The system uses a machine learning approach to build a simple model of each user’s interests. A combination of similarity-based methods and Bayesian methods achieves the balance of learning and adapting quickly to change interests while avoiding brittleness. A combination of similarity-based methods and Bayesian methods achieves the balance of learning and adapting quickly to change interests while avoiding brittleness.

19 Learning User Models for News Access (cont.) These two algorithms form a multi- strategy learning approach to learn two separate user models. These two algorithms form a multi- strategy learning approach to learn two separate user models. Short-term interests user model Short-term interests user model Long-term interests user model Long-term interests user model

20 Learning User Models for News Access (cont.) The purpose of the short-term model The purpose of the short-term model First, it should contain information about recently read events, so that stories which belong to the same thread can be identified. First, it should contain information about recently read events, so that stories which belong to the same thread can be identified. To allow for identification of stories that user already knows. To allow for identification of stories that user already knows. The k-nearest-neighbor algorithm (kNN) is used to achieve the desired functionality. The k-nearest-neighbor algorithm (kNN) is used to achieve the desired functionality. Convert news stories to tf-idf vectors (term- frequency/inverse-document-frequency). Convert news stories to tf-idf vectors (term- frequency/inverse-document-frequency). Use the cosine similarity measure to quantify the similarity of two vectors. Use the cosine similarity measure to quantify the similarity of two vectors.

21 Learning User Models for News Access (cont.) The purpose of the long-term model is to model a user’s general preferences. The purpose of the long-term model is to model a user’s general preferences. The system periodically selects informative words for each news category from a large sample of stories. The system periodically selects informative words for each news category from a large sample of stories. The goal of the feature selection process is to select informative words that reoccur over a long period of time. The goal of the feature selection process is to select informative words that reoccur over a long period of time. A naïve Bayesian classifier is used to assess the probability of stories being interesting. A naïve Bayesian classifier is used to assess the probability of stories being interesting.

22 Learning User Models for News Access (cont.)

23 Evaluation They summarize the results from two studies that compare personalization information access to static one. They summarize the results from two studies that compare personalization information access to static one. First, the “alternating sessions” experiment quantifies the difference between static and adaptive information access First, the “alternating sessions” experiment quantifies the difference between static and adaptive information access A half of users used its user modeling approach. A half of users used its user modeling approach. The other half received news in static order from the source. The other half received news in static order from the source.

24 Evaluation (cont.) The average display rank of selected stories was 6.7 in the static mode and 4.2 in the adaptive mode (based on 50 users that selected 340 stories out of 1882 headlines). The average display rank of selected stories was 6.7 in the static mode and 4.2 in the adaptive mode (based on 50 users that selected 340 stories out of 1882 headlines). The analysis of the distribution of selected stories. The analysis of the distribution of selected stories. In the static mode, 68.7% of the selected stories on the top two headline screens In the static mode, 68.7% of the selected stories on the top two headline screens In the adaptive mode, 86.7% on the top two In the adaptive mode, 86.7% on the top two

25 Evaluation (cont.)

26 Second, the “alternating stories” experiment displays stories selected with respect to both the adaptive and static modes on the same screen. Second, the “alternating stories” experiment displays stories selected with respect to both the adaptive and static modes on the same screen. Advantages: Advantages: The system still adapts to user’s interests. The system still adapts to user’s interests. Allow a direct comparison between the two selection strategies. Allow a direct comparison between the two selection strategies.

27 Evaluation (cont.) The difference was not as pronounced as in the “alternating sessions” experiment. The difference was not as pronounced as in the “alternating sessions” experiment. The average display rank of selected stories was 5.8 in the static mode and 5.27 in the adaptive mode. The average display rank of selected stories was 5.8 in the static mode and 5.27 in the adaptive mode. The analysis of the distribution of selected stories. The analysis of the distribution of selected stories. In the static mode, 75.57% In the static mode, 75.57% In the adaptive mode, 80.44% In the adaptive mode, 80.44% Users are more likely to select adaptive stories (19.02%) than static ones (13.26%) which amounts to a 43.44% increase in selected content. Users are more likely to select adaptive stories (19.02%) than static ones (13.26%) which amounts to a 43.44% increase in selected content.

28 Evaluation (cont.)

29 In summary, the “alternating sessions” and “alternating stories” experiments show adaptive information access is higher than static access. In summary, the “alternating sessions” and “alternating stories” experiments show adaptive information access is higher than static access. The “alternating sessions” experiment showed adaptive order helps shifting interesting stories towards the beginning of personalized lists. The “alternating sessions” experiment showed adaptive order helps shifting interesting stories towards the beginning of personalized lists. The “alternating stories” experiment showed the system is capable of ordering content that the top- ranked items have a significantly higher chance to be selected that the ranked static ones. The “alternating stories” experiment showed the system is capable of ordering content that the top- ranked items have a significantly higher chance to be selected that the ranked static ones.

30 Recent Trends and Systems Podcasting Podcasting Online audio distribution of news content. Online audio distribution of news content. Collaborative filtering techniques is applicable to podcast recommendation. Collaborative filtering techniques is applicable to podcast recommendation. Personalization and the Blogosphere Personalization and the Blogosphere Blogosphere refers to the set of all webblogs. Blogosphere refers to the set of all webblogs. Some systems support personalized blog access such as Findory.com, NewsGator.com. Some systems support personalized blog access such as Findory.com, NewsGator.com. News Zeitgeist News Zeitgeist Zeitgeist is a German word that means “the spirit (Geist) of the time (Zeit)”. Zeitgeist is a German word that means “the spirit (Geist) of the time (Zeit)”. The goal is to automatically identify the most popular topics of the current Blogosphere. The goal is to automatically identify the most popular topics of the current Blogosphere.

31 Conclusions & References We need new technology to help leverage the full potential web-based news distribution. We need new technology to help leverage the full potential web-based news distribution. [1] Billsus, D. (2005). Adaptive News Access [2] Billsus, D., & Pazzani, M. (2000). User Modeling for Adaptive News Access. User Modeling and User-Adapted Interaction, 10(2/3): 147-180. [3] Chiu, B. & Webb, G. (1998) Using decision trees for agent modeling: improving prediction performance. User Modeling and User-Adapted Interaction, 8, 131-152.

32 Questions or Comments?


Download ppt "Adaptive News Access Daniel Billsus Presented by Chirayu Wongchokprasitti."

Similar presentations


Ads by Google