Example: Academic Search

Slides:



Advertisements
Similar presentations
Suleyman Cetintas 1, Monica Rogati 2, Luo Si 1, Yi Fang 1 Identifying Similar People in Professional Social Networks with Discriminative Probabilistic.
Advertisements

+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
LEARNING INFLUENCE PROBABILITIES IN SOCIAL NETWORKS Amit Goyal Francesco Bonchi Laks V. S. Lakshmanan University of British Columbia Yahoo! Research University.
Frequent Closed Pattern Search By Row and Feature Enumeration
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
Graph Data Management Lab School of Computer Science , Bristol, UK.
Optimizing Estimated Loss Reduction for Active Sampling in Rank Learning Presented by Pinar Donmez joint work with Jaime G. Carbonell Language Technologies.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Carnegie Mellon 1 Maximum Likelihood Estimation for Information Thresholding Yi Zhang & Jamie Callan Carnegie Mellon University
Presented by Li-Tal Mashiach Learning to Rank: A Machine Learning Approach to Static Ranking Algorithms for Large Data Sets Student Symposium.
Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg Presented By: Talin Kevorkian Summer June
Margin Based Sample Weighting for Stable Feature Selection Yue Han, Lei Yu State University of New York at Binghamton.
1 Heterogeneous Cross Domain Ranking in Latent Space Bo Wang 1, Jie Tang 2, Wei Fan 3, Songcan Chen 1, Zi Yang 2, Yanzhu Liu 4 1 Nanjing University of.
Presented by Zeehasham Rasheed
1 Bo Wang 1, Jie Tang 2, Wei Fan 3, Songcan Chen 1, Zi Yang 2, Yanzhu Liu 4 1 Nanjing University of Aeronautics and Astronautics 2 Tsinghua University.
1 A Topic Modeling Approach and its Integration into the Random Walk Framework for Academic Search 1 Jie Tang, 2 Ruoming Jin, and 1 Jing Zhang 1 Knowledge.
COMMUNITIES IN MULTI-MODE NETWORKS 1. Heterogeneous Network Heterogeneous kinds of objects in social media – YouTube Users, tags, videos, ads – Del.icio.us.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Active Learning for Networked Data Based on Non-progressive Diffusion Model Zhilin Yang, Jie Tang, Bin Xu, Chunxiao Xing Dept. of Computer Science and.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
Focused Matrix Factorization for Audience Selection in Display Advertising BHARGAV KANAGAL, AMR AHMED, SANDEEP PANDEY, VANJA JOSIFOVSKI, LLUIS GARCIA-PUEYO,
 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Xiangnan Kong,Philip S. Yu Department of Computer Science University of Illinois at Chicago KDD 2010.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
Probabilistic Models for Discovering E-Communities Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha The Pennsylvania State University WWW.
1 Panther: Fast Top-K Similarity Search on Large Networks Jing Zhang 1, Jie Tang 1, Cong Ma 1, Hanghang Tong 2, Yu Jing 1, and Juanzi Li 1 1 Department.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
Towards Social User Profiling: Unified and Discriminative Influence Model for Inferring Home Locations Rui Li, Shengjie Wang, Hongbo Deng, Rui Wang, Kevin.
Discriminative Frequent Pattern Analysis for Effective Classification By Hong Cheng, Xifeng Yan, Jiawei Han, Chih- Wei Hsu Presented by Mary Biddle.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Topical Authority Detection and Sentiment Analysis on Top Influencers
Convolutional Neural Network
Learning to Personalize Query Auto-Completion
Sofus A. Macskassy Fetch Technologies
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Syntax-based Deep Matching of Short Texts
Learning Triadic Influence in Large Social Networks
COMBINED UNSUPERVISED AND SEMI-SUPERVISED LEARNING FOR DATA CLASSIFICATION Fabricio Aparecido Breve, Daniel Carlos Guimarães Pedronette State University.
Hyper-parameter tuning for graph kernels via Multiple Kernel Learning
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
RankClus: Integrating Clustering with Ranking for Heterogeneous Information Network Analysis Yizhou Sun, Jiawei Han, Peixiang Zhao, Zhijun Yin, Hong Cheng,
Community Distribution Outliers in Heterogeneous Information Networks
Example: Academic Search
Weakly Learning to Match Experts in Online Community
Efficient Distribution-based Feature Search in Multi-field Datasets Ohio State University (Shen) Problem: How to efficiently search for distribution-based.
Performance Comparison of Tarry and Awerbuch Algorithms
Discriminative Frequent Pattern Analysis for Effective Classification
Noémi Gaskó, Rodica Ioana Lung, Mihai Alexandru Suciu
Jinhong Jung, Woojung Jin, Lee Sael, U Kang, ICDM ‘16
RECOMMENDER SYSTEMS WITH SOCIAL REGULARIZATION
Scaling up Link Prediction with Ensembles
Structural influence:
Jiawei Han Department of Computer Science
Asymmetric Transitivity Preserving Graph Embedding
GANG: Detecting Fraudulent Users in OSNs
Learning to Rank with Ties
Multiple DAGs Learning with Non-negative Matrix Factorization
Using Clustering to Make Prediction Intervals For Neural Networks
Sanguthevar Rajasekaran University of Connecticut
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
2019/9/14 The Deep Learning Vision for Heterogeneous Network Traffic Control Proposal, Challenges, and Future Perspective Author: Nei Kato, Zubair Md.
Presentation transcript:

Example: Academic Search Cross-domain Ranking via Latent Space Learning Jie Tang+ and Wendy Hall▫ +Tsinghua University, ▫University of Southampton Cross-domain Ranking: learning to rank objects from multiple interrelated domains. Also called heterogeneous cross-domain ranking, when the domains are different. Example: Academic Search Input: Given T domains, with each having a training dataset—i.e., ranking pairs {(xt1q, yt1q), …, (xtnq, ytnq)}, where xt1q is K-dimensional feature vector to represent the relevance between an object in domain t and the query q at time t; ytnq ∈{r1, …, rlt}represents the ranking level of the object w.r.t. the query q; Learning: learn T ranking functions {ft}Tt=1 simultaneously, with each ft for predicting the rank level of unlabeled objects in the corresponding domain. Good morning, everyone. It’s my honor to introduce our paper “StructInf: mining structural influence from social streams”. In this paper, we study the structure characteristics of social influence. The first question we want to ask is : “in which structures, the target nodes, i.e., the write nodes in the figures, are most likely to be activated?” We take an example of retweet behavior of sina weibo in this figure. The red node represents a friend who retweets a message before time t ; the white node denotes the target user to be studied. The general question is how likely it is that the target user will retweet this message in a short time interval, conditioned on different influence structures. We find several interesting patterns from this retweet dataset. First, the conditional probability in the second figure increases to 150% higher than that of the first figure, suggesting more active friends can improve the retweet likelihood. On the other hand, the probability in the third figure increases to 300% higher than that of the second figure. Please note that the difference between figure two and three is the relationship between the friends. So our target is to mine the significant influence structures hidden in the huge volume of streaming behavior data.

Basic model The proposed model Dataset: LETOR 2.0 (Liu et al. 2007): a public dataset for learning to rank research. LETOR consists of three homogeneous sub datasets (i.e., TREC2003, TREC2004, and OHSUMED), with 50, 75, and 106 queries, respectively. AMiner (Tang et al. 2008): The dataset contains 14,134 authors, 10,716 papers, and 1,434 conferences. Given a query, the goal is to find experts, top conferences, and authoritative papers for the query. The second question is how to quickly mine those influential structures? Because to mine the influential structures from large social network, the exact algorithm need to build a diffusion graph from the input network and streaming actions, and enumerate all possible structures from each target action, and thus the time complexity is high. So We propose three different sampling algorithms to mine influential structures. The first one randomly sample nodes when enumerating influence patterns, the second one randomly reserve edges when building diffusion graph, and the third one combine the two methods. We can prove them to be unbiased. In the figure, we show the exact influence probabilities estimated by the exact algorithm and approximate influence probabilities estimated by one sampling algorithm. The results show that the proposed sampling algorithm can achieve a 10 speedup compared to the exact influence pattern mining algorithm, with an average error rate of only 1.0%. In conclusion, we present the concept of structural influence and propose efficient sampling algorithms to quickly estimate the influence probabilities of different structures from social stream. Welcome to our poster in the evening. Thanks.