KDD 2011 Doctoral Session Modeling Trustworthiness of Online Content V. G. Vinod Vydiswaran Advisors: Prof.ChengXiang Zhai, Prof.Dan Roth University of.

Slides:



Advertisements
Similar presentations
Relevance Feedback and User Interaction for CBIR Hai Le Supervisor: Dr. Sid Ray.
Advertisements

Jason H.D. Cho 1,2, Parikshit Sondhi 1, Chengxiang Zhai 1, Bruce R. Schatz 1,2,3 1 Department of Computer Science, 2 Institute of Genomic Biology, 3 Department.
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Development and validation of an instrument to measure user perceived service quality of information presenting Web portals 報告人 : 謝淳雅
Jean-Eudes Ranvier 17/05/2015Planet Data - Madrid Trustworthiness assessment (on web pages) Task 3.3.
KDD 2011 Research Poster Content - Driven Trust Propagation Framwork V. G. Vinod Vydiswaran, ChengXiang Zhai, and Dan Roth University of Illinois at Urbana-Champaign.
Search Engines and Information Retrieval
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
© Tefko Saracevic, Rutgers University 1 EVALUATION in searching IR systems Digital libraries Reference sources Web sources.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Online communities 1 Theory revision Complete some of the activities in this powerpoint and use the revision book to answer questions.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Social Media & The U.S. Election. Important Connections TOK: –What is a citizen’s responsibility in an election? How should they participate? –How do.
Signatures As Threats to Privacy Brian Neil Levine Assistant Professor Dept. of Computer Science UMass Amherst.
Can you believe what you read online?: Modeling and Predicting Trustworthiness of Online Textual Information Ph.D. Defense V.G.Vinod Vydiswaran Department.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Cluster based fact finders Manish Gupta, Yizhou Sun, Jiawei Han Feb 10, 2011.
Can you believe what you read online?: Modeling and Predicting Trustworthiness of Online Textual Information V.G.Vinod Vydiswaran Department of Computer.
Search Engines and Information Retrieval Chapter 1.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Prepare Yourself for IR Research ChengXiang Zhai Department of Computer.
RESEARCH Checking Reliable Sources. Why do I need to check if a website is reliable? Unlike most traditional written information, no one has to approve.
FaceTrust: Assessing the Credibility of Online Personas via Social Networks Michael Sirivianos, Kyungbaek Kim and Xiaowei Yang in collaboration with J.W.
BiasTrust: Teaching Biased Users About Controversial Topics V.G.Vinod Vydiswaran, ChengXiang Zhai, Dan Roth University of Illinois at Urbana-Champaign.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
The Process of Conducting Research
Development of a Self-Assessment Method for Patients to Evaluate Internet-based Health Information.
Context-Sensitive Information Retrieval Using Implicit Feedback Xuehua Shen : department of Computer Science University of Illinois at Urbana-Champaign.
Recommending Twitter Users to Follow Using Content and Collaborative Filtering Approaches John HannonJohn Hannon, Mike Bennett, Barry SmythBarry Smyth.
Information commitments, evaluative standards and information searching strategies in web-based learning evnironments Ying-Tien Wu & Chin-Chung Tsai Institute.
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Relevance Feedback: New Trends Derive global optimization methods: More computationally robust Consider the correlation between different attributes Incorporate.
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Knowledge based Personalization by Wonjung Kim. Outline Introduction Background – InfoQuilt system Personalization in InfoQuilt Related Work Conclusions.
How is the process of publishing printed material
Here’s what you need to do in the computer lab: Myth Busting Select an Urban Myth or Legend, either one we discussed in class or one you find independently.
BiasTrust: Trusting credible information in presence of human bias V.G.Vinod Vydiswaran ChengXiang Zhai, Dan Roth Department of Computer Science University.
Truth Discovery with Multiple Conflicting Information Providers on the Web KDD 07.
Relation Alignment for Textual Entailment Recognition Cognitive Computation Group, University of Illinois Experimental ResultsTitle Mark Sammons, V.G.Vinod.
WEDNESDAY, SEPTEMBER 3, 2014 SKILLS STRATEGIES. READING BEFORE Set a purpose for reading Ask questions Predict Use prior knowledge.
Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
Assessment: Reliability & Validity. Reliability Refers to the repeatability of a given testing instrument The extent to which a student would be expected.
Measuring Behavioral Trust in Social Networks
1 Generating Comparative Summaries of Contradictory Opinions in Text (CIKM09’)Hyun Duk Kim, ChengXiang Zhai 2010/05/24 Yu-wen,Hsu.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Trust, Influence, and Noise: Implications for Safety Surveilance Bill Rand Asst. Prof. of Marketing and Computer Science Director of the Center for Complexity.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Web Search and Text Mining Lecture 5. Outline Review of VSM More on LSI through SVD Term relatedness Probabilistic LSI.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
Multimedia Analytics Jianping Fan Department of Computer Science University of North Carolina at Charlotte.
Myth Busting Distinguish between factual statements and inferences. Evaluate scientifically related claims against available evidence. Reject pseudoscience.
Argumentative Writing Grades College and Career Readiness Standards for Writing Text Types and Purposes arguments 1.Write arguments to support a.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
12 Skills and Strategies Strategic connections between nonfiction reading and concept learning increases academic curiosity and engagement as well as use.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Unsupervised Sparse Vector Densification for Short Text Similarity
Evaluating Information Sources
Knowledge Sharing Mechanism in Social Networking for Learning
Topic: Semantic Text Mining
Presentation transcript:

KDD 2011 Doctoral Session Modeling Trustworthiness of Online Content V. G. Vinod Vydiswaran Advisors: Prof.ChengXiang Zhai, Prof.Dan Roth University of Illinois at Urbana-Champaign Incorporating text in trust models Three directions of research Credibility assessment woes Acknowledgments My research is supported partially by the Multimodal Information Access and Synthesis (MIAS) Center at the University of Illinois at Urbana-Champaign, part of CCICADA, a DHS Science and Technology Center of Excellence, and grants from the Army Research Laboratory. Contact details Claim 1 Claim n Claim EvidenceClaims Sources Web sources Evidence passages Claim sentences  Incorporates semantics in trust computation using evidence.  Claims need not be structured tuples – they can be free-text sentences.  Framework does not assume that accurate Information Extraction is available.  A source can have different trust profile for different claims – not all claims from a source get equal weight. Advantages over traditional models Traditional two-layer fact-finder models Claim 1 Claim n Claim 2 … [Yin, et al., 2007; Pasternack & Roth, 2010]  Need to determine the truth value of a claim.  Many information types available to gauge trustworthiness  Source credibility and the power of information network  Evidence trustworthiness  Signals from community knowledge  Contrastive viewpoints for claims  Biases of users accessing the information  The goal is to recognize credible information by combining these features  Next step is to understand how human biases interact with credibility of information they access Conclusion and future research steps Community knowledge to validate claims Veracity of news reporting Trustworthiness of news stories Credibility of news sources  Building trust models over pieces of evidence  Content-driven trust propagation framework (KDD 2011)  Utilizes similarity and trustworthiness of evidence to measure trustworthiness of sources and claims.  Scoring claims based on community knowledge  Find treatment relations in health message boards and forums  Verify if the perception formed from reading forums correlates with validity of treatments (as approved by FDA)  Squashing rumors with evidence search  Find evidence for claims from a large text collection (ACL 2009)  Find contrasting evidence (ongoing)  Even reputed sources make mistakes  Some claims (and sources) are purposefully misleading  Not all claims made by a source is equally trustworthy  Often, contradictory claims are both supported by credible evidence  How to verify free-text claims? Claim DB Evidence & Support DB Match up claims to evidence Rate sites based on matching claims and their support Extract relevant claims and evidence 12 3 Contrastive evidence retrieval Lookup pieces of evidence supporting and opposing the claim Lookup pieces of evidence only on relevance Traditional search Evidence search Scalable Entailed Relation Recognizer Expanded Lexical Retrieval Entailment Recognition Text Corpus Indexes Hypothesis (Claim) Relation [Initial work at ACL 2009] [KDD-DMH 2011] [KDD 2011]