Semantic Clipboard User Interface is integrated in the Browser Architecture of the Semantic Clipboard Illustration of a license incompliant content reuse.

Slides:



Advertisements
Similar presentations
IATI Technical Advisory Group Technical Proposals Simon Parrish IATI Technical Advisory Group, DIPR March 2010.
Advertisements

CONFIDENTIAL DIGITAL WATERMARKING ALLIANCE. CONFIDENTIAL DIGITAL WATERMARKING ALLIANCE 2 Digital Watermarking Alliance Charter The Digital Watermarking.
Using Multimedia on the Web Enhancing a Web Site with Sound, Video, and Applets.
CNIT 132 – Week 9 Multimedia. Working with Multimedia Bandwidth is a measure of the amount of data that can be sent through a communication pipeline each.
Using JavaScript in Linked Data Applications Oshani Seneviratne Oct 12, 2010.
Project 1 Introduction to HTML.
3. Technical and administrative metadata standards Metadata Standards and Applications.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Web Page Behavior IS 373—Web Standards Todd Will.
School of something FACULTY OF OTHER University Library The Library’s Digital Repository or Whatever happened to MIDESS? Michael Emly Jonathan Ainsworth.
Tutorial 7 Working with Multimedia. XP Objectives Explore various multimedia applications on the Web Learn about sound file formats and properties Embed.
Lesson 46: Using Information From the Web copy and paste information from a Web site print a Web page download information from a Web site customize Web.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
UNIT-V The MVC architecture and Struts Framework.
Session: 11. © Aptech Ltd. 2HTML5 Audio and Video / Session 11  Describe the need for multimedia in HTML5  List the supported media types in HTML5 
DMPTool Expert Resources and Support for Data Management Planning Tao Zhang Michael Witt Purdue University Libraries 1.
Web Development & Design Foundations with XHTML Chapter 11 Key Concepts.
Internet Standard Grade Computing. Internet a wide area network spanning the globe. consists of many smaller networks linked together. Service a way of.
Chapter 1 Introduction to HTML, XHTML, and CSS
Joel Bapaga on Web Design Strategies Technologies Commercial Value.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
Tutorial 7 Working with Multimedia. XP Objectives Explore various multimedia applications on the Web Learn about sound file formats and properties Embed.
XP Tutorial 8New Perspectives on HTML and XHTML, Comprehensive 1 Using Multimedia on the Web Enhancing a Web Site with Sound, Video, and Applets Tutorial.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
HTTPA (Accountable Hyper Text Transfer Protocol) PhD Proposal Talk Oshani Seneviratne DIG, MIT CSAIL May 31, 2011.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Design engineering Vilnius The goal of design engineering is to produce a model that exhibits: firmness – a program should not have bugs that inhibit.
Tutorial 7 Working with Multimedia. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Explore various multimedia applications.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check This work by Oshani.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
Web 2.0: Making the Web Work for You, Illustrated Unit B: Finding Media for Projects.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
What’s MPEG-21 ? (a short summary of available papers by OCCAMM)
Linked Data: Emblematic applications on Legacy Data in Libraries.
Validator Website to Validate URI License Violations Validator – Only requires the URI of the site to check A bad case of content reuse This work by Oshani.
Plug-in Architectures Presented by Truc Nguyen. What’s a plug-in? “a type of program that tightly integrates with a larger application to add a special.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Renovation of Eurostat dissemination chain
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
The 2007 Microsoft Office System Servers Enterprise Content Management, Workflow and Forms Martin Parry Developer and Platform Group, Microsoft Ltd
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
A centre of expertise in digital information managementwww.ukoln.ac.uk UKOLN is supported by: This work is licensed under a Attribution- NonCommercial-ShareAlike.
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
Harnessing the Deep Web : Present and Future -Tushar Mhaskar Jayant Madhavan, Loredana Afanasiev, Lyublena Antova, Alon Halevy January 7,
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Sarah Whitcher Kansa (Open Context / Alexandria Archive Institute)
Using E-Business Suite Attachments
Chapter 1 Introduction to HTML.
Project 1 Introduction to HTML.
Policy Aware Content Reuse on the Web
Using Semantic Web Data: Proof
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Playing Audio (Part 2).
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
Managing a Web Server and Files
THREE TIER MOBILE COMPUTING ARCHITECTURE
Policy reasoning A policy is a set of norms that define optimal behavior of agents in a system What does policy reasoning usually entail ? Proving that.
Metadata The metadata contains
Objective Explain concepts used to create websites.
Presentation transcript:

Semantic Clipboard User Interface is integrated in the Browser Architecture of the Semantic Clipboard Illustration of a license incompliant content reuse Validator Website to Validate License Violations Validator This work by Oshani Seneviratne is licensed under Creative Commons Attribution - Non Commercial - Share Alike 3.0 license. Oshani Seneviratne, Tim Berners-Lee Decentralized Information Group, CSAIL, MIT {oshani, Flickr has over 100 million Creative Commons Licensed images. Given a sample of web pages which embed such images, how many of these are properly attributed as specified in their licenses? Sample 1 (67 sites, 426 images)‏ Properly attributed images = 28 Misattributed images = 333 Misattribution = 78 % Sample 2 (70 sites, 241 images)‏ Properly attributed images = 8 Misattributed images = 194 Misattribution = 80 % Sample 3 (70 sites, 466 images)‏ Properly attributed images = 6 Misattributed images = 439 Misattribution = 94 %  Assessment on the level of policy-awareness on the Web  Provide a platform to use the data exposed on the Semantic Web  A License Violations Validator for Flickr images: to check for license violations use the information given by the validator to be policy- compliant  Semantic Clipboard: to detect reusable content while browsing seamlessly integrate such content along with their metadata  Assess the level of violations with regards to other types of licenses such as ‘no commercial use’, ‘share alike’ and ‘no derivatives’  Assess the level of license violations on other types of media  Extend to licenses embedded in free-floating content  Explore new and efficient ways of license violations detection  Improve the User Interfaces of the CC license violations validator and the Semantic Clipboard Results of the experiment summarized Build Policy Aware Systems, such as:  Validators to tell users what information is missing or inaccurate  Seamlessly integrate metadata by detecting and assisting in embedding the licenses  Notify users if their content is used in an inappropriate manner Policies are pervasive in web applications as they play a crucial role in enhancing security, privacy and usability of services offered on the Web. Use of Creative Commons licenses is the widely accepted method of expressing rights of the original content creators when it comes to digital multimedia content on the Web. How can you Extract License Metadata? 1. Through APIs which expose the licenses. E.g. Flickr 2. Through RDFa (Resource Description Framework in Attributes) A simple scenario which illustrates a rights violation of a content creator: Check whether a particular site has any embedded Flickr images which are not properly attributed as specified in the Creative Commons license. Spider: This is a site crawler which searches for all the links in a given seed site using a Breadth First search algorithm to determine any embedded Flickr images. License Checker: This extracts the photo id from the Flickr image URI. Then all the information related to the photo is obtained through the Flickr API. Based on this information, the DOM of the page is checked for the proper attribution. Try it out! validator.py More Information Exchange Enable transfer of content between Web applications with minimal effort in a policy aware manner, i.e. when content is copied, license metadata is also copied and pasted appropriately in the target application. Try it Out! More Information RDFa Extractor: Extracts all the semantic information in the form of RDF attributes embedded in the HTML page the user browses. UI Enhancer: Adds visual cues to the page for easy identification of images that can be copied based on the user’s intended use. RDFa License Store: Indexes the License data of images in a given browser session. Attribution XHTML Constructor: Creates the attribution XHTML snippet as stated in the CC specification upon a copy instruction. Then it places this snippet in the system clipboard. Policy Aware Content Reuse on the Web The Problem The Solution How much of a problem is this? Background CC Attribution License Violations Validator Semantic Clipboard Goal Components ContributionsFuture Work More Information Notification System: This will pretty-print and report the images with missing attributions in a Web interface. The user can then use the missing information in his or her own work to be license compliant. User Checker (optional): This module can be used to send actual notifications to the original content creators for any violations if the system is linked to some user base. All of these components are implemented in the Tabulator, a Semantic Web Browser which can be installed as a Firefox Extension. A simple experiment was conducted to get an assessment on this, and the results are as follows: Violations Rates and Precision Reusing content saves resources and fosters creativity. However, reusing a particular piece of content without honoring the license expressed with it may violate the original content creator’s rights. There are several reasons this situation might happen. The person reusing the content may be: too lazy to check for the licenses hidden in the XHTML weary of the multi-step operations required to embed the license metadata ignorant as to what each of the licenses mean At the same time, the original content creator would also be interested in knowing whether someone has violated his or her license terms. Context menu to copy image with the license metadata Challenges  Tracking provenance of content on the Web is hard  Subsequent changes to a CC license cannot be prevented  Lack of proper definitions from CC for the scoping of the human readable attribution in the DOM, and the license granularity  Limited Flickr support for license expression  Usability vs. Operating System independence CC Licenses