Almaden Services Research © 2009 IBM Corporation COA: Finding Novel Patents through Text Analysis Mohammad Hasan, Scott Spangler, Tom Griffin, Alfredo.

Slides:



Advertisements
Similar presentations
Support.ebsco.com EBSCOhost Mobile Tutorial. Welcome to the EBSCOhost Mobile tutorial, a guide to the most popular EBSCOhost features available for use.
Advertisements

eClassifier: Tool for Taxonomies
TSpaces Services Suite: Automating the Development and Management of Web Services Presenter: Kevin McCurley IBM Almaden Research Center Contact: Marcus.
Shared Space Admin Demo March Admin demo introduces - Adding users Moderating users Moderating resources Adding communities and sub groups.
New DAITS Training and reference manual Start slide show Go to index.
MILLENNIUM STATISTICS … fun for all!! Matt Polcyn August 6, 2004.
Journal Citation Reports on the Web Don Sechler Customer Education – Science and Scholarly Research
Units can enter ranks, merit badges, and awards online.
Longhorn Council Units can enter ranks, merit badges, and awards online.
Manitoba Speech-Language Pathology Outcomes Measure A Supervisor’s Step by Step Guide to Navigating the Manitoba Speech-Language Pathology Outcomes Measure.
Search Engines and Information Retrieval
INFO 624 Week 3 Retrieval System Evaluation
DePaul Bears Try Your Luck!. Why buy this product? Approximately 1,000,000 cell phone users Approximately 2,000,000 or more people play the lottery New.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
© Copyright 2003 United Parcel Service of America, Inc. UPS, the UPS brandmark, and the color brown are registered trademarks of United Parcel Service.
Journal Impact Factors and H index
Dear Sir As requested I have undertaken a study of the communication program I-Call. I will explain the highlights of it’s functions and make my recommendations.
Network and Active Directory Performance Monitoring and Troubleshooting NETW4008 Lecture 8.
Almaden Services Research © 2008 IBM Corporation Intellectual Property Analytics Turning Unstructured Information Into Value Jeffrey T. Kreulen, Ph.D.
Section 6.1 Explain the development of operating systems Differentiate between operating systems Section 6.2 Demonstrate knowledge of basic GUI components.
This presentation will guide you though the initial stages of installation, through to producing your first report Click your mouse to advance the presentation.
© 2012 Adobe Systems Incorporated. All Rights Reserved. Copyright 2012 Adobe Systems Incorporated. All rights reserved. ® WRITING FOR THE WEB.
Figure 1-2: Simple peer-to-peer network
Killer Web Content Author: Gerry McGovern. The Theory ContentA valuable asset and if managed well can deliver tremendous value During the 1980’s web focus.
Syllabus outcomes Describes and applies problem-solving processes when creating solutions Designs, produces and evaluates appropriate solutions.
Databases C HAPTER Chapter 10: Databases2 Databases and Structured Fields  A database is a collection of information –Typically stored as computer.
Search Engines and Information Retrieval Chapter 1.
PeopleFinder: Searching for People, not just for Documents Technologies for Knowledge Sharing ICT-Centre CSIRO Alistair McLean, Anne-Marie Vercoustre,
10 Adding Interactivity to a Web Site Section 10.1 Define scripting Summarize interactivity design guidelines Identify scripting languages Compare common.
BIO1130 Lab 2 Scientific literature. Laboratory objectives After completing this laboratory, you should be able to: Determine whether a publication can.
© C.R. Business Education Creations WebQuest The History of the Internet.
Accessing the Deep Web Bin He IBM Almaden Research Center in San Jose, CA Mitesh Patel Microsoft Corporation Zhen Zhang computer science at the University.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Karen Herter (HMG) Mike Langley (DGS) April 15, 2008 Portfolio Manager for California State Buildings Meeting the Requirements of Executive Order S
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
What makes a good interactive resume? Click for detailed information Multimedia Navigation Communication.
1 Estimation Function Point Analysis December 5, 2006.
Content Sharing over Smartphone-Based Delay- Tolerant Networks.
Evaluating Web Pages Techniques to apply and questions to ask.
INTERNET. Objectives Explain the origin of the Internet and describe how the Internet works. Explain the difference between the World Wide Web and the.
RM Monitor and RMAlerts Installation, Setup, and Requirements January 23, 2010 John Raffenbeul presented this live via an internet connection. These slides.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
Living Online Lesson 3 Using the Internet IC3 Basics Internet and Computing Core Certification Ambrose, Bergerud, Buscge, Morrison, Wells-Pusins.
July What is the eCost TMS Solution ? Benefits & Features Explore the eCost Software Smart Storage Device (SSD9000 / SSD9001) - Buffers DX10 Dongle.
Citation Searching with Web of Knowledge Gabriella Netting & Louise Colver.
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Location-Based Alert & Marketing Services IP Confidential IP Presentation 10 January 2016Confidential, Copyright and Patent Pending 2012 by Ping4 Inc.
INFO 414 Human Information Behavior Presentation tips.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
You Can’t Afford to be Late!
Evaluating Web Pages Techniques to apply and questions to ask.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
You spoke © 2008 Acquire Media We listened...
Computers Are Your Future Tenth Edition Spotlight 5: Microsoft Office Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
Patent Applications Just the Frequently Asked Questions.
Android forensics: Automated data collection and reporting from a mobile device Justin Grover Digital Investigation Volume 10, Supplement, August 2013,
Units can enter ranks, merit badges, and awards online.
Originality Check: Preventing Plagiarism
Performance Review Tool Updates College of Engineering
CCNA Routing and Switching Routing and Switching Essentials v6.0
Shared Space Admin Demo
Chapter 10: Device Discovery, Management, and Maintenance
CCNA Routing and Switching Routing and Switching Essentials v6.0
Databases.
Chapter 10: Device Discovery, Management, and Maintenance
Milena Lonati PD Quality Management DG2, European Patent Office
Chapter 6 Using Questionnaires
Presentation transcript:

Almaden Services Research © 2009 IBM Corporation COA: Finding Novel Patents through Text Analysis Mohammad Hasan, Scott Spangler, Tom Griffin, Alfredo Alba Scott Spangler IBM Almaden Services Research

Almaden Services Research © 2009 IBM Corporation The BlackBerry Patents  Five patents on the subject of RF communication with mobile processors  Judge threatened an injunction which would have forced RIM/Blackberry to shut down service  On the surface they appear to read very directly on RIM’s business  But are these patents really what they appear to be?

Almaden Services Research © 2009 IBM Corporation Problem Addressed  How do you automatically evaluate the value of Patent claims.  Most existing approaches use field of invention + citation analysis to derive an approximation  Our approach uses analysis of the claim text itself to discover indicators of patent worth.

Almaden Services Research © 2009 IBM Corporation Intuition The most valuable patents are those that are among the first to claim an important technology. Challenge: How do we discover that part of a patent claims which are most “original”

Almaden Services Research © 2009 IBM Corporation Method  Focus on the patent claims section  Find all terms occurring in the claims section  For the technical area of the patent (patent class), discover when each of these search terms first occurred in patent claims  Term originality then is defined as small difference between patent date and term first use date  Create a score that ranks highly those patents with “original” terms in their claims

Almaden Services Research © 2009 IBM Corporation Description  Build an index of patent claim words associated with time of first occurrence in patent claims  For each patent evaluated –Analyze each 1,2,3-gram in patent claims to see if it is an original usage or an “early” usage of those words in the patent claim section in that technology “area” –Look for subsequent usage of that word in more recent patents to calculate “support”  The value of a patent is based on the number of early* words with significant** support. Scored one of two ways: –Sum of support (# of patents) divided by age (# of days) –Count of # of terms with support > 2 and Age < 7 years *early = within 7 years of first occurrence **significant = at least 3 patents use the term

Almaden Services Research © 2009 IBM Corporation How we validated this approach  Three easily identifiable metrics that should correlate to patent value –Citations –Lapsed Fees –Internal IBM Attorney Rating  None of these is perfect, but all three should roughly correlate with the intrinsic value of the patents

Almaden Services Research © 2009 IBM Corporation Results  Citations are roughly correlated with COA scores  Lapsed patents have lower COA scores on average than do other patents  Patents rated 1 (by IBM attorneys) have on average significantly better COA scores then those rated 3.

Almaden Services Research © 2009 IBM Corporation Claims Originality of Blackberry Patent  All five patents have very lengthy, extensive claim language, around electronic mail devices  Very little text in these claims is original.  Taking context into consideration, the technical merit of these patents is questionable.  $120M / patent licensed an appropriate valuation? TermFirst Occurred Difference in Days Supp ort application programs stored7/25/ information added8/20/ interface stores4/29/ network storing10/15/ information network12/25/ network information12/25/ mail systems5/23/ destination transmits7/25/ processors occurs4/29/ information accessible11/6/ electronic mail7/11/ interface switch7/25/ network switch7/30/ gateway switch7/25/ transmitting originated1/1/ stored originated7/25/ interface receiving9/28/

Almaden Services Research © 2009 IBM Corporation SIMPLE Implementation Usage: 572 Invocations of COA as of 6/15/2009

Almaden Services Research © 2009 IBM Corporation Success stories from SIMPLE to date:  VOIP analysis: –Started from 13 original patents to more than 20 eventually licensed. –This drove nearly $8M in licensing revenue.  Videoconferencing analysis: – Found 2 additional patents, each of which was sold. – This drove upwards of $5M in licensing revenue.  SIMPLE has over 280 active users (both internal and external).  We continue to develop and grow the capabilities.

Almaden Services Research © 2009 IBM Corporation If you want to try this out yourself  Go to:  Username: sb_test8  Password: hello2You  Click Analyze / Claims Originality  Enter one or more patent numbers  Click Analyze button  Tell us what you think! (

Almaden Services Research © 2009 IBM Corporation Potential Future Application: Tracing the Source in Web Content

Almaden Services Research © 2009 IBM Corporation Credibility Scoring (“net cred”)

Almaden Services Research © 2009 IBM Corporation Conclusions and Future Work  We have demonstrated how text analysis in the patent space can help provide context far more effectively than manual methods  We feel these methods generalize to other types of unstructured information  The ability to provide better information context and validation will be important to individuals and organizations in a world where a smaller and smaller percentage of information comes from “authoritative” sources.

Almaden Services Research © 2009 IBM Corporation