Download presentation
Presentation is loading. Please wait.
Published byAnna Wilcox Modified over 8 years ago
1
Automatic Categorization of Patent Applications Presentation to the 3rd IPC Workshop, WIPO, Feb. 25-26, 2013 1.The need for automatic categorization of patent applications 2.The purpose of automatic categorization 3.How does automatic categorization work? 4.A quick word about the IPCCAT technology 5.Measuring categorization accuracy 6.Strategy to use an automatic categorizer 7.IPCCAT Demo
2
The Need for Automatic Categorization of Patent Applications Growth in number of patent applications o 2’140’600 patent applications worldwide in 2011 o Up from 1’050’700 in 1995 (more than doubled in 16 years) Source: WIPO IP Indicators Large (and growing) number of IPC categories o 631 Sub-Classes in IPC 2011 o 7’392 Main Groups in IPC 2011 Source: IPCCAT Help File
3
The Purpose of Automatic Categorization Should not be used as a fully automated categorizer o Average error rate at 5 to 10% at Class level, up to 20% or more at Main Group level o Batch classification is possible but requires downstream elimination of predictions which are below a given confidence threshold Rather an assistant for human examiners o Suggests most probable IPC categories o Interactions with the examiner (asking for more predictions at a different IPC level, forcing a given domain, etc.) Accelerate patent application processing at Patent Offices
4
How Does Automatic Categorization Work? Train an artificial intelligence program to recognize typical examples for each IPC category o Provide already-classified patents for training o Essential: Balancing the number of examples across categories o The more examples the better Test the program o Submit patent applications whose IPC categories are already known o Calculate categorization accuracy
5
A Quick Word About The IPCCAT Technology A strictly statistical approach o No linguistic or other human-defined rules o So it is language independent Categorization algorithm: Neural Networks of the Winnow type, improved by Simple Shift with the help of WIPO But an adaptation of the indexing method to the various languages supported (English, French, Spanish) so as to process collocations correctly Validated through several competitions on the Internet (latest one : the CLEF-2010 project) The IPCCAT project was designed, managed and financed by WIPO from 2002 to 2004
6
Measuring Categorization Accuracy To be really good we would only have to predict all the categories all the time! So we need two different ratios: o Precision: On all the predictions made, how many were correct o Recall: On all the correct categories which should have been predicted, how many did we actually find The prediction accuracy is directly correlated to : o The number of categories at each IPC level o The number of available training documents for each category
7
Strategy To Use An Automatic Categorizer If you don’t know which section or class is the most relevant : o Ask for a direct prediction at the finest possible level (Main Group); or o Ask for a prediction at a coarser level (Class) and refine it down to Sub-Class, then to Main Group If you know which section or class is the most relevant : o Force a prediction under the relevant section or class (reduces the risk of error) o Refine the prediction at the next level(s)
8
IPCCAT Demo
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.