1 Are New ‘Silicon Valleys’ Emerging? The Distribution of Superstar Patents across US States Carolina Castaldi* and Bart Los** *ECIS, School of Innovation.

Slides:



Advertisements
Similar presentations
International Technological Specialization in Important Innovations: Some Industry-Level Explorations Carolina Castaldi* and Bart Los** *University of.
Advertisements

Nonlinear models Hill et al Chapter 10. Types of nonlinear models Linear in the parameters. –Includes models that can be made linear by transformation:
Brief introduction on Logistic Regression
Departments of Medicine and Biostatistics
Patterns of Research Collaboration in U.S. Universities, AAAS Meetings Denver, Colorado February 18, 2003 James D. Adams, University of Florida.
IDENTIFYING HOT BRAZILIAN SCIENCE AND TECHNOLOGY: TECH MINING METHODS FOR RELATING SOURCES OF KNOWLEDGE AND EMERGING RESEARCH AREAS EU-SPRI CONFERENCE,
Models with Discrete Dependent Variables
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Cost-Sensitive Classifier Evaluation Robert Holte Computing Science Dept. University of Alberta Co-author Chris Drummond IIT, National Research Council,
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
PSY 307 – Statistics for the Behavioral Sciences
By: Kin Tat Lay Chapter 15. I. Introduction  Patents are a significant factor in the success of an enterprise for many industries  Patents are assets.
Today Concepts underlying inferential statistics
The Robustness of Indices of Child Well-being Kristin Anderson Moore, Ph.D., David Murphey, Ph.D., Tawana Bandy, and Elizabeth Lawner Child Trends.
Validation of predictive regression models Ewout W. Steyerberg, PhD Clinical epidemiologist Frank E. Harrell, PhD Biostatistician.
South Carolina Economic Summit Douglas P. Woodward Director, Division of Research Moore School of Business University of South Carolina.
1 Construction of Japanese Patent Database and Preliminary Findings on Patenting Activities in Japan Akira Goto and Kazuyuki Motohashi RCAST, University.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
An overview of a few of the methods used in landscape ecology studies.
Review of normal distribution. Exercise Solution.
Master in Engineering Policy and Management of Technology, 8 th Edition - Science & Technology Innovation Policy 1 - By Keith Pavitt SPRU – Science Policy.
Local job multipliers in Turkey Altan AldanWilliam MaloneyJosefina PosadasTemel Taskin (Central Bank of Turkey)(World Bank)   The views expressed here.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Statistical Analysis & Techniques Ali Alkhafaji & Brian Grey.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Population Biology: PVA & Assessment Mon. Mar. 14
1 Innovation & knowledge indicators  INTERNATIONAL PATENTS  OTHER INDICATORS OF INNOVATION, KNOWLEDGE AND HUMAN CAPITAL.
DIGEST OF KEY SCIENCE AND ENGINEERING INDICATORS 2008 Presentation Slides National Science Board.
Introduction to Operation Research
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Patent Citation Networks Bernard Gress Fannie Mae Inc., Washington DC. Forthcoming in The.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
M. Velucchi, A. Viviani, A. Zeli New York University and European University of Rome Università di Firenze ISTAT Roma, November 21, 2011 DETERMINANTS OF.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Our Divided Patent System John R. Allison University of Texas McCombs School of Business Mark A. Lemley Stanford Law School David L. Schwartz Northwestern.
14 Elements of Nonparametric Statistics
A Comparison of On-line Computer Science Citation Databases Vaclav Petricek, Ingemar J. Cox, Hui Han, Isaac G. Councill, C. Lee Giles
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Patterns of Event Causality Suggest More Effective Corrective Actions Abstract: The Occurrence Reporting and Processing System (ORPS) has used a consistent.
1 Modeling Coherent Mortality Forecasts using the Framework of Lee-Carter Model Presenter: Jack C. Yue /National Chengchi University, Taiwan Co-author:
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Exploring Patents & Citations Using GIS 2008Indiana GIS Conference Indiana Geographic Information Council Exploring Patents & Citations Using GIS 2008.
NCHS July 11, A Semiparametric Approach to Forecasting US Mortality Age Patterns Presenter: Rong Wei 1 Coauthors: Guanhua Lu 2, Benjamin Kedem 2.
Measuring patent quality and radicalness: new indicators
Landscape ecology methods
Course Title: Using Epi Info™ 7 Using Classic Analysis (Continuation) April Epi Info™ 7 Training Software for Public Health Epi Info™ 7 Training.
URBDP 591 I Lecture 4: Research Question Objectives How do we define a research question? What is a testable hypothesis? How do we test an hypothesis?
Rapid Penetration of COX2 Inhibitors in Non-Steroidal Antiinflammatory Drug Market: an Implication to Hospital Cost Containment Policy Supon Limwattananon,
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
A RE ICT S PEEDING U P THE G EOGRAPHIC D IFFUSION OF K NOWLEDGE ? A N A NALYSIS OF P ATENT C ITATIONS Vincenzo Spiezia OECD
BPS - 5th Ed. Chapter 231 Inference for Regression.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Stats 242.3(02) Statistical Theory and Methodology.
Skolkovo PRESENTATION
JRC – Territorial Development Unit Petros Gkotsis 08 March 2017
South Carolina Economic Summit
Is this the Asian Century?
EVAAS Overview.
Tabulations and Statistics
Hypothesis Testing.
Retrieval Performance Evaluation - Measures
Objectives 6.1 Estimating with confidence Statistical confidence
Objectives 6.1 Estimating with confidence Statistical confidence
Fractional-Random-Weight Bootstrap
Presentation transcript:

1 Are New ‘Silicon Valleys’ Emerging? The Distribution of Superstar Patents across US States Carolina Castaldi* and Bart Los** *ECIS, School of Innovation Sciences, Eindhoven University of Technology ** Groningen Growth and Development Center (GGDC), University of Groningen, DIMETIC Summerschool, Pécs, Hungary 7 July 2010

2 Outline of Research Project Do “Liberal Market Economies” have a comparative advantage in producing important inventions, in comparison to “Coordinated Market Economies”? (Hall & Soskice, 2001) Citation data from US Patent and Trademark Office not suitable for international comparisons. Overall objectives of the current project: –To gain knowledge about the relative technology- specific ability of US States to generate ‘superstar’ patents –To detect trends in spatial patterns of superstar invention over time

Superstars 3 Power Law/Pareto distribution: income distribution Alternative: Lognormal distribution Many other phenomena display similar statistical regularities: Size distributions of cities (Eeckhout, Levy, AER 2009) Size distributions of files on the WWW (Mitzenmacher, 2004) Distributions of citations to patents (indicator of importance of the underlying invention) are also known to have heavy tails (Silverberg & Verspagen, JEctrics, 2007)

4 Stylized fact: Fat tails Curved part: lognormally distributed Linear part: Pareto distributed Drees-Kaufmann-Lux procedure to estimate cut-off point (Silverberg & Verspagen, 2007) Some inventions act as “focusing devices” (Rosenberg 1969) or initiate new paradigms (Dosi, 1982); see Sanditov (2006) Cutoff Biotech= 17 citations (bs mean 22), Heating=33 citations (bs mean 32.3)

5 Data NBER Patent-Citations Datafile –Book by Jaffe and Trajtenberg ( MIT Press, 2001) –Update of database by Bronwyn Hall (2006) –2009 update cannot be used, since geographic data on invention is missing Numbers of citations ( ) to all utility patents granted by USPTO in Our subset: (application year) Only patents granted to a US-based first inventor Classification of patents in 31 of the 36 technological fields used in Hall et al. (2002)

6 Comparing citations received by patents: problems Point of departure: patents that receive more citations in subsequent patents have more value Problem 1: Patenting behavior varies across technology categories Problem 2: Citations are not received immediately Problem 3: Citation behavior varies over time

7 Comparing citations received by patents: solutions Top patents determined by constructing citation-based rankings by category and application year for all patents issued; A first measure: top quantile (Hall & Trajtenberg, 2005; Akkermans, Castaldi & Los, 2009, Research Policy) An data-driven measure: Distinction between superstar patents and regular patents based on stylized fact that tail of size distribution is Pareto

8 Application of tail estimation routine DK routine (based on Hill-estimator) applied for every category and year: two parameters estimated: –Cut-off point: nr superstar patents = patents with citations larger than cut-off point –Alpha: “fatness” of the Pareto tail Confidence intervals for estimated counts obtained via bootstrap (Castaldi & Los, 2008, working paper) The overall analysis revealed two problems

9

by Category Problem 1: High variability

11 Problem 2: Truncation Different citation lags for superstar vs regular patents (e.g. cited half- life for 1980 patents in “information storage”: 7 years for regular patents; 12 years for superstar patents) => not very timely indicator

12 Our proposal for a more timely indicator A probabilistic approach: developing a model which predicts the likelihood of a patent to become superstar based on a limited set of years Logistic regressions predicting probability p ak,i for patent i with –a=age (citation window, at least 5 years) –category k Regressors: category- and age-specific variables that might predict eventual ‘superstarness’ at early ages

13 Probabilistic approach ncit = number of citations received (ln(ncit+1)) frec = fraction of citations in most recent half of existence; GEN=measure of generality; Regressions were done for patents applied for in period –Age/citation window from a=5 to a=20 –To control for high variability of DK estimates, we use the bootstrap mean to single out superstar patents –Estimates used to assess the probabilities of eventual superstarness for more recent patents ( ) –Why not predictions for ? a=5, and many patents applied for in 2001/2002 are not in database because they had not been granted yet. Standardized by year

Regression Results Category k=9 information storage (bold numbers: significantly different from 0 at 5%) CONSTCITFRECGENR2R2 k=9a= a= a= a= Average patent: odds are 1:1060 that it will be superstar Average patent: odds are 1:84000 that it will be superstar

15 Truncation problem solved…

Technologies: Emergence and Demise Ratio of 3-year moving averages of numbers of superstar patents between 1994 and 1976 < 0.7: –Agriculture, food and textiles (0.59); Heating; Organic compounds; Apparel and textiles; Motors, engines and parts. Ratio of 3-year moving averages of numbers of superstar patents between 1994 and 1976 > 3.0 –Drugs; Semiconductor devices; Surgery and medical instruments; Computer peripherals; Computer hardware and software; Biotechnology (12.56, from to ) Ratios of shares of superstars in all patents ( ): –Agriculture etc. (0.62); Heating (0.93); Drugs (0.77); Semiconductor devices (0.81) Biotechnology (1.11, from 8.6% to 9.5%) 16

Shares of Superstars in Total (selected technologies)

The Geographic Aspect 18 Concentration indicators over states (all technology classes). 50 States + Washington DC + Puerto Rico

Superstar Generators (blue: 1976, red: 1994) 19 ID VT NH Numbers of superstars scaled by population (in mlns.)

States: Emergence and Demise Ratio of 3-year moving averages of numbers of superstar patents between 1994 and 1976 < 0.8: –West Virginia (0.39); Oklahoma (0.67); Delaware (0.74) Ratio of 3-year moving averages of numbers of superstar patents between 1994 and 1976 > 4.0 –Idaho (24.9); Vermont (4.70); Oregon (4.36); Georgia (4.08) Ratios of shares of superstar patents in all patents, between 1994 and 1976: –West Virginia (0.43); Oklahoma (0.76); Delaware (0.66); Idaho (4.44); Vermont (1.28); Oregon (1.64); Georgia (1.67) 20

New Silicon Valleys? No systematic summary yet, though: –Idaho: no superstar patents in semiconductors in , on average 15 per year in ; –Vermont: mainly small state effect; –Oregon: very good performance in computer hardware and software, less than 1 superstar patent per year in the first 11 years, almost 9 on average in ; –Georgia: solid superstar patenting performance in several technologies, i.e. Biotechnology, communications and computer hardware and software 21

22 Conclusions New operationalization of top inventions: Tail estimators allow endogenous determination More timely indicator thanks to probabilistic method Relative size of the tail differs across fields Results track the emergence of ‘new technologies’ => we can use patent data to identify emerging technology fields and link them US States also emerge and decline with regard to technological leadership. The trends are clearer when superstar patents are considered. Reality check: link the identified superstar patents to case studies