Presentation is loading. Please wait.

Presentation is loading. Please wait.

Patent Citation Networks Bernard Gress Fannie Mae Inc., Washington DC. Forthcoming in The.

Similar presentations


Presentation on theme: "Patent Citation Networks Bernard Gress Fannie Mae Inc., Washington DC. Forthcoming in The."— Presentation transcript:

1 Patent Citation Networks Bernard Gress http://student.ucr.edu/~gressb01 Fannie Mae Inc., Washington DC. bernard_gress@fanniemae.com Forthcoming in The Mathematica Journal http://www.mathematica-journal.com/

2 The Patent Citation Dataset Patent citations are part of the legal patent process where the patent applicant has the duty to disclose any knowledge of 'prior art' amongst previous patents. Some objectivity in the process is provided by the government patent examiner who is supposed to be an expert in the area and who approves the final citation. The network established by patent citations allows one to trace the flow of technology through time, from patent to patent, and across fields. Studies of technological spillover effects, the impact or influence of individual patents, the rates of technological development, and other such issues, can be assisted by the consideration of patent citations.

3 The Patent Citation Dataset - continued Hall, Jaffe, and Trajtenberg, and the National Bureau of Economic Research (NBER) (http://www.nber.org/patents/). The primary database (cite75_99.zip) contains 22,309,440 pairs of pair-wise patent citation dataset on more than 3 million U.S. patents granted between January 1963 and December 2002. The secondary database (pat63_02f.txt) contains records for 3,414,910 patents with 25 fields each.

4 Structure of Primary Database (cite75_99.zip)

5 Structure of Secondary Database (pat63_02f.txt)

6 Patent Numbers Issued Serially

7 Two Types of Citation Networks A Citation Lineage –all of the progenitors and descendants by citation reference, so long as no siblings are brought into the picture A Citation Neighborhood –all those patents that are within a specified network distance of the patent of interest, regardless of relationship, including all 'siblings' and 'cousins'.

8 There are 14 nodes for the 1- generation lineage of patent #3858382: PatentLineage[3858382,1] –PatentsOfInterest ® {3858382}, –PrintRules ® {1 ® 3858382, 2 ® 1794517, 3 ® 2045678, 4 ® 2069266, 5 ® 2790591, 6 ® 3044233, 7 ® 3100569, 8 ® 3468100, 9 ® 3646723, 10 ® 4085822, 11 ® 4316353, 12 ® 4750694, 13 ® 4863125, 14 ® 5054646, 15 ® 6250501} –Relations ® {3858382 ® 4085822, 3858382 ® 4316353, 3858382 ® 4750694, 3858382 ® 4863125, 3858382 ® 5054646, 3858382 ® 6250501, 1794517 ® 3858382, 2045678 ® 3858382, 2069266 ® 3858382, 2790591 ® 3858382, 3044233 ® 3858382, 3100569 ® 3858382, 3468100 ® 3858382, 3646723 ® 3858382} –Vertexes ® {3858382, 1794517, 2045678,2069266, 2790591, 3044233, 3100569, 3468100, 3646723, 4085822, 4316353, 4750694, 4863125, 5054646, 6250501} –IndexPairs ® {{1,10},{1,11},{1,12}, {1,13},{1,14}, {1,15}, {2,1},{3,1}, {4,1},{5,1},{6,1},{7,1}, {8,1},{9,1}} –IndexRules ® {1 ® 10, 1 ® 11, 1 ® 12, 1 ® 13, 1 ® 14, 1 ® 15, 2 ® 1, 3 ® 1, 4 ® 1, 5 ® 1, 6 ® 1, 7 ® 1, 8 ® 1, 9 ® 1}

9 There are 15 nodes for the 1-generation Neighborhood of patent #3858382: PatentNeighborhood[3858382,1] –PatentsOfInterest ® {3858382} –PrintRules ® {1 ® 3858382, 2 ® 1794517, 3 ® 2045678, 4 ® 2069266, 5 ® 2790591, 6 ® 3044233, 7 ® 3100569, 8 ® 3468100, 9 ® 3646723, 10 ® 4085822, 11 ® 4316353, 12 ® 4750694, 13 ® 4863125, 14 ® 5054646, 15 ® 6250501} –Relations ® {1794517 ® 3858382, 2045678 ® 3858382, 2069266 ® 3858382, 2790591 ® 3858382, 3044233 ® 3858382, 3100569 ® 3858382, 3468100 ® 3858382, 3646723 ® 3858382, 3858382 ® 4085822, 3858382 ® 4316353, 3858382 ® 4750694, 3858382 ® 4863125, 3858382 ® 5054646, 3858382 ® 6250501} –Vertexes ® {3858382, 1794517, 2045678, 2069266,2790591, 3044233, 3100569, 3468100, 3646723, 4085822, 4316353, 4750694, 4863125, 5054646, 6250501} –IndexPairs ® {{1,10}, {1,11}, {1,12}, {1,13}, {1,14}, {1,15}, {2,1}, {3,1},{4,1},{5,1},{6,1}, {7,1}, {8,1}, {9,1}} –IndexRules ® {1 ® 10, 1 ® 11, 1 ® 12,1 ® 13, 1 ® 14, 1 ® 15, 2 ® 1, 3 ® 1, 4 ® 1, 5 ® 1, 6 ® 1, 7 ® 1, 8 ® 1, 9 ® 1}

10 Mathematica has Nice Built-in Graph Visualization Functions for Unstructured Graphs: GraphPlot GraphPlot3D ShowGraph But to Plot Graphs Over Time then Have to Use My Function: PatentPlot

11 Citation Networks Over Time - continued The 2-Generation Lineage of 3858382

12 Citation Networks Over Time - continued The 2-Generation Neighborhood of 3858382

13 GraphPlot[PatentNeighborHood[ {3858382, 4597749}, 2]]

14 A nice illustration of the spread of technology over time.

15 Coloring nodes by criteria I also add functions to color nodes and edges by different patent characteristics, e.g. –Patent Technology Category (2- and 4-digit HJT) –Patent Originality/ Generality Index –Total Number of Citations

16 GraphPlot3D[PatentNeighborhood[ 3858382, 7]]

17 GraphPlot[PatentNeighborhood[3858382,12]] Colored by technology category

18 Time Constrained The 7-Generation Neighborhood of #3858382, Colored by Technology Class

19 Network Statistics and Structure Analysis Citation Lags Network Curvature Citation Count Distributions HJT Technology Categories Originality and Generality

20 Distributions of Backward Lags

21

22 Network Curvature the average number of patents reached at subsequent network distances -some simple graphs and their respective curvature plots-

23 Network Curvature the average number of patents reached at subsequent network distances

24 A much larger network of 91,000 patents over 40 years Curvature graphs for each year

25 Curvature graphs for each year, all together

26 Curvature graphs for each year, all together, different view

27 Patent Technological Composition

28 HJT Technology Category Distribution

29 Cumulative distribution of patents by tech category

30 Citation Count Distributions

31 Citation Count Distributions - continued

32

33

34 Generality and Originality where J is the number of patent classes, N i is the total number of forward citations for patent i, and N i,j is the number of forward citations in each patent class for patent i. The second term is a Herfindal-type of index. The 'Originality' of Patent 'i' is the same, except with backwards citations (i.e. citations made). "Thus if a patent cites previous patents that belong to a narrow set of technologies, the originality score will be low, whereas Citing patents in a wide range of fields would render a higher score."

35 Generality and Originality - Continued Not very interesting - at least no trends over time – and seemingly no necessary relationship to the concepts they intend to measure.

36 Conclusions Mathematica is a nice platform for networks analysis There is a lot of opportunity for research in this area Don’t know what the value of this research is to the IPI-ConfEx clientele

37 References [1] B. Hall, Jaffe, Trajtenberg, "The NBER Patent Citations Data File: Lessons, Insights and Methodological Tools," 2002, http://emlab.berkeley.edu/users/bhhall/pat/ NBERpatdata.pdf http://emlab.berkeley.edu/users/bhhall/pat/ NBERpatdata.pdf [2] S. Wolfram, A New Kind of Science, : 2002


Download ppt "Patent Citation Networks Bernard Gress Fannie Mae Inc., Washington DC. Forthcoming in The."

Similar presentations


Ads by Google