Relations Between National Research Investment Input and Publication Output: Application to an American Paradox R. D. Shelton Sponsored by a sabbatical.

Slides:



Advertisements
Similar presentations
Why is Europe so efficient at producing scientific papers, and does this explain the European Paradox? Patricia Foland and R. D. Shelton WTEC 11th International.
Advertisements

Here we add more independent variables to the regression.
Experiments and Variables
1 “China’s Remarkable Progress in Science and Technology” Akito Arima P resident of Japan Science Foundation, Former Minister of Science and Technology.
Lecture 3 HSPM J716. Efficiency in an estimator Efficiency = low bias and low variance Unbiased with high variance – not very useful Biased with low variance.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
The Race for World Leadership of Science and Technology: Status and Forecasts R. D. Shelton and P. Foland WTEC Baltimore, Maryland, USA Word version in.
Chapter 13 Multiple Regression
Review: What influences confidence intervals?
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Asia’s role in the future Randy Kluver Institute for Pacific Asia Texas A&M University.
Simple Linear Regression
Contribution of ICT to Economic Growth in Asia ITS 15 th Biennial Conference Berlin, Germany, September 7, 2004 Takahito Kanamori (Waseda University) Masahiro.
Multiple Regression Models
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Discovering and Describing Relationships
Ch. 14: The Multiple Regression Model building
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
1 “European R&D Benchmarking (2002) “European R&D Benchmarking (2002)” Science, Technology and Innovation Policy Student Presentations Students: Miguel.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 13: Inference in Regression
Is Europe falling behind? Copyright’s impact on data mining in academic research C. Handke, L. Guibault, J.J. Vallbé, LIBER, Institute for Information.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
DIGEST OF KEY SCIENCE AND ENGINEERING INDICATORS 2008 Presentation Slides National Science Board.
Classical Economics & Relative Prices. Classical Economics Classical economics relies on three main assumptions: Classical economics relies on three main.
Estimation of Statistical Parameters
Chapter 14 Introduction to Multiple Regression
Gas class #1 OBJECTIVE: understanding the nature of gases, the KMT, the 4 variables to measure gases, and the first demo diagram for gases.
Why the US lost leadership in scientific papers, and will soon lose in other indicators R. D. Shelton and Lance Miller WTEC Sponsored by NSF coop agreement.
Managerial Economics Demand Estimation & Forecasting.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Manijeh Keshtgary Chapter 13.  How to report the performance as a single number? Is specifying the mean the correct way?  How to report the variability.
Causal Connections Between Scientometric Indicators R. D. Shelton, Tarek Fadel, Patricia Foland Which Ones Best Explain High-Technology Manufacturing Outputs?
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Number System. Do not train children to learning by force and harshness, but direct them to it by what amuses their minds, so that you may be better able.
1 Making a Grope for an Understanding of Taiwan’s Scientific Performance through the Use of Quantified Indicators Prof. Dr. Hsien-Chun Meng Science and.
Do New SCI Journals Have a Different National Bias? R. D. Shelton, Patricia Foland, and Roman Gorelskyy Sponsored by a sabbatical from Loyola and NSF Grant.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Welcome to MM570 Psychological Statistics
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
EXCEL DECISION MAKING TOOLS BASIC FORMULAE - REGRESSION - GOAL SEEK - SOLVER.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Which Scientometric Indicators Best Explain National Performance of High-Tech Outputs? R. D. SheltonTarek Fadel WTEC ITRI Presented at the Collnet Conference,
Regression. Outline of Today’s Discussion 1.Coefficient of Determination 2.Regression Analysis: Introduction 3.Regression Analysis: SPSS 4.Regression.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Bibliometric evidence for empirical trade-offs in national funding strategies Duane Shelton and Loet Leydesdorff ISSI 2011 Durban.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
ECONOMIC INDICATORS GROSS DOMESTIC PRODUCT (GDP) & GROSS NATIONAL PRODUCT (GNP) MEASURING ECONOMIC GROWTH.
The Law of Averages. What does the law of average say? We know that, from the definition of probability, in the long run the frequency of some event will.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Analysis of AP Exam Scores
Chapter 14 Introduction to Multiple Regression
Chapter 4 Basic Estimation Techniques
Introduction to Regression Analysis
Basic Estimation Techniques
R. D. Shelton and Hassan B. Ali WTEC
Basic Estimation Techniques
Why is Europe so efficient at producing scientific papers, and does this explain the European Paradox? Patricia Foland and R. D. Shelton WTEC 11th International.
Review: What influences confidence intervals?
National Efficiencies in Publishing Scientific Papers
Chapter 7: The Normality Assumption and Inference with OLS
R. D. Shelton and P. Foland WTEC Baltimore, Maryland, USA
R. D. Shelton Tarek Fadel WTEC ITRI
Regression Part II.
Presentation transcript:

Relations Between National Research Investment Input and Publication Output: Application to an American Paradox R. D. Shelton Sponsored by a sabbatical from Loyola and NSF Grant ENG

What’s the Problem? “European Paradox” is the perceived failure of Europe to cash in on its leadership of science publishing I use "American Paradox" to mean the long term decline in U.S. share of science publishing, despite world-leading investments in R&D, particularly as that decline has become worse “Why” is today’s subject The methods are also useful for other countries; e.g. the EU is developing the same syndrome

The American Paradox: Declining SCI Paper Shares Despite Huge R&D Investments (GERD*) *Gross domestic expenditures on R&D (OECD Constant $ PPP), paper share from NSI CD Paper Share % NSF analyzed paradox for 6 years. GERD $Billions

EU Recently Developed the Same Syndrome Paper Share % GERD $Billions

Outline 1.Is it just that the SCI is a rubber ruler? (Hint: no) 2.Which inputs are most important: a regression approach (Hint: look for the money, but not just any money.) 3.A theory that can account for U.S. (and EU) decline, and forecast the future (Hint: it's simple, m = kw)

1. Is it Just That the SCI is a Rubber Ruler? Hypothesis: In response to criticism that it is biased toward English language and the U.S., the Science Citation Index may have changed to be less favorable to the U.S. This could account for some of the rapid decline in U.S. publication since Method: For a sample of 8 fields, partition their journals into 2 sets: New journals added to the SCI after 1994, and Old added before. Then measure U.S. share of each journal to see if difference between Old and New is significant

Example Analysis of U.S. Share in SCI

Sample of 8 Fields of Science Working paper at

Conclusions from Bias Study In some fields (space science, math, and microbiology) New journals were much less favorable to the U.S. But, in some fields the opposite was true In aggregate over 8 fields the change in bias was too small to account for the sharp changes in national shares Most share differences between Old and New journals are not statistically significant Therefore, hypothesis is not proven, and is unlikely over 24 fields in the National Science Indicator CD Thus the shifts in national shares are real, and are probably not an artifact of the SCI database

2. Since Total GERD Is Not Increasing U.S. Papers, Which Inputs are Most Important? Multiple Linear Regression Can Help Identify Which National Publication Systems (The Black Box) Resources In Papers Out

A Simple Regression Scattergram for 1999 This example has only one IV, of course. Regression Line: Papers99 = GERD99 Slope is about 1 paper / $1 million, and R = U.S. and Japan are far off the graph to the upper right

Multiple Linear Regressions: Drilling Down with 2 Independent Variables (IV) per Level DV is papers out. Significance level is 0.05, but IV1 is always very significant (Sig ). R > 0.9 always. Level1. IV1 = GERD (Total national R&D) IV2 = Number of Researchers (not significant) Level2. IV1 = Government Investment part of GERD IV2 = Private Sector Investment (not significant) Level3. IV1 = Civil Part of Government Investment IV2 = Government Defense Investment (less significant and a smaller coefficient) Details in table in Appendix

Regression Findings: In Producing Papers… Research investments are much more important than the number of researchers. Government investments are much more important than those from industry. Government investments in non-defense sectors are somewhat more important than their investment in defense. For 2003 the ratio of their IV coefficients has increased to 2. Not surprising to bibliometricians, but regression quantifies these statements. Causality can’t be proven, but strong and weak associations can be found.

Is This Why the EU Passed the U.S. in Science Publishing in 1995? (Greater Government Non-Defense R&D) May account for EU gaining on U.S. before 1995, but we still need to find why loss of U.S. share accelerated then, here shown as leveling of US Papers curve

Conclusions from Regression Money is the key input variable, but some types of investments are more effective U.S. Government investments are not as efficient at producing papers as in the EU: too much defense R&D, and not enough NSF These inefficiencies may account for the EU passing the U.S. in papers, but are not large enough to account for the recent sharp decline in U.S. paper share

3. A Theory That Can Account for U.S. Decline, and Predict its (Bleak) Future Since we can’t blame the recent sharp decline in U.S. share on a rubber ruler from SCI or inefficient allocations by the U.S., we have to cast a wider net– overseas, in fact. We will look at four regions: US, EU-15, "Asian Tigers" (China, Taiwan, S. Korea, and Singapore), and rest of the world (ROW). Usually the “world” will be the OECD countries, including affiliates.

More Detailed Model of Publication System (Inside the Black Box) $ Inputs US EU AT ROW Papers Published National Research Systems -- Fairly Independent Highly Interdependent Paper Selection Journal Editors g1g1 p1p1 G (total)P (total) w i = g i /G GERD share m i = p i /P Paper share

Search for a Model of System Since SCI does not increase P much in a year, total papers is almost a zero-sum game, and is one for share m i (on a fractional count basis) Thus competition with other nations strongly influences U.S. success We know that the critical input is money, but we need a U.S. series that declines to get a strong positive R with declining U.S. paper share

US GERD has risen, but others have risen faster. Is the shape of this curve familiar? An Input Series that Involves Both the Investments of the U.S. and Others.

US Paper Share Depends More on Its Share of GERD Than Its Total GERD

m i is share of papers published (fractional basis) w i is the share of GERD for the OECD Group k i is a "constant" of proportionality; it differs by country. k i is also the efficiency of country i in producing papers per $1 million in GERD, normalized by the OECD average efficiency. For data in a single year the equation is an identity, but it is most useful over a range of years when k i is approximately constant A Simple Model for Country i m i = k i w i

Since 1998, k i Has Been Fairly Constant EU is about 40% more efficient; AT is about 40% less efficient. Source for papers: NSF S&EI 2006 (fractional counts)

Input Variable w i for m i = k i w i U.S. GERD share declined because of aggressive investments by the Asian Tigers. ROW is fairly flat. Since these add to 100%, U.S. and EU both lost GERD share to AT. Can you forecast what is going to happen next?

Paper Share Fractional Count Lately, EU shows the same decline as U.S.

The Future? Asian Tigers have announced plans to continue to rapidly increase R&D investment. The American Competitiveness Initiative is small compared to Asian plans--no net increase EU seems to be faltering in meeting Lisbon goals Thus, U.S. and EU investment shares will probably continue to be lost to the Asian Tigers Based on the theory here, leadership of science publication will probably continue to shift from the U.S. and EU to Asia But, science itself is not zero-sum; a spirited competition helps all

Appendix Also see review draft of text paper at: Complete citations are there.

Detailed Regression Results: DV is papers unless stated

More Regression Findings: USG Allocations I can’t resist drilling down yet one more level, to the R&D funding by each U.S. agency, using AAAS data U.S. government investment has been up in recent years, but most of that is in defense. Even the non-defense budget has been up some, but almost all that increase is at National Institutes of Health. Except for NIH, non-defense government research investment has been flat in constant dollars since Since there has been very little increase in U.S. publication in healthcare fields during this interval, this resulted in a loss of publication share.

No Bang Per Buck: NIH Papers and Budgets (Papers in Psych., Clinical Meds, Pharma, Immune, Neuro.) Budgets are in constant $

Now Use Model with Constant k i to See If It Could Have Predicted Paper Share Decline NSI CD whole counts, actual GERD used each year.

Accuracy of Model Using 1995 Value of k i Fractional count basis, actual GERD shares used.