Alex Wade, Microsoft Research And acknowledging Walter L. Warnick, Ph.D. Director, Office of Scientific and Technical Information U.S. Department of Energy.

Slides:



Advertisements
Similar presentations
Brian Hitson Associate Director Office of Scientific and Technical Information U.S. Department of Energy.
Advertisements

CHORUS Implementation Webinar May 16, 2014 Mark Martin Assistant Director, Office of Scientific and Technical Information Office of Science U.S. Department.
Lorrie Apple Johnson Lead Librarian, Information Analysis & Services Office of Scientific and Technical Information (OSTI) National Academy of Sciences.
1 “China’s Remarkable Progress in Science and Technology” Akito Arima P resident of Japan Science Foundation, Former Minister of Science and Technology.
WorldWideEnergy: A paradigm shift in advancing energy information access Ms. Deborah Cutler International Program Manager Office of Scientific and Technical.
Brought to you by 1 Agents, the internet or both? How to fully integrate agents into your online student recruitment - from marketing to enrolment to relationship.
Annual Partner Meeting Washington, DC November 18, 2013 John Van Oudenaren The Library of Congress WDL Project Director.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
Science.gov: Providing Unified and Simplified Access to U.S. Government Science Information Presented by Eleanor Frierson, Deputy Director National Agricultural.
Evaulating Learning Objects Across Boundaries: The semantics of localization Bahadır Karabina Ayşe Sümeyye Güven.
Community of Science The Leading Internet Site for Researchers Worldwide
1 Collaborative Expedition Workshop National Science Foundation, Arlington, VA Walt Warnick, Ph.D. Director, Office of Scientific and Technical Information.
Global Trends in Chemistry Publishing Background and Developments
The World Digital Library Initiative John Van Oudenaren Senior Advisor, World Digital Library Library of Congress April 18, 2007.
REALTOR.com® International. RRe National Association of REALTORS® 2011 Profile of International Home Buying Activity U.S. remains a top destination for.
New Web of Science Rachel Mangan Customer Education
OPEN ACCESS SCHOLARLY COMMUNICATION WORKSHOP February 2005, Vilnius, Lithuania Bioline International: A Model of Collaborative.
National Science Portals: New Potential Partnerships for Global Discovery Eleanor G. Frierson Deputy Director, National Agricultural Library (U.S.), Co-chair.
The WorldWideScience Alliance: An International Partnership to Improve Access to Scientific and Technical Information Lorrie A. Johnson United States Department.
Abe Lederman, President and CTO Deep Web Technologies 2008 STIP Working Meeting, April 23, 2008 Federated Search: The Technology For Making Global Discovery.
Databases and Library Catalogs Global Index Medicus/Global Health Library PubMed Source Bibliographic Database: International Health and Disability.
George Brown A View From Above.  Who created it?  Scientists from the Digital Equipment Corporation’s Research lab in Palo Alto, CA  When was it created?
Open Access Ayesha Abed Library BRAC University October 30, 2011.
© 2012 Deep Web Technologies, Inc. 03 December 2012 By Abe Lederman, CEO Deep Web Technologies Show and Tell Presentation to.
Five Years InterLab ’07 Los Alamos, New Mexico October 1–3, 2007 Valerie S. Allen, MSLIS U.S. Department of Energy Office of Scientific and.
Science Research: Journey to 10,000 Sources Presented by: Abe Lederman, President and Founder Deep Web Technologies, Inc. Special Libraries Association.
THE SCIENCETHE SEARCHTHE SOLUTION New Publishing Paradigms and their impact on a not-for- profit organisation Shaun Hobbs Database.
1 Walter L. Warnick, Ph.D. Director United States Department of Energy Office of Scientific & Technical Information (Operating Agent for WorldWideScience.org)
November 8, Global Competitive Internet Usage Forecasting Across Countries and Languages June Wei Department of Management/MIS College of Business.
FishBase Summary Page about Salmo salar in the standard Language of FishBase (English) ENBI-WP-11: Multilingual Access to European Biodiversity Sites through.
Multilingual WorldWideScience.org: International Collaboration Speeds Advances in E-Science United Nations Commission on Science and Technology for Development.
FLICC Meeting, November 29, 2007 Sharing Our Knowledge Dr. Walter L. Warnick Director DOE Office of Scientific and Technical Information Advancing Global.
WorldWideScience.org Providing Global Access to World Science: Overcoming Standards Challenges Through Federated Search CODATA 2008 Kyiv, Ukraine Thomas.
Copyright Universal Solutions Inc Generate More Online Advertising Revenues for Website Publishers We Globalize and Monetize Your Web Content to..
Walt Warnick, Ph.D. Director, Office of Scientific and Technical Information U.S. Department of Energy n Accelerating the Spread of Knowledge About Science.
International Atomic Energy Agency 1 International Partnerships in Managing INIS and Nuclear Knowledge Anatoli Tolstenkov Head, INIS Unit INIS & Nuclear.
© 2009 Deep Web Technologies, Inc. Federated Search Presentation Explorit Research Accelerator Focus Deep. Get Results.
Not All Federated Searches are Created Equal Abe Lederman, President and CTO Deep Web Technologies Thomson Scientific Government Event, April 10, 2008.
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
You have a web site. How do you make money from it? Get lots of targeted traffic to it. How do you get this traffic? Marketing Social Media Search.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
OCLC Research: Selected projects Eric Childress Larry Olszewski Presentation for Dpto. Biblioteconomía y Documentación Universidad Carlos III de Madrid.
1 Federated Search (Emphasizing WorldWideScience.org) as a Transformational Technology Enabling Knowledge Discovery InterLending and Document Supply Conference.
JEFC Book3 Unit 9 Come on! Lesson 34 English is widely used By Cui Bibo Yinjiang Middle School.
Policies of the major countries of the world concerning implementation of integrated science and technology information networks International Workshop.
DOE Showcase CENDI Meeting May 5, 2010 Walter L. Warnick, Ph.D. Director Office of Scientific and Technical Information Accelerating Science by Accelerating.
1 ETEC Meeting December 7, 2007 Dr. Walter L. Warnick Director DOE Office of Scientific and Technical Information OSTI—Advancing Science Accelerating Discovery.
9/26/2007OCLC Orientation & Services1 What is OCLC?
Sharon M. Jordan Assistant Director for Program Integration U.S. DOE Office of Scientific & Technical Information Vantage Point: Government R&D Results.
The 2 nd CERN-UNESCO School on Digital Libraries Jens Vigen (CERN) CNRST, Rabat, Morocco, November 2010.
1 OSTI - Accelerating Science Information Dr. Walter L. Warnick Director U.S. Department of Energy Office of Scientific and Technical Information Federal.
WorldWideScience.org: An International Knowledge-Sharing Model Brian A. Hitson Office of Scientific & Technical Information U.S. Department of Energy.
ETDE – A Catalyst for Energy Technology Breakthroughs: Bringing the Researcher Closer to Research Lisbon, Portugal 5 July 2007 Brian A. Hitson Chair, IEA.
1 Dr. Walter L. Warnick Director DOE Office of Scientific and Technical Information DTIC 2008 Conference April 8, 2008 Keynote Address Advancing Scientific.
1 Walter L. Warnick, Ph.D. Director United States Department of Energy Office of Scientific & Technical Information (Operating Agent for WorldWideScience.org)
Research Information Management: Continuity, Change and Impact Michael Jubb Research Information Network UUK Workshop 5 December 2007.
Dr. Walter L. Warnick Director Office of Scientific and Technical Information Office of Science ARPA-E June 24, 2010 Innovative Web Resources Can Advance.
Walter L. Warnick, Ph.D. Director, U.S. Department of Energy (DOE) Office of Scientific and Technical Information (OSTI) ETEC, October 19, 2012.
Advancing Science: OSTI’s Current and Future Search Strategies Jeff Given IT Operations Manager Computer Protection Program Manager Office of Scientific.
Batchload User Group Meeting Ted Fons, Director, WorldCat Global Metadata Network Pam Harper, Product Manager, Batchload services Tony Chirakos, Consulting.
1 STIP Collaboration: Meeting the Challenges Together Walt Warnick, Ph.D. Director U.S. Department of Energy Office of Science Office of Scientific and.
ELanguages creative collaboration for teachers globally.
Multilingual WorldWideScience:
ICSTI Annual Conference 2012
European Network of e-Lexicography
Trust and Culture on the Web
Scientific and Technical Information Issues
Realities, Challenges, and Promises - Promoting the Next Generation of English Teachers in China Jun Liu May 18, 2007 Beijing, China.
Office of Scientific and Technical Information
Presentation transcript:

Alex Wade, Microsoft Research And acknowledging Walter L. Warnick, Ph.D. Director, Office of Scientific and Technical Information U.S. Department of Energy Multilingual WorldWideScience: Accelerating Discovery through Multilingual Translations

Corollary 1: Scientific discovery can be accelerated by accelerating access to worldwide scientific information. The case for WorldWideScience.org. Corollary 2: Multilingual translations of science will further accelerate scientific discovery. The case for Multilingual WorldWideScience.org “If I have seen further it is only by standing on the shoulders of giants.” Sir Isaac Newton Science Advances Only if Knowledge is Shared

The “Accelerating” Power of WorldWideScience.org Overcoming the researcher’s practical limitations: 1.Not knowing “what’s out there.” (examples: Korean medical journals, Australian Antarctic data, South African scientific research database) 2.Inadequate time to search scientific databases one by one. (examples: UK PubMed Central, Ginsparg’s arXiv.org) 3.Inability to sort compiled results by relevance. By filling these gaps, WorldWideScience.org has accelerated access to scientific information.

Brief History: Federated Search and WorldWideScience.org where science is hundreds of times larger than the “surface web” generally not “googleable,” or searchable, by major search engines Deep Web

Deep Web Solution: Federated Searching A single user query simultaneously sent to multiple deep web databases. Federated search engine sorts and presents results in relevance-ranked order. Overcomes the 3 practical limitations. No burden on individual database “owners.”

Federated Search Examples Science.gov – searches across all U.S. federal science agencies’ databases (200 million pages) Similar – but different -- experiences outside science:  Kayak.com – “compare hundreds of travel sites at once”  Pricegrabber.com – comparison shopping across multiple merchants

Global Federated Search Taking the Science.gov model global – WorldWideScience.org  Initial partnership between U.S. Department of Energy and the British Library (2007)

Global Federated Search Transition to multilateral governance (WorldWideScience Alliance) and ICSTI sponsorship (2008)

WorldWideScience – Facts and Figures Tremendous growth in search content: from 10 nations to 65 nations in 3 years > 400 million pages  From well-known sources: e.g., PubMed, CERN, KoreaScience  To more obscure sources: e.g., Bangladesh Journals Online

WorldWideScience – Fills Key Niche in Scientific Discovery In comparison of search results from identical queries on WWS, Google, and Google Scholar, only 3.5% overlap (i.e., WorldWideScience is 96.5% unique) 96.5% “Unique” Accelerated access  Accelerated discovery: the case for WorldWideScience.org

Now, the case for Multilingual WorldWideScience.org...

Consider this... While English is the lingua franca for science, these are the world’s most widely spoken languages:

Increasing Globalization of Science Calls for Multilingual Search Capabilities... Is there Science beyond English? Initiatives to increase the quality and visibility of non-English publications might help to break down language barriers in scientific communication (Meneghini and Packer, Nature, 2007) Science’s Language Problem: As globalization increases, communication between linguistic communities could become a serious stumbling block (Barany, Business Week, 2005) Science on the Rise in Developing Countries (Holmgren and Schnitzer, PLoS Biology, 2004)

Examples: HAL CNRS -- French Kyoto University Research Repository – Japanese Leiden University Digital Repository -- Dutch CSIC (Spanish National Research Council) (Source: Cybermetrics Lab, Spain) Of the world’s “top 400” institutional repositories, 250, or 63%, have some or all non-English content.

Major Non-English Science “Producers”

China

Russia

Japan France Germany Brazil... and many other countries.

To further accelerate access to science, multilingual translations are needed in both directions: Translation of English content for non-English speakers... and... Translation of non-English content for English speakers

Up until now, real-time translation of science has been limited. Generally limited to translating from one language into another single language at one time. Not deployed on deep web scientific databases. Results less than perfect with complex scientific language (note that it’s still not perfect but is constantly improving)

Now, we have the essential ingredients for real-time translation of science National science databases in multiple languages Federated search Multilingual translation on both front and back end of the user experience A public-private partnership, introduced as Multilingual WorldWideScience.org Beta

Here’s how it works... 1.A Chinese scientist submits a query in Chinese to Multilingual WorldWideScience.org. 2.MWWS.org uses Microsoft to translate the Chinese query into individual languages of source databases (English, French, Portuguese, Russian, etc.) 3.MWWS.org sends the translated queries to corresponding databases, which search their contents and return results in native languages to MWWS.org. 4.MWWS.org uses Microsoft to translate native language results into Chinese and presents results to the user in relevance- ranked order. Conversely, an English-speaking user could have a query translated into languages of non-English databases and then get results back in English.

View Demo

With the launch of Multilingual WorldWideScience.org, we are... Opening vast reservoirs of heretofore under-utilized scientific knowledge Providing equal access to science for anyone on the Internet Promoting scientific collaboration, participation, and transparency... and accelerating scientific discovery!