Click to edit Master text styles – Second Level – Third Level Solving Customer Problems with Big Data across Thomson Reuters Brian Director,

Slides:



Advertisements
Similar presentations
Leveraging an Integrated ERP and CRM System - Featuring Sage MAS 500 ERP and Sage SalesLogix CRM.
Advertisements

Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Almaden Research Center © 2006 IBM Corporation IOP 06 Open Source Intelligence Lesson Learned.
Distributed Data Processing
ERP Applications Selection in a Changing Marketplace Evaluation of Software Providers for Midsize Institutions Bill Reed Director, Special Projects Northern.
Dashboards Slide by ana’s presentation. Tired of these challenges? No centralized view of executive information from multiple functional areas and systems;
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Steve Jordan Director. Industry Solutions 05/05/14 Managing Chaos: Data Movement in 2014.
Systems Analysis and Design in a Changing World
ARCH-01: Introduction to the OpenEdge™ Reference Architecture Don Sorcinelli Applied Technology Group.
Principal Patent Analyst
Cultural Diversity Based on the example of Thomson Reuters 17th May 2011.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Enterprise Architecture The Arkansas Approach. Key Areas What is enterprise architecture? Why is it important? How you can participate Current status.
About Thomson Reuters 2 Markets Division At $13B in 2009 revenue, we are the leading source of intelligent information for the world’s businesses and.
With the Help of the Microsoft Azure Platform, Devbridge Group Provides Powerful, Flexible, and Scalable Responsive Web Solutions MICROSOFT AZURE ISV PROFILE:
8 Systems Analysis and Design in a Changing World, Fifth Edition.
Annual World Bank Conference on Land and Poverty 2015 Gasant Jacobs; South Africa Director: Business Development March 2015 The Use of Technology in Land.
An innovative platform to allow translation and indexing of internet sites Localization World
Today’s Agenda Bill Presentment Overview Demo. Tailoring Your Invoices with Oracle’s Bill Presentment Architecture March 7, 2005.
FINANCIAL MARKETS “A MARKET IN WHICH PEOPLE AND ENTITIES CAN TRADE FINANCIAL SECURITIES, COMMODITIES, AND OTHER FUNGIBLE ITEMS OF VALUE AT LOW TRANSACTION.
Accounting Information Systems (ACCT 312) XBRL: eXtensible Business Reporting Language PowerPoint Presentations.
A Robust Health Data Infrastructure P. Jon White, MD Director, Health IT Agency for Healthcare Research and Quality
Software Developer Career. ◦ Desktop Program development ◦ Web Program Development ◦ Mobile Program Development.
Guillaume Rivalle APRIL 2014 MEASURE YOUR RESEARCH PERFORMANCE WITH INCITES.
What is BAM?. :Contents *Definition *Description *Goals and benefits *BAM Applications *BAM components.
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Health Sciences Global Business Unit Strategy Steve Rosenberg Senior Vice.
1 Talal Abu Ghazaleh Information Technology International (TAG-ITI)
© VESP International Pty Limited To Contents Slide CLICK to advance slides/ bullet points within slides Integrated Master Planner An Overview.
The power of thought Misys Asset Management Systems Enterprise Application Integration.
Effective User Services for High Performance Computing A White Paper by the TeraGrid Science Advisory Board May 2009.
Adra Match BALANCER: Balance Sheet Reconciliation Software Powered by the Microsoft Azure Cloud MICROSOFT AZURE ISV PROFILE: ADRA MATCH Adra Match develops.
Indiana University Professional Opportunities Orientation Program September 25, 2001 Presented by: Brian Oliver Laura Bissett Sarah Leinweber
Hospitality Management. The Big Problem in Hospitality Poor task and event execution of corporate strategy costs – 2% – 5% of Annual Revenues Analysts.
CHAPTER TEN AUTHORING.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
THE IMPORTANCE OF IPR ACROSS THE LIFECYCLE OF INNOVATION Bob Stembridge Principal Patent Analyst, IP & Science.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
Reporting & Analytics Stephen Chan Senior Solution Consultant.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
IoT Meets Big Data Standardization Considerations
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
Microsoft Azure and ServiceNow: Extending IT Best Practices to the Microsoft Cloud to Give Enterprises Total Control of Their Infrastructure MICROSOFT.
CONFIDENTIAL© Copyright Seal Software Limited. All Rights Reserved Contract Discovery and Analytics SR14-1: Resolution and Recovery Planning Seal.
THOMSON REUTERS PROFESSIONAL SERVICES. THOMSON REUTERS PATENT CONTENT 98% of world’s filed patents.
Enterprise Alert on Microsoft Azure Fully Automates Critical Incident Communication and Transforms It into an Intelligent, Reliable, and Mobile Experience.
Vision: Increase regional sharing and collaboration in order to expedite the delivery and adoption of energy efficiency. Conduit is brought to you by NEEA.
LEADING TAX DEPARTMENTS FORWARD John Diamond, Director Ryan Lynch, Director Tax’s World of Data February 26, 2016.
Tools for Effective Evaluation of Science InCites David Horky Country Manager – Central and Eastern Europe
Voyager Search. INTRODUCTION › Established in 2008 › Self-funded and privately owned › Geospatial search and data management › Leverages Open Source technology.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Chapter 8 Environments, Alternatives, and Decisions.
A UNIFIED ECOSYSTEM FOR MARKET DATA VISUALIZATION
THE RAPID-START ENTERPRISE SERVICE DESK
Microsoft Dynamics.
Content & the Supply Chain
Oscar AP by Massive Analytic: A Precognitive Analytics Platform for Effortless Data-Driven Decisions. Now Available in Azure Marketplace MICROSOFT AZURE.
Stop Data Wrangling, Start Transforming Data to Intelligence
Blockchain technology at Change Healthcare
Adra ACCOUNTS: Transaction Matching Software Powered by the Microsoft Azure Cloud That Helps Optimize the Accounting and Finance Processes MICROSOFT AZURE.
Collaborative Business Solutions
Jonathan Griffin, Managing Director, IFIS Publishing &
KEY INITIATIVE Financial Data and Analytics
Gartner for Sales Leaders
Jack G. Conrad, Thomson R&D
Presentation transcript:

Click to edit Master text styles – Second Level – Third Level Solving Customer Problems with Big Data across Thomson Reuters Brian Director, David Innovation Lab Thomson Reuters STRATA + HADOOP 2015

Click to edit Master text styles – Second Level – Third Level THOMSON REUTERS GLOBAL RESOURCES Who is Thomson Reuters? 2 REUTERS NEWS Powered by more than 2,800 journalists reporting in 20 languages from bureaus around the world, Reuters is the world’s largest international news organization FINANCIAL & RISK INTELLECTUAL PROPERTY & SCIENCE LEGAL Comprehensive IP & scientific information, decision support tools & services to enable governments, academia, publishers, corporations & law firms. Critical information, decision support tools, software & services to legal, investigation, business and government professionals. Critical news, information & analytics, enables transactions, and connects trading, investing, financial and corporate professionals. TAX & ACCOUNTING Integrated tax compliance and accounting information, software & services for professionals in accounting firms, corporations, law firms and government.

Click to edit Master text styles – Second Level – Third Level Data Overview: One company, Boehringer Ingelheim News Broker Research Bonds Fundamentals Press Releases Case Law Admin Decisions Public Records Dockets Arbitration 180 Editorial Analysis docs Scientific Articles Patents Trademarks Domain Names Clinical Trials Drugs Three Vs at TR: Velocity from fractions of seconds to quarterly filings. Volume: all the data needed by target professionals Variety: multiple disparate content, formats, languages.

Click to edit Master text styles – Second Level – Third Level Thomson Reuters Data Innovation Lab Started in July 2014 PhD and MS from leading universities, MIT, Columbia, UC Berkeley… Business expertise in Finance, Government, Academia, Software and Hardware Technology and Life Sciences

Click to edit Master text styles – Second Level – Third Level End User Need: Peer Detection Fairness Opinion Comparable Companies for benchmarking Buyside and sellside research M&A practitioners Supply chain Transfer Pricing Peer detection is a common task across customer segments:

Click to edit Master text styles – Second Level – Third Level Peers in Eikon (Public Companies)

Click to edit Master text styles – Second Level – Third Level Peers in Eikon (Private Companies)

Click to edit Master text styles – Second Level – Third Level Use Case: Peer detection Fundamental workflow: for any given company, which are its most similar companies? Increase the scope of companies Improve the quality of peer recommendations Provide multiple flavors of peer lists Allow end user control and customization Provide transparency and explanations for the recommendations

Click to edit Master text styles – Second Level – Third Level Key tasks in peer detection Find content sets with potential signals Classify/ extract and store signals Clean data Resolve to authorities Create a company fingerprint through a list of ranked attributes Compose a similarity metric based on the different data sources Provide an interactive user interface to visualize and fine tune the recommendations

Click to edit Master text styles – Second Level – Third Level Datasets News Trademarks Patents Wikipedia Fundamentals Deals Starmine Peers Press Releases – (TR Curated Data)

Click to edit Master text styles – Second Level – Third Level THOMSON REUTERS GLOBAL RESOURCES Patents Similarity between patent portfolios Derwent Patent database – approximately 50 million patents - Associate patents with companies - Select a set of attributes that defines a company patent portfolio - Based on these attributes establish a similarity measure - Neighbors of companies in the network can be considered peer candidates - Clustering this network gives technology areas

Click to edit Master text styles – Second Level – Third Level THOMSON REUTERS GLOBAL RESOURCES Aside: Visualizing the Derwent Ontology

Click to edit Master text styles – Second Level – Third Level THOMSON REUTERS GLOBAL RESOURCES Patent Assignees: Obfuscation and Trolls Patent “Trolls” often try to hide their status as assignee of patents. We characterize assignees by ratio of plaintiff to defendant role in patent litigation. Identifying NPE assignees requires de-obfuscating names.

Click to edit Master text styles – Second Level – Third Level Tools for normalization & access ENTITY, FACT AND EVENT EXTRACTION, TOPICAL CLASSIFICATION CONCORDANCE AND RESOLUTION SERVICES ORGANIZATION AND PEOPLE MASTERS CENTRALIZED CONTENT ACCESS

Click to edit Master text styles – Second Level – Third Level Open Calais A free to use external version of our entity, fact and event extraction engine. New Calais releases will rely on TR authorities. Assign Permanent Identifier (PermID) to entities. Better quality and disambiguation Leverage the TR identity management of entities Stay tuned for 2015

Click to edit Master text styles – Second Level – Third Level Eikon/Open Eikon The Open Eikon project is transforming Eikon into a platform for 3rd parties.

Click to edit Master text styles – Second Level – Third Level THOMSON REUTERS GLOBAL RESOURCES Demo Front end: AngularJS D3 Eikon framework Aggregation engine: Java All communications RESTful with json services

Click to edit Master text styles – Second Level – Third Level THOMSON REUTERS GLOBAL RESOURCES Lessons Learned/Agile Approach Agree on a deliverable Extensible architecture Flexible interaction –Let user determine how they want to drill into information. –One metric doesn’t fit all. Agree on a contract Start by integration Short milestones Small, self selected teams In and out of comfort zones

Click to edit Master text styles – Second Level – Third Level Wish List for the research community Increased automation for precise information integration Automated curation upon acquisition or ingest from various formats including pdf, XML into structured forms Achieving scalable inference on large graphs Managing rights and permissions Supporting accessibility and navigation Provenance tracking Data visualization at scale, across diverse data sets

Click to edit Master text styles – Second Level – Third Level Questions? Yes, we are hiring!