Realtime Financial Monitoring and Analysis System May 2010 Lietu Search Engine.

Slides:



Advertisements
Similar presentations
1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
Advertisements

Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Chapter 1 Business Driven Technology
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
The database approach to data management provides significant advantages over the traditional file-based approach Define general data management concepts.
Unit 7: Store and Retrieve it Database Management Systems (DBMS)
Company LOGO Stock Price Forecasting with Support Vector Machines based on Web Financial Information Sentiment Analysis Run Cao School of Information Renmin.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Information Retrieval in Practice
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Building an Intelligent Web: Theory and Practice Pawan Lingras Saint Mary’s University Rajendra Akerkar American University of Armenia and SIBER, India.
Chapter 3 Database Management
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Business Intelligence Tracy Hartley Zack Johnson Marissa McGee Tracy Hartley Zack Johnson Marissa McGee.
What it’s all about.
Fundamentals of Information Systems, Second Edition 1 Organizing Data and Information Chapter 3.
Chapter 14 The Second Component: The Database.
Databases Chapter 11.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 8-1 Chapter 8 Enterprise Information Systems.
Overview of Web Data Mining and Applications Part I
Overview of Search Engines
Knowledge Process Outsourcing1 Turning Information into Knowledge... for YOU The Gyaan Team.
Chapter 2: Business Intelligence Capabilities
Media trends - market data correlation Assuming mass media events can have a significant impact to the market environment - service determines how informative.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Databases & Data Warehouses Chapter 3 Database Processing.
More than words: Social networks’ text mining for consumer brand sentiments A Case on Text Mining Key words: Sentiment analysis, SNS Mining Opinion Mining,
Copyright © 2014 McGraw-Hill Education. All rights reserved
Enabling Organization-Decision Making
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Chapter 5 Lecture 2. Principles of Information Systems2 Objectives Understand Data definition language (DDL) and data dictionary Learn about popular DBMSs.
Search Engines and Information Retrieval Chapter 1.
@ ?!.
IS Today (Valacich & Schneider) 5/e Copyright © 2012 Pearson Education, Inc. Published as Prentice Hall 10/5/ With the help of their data warehouse.
1 Computational Linguistics Ling 200 Spring 2006.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Chapter 8 Database Applications and Implications.
Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Business Plug-In B18 Business Intelligence.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Chapter 1 Business Driven Technology MANGT 366 Information Technology for Business Chapter 1: Management Information Systems: Business Driven MIS.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 4 Data and Databases. Learning Objectives Upon successful completion of this chapter, you will be able to: Describe the differences between data,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Incorporating News into Algorithmic Market Trading Presented by Philip Gagner, Vice President RavenPack International, S.L. With the kind assistance of.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining knowledge from natural language texts using fuzzy associated concept mapping Presenter : Wu,
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
Data mining in web applications
Information Retrieval in Practice
Popular Database Management Systems
Introduction BIM Data Mining.
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
Organization and Knowledge Management
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
What is Pattern Recognition?
Building Topic/Trend Detection System based on Slow Intelligence
Information Retrieval and Web Design
Presentation transcript:

Realtime Financial Monitoring and Analysis System May 2010 Lietu Search Engine

Agenda Status Quo Analysis  Software Design  Software System Structure  System Advantage  Successful Application Cases

Stock market analysis is based on large-volume,real-time information, internet information possess the due characteristics. Large-volume internet information provide large-volume valuable information, which necessitates machine-based online content-mining. Online financial information data-mining relates to internet technology, vertical search engine, information theory, machine learning, natural language process, finance, quantitative. Now it is a new cross-domain research field. Online financial information data-mining has become a very popular and important research area in information data-mining. Realtime Financial Monitoring and Analysis System

Stock recommendation Take flammable ice concept for example, many people don’t know which stock might benefit from flammable ice concept. Whilst after the event, stock price already high up there. Analysis of stock price uprising Event driven mechanism model. Analysis of event correlation Analysis of hot event impact on every stock, and the impact degrees. ”dubai”negative impact on” 振华重工 ”, but in a long term,”dubai crisis”is temporary. When “ 振华重工 ” price down low enough to stable, and “dubai crisis” dwindle away, we can buy “ 振华重工 ”. Analysis of bulk commodity price which impact on stock price Major Applications

Realtime financial monitoring and analysis system take advantage of internet search engine and machine learning technologies, real-time analyze the event that impact stock price and data-mining correlation information which influence the listed due companies, and provide some criteria for real-time predictions of stock market. Main functions include: 1 、 Recoginition and clustering of information which impact stock price, which include recoginition and clustering of policies and accidental events. 2 、 Data-mining of correlations between listed companies, which refers to mutual stock-share holdings and industry chain analysis. 3 、 Analysis and evaluation of influential event’s impact on stock-cluster and individual stock. 4 、 Analysis of stock-cluster and individual stock, finding out the underlying reasons for hot stock-cluster shaping, and facilitates continuous tracking. Main Functions

Technology rationale  Natural language processing technology One Chinese text appears as a string composed of Chinese characters. Characters can form word, word can form word cluster, word cluster can form sentense, and go on to paragraph, chapter, etc. Character, word, word cluster, sentence, paragraph…whatever level explanation, within different syntax, there might be different meanings. Generally, nearly they all can be resolved right according to the due environment, and do not have ambiguity, that’s why we can communicate fluently with natural language. However, to eliminate ambiguity, we need very large volume knowledge and reasoning. How to collect and select out such knowledge, and store it with due form to computer system, and how to effectively eliminate ambiguity with them, is a very immense and laborious work.  Internet data mining Internet data mining technology take advantage of the internet web crawler, whole text retrieval, and integrate with intelligence, pattern recognition, neural-net, and significantly differ from internet search engine in that it can automatically comprehend user’s natural information need and search out the best match through pattern matching.

Technology rationale ( Continue )  Text tendency analysis Text tendency analysis has now become a very conspicuous topic in natural language processing and machine learning domain. Through text tendency analysis, we can grasp text writer’s emotion tendency. In financial domain, news public opinion is a critical criterion that shows investor’s, trader’s and regulator’s view points and emotions, they have identical importance with all other financial statistics such as trading statistics and macro statistics. According to current references, text tendency analysis can be carried out through two factors. The first is to calculate a value for one text, the sign represents the text’s tendency, and the absolute value represents the text’s influence. The second is to classify a text to category—roughly positive or negative. From these two factors, text tendency analysis is a statistics problem, and also a classification problem. Now, many researchers do it as a classification problem. Besides title classification, text tendency classification needs more concern, which can be carried out from two factors: one is machine learning, another is semantic analysis. Now, text tendency analysis is applied to English, Chinese and Arabic.

innovation highlights Main modules of the project:  Recognition and clustering of stock-price sensitive information: real- time monitor policy issues and accidental events on internet, extract and cluster them.  Data-mining of inter-relationship of public listed companies: construct public listed companies stock holding relationship and industry chain relationship, and evaluate their mutual influence within the relationship context.  Evaluation of stock-price sensitive information impact on stock cluster and individual stock: according to public listed companies inter- relationship and industry chain relationship, find out the stock cluster and stocks which may be influenced by stock-price sensitive policy or event, calculate the influence degree of the event impact on individual stock.  Stock cluster and individual stock analysis: aiming at stock-price sensitive policy or event and the industry chain relationship, provide analysis of stock cluster and individual stock, including short-term evaluation and long-term evaluation.

Module Component Structure Search Engine Database Search Engine Management DBMS Text Mining Knowledge Database Output InterfaceProgram Interface Console Classification, Clustering, Correlation Analysis......

Product Frame Finance Information Mining Basic Function Applications Segment Tagger Emotion Recognition Page Anti- noise Text Abstraction Correlation Analysis Text Clustering Text Classification Trend Analysis Association Analysis Display Functions Chart Generation Report Generation Control Console Others Backbone System Index Database

DEMO

Case study  Lietu Internet Monitor  Lietu Enterprise Search

Thanks!