Presentation is loading. Please wait.

Presentation is loading. Please wait.

From OSM-L to JAVA Cui Tao Yihong Ding. Overview of OSM.

Similar presentations


Presentation on theme: "From OSM-L to JAVA Cui Tao Yihong Ding. Overview of OSM."— Presentation transcript:

1 From OSM-L to JAVA Cui Tao Yihong Ding

2 Overview of OSM

3 OSM  OSM (Object-oriented Systems Model) – Use for system analysis, specification, design, implementation, and evaluation – Structural components: object sets and relationship sets Object set: generalization/specialization Relationship set: n-ary relationships, cardinality constraints – Usually shown graphically

4 Sample OSM for Cars (Graphic Version) YearPrice Make Mileage Model Feature PhoneNr Extension Car has is for has 1..* 0..1 1..* 0..1 0..* 1..*

5 OSM-L and Ontology  OSM-L: A textual language for representing OSM application models.  Ontology: A program written in OSM-L to provide the database schema, relationship sets and a knowledge base to the extractor  For each application domain, we have to write a new ontology depend on the user’s request

6 Car-Ads Ontology Car [->object]; Car [0..1] has Year [1..*]; Car [0..1] has Make [1..*]; Car [0...1] has Model [1..*]; Car [0..1] has Mileage [1..*]; Car [0..*] has Feature [1..*]; Car [0..1] has Price [1..*]; PhoneNr [1..*] is for Car [0..*]; PhoneNr [0..1] has Extension [1..*]; Year matches [4] constant {extract “\d{2}”; context "([^\$\d]|^)[4-9]\d,[^\d]"; substitute "^" -> "19"; }, … End;

7 Data Extraction

8 Information Exchange SourceTarget Information Extraction Schema Matching Leverage this … … to do this

9 Extracting Pertinent Information from Documents

10 Recognition and Extraction Car Year Make Model Mileage Price PhoneNr 0001 1989 Subaru SW $1900 (363)835-8597 0002 1998 Elandra (336)526-5444 0003 1994 HONDA ACCORD EX 100K (336)526-1081 Car Feature 0001 Auto 0001 AC 0002 Black 0002 4 door 0002 tinted windows 0002 Auto 0002 pb 0002 ps 0002 cruise 0002 am/fm 0002 cassette stero 0002 a/c 0003 Auto 0003 jade green 0003 gold

11

12 OSM Object Set Relationship Set { -- connection { object set constraint } Structure Nonlexical Lexical { object name data frame } Data frame { extraction rule context rule substitution rule keyword } Schema Generation Interface Schema implements Table-Insertion Interface{ relational database tables insert methods } Matching Process Retrieved Data Database Population Interface

13 Parser and Symbol Table  Generate parse tree  Design the structure of symbol table

14 Data Extraction

15 Extraction Rules Defines the expecting pattern of string to extract.

16 Context Rules Defines the context constraint of the target pattern.

17 Substitution Rules Defines the substitution situation if applicable.

18 Keywords Defines keywords to get rid of ambiguity if it happens.

19 Knowledge Representation  Current knowledge base – Static – Need peripheral programs  Our predicating knowledge base – Functional – Adaptive – Object-oriented

20 Schema Generation Domain Attribute Relation Constraint

21 Schema Generation if(!existTable(“car”) createStatement(creat eTable( “createCar”); createCar =“ create table Car( ObjNr char(4) primary key, VIN char(4) unique, Make char(10), : PhoneNr char(20), );

22 Schema Generation if(!existTable(“Feature”)) createStatement(createTable( “createFeature”); createFeature =“ create table Feature( ObjNr char(4) primary key, Feature char(20), );

23 Schema Generation if(!existTable(“Extension”)) createStatement(createTable( “createExtension”); createExtension =“ create table Extension( PhoneNr char(14) primary key, Extension char(3), );

24 Insert Data  Collect all the values available for each object  Find out the position of each insert value  Insert values for each object Data.attribute Data.value Data.objNr Data Record Table:

25 Populate Database


Download ppt "From OSM-L to JAVA Cui Tao Yihong Ding. Overview of OSM."

Similar presentations


Ads by Google