Presentation is loading. Please wait.

Presentation is loading. Please wait.

© NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Enrich the lexicons for the 1 st domain based on partners remarks.

Similar presentations


Presentation on theme: "© NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Enrich the lexicons for the 1 st domain based on partners remarks."— Presentation transcript:

1 © NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Enrich the lexicons for the 1 st domain based on partners remarks (RTV) (mid December)  Finalise section in D1.3(a) (2 nd domain, portability, maintenance, constraint checking, evaluation, enhancement of ontology role) (RTV) (mid December)  Examine the use of PROTÉGÉ API instead of the XML files (NCSR, RTV) (mid January)  Ontology and lexicons for the 2 nd domain (RTV coordinates, partners send to RTV) (mid January)  Final report on ontologies for D1.3(b) (RTV) (mid February) Corpus formation for the needs of page filtering Corpus formation for the needs of page filtering  New version of corpus formation tool and guidelines for the 2 nd domain (use of new ontology and lexicons) (NCSR sends to partners) (mid January)  Corpus formation for the 2 nd domain (partners send to NCSR) (end of January)  Final report on corpus formation task for 2 nd domain for D1.3(b) (NCSR) (early February) Web spidering (NCSR) Web spidering (NCSR)  Finalise site navigator (early January)  Finalise page filtering experiments for the 2 nd domain (mid February)  Finalise link scoring experiments for the 2 nd domain (mid February)  Final report on web spidering for D1.3(b) (mid February)

2 © NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (2) Focused Crawling Tool Focused Crawling Tool  Language identification for 1 st domain: examine Lingway’s tool, evaluation, related work on LI for Web sites (EDIN, Lingway, partners annotate the 150 sites) (mid December)  Final evaluation results for 1 st domain (EDIN, NCSR) (mid December)  Final report of Focused Crawling for 1 st domain for D1.3(a) (EDIN) (mid December)  Focused crawling for 2 nd domain, Evaluation Results (EDIN) (end January)  Final report of Focused Crawling for 2 nd domain for D1.3(b) (EDIN) (mid February) Other tools for web pages collection Other tools for web pages collection  Meta-TIDY for 2 nd domain (NCSR, Lingway) (end January)  Cross-merge for 2 nd domain (NCSR) (end January)  Documentation (NCSR) (mid February) Integration of all the tools involved in the collection process (NCSR) Integration of all the tools involved in the collection process (NCSR)  Integration for the 1 st domain, Report for D1.3(a) (mid December)  Integration for the 2 nd domain, Report for D1.3(b) (mid February)

3 © NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (3) Corpus collection for the needs of NERC and FE Corpus collection for the needs of NERC and FE  Collection of web pages for the 2 nd domain according to the methodology agreed in Paris (NCSR sends to partners) (end January)  Report on the corpus collection task for 2 nd domain for D1.3(b) (NCSR) (mid February) Web Annotator Web Annotator  Final version for 2 nd domain (NCSR sends to partners) (end January)  Report for D1.3(b) (NCSR) (mid February) Deliverable D1.3 (NCSR) Deliverable D1.3 (NCSR)  D1.3(a) (mid December)  D1.3(b) (end February)

4 © NCSR, Paris, December 5-6, 2002 WP2: Plan for the remainder (1) NERC DTD NERC DTD  Finalise issues on 2 nd domain DTD (JOB_TITLE, EDU_TITLE) (Lingway) (mid December)  Report on NERC DTD for both domains for D2.3 (take into account HR-XML) (EDIN) (mid December) Corpus annotation for the needs of NERC Corpus annotation for the needs of NERC  Corpus annotation for the 2 nd domain according to the annotation methodology agreed (Velti-EL, RTV-I, Lingway-F, EDIN-EN, NCSR checks the annotations) (starts end January, ends mid March)  Final NERC annotation guidelines for the 2 nd domain (NCSR) (end January)  Report on corpus annotation task for 2 nd domain (NCSR) (end March) NERC v.2 NERC v.2  FNERC v.2 (Lingway) (mid December)  Finalise NERC v.2 evaluation as agreed in Paris (per page type, without demarcator) (mid December)  Finalise report for NERC v.2 for D2.3 (no evaluation for name matching, normalisation, these will be presented in D3.2) (EDIN) (mid December)

5 © NCSR, Paris, December 5-6, 2002 WP2: Plan for the remainder (2) NERC v.3 (incorporation of mechanisms for rapid adaptation to new domains, use of demarcator) NERC v.3 (incorporation of mechanisms for rapid adaptation to new domains, use of demarcator)  First evaluation results (end March)  Final evaluation results (mid April)  Final reports for D2.4 (mid April) NERC-based demarcator (NCSR) NERC-based demarcator (NCSR)  Finalise evaluation for the 1 st domain (English, Italian) (mid December)  Final report for D2.3 (mid December)  New version for 1 st and 2 nd domain (exploit machine learning techniques) (early March)  Final report for D2.4 (end March) Deliverables (EDIN) Deliverables (EDIN)  D2.3 (mid December)  D2.4 (end April)

6 © NCSR, Paris, December 5-6, 2002 WP3: Plan for the remainder (1) FE schema FE schema  Report on FE schema for both domains according to what was agreed in Paris (RTV) (mid December) Corpus annotation for the needs of FE Corpus annotation for the needs of FE  EDIN sends to NCSR the final Fact annotated corpus for the 1 st domain (mid December)  Final Fact annotation guidelines for the 1 st domain (NCSR) (mid December)  Report on Fact annotation task for the 1 st domain (NCSR) (mid December)  Final Fact annotation guidelines for the 2 nd domain (NCSR) (end March)  Corpus annotation for the 2 nd domain according to the annotation methodology agreed (Velti-EL, RTV-I, Lingway-F, EDIN-EN, NCSR checks the annotations) (starts early April, ends early May)  Report on Fact annotation task for the 2 nd domain (NCSR) (mid May)

7 © NCSR, Paris, December 5-6, 2002 WP3: Plan for the remainder (2) Wrapper Induction Wrapper Induction  STALKER-based wrapper induction (NCSR)  Evaluation results for English, Italian for the 1 st domain (mid December)  Final report for D3.1 (mid December)  Provide the trained monolingual modules to the partners (early January)  New version exploiting other types of information (linguistic, images) (early February)  New version evaluation results for 4 languages for the 1 st domain (mid February)  Final report for D3.2 (mid February)  Boosted (?) Wrapper Induction (EDIN)  Final report for D3.1 (mid December)  New version exploiting other types of information (linguistic, images) (early Fenruary)  New version evaluation results for 4 languages for the 1 st domain (mid February)  Final report for D3.2 (mid February)  Provide the trained monolingual modules to the partners (end February)  WHISK (RTV)  Final report for D3.1 (mid December)  New version exploiting other types of information (linguistic, images) (early Fenruary)  New version evaluation results for 4 languages for the 1 st domain (mid February)  Final report for D3.2 (mid February)  Provide the trained monolingual modules to the partners (end February)

8 © NCSR, Paris, December 5-6, 2002 WP3: Plan for the remainder (3) Handling of images Handling of images  Final report on the techniques that can be used for D3.1 (NCSR, Lingway) (mid December)  v.1 for 1 st domain (NCSR, Lingway) (end January)  Evaluation of v.1 for the 1 st domain (NCSR, Lingway) (mid February)  Report for D3.2 (NCSR, Lingway) (mid February) Deliverables (NCSR) Deliverables (NCSR)  D3.1 (mid December)  D3.2 (includes also name matching, normalisation) (end February)

9 © NCSR, Paris, December 5-6, 2002 WP4: Plan for the remainder (1) System Architecture System Architecture  Final Report on the refined architecture (Velti) (mid December) End-User Interface End-User Interface  Help texts, localisation, End-user interface for the 1 st prototype (Velti) (mid December)  Enrichment of the products database for the 1 st domain (Velti) (mid January)  Final report for 1 st prototype (Velti) (mid December)  UI for the 2 nd prototype (exploit user modeling techniques, personalisation agent) (Velti) (end January)

10 © NCSR, Paris, December 5-6, 2002 WP4: Plan for the remainder (2) System Integration System Integration  Report for the 1 st integrated prototype (Velti) (mid December)  2 nd prototype (end January)  Report for the 2 nd integrated prototype (Velti) (mid February) Evaluation Evaluation  Evaluation methodology for D4.2 (Velti, Lingway) (mid December)  Evaluation results for the integrated IE systems of the 1 st prototype (based on the STALKER monolingual modules) (mid January)  3 Evaluation Workshops for the 2 nd prototype (students at NCSR, RTV, EDIN) (mid February) (Velti coordinates)  Final report for the evaluation of the 2 nd prototype for D4.3 (Velti) (end February) Deliverables (Velti) Deliverables (Velti)  D4.2 (mid December)  D4.3 (end February)

11 © NCSR, Paris, December 5-6, 2002 WP5: Plan for the remainder (1) Consortium agreement sign (mid December) Consortium agreement sign (mid December) Management reports Management reports  7 th Quarterly report (NCSR sends draft by mid December, final to EC early January)  Annual report (NCSR sends draft by mid December, final to EC early January) Formation of a user group (????) Formation of a user group (????)

12 © NCSR, Paris, December 5-6, 2002 WP5: Plan for the remainder (2) Publication in Conferences, Journals … Publication in Conferences, Journals …  WWW, IJCAI, ACL, ??????? CROSSMARC presentation in a large event CROSSMARC presentation in a large event  IST-2003 ??? Updated Deliverable 5.2 “Exploitation and Use Plan” (mid December) Updated Deliverable 5.2 “Exploitation and Use Plan” (mid December) Date and place of next meeting (Edinburgh, 6-7 March 2003) Date and place of next meeting (Edinburgh, 6-7 March 2003)


Download ppt "© NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Enrich the lexicons for the 1 st domain based on partners remarks."

Similar presentations


Ads by Google