2005-06-20/21LIRICS IAG Meeting Barcelona LIRICS IAG Meeting 2005-06-20/21 Universitat Pompeu Fabra Barcelona Introduction Gerhard Budin.

Slides:



Advertisements
Similar presentations
ISO TC 37 Terminology and Other Language and Content Resources OntoIOp Telecon Sue Ellen Wright October 20,
Advertisements

Towards a National Quality Assurance Framework: the UN Statistical Commission Initiative CCSA Conference on Data Quality for International Organizations.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. ISO TC 37 Tutorial Gerhard.
Computational Paradigms in the Humanities – eHumanities and their role and impact in transdisciplinary research Gerhard Budin University of Vienna.
International Standards for Software & Systems Documentation Ralph E. Robinson R 2 Innovations.
Technical update on ISO 9001:2015 Colin MacNee Duncan MacNee Limited
Final Study Report on Semantic Metadata Mapping Procedure June 22, 2009 Tae-Sul Seo and Tae-Hoon Lim ISO/IEC JTC1/SC32.
ISOcat introduction 19 June 20121CLARIN-NL ISOcat workshop.
ISO/TC 37 Terminology and other language and content resources ISO WD Terminology Policies – Development and Implementation.
Terminology standards – enhancing language ISO/TC 37  Semantic Interoperability ISO TC 37 Secretariat c/o Infoterm Christian Galinski Bamako (Mali) /07.
ISO 9001: Countdown to 2015 Preparing for the next revision of the ISO 9001 standard on quality management systems.
(TERMINOLOGY) STANDARDIZATION at the service of diversity reasons, institutions, results, implementation Anja Drame.
ANSI TAG 37 Committee F43 Language Services and Products Interagency Language Roundtable September 30, 2011 Sue Ellen Wright ISO TC 37, Terminology and.
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
LIRICS International Standards in Lexicography Gerhard Budin University of Vienna August 2005.
SC38 Liaison Report to SC32 at SC32 meeting, Oct 24-28, 2011 Crete Baba Piprani/Canada SC38  SC32 Liaison 1 ISO/IEC JTC1/SC32/WG2 N1599.
GSC16-OBS-03 ITU-T GSC – 16 Observer Presentation Karen Higginbottom, JTC 1 Chair.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
ISO 9001:2015 Revision overview - General users
Tutorial for SC 32/WG 1 e-Business Standards Prepared for: SC Kunming Plenary Meeting Wenfeng Sun, Convenor ISO/IEC JTC1 SC32 WG1 (eBusiness)
(Terminology Summer School) TSS. Term Net Inforterm TSS Terminology Management Terminology.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Commonalities and Differences.
Provo, 16 Aug 2007 LMF meeting 1 Lexical Markup Framework: ISO Provo meeting Gil Francopoulo.
_________________________________________________________________________ CEOS/ISPRS Workshop on Rad/Geo Cal December 3, 2003 in Gulfport, MS, U.S.A. Wolfgang.
IEEE S2ESC Report1 Software And Systems Engineering Standards Committee (S2ESC) Paul R. Croll S2ESC Sponsor Chair June 2004 Report.
CLARIN web services and workflow Marc Kemps-Snijders.
Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.
LEFIS W2 Posgraduate Workshop 1 LEFIS, WG 2 Postgraduate studies Meeting, Rotterdam.
ISO TC 232: Language Training Interagency Language Roundtable September 30, 2011 William P. Rivers, Ph.D. Chair, US Technical Advisory Group to ISO TC.
LIRICS mid-term review 1 LIRICS WP3: Morpho-syntactic and syntactic annotations Thierry Declerck DFKI-LT - Saarbrücken 23rd May 2006.
Experiments with ODD outside the TEI framework Laurent Romary & Piotr Banski The ISO-TEI connection.
Marina Signore Head of Service “Audit for Quality Istat Assessing Quality through Auditing and Self-Assessment Signore M., Carbini R., D’Orazio M., Brancato.
LIRICS Mid-term Review 1 LIRICS WP2 – NLP Lexica Monica Monachini CNR-ILC - Pisa 23rd May 2006.
Quality assurance / Quality control system for the Greek GHG emissions inventory Yannis Sarafidis, Elena Georgopoulou UNFCCC Workshop on National Systems.
InWEnt | Qualified to shape the future1 Internet based Human Resource Development Management Platform Human Resource Development Programme in Natural Disaster.
24 Jan 2005 Kick off meeting (Luxembourg) 1 LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►Kick off meeting presentation ►Proposal.
TOSCA Technical Committee Kick-off December 12, 2011.
Conformity assessment – Standards and CEOC’s involvement Annual Conference 31 of May 2010 in Vienna Dipl.-Ing. Gerd-Hinrich Schaub CEOC International.
24 Jan 2005 Kick off meeting (Luxembourg) 1 LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►Kick off meeting presentation ►Proposal.
Towards multimodal meaning representation Harry Bunt & Laurent Romary LREC Workshop on standards for language resources Las Palmas, May 2002.
Overview of ISO NP Stella G Dextre Clarke Convenor, IDT/2/2 Working Group of BSI and Project Leader for ISO NP
ISOcat introduction 20 March 20121CLARIN-NL ISOcat workshop.
The Codex Alimentarius Commission
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
Introduction 1. Purpose of the Chapter 2. Institutional arrangements Country Practices 3. Legal framework Country Practices 4. Preliminary conclusions.
1The ISO Concept database Reinhard Weissinger 16 August International Organization for Standardization.
WG9 Report ISO/TC211 Plenary Meeting Montreal, Canada
Towards a Glossary of Activities in the Ontology Engineering Field Mari Carmen Suárez-Figueroa and Asunción Gómez-Pérez {mcsuarez, Ontology.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Catia Cucchiarini, Walter Daelemans and Helmer Strik Strengthening the Dutch Language and Speech Technology Infrastructure Catia Cucchiarini, Walter Daelemans.
ISO/TC37/SC4/N377 secretary report
Towards a roadmap for standardization in language technology Laurent Romary & Nancy Ide Loria-INRIA — Vassar College.
Provenance and Context Content Standard (Emerging) – Status of Activities H. K. Ramapriyan Science Systems and Applications, Inc. & ESDIS Project, Code.
1 Serbian Association of Accountants and Auditors (SAAA) IFRS and ISA TRANSLATION.
ISO/TC37/SC4 Draft Resolution
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
Formats, interoperability and standards Marc Kemps-Snijders.
1 February 2012 ILCAA, TUFS, Tokyo program David Nathan and Peter Austin Hans Rausing Endangered Languages Project SOAS, University of London Language.
SNOMED CT Vendor Introduction 27 th October :30 (CET) Implementation Special Interest Group Tom Seabury IHTSDO.
Dr. Trevor Smith Chair ISO/TC 176 Quality Management & Quality Assurance Worldwide Quality Director Global Manufacturing & Logistics Eastman Kodak Company.
ISOcat introduction 10 May /20111CLARIN-NL ISOcat workshop.
James W. Moore Liaison Representative IEEE Computer Society June 2004
ISO/IEC JTC 1/SC 7 Working Group 42 - Architecture Johan Bendz
IEEE CS SAB, Mar 2009 IEEE Computer Society Category A Liaison to ISO/IEC JTC 1/SC 40: Status Report Annette Reilly IEEE Computer Society
Lirics mid-term review
Introduction to ISO/IEC JTC 1 SC7
A Global Consensus Process
Item 7.11 SDMX Progress report
Presentation transcript:

/21LIRICS IAG Meeting Barcelona LIRICS IAG Meeting /21 Universitat Pompeu Fabra Barcelona Introduction Gerhard Budin

/21LIRICS IAG Meeting Barcelona Agenda Monday 14:00 14:45 Presentation of all attendees 14:45 15:00 Presentation of the ISO process to publish a standard (Gerhard) 15:00 15:15 State-of-the-art for ISO-TC37/SC2+3+4 standards (Gerhard) 15:15 16:15 Presentation of MAF+SynAF (Thierry) 16:15 16:30 Coffee break 16:30 18:15 Terminology Markup Framework (TMF), Data categories and Lexical Markup Framework (LMF) (Gil) Tuesday 09:00 11:00 Expression of needs by IAG members and discussion Presentations by IAG members 11:00 12:00 Conclusions and objectives for the rest of the project (Gerhard)

/21LIRICS IAG Meeting Barcelona Linguistic Infrastructure for Interoperable Resources and Systems GOALS: LIRICS provides a common standards framework for language engineering by translating requirements from European language industry into ISO standards on the basis of ongoing R&D work LIRICS provides input, on the basis of the cooperation and interaction between research consortia and industry groups, to ongoing standards work in ISO/TC 37, mainly focusing on lexicons, morpho-syntax, syntax and, to a certain extent, semantic content. These standards will be accompanied by a set of test suites in nine European languages to facilitate their implementation and an open source implementation platform allowing common-format, multi- lingual language processing compatible with legacy systems and tools

/21LIRICS IAG Meeting Barcelona

/21LIRICS IAG Meeting Barcelona Self-presentation by workshop participants Gerhard Budin PhD in linguistics, research, publications and teaching in terminology management, translation technologies, language & knowledge engineering Professor for Terminology Studies and Translation Technologies Deputy Director of the Center for Translation Studies, University of Vienna Chair of ISO/TC 37/SC 2 Terminography and Lexicography Chair of CEN/ISSS/WS ADNOM Administrative Nomenclature Chair of Austrian Standards Committee on Terminology (ON FNA 033) Since 1991 many EU-funded, international and national projects on terminology, language technologies, ontologies, E-Learning, knowledge engineering, and related topics

/21LIRICS IAG Meeting Barcelona Creating ISO standards 1.Viable idea, existing documents (including de-facto industry standards) representing real needs and requirements from society (industry/trade, consumers, research, social and cultural institutions, etc.) 2.National standards committees (AFNOR, BSI, DIN, AENOR, ON, etc.) or international committees (ISO) present a New Work Item Proposal (NWIP) for vote (certain requirements to be fulfilled) 3.Assignment of the NWI to a working group of a (sub-)committee, NWI to be edited by project editor in cooperation with a project team within a working group (experts to be nominated by national member committees, plus liaison representatives) 4.Presentation of WD (Working Draft) for vote to become a CD (Committee Draft), receiving comments to be resolved for presenting the CD for vote to become a DIS (Draft International Standard), receiving comments to be resolved for presenting the FDIS (Final Draft International Standard) to become an IS (International Standard) 5.Standards to be reviewed and updated at regular intervals 6.Fast-track procedure, Vienna Agreement (CEN-ISO)

/21LIRICS IAG Meeting Barcelona ISO/TC 37 Terminology and other language and content resources Founded in 1936/re-established in 1951 Scope: Standardization of principles, methods and applications relating to terminology and other language and content resources in the contexts of multilingual communication and cultural diversity –SC 1 Principles and Methods (chair: L.-J. Rousseau, Secr. Sweden) –SC 2 Terminography and Lexicography (chair: G. Budin, Secr. Canada) –SC 3 Computer applications (chair: B. Nistrup Madsen, Secr. Germany) –SC 4 Language Resource Management (chair: L. Romary, Secr. Korea) Each SC has several working groups which run at least one project Based on practical needs horizontal cooperation and coordination is to be guaranteed by SC chairs

/21LIRICS IAG Meeting Barcelona Language Resource Management Standardization Standardization is needed for language resources (mono- and multilingual), e.g. speech data, written (full) text corpora, lexical (general language) corpora and their processing methods Relevant research areas are computational linguistics and computational lexicography, language engineering, etc., which have provided industrial best practices to be turned into official standards This process will contribute to the further development of the language industries at large As is the case with terminologies, language resources in general are often multilingual, multimedia and multimodal

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 1 The following standards are under the direct responsibility of ISO/TC 37/SC 1: ISO 704:2000Terminology work – Principles and methods ISO 860:1996Terminology work – Harmonization of concepts and terms ISO :2000Terminology work – Vocabulary – Part 1: Theory and application The following standards are under preparation: ISO/CD 704Terminology work – Principles and methods ISO/CD 860Terminology work – Harmonization of concepts and terms ISO/PWI Terminology work – Vocabulary – Part 1: Theory and application ISO/WD 22134Practical guide for socioterminology

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 2 Title: Terminography and lexicography Scope: Standardization of terminological and lexicographical working methods, procedures, coding systems, workflows, and cultural diversity management, as well as related certification schemes

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 2 (2) The following standards are under the direct responsibility of ISO/TC 37/SC 2: ISO 639-1:2002Codes for the representation of names of languages – Part 1: Alpha-2 code ISO 639-2:1998Codes for the representation of names of languages – Part 2: Alpha-3 code ISO 1951:1997Lexicographical symbols and typographical conventions for use in terminography ISO 10241:1992International terminology standards -- Preparation and layout ISO 12199:2000Alphabetical ordering of multilingual terminological and lexicographical data represented in the Latin alphabet ISO 12616:2002Translation-oriented terminography ISO 15188:2001Project management guidelines for terminology standardization

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 2 (3) The following standards are under preparation: ISO/DIS Codes for the representation of names of languages Part 3: Alpha-3 code for comprehensive coverage of languages ISO/CD Codes for the representation of names of languages Part 4: Implementation guidelines and general principles for language coding ISO/CD Codes for the representation of names of languages Part 5: Alpha-3 code for language families and groups ISO/WD Codes for the representation of names of languages Part 6: Extension coding for language variation ISO/FDIS 1951 Presentation/representation of entries in dictionaries ISO/CD Terminological entries in standards – Part 1: General requirements ISO/CD Terminological entries in standards ISO 12615Bibliographic references and source identifiers for terminology ISO/NWI TR Quality assurance guidelines for terminology products ISO/NP Assessment and benchmarking of terminological holdings

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 3 (1) title: Terminology management systems and content interoperability scope: Standardization of principles and requirements for semantic interoperability, terminology and content management systems, and knowledge ordering tools

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 3 (2) The following standards are under the direct responsibility of ISO/TC 37/SC 3: ISO :2000Terminology work – Vocabulary – Part 2: Computer applications ISO 6156:1987Magnetic tape exchange format for (withdrawn)terminological/ lexicographical recordsISO 6156:1987 ISO 12200:1999Computer applications in terminology – Machine-readable terminology interchange format (MARTIF) – Negotiated interchange ISO 12620:1999Computer applications in terminology – Data categories ISO 16642:2003Computer applications in terminology – Terminological markup framework

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 3 (3) The following standards are under preparation: ISO/NWI TR 12618Computational aids in terminology – Design, implementation and use of terminology management systems ISO/CD Computer applications in terminology – Data categories – Part 1: Model for description and procedures for maintenance of data category registries for language resources ISO/CD Computer applications in terminology – Data categories – Part 2: Terminological data categories

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 4 (1) Title: Language resource management Scope: Standardization of specifications for computer-assisted language resource management linguistic infrastructures are being established or re-enforced as part of the rapidly evolving information and communication society; professional activities involving language resource sharing and standardization are increasing in diverse areas: –governmental or non-governmental organizations, public or private institutions, educational institutions, commercial enterprises, etc., –both, globalization and localization necessitate multilingual communication; there is an increasing need for new standardization as well as urgent recognition of existing de facto standards and their transformation into International Standards

/21LIRICS IAG Meeting Barcelona ISO/TC 37/SC 4 (2) The following standards are under preparation: ISO/NWI Terminology for language resources ISO/NP Word segmentation of written texts for mono-lingual and multi-lingual information processing – Part 1: General principles and methods ISO/NP Word segmentation of written texts for mono-lingual and multi-lingual information processing – Part 2: Word segmentation for Chinese, Japanese and Korean ISO/CD Language resource management – Feature structures – Part 3: Word segmentation for other languages ISO/WD 24611Language resource management – Morpho-syntactic annotation framework ISO/WD 24612Language Resource Management – Linguistic Annotation Framework ISO/WD 24613Language resource management – Lexical markup framework

/21LIRICS IAG Meeting Barcelona Conclusions Industry requirements Feedback loops/co-operation schemes Time table