13. - 15. 2. 2008 FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík 1, Ján Paralič 1, Pavel Smrž.

Slides:



Advertisements
Similar presentations
Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Advertisements

SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Text mining Extract from various presentations: Temis, URI-INIST-CNRS, Aster Data …
DEVELOPMENT OF CASCOT 5.0 (a multi-language text coding tool) Presentation to the DASISH project meeting, Gothenburg, November 2014 Peter Elias Margaret.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Using language services to enrich the LOs' descriptions Dr. Vassilis Protonotarios University of Alcala, Spain 10 th Strategic Seminar / Conference 6-7.
Template and information based on data provided by DERI 1 Mondeca ITM presentation & possible (?) collaboration Semantic Web Portal Project Michael Stollberg.
Search Engines and Information Retrieval
Xyleme A Dynamic Warehouse for XML Data of the Web.
Mobile Web Search Personalization Kapil Goenka. Outline Introduction & Background Methodology Evaluation Future Work Conclusion.
TAXONOMY-BASED ANNOTATION OF XML DOCUMENTS Application to e-Learning Resources Nicolas Spyratos University of Paris-South France Joint work with B. Gueye.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Intelligent Content Management. Concept Glossary Manager - Topic Maps Engine and Navigator Jakub Strychowski Rodan Systems SA
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 Tools of Software Development l 2 types of tools used by software engineers:
Chapter 5: Information Retrieval and Web Search
Overview of Search Engines
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Architecture Design of a System for Collaborative Policy Modelling Karol Furdik, Peter Butka, Marian Mach Karol Furdik 1,2, Peter Butka 1, Marian Mach.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Overview of the Database Development Process
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Search Engines and Information Retrieval Chapter 1.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
1 The BT Digital Library A case study in intelligent content management Paul Warren
VŠB - Technická univerzita Ostrava Text Mining Services for Trialogical Learning Pavel Smrž 1, Ján Paralič 2, Peter Smatana 2, Karol.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Mobile Topic Maps for e-Learning John McDonald & Darina Dicheva Intelligent Information Systems Group Computer Science Department Winston-Salem State University,
1st Workshop on Intelligent and Knowledge oriented Technologies Universal Semantic Knowledge Middleware Marek Paralič,
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
University of Economics Prague Information Extraction (WP6) Martin Labský MedIEQ meeting Helsinki, 24th October 2006.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
Košice, 10 February Experience Management based on Text Notes The EMBET System Michal Laclavik.
Chapter 6: Information Retrieval and Web Search
WIKT 2006, , Bratislava Service-based architecture of Access-eGov system {Martin.Tomasek, InterSoft, a.s.,
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Task-oriented approach to information handling support within web-based education Lora M. Aroyo 15 November 2001.
Facilitating Document Annotation using Content and Querying Value.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Semantic Annotation of Services and Processes in Business Alliances Karol Furdík 1,2 1 InterSoft, a.s., Floriánska 19, Košice, Slovakia
META-LEARNING FOR AUTOMATIC SELECTION OF ALGORITHMS FOR TEXT CLASSIFICATION Karol Furdík, Ján Paralič, Gabriel Tutoky {Jan.Paralic,
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
2 Software CASE tools state-of-the-art UML modeling Partially automatic code generation Refactoring browsers (occasionally) Context-sensitive search and.
Technology-enhanced Learning: EU research and its role in current and future ICT based learning environments Pat Manson Head of Unit Technology Enhanced.
Realtime Financial Monitoring and Analysis System May 2010 Lietu Search Engine.
1 CS 8803 AIAD (Spring 2008) Project Group#22 Ajay Choudhari, Avik Sinharoy, Min Zhang, Mohit Jain Smart Seek.
Facilitating Document Annotation Using Content and Querying Value.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Towards a framework for architectural design decision support
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Presented by: Hassan Sayyadi
SMART GROUND platform overview
Tools of Software Development
Presented by: Prof. Ali Jaoua
ece 627 intelligent web: ontology and beyond
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
SDMX IT Tools SDMX Registry
Presentation transcript:

FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík 1, Ján Paralič 1, Pavel Smrž 2 1: Technical University of Košice, Centre for Information Technologies, Letná 9, Košice, Slovakia {Jan.Paralic, 2: Brno University of Technology, FIT, Božetěchova 2, Brno, University of Economics, Prague, W.Churchill Sq.4, Praha, Czech Republic,

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 2 Contents Context KP-Lab project: Collaborative eLearning PoZnaŤ project: Adaptation to the Slovak education Semantic Annotation of Knowledge Artefacts Text Mining Services Creation of Concept maps Classification Functional specification, Architecture Implementation, API - signatures of methods Mining Engine Console Testing, Evaluation Future work

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 3 Attributes: Collaborative work Sharing of learning materials (knowledge artefacts) Trialogical learning, Activity theory: knowledge creation activities rely on usage, manipulation, and evolution of shared knowledge artefacts KP-Lab project, : integrated EU funded FP6 IST project ( ) objective: development of eLearning system aimed at facilitating innovative practices of sharing, creating, and working with knowledge in education and workplaces Collaborative eLearning

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 4 Full title: Support of the processes of innovative knowledge creation Duration: 30 months (February July 2009) Web: Project is supported by the Slovak Research and Development Agency under the contract No. RPEU Builds on the integrated EU project KP-Lab Focused on: eLearning, Knowledge management. Main goal: adaptation of the KP-Lab tools for trialogical learning to the conditions of Slovak higher education. Project PoZnaŤ

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 5 Semantic Annotation, Text Mining Knowledge artefacts are semantically annotated (by means of ontologies/taxonomies/concept maps) and are collaboratively investigated in the shared learning space. Text mining services - intelligent access and manipulation with the knowledge artefacts; to assist users in creating or updating the semantic descriptions of KP-Lab knowledge artefacts. TMS fundamental tasks: Ontology learning - extraction of conceptual maps (clustering), i.e. an automatic extraction of significant terms from KA's textual descriptions and converting them to a structure of concepts and their relationships. Classification of knowledge artefacts - grouping a given set of artefacts into predefined or ad hoc categories.

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 6 Semantic annotation of a knowledge artefact Knowledge artefacts in the Shared Space KP-Lab Shared Space Suggested terms Other terms from dictionary

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 7 Concept map creation Concept maps can be used in learning for: open-ended research questions about a certain topic, questions coming out of practical experiences that are to be explored. Procedure: Related materials are uploaded to the Shared Space. System analyses the data (using Text Mining services) and provides a hierarchy of concepts extracted from documents.

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 8 General Functionality (1) Concept map services: Pre-process documents, produce internal representation, store it into the Mining Object Repository, Manage the Mining Object Repository (insert / update / delete), Find clusters in a set of documents, Identify concept candidates and rank them according to the estimated relevance, Given a set of concepts, find related concepts from the documents provided by the user. Return a ranked list of candidate relations together with their types, Build the concept map and generate the RDF graph.

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 9 General Functionality (2) Classification services: Creation of a training data set from already annotated knowledge artefacts to a pre-defined set of categories, Creation of a classification model, based on the selected algorithm and on a given training data set, Modification (tuning) of the classification model, by changing the texts and/or categories in the training data set, as well as by editing the settings of the algorithm or switching to another algorithm, Provision of basic measures for existing classification model, e.g. by means of precision and recall, Verification and validation of the classification model. Classification of unknown (not annotated) artefacts to the categories used for training. The output of this function is a set of weighted categories (concepts, terms) for each of the classified artefacts.

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 10 Java 1.5, Service-oriented architecture - Web Services JBowl Library: platform for pre-processing (incl. NLP methods) and indexing of large textual collections; functions for creation and evaluation of text mining models (for both supervised or unsupervised algorithms). GATE Framework: an architecture, or organisational structure, for NLP software; a framework, or class library, which implements the architecture; a development environment built on top of the framework Implementation

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 11 Pre-process service - implemented as a pipeline of processing resources on top of the GATE engine. Additional NLP resources integrate language-dependent tasks such as parsing, keyword extraction, co-occurrence statistics and semantic-distance computation: –URI preprocess(String[] artefacts, String[] seed) –void delete(URI preprocessedData) Clustering and concept map service - interfaces to unsupervised text mining methods: –String[] findClusters(String[] artefacts) –String[] findConceptCandidates(URI preprocessedData) –String[] findRelatedConcepts(URI preprocessedData, String[] concepts) –String[] buildConceptMap(URI preprocessedData) API reference: API - signatures of methods (1)

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 12 Pre-process Service

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 13 ConceptMapCreation Service

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 14 LearningClassification service - methods for creation, verification, modification, and removal of a classification model: –String createModel(String settings, String[] artefactURI) –String verifyModel(String modelURI, String[] artefactURI) –String modifyModel(String[] settings, String[] artefactURI) –String deleteModel(String modelURI) Classify service - method for classification of artefacts: –String[] classify(String modelURI, String[] artefactURI) API reference, WSDL, demo web interface: API - signatures of methods (2)

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 15 LearningClassification Service

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 16 Classify Service

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 17 Mining Engine Console Web interface (JSP), Access to the TMS for users, Maintenance of the Mining Object Repository and classification models, Perform classification tasks, View statistics, generate reports.

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 18 Testing, Evaluation Three categorized sets of documents: EID UU: University of Utrecht, The Netherlands. 138 PDF and MS Word documents classified into 5 categories. Language: Dutch. UHE: University of Helsinki, Finland. 56 MS Word and plain text documents classified into 5 categories. Language: English. VSE: University of Economics in Prague, Czech Republic. 97 PDF and MS Word documents classified into 7 categories. Language: English.

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 19 Full integration of the Text Mining Services into the whole KP-Lab system is planned for spring Enlarge data collection for testing: build a corpora of learning materials, create the suite of tools for processing of Slovak language. Implement more text mining algorithms, compare their efficiency: a mechanism of automatic selection of learning algorithms and their settings is currently investigated. Enhance the Mining Engine Console for usage within the real learning process: the console will be tested in summer semester 2008 on Technical University of Košice, within the lessons of Knowledge Management. Future Work

, FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík, Ján Paralič, Pavel Smrž # 20 Thank you ! Questions? Further information: