Presentation on theme: "Prezentácia pre šéfa do Krakowa Ladislav Hluchý Institute of Informatics of the Slovak Academy of Sciences."— Presentation transcript:
Prezentácia pre šéfa do Krakowa Ladislav Hluchý Institute of Informatics of the Slovak Academy of Sciences
Institute of Informatics – Research Directions and Departments Research directions –Parallel and distributed information processing –Design and testing of digital systems and their technological realisations –Nano- and micro-structures and their realisation with the use of electron beam lithography –Discrete processes modelling and control –Robotics (kinematic structures, animation), mobile systems, sensors and effectors –Speech processing, Image processing, Numerical methods and algorithms –The Institute of Informatics also deals with development of experimental laboratory equipment and printed circuit boards. Departments –Parallel and distributed computing –Design and diagnostics of digital systems –Numerical methods and algorithms –Speech analysis and synthesis –Electron beam lithography –Discrete processes modelling and control –Sensor systems 2
3 Department of Numerical methods and algorithms Fundamental & applied research on: - Discrete sinusoidal transforms and cosine modulated filter banks: theory, fast algorithms, applications, implementation and integer realization, - Fire research, principles, models and methods: 1. forest fire computer simulation, fire behaviour modeling, reconstruction and prediction of forest fire under real conditions, 2. fire in buildings, departments, personal motor car fires and fires in road tunnels, - Other research activities: linear algebra, signal and image processing, parallel processing.
APVT project: Fire modelling and simulation, APVT-51-037902 (2004 - 2006) APVT project: Fire modelling and simulation, APVT-51-037902 (2004 - 2006) The aim of project: (Institute of Informatics SAS, Bratislava) -To simulate real forest fire growth in a given topographical area with different simulated weather conditions by numerical computation on PC, using FARSITE software and data gathered in a localities in Slovak Republic; further improvement of applied models, too, -To simulate past fires – reconstruction active fires - decision support potential fires - prevention, -Other practical goals creation of a decision support system, as means provided to a training centre, for educational purposes, ecosystem institutions, insurance companies, etc. 4
5 The aim of project: (Technical university at Zvolen, collaborative partner ) - Description of the space situation on digital elevation model with resolution 25 and 10 m, - Stands description (its tree species composition, age, crown closure, height, diameter, quality ), - Definition of “fuel” parameters for the afforested territory. To use and verify the processes suggested at “growth models”, - Verify all available forest data by the information from high resolution materials and pictures, - Accessibility of the territory, capacities and source location - key importance in forest fire defense and decision support system, - Education of students and PhD students for real using of the results of this project and application at praxis. APVT project: Fire modelling and simulation, APVT-51-037902 (2004-2006)
6 Huygens’ principle with application of the envelope theory Changed wind direction, constant fuel Constant wind direction, variable fuel
8 Differential form of a fire front b+c, b-c, a – forward, backward, and lateral rate of the fire spread - the angle of a wind direction
9 After 6 hoursAfter 2 dayscombustion 6 people’s lives lost Forest fire 23.10.2000 in the National park where people’ were entrapped by a fire (reconstruction)
10 APVV project: Personal motor car fires, computer simulation and its experimental verification, APVV-0452-06 (2008 - 2010) ??? Why motor car fires ??? “ The latest estimate made by the EC state that the number of motor cars fires in Europe increases about 6 times faster than current number of cars on roads “ [Verdunner, H. : Road traffic accidents - what can be done to save lives ?, Fire and Rescue, No 4, pp. 10-14, 2005] Statistics of fires in SR between 2001 - 2004 : 2529 Motor cars fires 25 people’s lives lost 53 people’s were injured during these fires 10.1 million euro is the total price of the damage [ Year-book’s, Fire safekeeping, 2001-2004] In 2005 : Int. Tunneling Association - Committee on Operation safety of Undeground Facilities was establish, ITA - COSUF
11 APVV project: Personal motor car fires, computer simulation and its experimental verification, APVV-0452-06 (2008 - 2010) The general aims of the project are : -Experimental examination of the course and expansion of individual phases of motor car fire and investigation of possibility of flashover to adjacent cars in open and non open space, too, - The obtain data necessary for simulation of selected motor car fires by computer program based on CFD and visualize the results of simulation, - Knowledge and facility obtained during these task to use to more general utilization to modeling potential fire risk, - To test the software on multi-processor computers
12 APVV project: Personal motor car fires, computer simulation and its experimental verification, APVV-0452-06 (2008 - 2010) The aims of the project (Institute of Informatics SAS, Bratislava) : -To understand the simulation methods and the software conditions correct using ( Fire Dynamics Simulator, NIST US) -To prepare the input data for the simulation, -To model the development of selected types of fire on computer, - To visualize the results of simulation and their interpretation, - To compare the results of experiments and simulations and make verification.
13 The aims of the project : (Faculty of Special Engineering, University of Žilina, collaborative partner) : - Initiation of fire in selected place of care, - Examination of the course and expansion of individual phases of burning, - Monitoring of spreading changes and development phases of the fire, and possibilities of flashover to adjacent cars, - Measurement of the released energy from a unit surface, the thermal radiation intensity, the rate of heat and smoke intensity, - Evaluation all experiments. APVV project: Personal motor car fires, computer simulation and its experimental verification, APVV-0452-06 (2008 - 2010)
Department of Design and Diagnostics of Digital Systems Research interests Design field : Architectures for testable & reliable design of digital systems (FPGA, CMOS ASIC) Hardware implementations of specific algorithms Results: IP digital cores: data encryption and decryption algorithm designs, modified discrete cosine transform (MDCT) algorithm low power design for MP3 (350, 90 nm) Test field Algorithms for test pattern generation for digital systems o at the behavioural level (using VHDL) o defect-oriented level Algorithms for system diagnosis in multiprocessor or distributed systems Test compression methods for delay faults in digital circuits Methods and techniques of memory testing and self-repairing Testability of asynchronous circuits Results: random and deterministic defect oriented ATPG tool, ATPG tool for delay faults, tools for automatic testability and BIST structures synthesis 14
LPP-0140-09 Virtual design laboratory for high schools VILAB (granted by Slovak Research and Development Agency) Project objective: Developing partnership of researchers, teachers and students: it will enhance the chances for innovation of expert subject curricula of teachers, it will give new knowledge and skills to students, thus enhancing their interest in the study and in practice in technological branches. 15
VILAB - goals of the Institute of Informatics Transfer of research knowledge in microelectronics and of currently available technologies to secondary school teaching process to support its innovation. Promoting modern processes and means in the field of application of digital integrated circuits and systems on chip. 16
VILAB - goals of the schools Creating a virtual laboratory for the design of digital circuits for students of high schools focused on electrical engineering and/or applied informatics. Providing professional design software and hardware environment for digital design for FPGA technologies with new teaching materials. Delivering special lessons for selected parts of the school subject electrical engineering, electrical technology and computer systems. Specifying tasks for student theses, competitions, as well as for courses of design software usage with methodical handbooks. 17
Applets for education Training tools for BIST and DFT applications applets for BIST and MemBIST construction applet for test wrapper construction applets BIST, Wrapper a MemBIST in education at SUT FIIT Bratislava, TU in Tallinn and TU in Jonkoping accessible at www.ui.savba.sk/diag/tools/testability.html Testing system DefSim for defect testability education in CMOS circuits (technology CMOS 0,8 μm) designed and produced with partners in the REASON project granted by IST-2000-30193 REASON: Research a training actions for system on chip design (2002-2005) Organisation of tutorials, courses, workshops, joint research, training tools development in the field of testing and testability of digital systems. 18
Laboratory of E-beam Lithography R&D Field: Micro- and nanoscience and engineering Research subject: Electron-Beam Lithography & Pattern Transfer Application: The fabrication of submicrometer and nanometer patterns The fabrication of binary photomasks for photolithography, deep-UV lithography Mix-and-Match Lithography Facilities: e-beam equipment ZBA21 (variable shaped spot size 100 nanometer - 3 micrometer, substrates size 4 – 6-inch) http://www.ui.sav.sk/ebl/ Submicrometer structures Photomask
Laboratory of E-beam Lithography Projects: Centre of Excellence for New technologies in Electrical Engineering The strategic objective: to increase research and development potential of the Bratislava region in micro- and nanotechnologies Facilities of the Centre: - Laboratory for metal coating (evaporation system) - Laboratory for the preparation and shaping of nanostructures (electron and ion dual beam FIB, Quanta 3D 200i from FEI) - Laboratory for the detection of nanoclusters (x-ray based LAB- GISAXS) Partners: Slovak Academy of Sciences (Institute of Electrical Engineering, Institute of Physics Institute of Measurement Science, Institute of Informatics), Slovak University of Technology in Bratislava
SEMCO-WS Semantic Composition of Web and Grid Services Goals –To create a distributed, easily-deployable semantic repository for all information pertinent to SOI resources –To extent workflow composition logic towards data –Compose workflow not only from existing application services, but also from existing (pre-computed) data; don’t repeat computations –To enable workflow description by WS-BPEL –A business semi-standard; the middleware will gain much larger user base –To extend/specialize other parts of K-Wf Grid middleware –Service quality prediction, user collaboration –We have dropped monitoring infrastructure of K-Wf Grid 21
22 K-Wf Grid: Objectives Objectives of K-Wf grid –To enable users to create complex workflows and use grid resources without detailed knowledge of grid –To construct workflows optimized for underlying infrastructure, using its advantages and avoiding its bottlenecks –To (semi-)automatically construct workflows based on user’s requirements, using semantic annotation of services, data, applications and resources –To constantly renew information about the grid by using complex monitoring network – to learn from experience –To provide simple, easy-to-use interface to K-Wf Grid services
23 K-WfGrid Consortium www.kwfgrid.net K-WfGrid Consortium www.kwfgrid.net Bratislava Athens Genova Kraków Innsbruck Berlin
SEMCO-WS: publikácie, Ph.D., diplomové práce 26
DMM Data Mining Meteo (DMM) – Forecast and Detection Methods of Significant Meteorological Events based on Data Mining Goals –Enhance and extend a forecating application developed by MICROSTEP-MIS –Deploy novel and experimental data integration and data mining techniques in a business environment –Experiment with data mining of meteorological/environmental data Partners –MICROSTEP-MIS –Insitute of Informatics of the Slovak Academy of Sciences –Faculty of Electrical Engineering and Informatics of the Technical University in Košice 27
ADMIRE – Concept & Goals Goals of ADMIRE: –Easy-to-use data mining technology –Abstract view of data integration and data mining For better coping with data and resource heterogeneity Concept of ADMIRE: –Low-level infrastructure for managing distributed resources –Middle-level data integration covered by OGSA-DAI –High-level model, language, architecture These are the main research objectives of ADMIRE
ADMIRE Consortium & Timeline Consortium –University of Edinburgh, UK - data integration, OGSA-DAI, architecture –Vienna University, Austria - high-level model and language –UPM, Spain - ontologies for data integration and data mining –Fujitsu Labs Europe, UK - low-level WS-RF-based infrastructure (USMT) –Institute of Informatics, SAS, Slovakia - pilot application, knowledge management, user interfaces –COMARCH SA, Poland - pilot application, use case definition Timeline –M1 to M12 – first prototype of pilot applications and architecture, some tools –M12 to M18 – second prototype of architecture, user interface, tools, ADMIRE white paper version 1 –M18 to M36 – iteration of the prototype-evaluation-prototype process; new version of ADMIRE white paper, architecture; stabilization of the DMI-L language for data mining process description
ADMIRE Architecture High-level architecture –A series of gateways, interconnecting distributed resources, concentrating, streaming and processing DMI requests
Knowledge management, knowledge technology Basis for the research in the area of knowledge technology comes from long term study performed within the following projects: Pelllucid – A New Platform for Organisationally Mobile Public Employees - (2002-2004) EU 5FP RTD project IST-2001-34519 NAZOU - Tools for Acquisition, Organization and Maintenance of Knowledge in an Environment of Heterogeneous Information Resources, (2004-2007) Slovak National Project SPVV 1025/04 RAPORT - Research and development of a knowledge based system to support workflow management in organizations with administrative processes, (2005-2007) Slovak National Project APVT-51-024604
PELLUCID A Platform for Organisationally Mobile Public Employees IST-2001-34519, 5FP (2002-2004) http://www.sadiel.es/Europa/pellucid/
Overview of Pellucid The overall objective was to develop an adaptable platform for assisting organizationally mobile employees. Pellucid : –improves organisation effectiveness and efficiency by formalisation, recording, storage and preservation of experience and knowledge; and –supports workers during integration in a new department or role by giving access to specific knowledge and experience accumulated in the past. Use of a new agent-based 3-layered architecture Focus on Administrational Processes
The Pellucid Consortium II SAS, Bratislava Slovak Republic MT Department of Genova, Italy Mancomunidad de Municipios del Bajo Guadalquivir CCLRC Rutherford Appleton Laboratory Oxfordshire, England Regional Ministry of the Presidency of the Junta de Andalucía
Objectives and Motivations Organisationally mobile employees move, as a normal part of their career progression, to different departments or units in a sector where they do not necessarily have direct experience. They need to develop an understanding of the new working environment and its processes in order to master the new situation rapidly. Question: How to assist such employees? The answer: PELLUCID – the experience of previous occupants of the role would be valuable for them, if only it could be made accesible and tailored to their individual needs.
PELLUCID: Basic System Architecture Pellucid Agents Pellucid System Organizational Memory Workflow Management System User Interface
PELLUCID: Possibilities for Synergies Possibilities for synergies in the following areas: –Workflow Based Systems –Management of Administrational Processes –Knowledge Management –Intelligent Software Agents –Coordination –Learning –Monitoring –etc.
NÁZOU Tools for acquisition, organisation, and maintenance of knowldge in an environment of heterogenous information resources http://nazou.fiit.stuba.sk/home/?page=about
Objectives models of heterogeneous environment (uncertainty, systems for modelling imperfect information, models of application domain, user models, context models, navigation models, metadata and ontologies, multilanguage sources, and multiagent systems), knowledge acquisition (information recommendation, acquisition of user model/environment model, special languages for flexible query, and ontology creation), knowledge organization (ontologies, various inductive methods, and small world networks), and knowledge presentation (adaptive navigation, adaptive content presentation, and virtual and enriched reality).
Partners Slovak technology university Faculty of informatics and information technology in Bratislava Institute of informatics, Slovak Academy of Sciences, Bratislava University of Pavol Jozef Šafárik, Košice Softec, Ltd., Bratislava
Research activities Research and development of methods for knowledge acquisition, organization, maintenance, and presentation Design of software architecture for intelligent knowledge management Design and development of the corporate memory Design and development of the components for semantic annotation Research and design of processes ensuring data security Integration and validation of the software architecture Extension of results into other domains
Methods for acquisition The group of methods for the acquisition of data and offers includes those methods that serve for: the acquisition of data from the Internet (downloading offers), the supportive functions important for mere acquisition of the offers from the Internet (offers' sources identification, basic evaluation of relevancy), direct offers identifications from acquired documents in required formats (individual offers' extraction, documents' conversion). The group consists of: Method for Relevant Internet Data Resource Identification (tool RIDAR) Method for Job Offer Fetching (tool WebCrawler) Method for Internet Documents Relevance Estimation (tool ERID) Method for Web Page Wrapping (tool Wrapper Suite) Job Offer Portal (tool JOP)
The chain is responsible for acquisition of job offers from the Internet and their processing from non-structured textual representations to (semi)structured representation in relational database and domain ontology. Integration itself is realized by a CRON job, which executes regularly in succession the methods RIDAR, WebCrawler, DocConverter, RFTS, OnTea and supporting tool RDB2Onto. Other methods have a supporting role in the process and are executed by the methods listed in the CRON job.
Methods for organisation and maintenance Group of methods for the organization and the maintenance of data and offers includes tools which analyze and process acquired data and offers in such a way that they would be ready for the presentation and search connected with it. The main concern is documents' annotation with the aim to find the parts of a document with certain meaning, clustering of documents (probabilistic, fuzzy), the analysis with the aim to search communities, extracting information by inductive logical programming, finding the best objects out of more well-ordered lists, indexing, and also the tools that enable search in text documents. The group consists of about 20 implemented software tools, e.g. OnTeA, OntoSim, OntoCase, IGAP, TopK, etc. that apply developed methods for annotation, clustering, and various search mechanisms.
Determining the best offer for a user A sequence diagram of the implemented chain
Methods for organisation and maintenance The chain deals with user dependent search, i.e. finding most relevant objects for many actual users in parallel. Every user can have unique, different notion of suitable objects. We represent this notion as user preferences based on fuzzy sets. User preferences can be obtained directly from user, learned from ranked objects or generated from user index. They are subsequently used for finding most relevant objects. This approach is combined with full text searching for keywords.
Methods for presentation and personalization The methods for the presentation of offers utilize the suggested frame for the presentation of offers represented by ontology (Prescott, Presentation-Cocoon-Ontology), which represents the method that in general provides the presentation of the information represented by ontology, (or also by the XML language with the following transformation via XSL patterns). This tool provides unified presentations of offers with the use of presentation ontology Fresnel. The group consists of 7 tools: Method for User Preferences Acquisition (tool UPreA) Method for Client-side User Action Logging (tool Click) Method for Semantic Logging (tool SemanticLog) Method for Analyzing User Action Logs (tool LogAnalyzer) Method for Adaptive Faceted Ontology Browsing (tool Factic) Method for Cluster-based Graph Browsing (tool ClusterNavigator) Method for Presentation of Ontology-based Job Offers (tool Prescott)
Adaptive presentation of offers Addressed problem Some properties of large open information spaces are dynamic – new content is added, existing content is removed or modified, and the diversity of their user base. This often results in many irrelevant search results as well as the infamous navigation problem, which occurs when users loose track of their position in hyperspace. Furthermore, a single common user interface is unable to accommodate the specific needs of individual users thus reducing overall user comfort and productivity.
The architecture of the personalized presentation layer.
The personalized presentation layer description The used personalized presentation layer consists of a set of interconnected tools that perform specific tasks: User modeling by means of client-and server-side automatic user action acquisition with semantics and the subsequent evaluation of acquired data into meaningful user characteristics Personalized search using the respective user characteristics as input for queries and the evaluation of search results relevance. Personalized navigation via an adaptive faceted semantic browser with support for integrated searching and browsing, facet and restriction adaptation and recommendation, and rating and presentation of domain ontology instances.
The main outcome of the project Design and implementation of tools for information and knowledge processing within the domain of job offers. The tools operate with data that can be seen in the sequence: from primary data on the Internet, or given by user, through acquired documents, the documents containing relevant data with respect for application domain (which are, in our case, job offers), in direction towards the offers chosen from documents with job opportunity offers, up to effective presentation of the offers to user.
The outputs into the educational process Defended PhD thesis: Ing. Marián Babík: Knowledge based systems for automated composition of web services, Ústav informatiky SAV, 2008, supervisor Ing. Ladislav Hluchý, CSc. Ing. Zoltán Balogh: Knowledge based performance prediction for Workflow Scheduling and Composition of Grid Services, Ústav informatiky SAV, 2006, školiteľ Ing. Ladislav Hluchý, CSc. Ing. Marek Ciglan: Data access optimization in grid systems, Ústav informatiky SAV, 2008, supervisor Ing. Ladislav Hluchý, CSc. Zoltán Fazekas, Improving Variability in Software Configuration by Separation of Concerns, Ústav informatiky a softvérového inžinierstva, FIIT STU, supervisor prof. P. Návrat RNDr. Tomáš Horváth: User Preference Learning by Induction of Fuzzy and Annotated Logic Programs, PF UPJŠ, 2008, školiteľ prof. P. Vojtáš RNDr. Michal Laclavík: Ontology and Agent based Approach for knowledge management, Ústav informatiky SAV, 2006, supervisor Ing. Ladislav Hluchý, CSc. RNDr. Rastislav Lencses: Efektívne a distribuované algoritmy v oblasti vyhľadávania informácií s podporou relačnej databázy, Ústav informatiky, PF UPJŠ, 2005, supervisor prof. P. Vojtáš RNDr. Peter Gurský: Searching Top-k objects for many users, PF UPJŠ, 2008, školiteľ prof. P. Vojtáš Viktor Oravec: Manažovanie koalície logických agentov, Ústav informatiky SAV, supervisor Prof. B. Frankovič, DrSc.
The outputs into the educational process 2 RNDr. (rerum naturalium doctor )thesis: Michal Mati: Spracovanie prirodzeného jazyka, Ústav Informatiky, Prírodovedecká fakulta, Univerzita Pavla Jozefa Šafárika v Košiciach, 2006 Jana Pribolová: Formálne metódy na popis ontológií. Ústav Informatiky, Prírodovedecká fakulta, UPJŠ, Košice, 2007 Veronika Vaneková: Preferenčné dopytovanie pre sémantický web, Prírodovedecká fakulta, UPJŠ, Košice, 2008 Michal Laclavík: Experience Management based on Ontology and Text Notes -The EMBET System, Ústav Informatiky, Prírodovedecká fakulta, Univerzita Pavla Jozefa Šafárika v Košiciach, September 2005; obhajoba 10. februára 2006 Marek Ciglan: Data management in grid environment, Ústav informatiky, Prírodovedecká fakulta, Univerzita Pavla Jozefa Šafárika v Košiciach, September 2005, obhajoba 10. február 2006 Róber Novotný: Stromová algebra vo formálnej logike, Ústav Informatiky, Prírodovedecká fakulta, Univerzita Pavla Jozefa Šafárika v Košiciach, 2006
Benefits of the research for educational processes Joint publications of students and experienced researchers Organization of workshops and meetings for students and young researchers New interesting topics for master and diploma thesis Cooperation of students, young and experienced researchers Creation of study materials that address high actual and up-to-date issues from informatics
Research and development of a knowledge based system to support workflow management in organizations with administrative processes RAPORT Institute of Informatics Slovak Academy of Sciences (UI SAV), Bratislava
RAPORT presentation, Krakow 57 Introduction Project consortium Institute of Informatics, Slovak Academy of Sciences (Bratislava) National Academy of Defense (Liptovský Mikuláš) Faculty of Informatics and Information Technologies, Slovak Technical University (Bratislava) Pilot application in CST NAD LM Project of APVV Agency Slovakia (2005 – 07) in cooperation of The II SAS and NAD in Liptovský Mikuláš, Goal – Support of Military Exercises Planning for CAX in The Centre of Simulation Technologies NAD in Liptovský Mikuláš.
RAPORT presentation, Krakow 58 Project Goal and Motivation Goal The RAPORT system is a supporting system for administrative workflow processes. It answers the purpose of acquisition, storing and presenting of knowledge, information and experiences by the administrative workflow processes execution. Motivation Employees of organizations with administrative workflow processes deal with many activities that repeatedly use the same information, like e.g. contact information, predefined forms and styles of written documents, information about approval regulations and directives, etc. Problems occur, when one or more employees are missing. To reduce problems with substitution of a missing employee (either long term - e.g. retirement, job changing, etc., or short term - vacation, illness, etc.), the proposed system enables acquiring of each employee's knowledge and experience in order to present them to the others in the same or similar context.
RAPORT presentation, Krakow 59 Realization of Military Exercise CAX in CST NAD LM – System Description Basic CST tasks in the 1st period: 1.CAX Definition - Determination of the content, aims and tasks of a military exercise. 2.CAX Planning - Determination of the scenario, preparation of simulation and technical plan, terrain digital maps, etc. 3.Preparation of the simulation software and exercise logistics. Basic CST tasks in the 2nd period: 1. Performance of a simulated computer fight. Basic CST tasks in the 3rd period: 1.Evaluation of the officers involved. 2.Evaluation of the quality of the military exercise preparation by CST NAD.
RAPORT presentation, Krakow 60 Requirements for Knowledge Oriented Management in CAX and Reasons for Project CST NAD organizes a higher number of time-overlapping exercises during the training year (approx. 12). There are several fundamental problems problems with an organization of time-overlapping military exercises, problems with a large number of documents for different time - overlapping military exercises, problems with the coordination and deadlines of individual processes and tasks during the exercise planning and organization, problems with underlimited personal staff (this planning is performed only by 6-7 persons) What is the solution for this situation? Development of a knowledge system for the support of military exercises preparation based on knowledge representation, electronic communication and web technologies (System RAPORT)
RAPORT presentation, Krakow 61 Secricom Conference, Warsaw, May 12, 200961 Requirements for Knowledge Oriented Management in the System RAPORT Work quality increasing – work organization demands worldwide knowledge of the rules related to the organization and realization of military exercises Decreasing the costs of the organization of the exercise processes – it must be processed by a large amount of information, documents, e-mails,..., which causes untransparency Time saving – documents must be processed and accessible in the required deadlines and users often have no overview of the accessible documents in the required time intervals Personal State Stabilization – employee fluctuation due to the absence, illness or change of position can cause untransparency in cooperation and coordination of work during the military exercise preparation
RAPORT presentation, Krakow 62 Objective: To Propose a System Supports automating of activities during the military exercise preparation, which enables to: Remove the duplicity of activities and saves work, Inform the user about the state of the exercise preparation and enables an effective approach to documents, Provide hints and suggestions how to continue the work and in this way it warns about some difficulties and lists some possible solutions how to successfully solve problems of military exercise preparation.
RAPORT presentation, Krakow 63 Structure of the Military Exercise Preparation - Workflow of Administrative Processes CST NAO LM
RAPORT presentation, Krakow 64 Model of System – inter-connection of all models Personal model – description of the personal structure of CST NAD and training army units, Data model – description of the documents, e-mails, faxes, etc., which means the structure of the information, Process model – description of the activities and events in the CAX preparation (planning) phase, Communication model – description of the user, process and document communication, Knowledge model – modeling integration of the military exercises preparation on the basis of ontologies. Process model Data model Communication model Personal model Ontology
RAPORT presentation, Krakow 65 RAPORT System Architecture Presentation layer provides transparent and user- friendly middleware for knowledge and information presentation to users Process layer ensures event monitoring and notification analyses the incoming information from presentation the layer creates active notes based on the current working context and previous experiments prepares the ontological data and required data (e-mails, documents, …) Data layer ensures functionality of the database and the file system repository ontology model is stored in OWL format
RAPORT presentation, Krakow 66 Evaluation of system RAPORT for CAX Planning Knowledge management collects experience from users by using Active Notes presents experience to other users working in similar context by Active Notes Schedule management checks current training plans for important deadlines Templates distribution prepares necessary information (predefined e-mails, documents, formulas, …) before deadlines with user’s awareness Teamwork support supports user’s experience exchange and collaboration
RAPORT presentation, Krakow 67 Outputs to the educational process Diploma thesis: –Bc. Rudolf Dačo, Mobile agents protection, FIIT STU 2007, supervisor Ing. I. Kapustík –Bc. Michal Bebjak, Aspect-oriented implementation of web aplication changes, FIIT STU 2007, supervisor Dr. V. Vranič –Bc. Peter Šimo, Semanticaly supported web browsing, FIIT STU 2007, supervisor Prof. P. Návrat –Bc. Jakub Vaňo, Colaborative editing of source files in team work, FIIT STU 2007, supervisor RNDr. V. Šimáková PhD thesis - defended: –M. Laclavík, Ontology and Agent based Approach for knowledge management, Ústav informatiky SAV, 2006, supervisor Ing. Ladislav Hluchý, CSc. –Viktor Oravec: Manažovanie koalície logických agentov, Ústav informatiky SAV, supervisor Prof. B. Frankovič, DrSc. –Zoltán Balogh: Knowledge based performance prediction for Workflow Scheduling and Composition of Grid Services, Ústav informatiky SAV, 2006, supervisor doc. Ing. Ladislav Hluchý, CSc. –Radoslav Forgáč, Zniťovanie dimenzie klasifikačného priestoru pomocou ITNN, II SAS 2007, Supervisor Prof. I. Mokriš PhD students still working on thesis - 3 Presentations and workshops for students and researchers - 7
Thank You for Your Attention! Ladislav Hluchý firstname.lastname@example.org Institute of Informatics Slovak Academy of Sciences, Bratislava (UI SAV)
Information Retrieval and Information Extraction lectures
IR Lectures Introduction to Information Retrieval Text Operations, Text Analysis, stemming Crawling, link processing IR Models, Indexing techniques IR Software libraries and systems Ranking by Graph Algorithms (PageRank, HITS, …) and Searching Information Extraction Regular Expressions Large Scale Data Processing on MapReduce Architecture Multimedia Information Retrieval Evaluation Techniques, Precision, Recall Google Semantics and IR, Semantic Web Standards 70
Lectures conditions Every students gets project focused on –Crawling –Indexing –Ranking –Information Extraction –Large Scale information Processing They have to consult project 3 times during semester Availability of data from day one Lectures are available at: –http://vi.ikt.ui.sav.sk/Témy_prednášokhttp://vi.ikt.ui.sav.sk/Témy_prednášok 71
73 Commius Vison Commius vision of interoperability –A software system adaptable customisable –For the SMEs With “zero-cost entry” into interoperability Delivered as an easy-to-use always available service (ISU) –Supporting collaboration and bottom-up, flexible, approach to interoperability
74 Commius Objectives Use e-mail protocols (SMTP, POP3, IMAP) to support System Interoperability Design and develop software components to support Semantic and Process Interoperability Delivered as an easy-to-use always available service (ISU) Develop an Open Source framework –With seamless integration within a network enterprise environment –Modular and adaptable to different ICT infrastructure standards Understan d Communicat e Work together Easy to use
75 Commius Objectives Demonstrate a pilot application, within three industrial use cases –Business Case 1 – Interoperability within the Spanish Technology Centers, Fedit (S) –Business Case 2 – SMEs Technological District, Aitek and Techfin (I) –Business Case 3 – Enterprise Resource Planning integration (SL) Direct involveme nt of the end-users Demonstrat e the results
76 Commius Partners 1.Softeco Sismat - Italy (coordinator) 2.University of Manchester - United Kingdom 3.German Research Center for Artificial Intelligence - Germany 4.Institute of Informatics, Slovak Academy of Sciences - Slovakia 5.Vienna University of Technology - Austria 6.Atos Origin - Spain 7.SingularLogic - Greece 8.Fedit, Federación Española de Entidades de Innovación y Tecnología - Spain 9.Aitek - Italy 10. Techfin - Italy
77 Commius business scenario Altamoda manager We need ICT to improve Altamoda business Altamoda is an SMEs clothing wholesale seller Altamoda uses email and internet but the business is carried on manually Altamoda manager is thinking he needs some simple ICT tools to support its business Altamoda is an SMEs clothing wholesale seller Altamoda uses email and internet but the business is carried on manually Altamoda manager is thinking he needs some simple ICT tools to support its business
78 Commius business scenario Altamoda manager Commius Dissemination is effective Commius community Zero cost easy-to-install and use always available Step by step
79 Commius business scenario Altamoda manager Commius community Altamoda manager decides to try Commius He downloads Commius and install it
80 Commius business scenario Altamoda manager decides to try Commius He downloads Commius and install it Altamoda manager Commius community
81 Commius business scenario Commius monitors emails workflow Commius starts as simple workflow assistant Commius monitors emails workflow Commius starts as simple workflow assistant Altamoda manager Altamoda customer Purchase order
82 Commius business scenario Altamoda manager You received a Purchase order Click to send the order to the Purchasing manager You received a Purchase order Click to send the order to the Purchasing manager
83 Commius business scenario Altamoda manager You received a Purchase order Click to send the order to the Purchasing manager You received a Purchase order Click to send the order to the Purchasing manager Altamoda Purchasing manager Process purchase order Vendors
84 Commius business scenario Altamoda manager likes Commius He decides to configure Commius He asks for services to the Commius community Altamoda manager likes Commius He decides to configure Commius He asks for services to the Commius community Altamoda manager Commius community
85 Commius business scenario A sw expert configures Commius for the company specific business Altamoda manager Commius community Commius sw expert
86 Commius business scenario Commius support the full internal process order workflow Delivery Altamoda manager Purchase order Altamoda customer Vendors Purchasing manager
87 Commius business scenario Customers and Vendors have installed Commius Commius support inter-organisational workflow Customers and Vendors have installed Commius Commius support inter-organisational workflow Altamoda manager Delivery Altamoda customer Vendors Purchasing manager
88 Commius business scenario Other services are available, such as business information, security provision… Altamoda manager Service provider Purchasing manager Vendors Altamoda customer
89 IPR and licensing The following roles are envisaged Maintain and upgrade the Commius OS platform > no profit Configure and Customise Commius Business modules according to the SMEs specific needs > make profit from services Develop specific system connectors to support integration with ERP and legacy systems > sells proprietary software Develop Business modules addressing specific needs > sells proprietary software Provide a set of on-line services accessible through the web > make profit from services, or free services (f.i download) OS community Local service provider SW developer proprietary SW developer proprietary On line Services provider
90 WP4 System Interoperability Objective To deliver technical interoperability by integrating Commius with email and SMEs infrastructure as well as serving Semantic and Process interoperability layers Approach Integrating with email and SME environment the way that users can use their favourite email client and web browser and thus do not change working practices WP4 is delivering methods and open source components for Integration with email infrastructure Module Execution, Management and Discovery Information Extraction System Connectors
91 Task 4.1 covers: –E-mail Gateway Plugin (EGP), –Module Management, –Connection to E-mail infrastructure Achievements –Client and server version deployed and tested –Code and software release as SourceForge project http://acoma.sf.net http://acoma.sf.net –infrastructure and components developed in Task T4.1 User guides Technical guides Software Release including JavaDocs Task 4.1 – Interoperability over SMTP
92 Task 4.1 – E-mail Gateway Plugin and Module Management E-Mail Gateway Plug-in Achievements SMTP and POP3 implementation Email Gateway plugin/framework Message Decomposition Attachments Processing A converter/processor is executed on every type of attachment New type of email message added – MPP component html output, 4 different type of emails Module Management Achievements Modules management based on OSGi container embedded inside framework Updating existing modules Executing modules Modules Interfaces Developed boolean check(String rawEmailText) Set execute(String path, String acomaid, String args) Module management (installing, uninstalling, executing) Web based management interface Directory based
93 Task 4.1 – Connection with Email Servers “Connection with E-Mail Infrastructure” Achievements E-mail Infrastructure Connection Concepts Research Server-Side Connection –MX Dedicated Commius Box –non-MX Dedicated Commius Box –Encapsulated Commius “Commius Server” deployed –Installed as a OVZ virtual server, Postfix MTA + Maildrop MDA Easy to clone, migrate or reconfigure –Commius intermediates SMTP communication –RMEC - Rule-based Module Execution Component A Control Panel Component Filter = Rule + Module Execution + Config
94 T4.2 Information Extraction (IE) Motivation To analyze and preprocess text To create meta data from text Text Preprocessing Analysis, text operations, Segmentation, indexing Pattern based IE Patterns can be used to extract various objects from text => valid for content in enterprise, interoperability and email context Results are: key - value pairs and its transformation Social Network and other Graph Extraction of communicating parties and relation to other objects and graph analysis Achievements Design and prototype implementation available at Ontea.sourceforge.net CVS pattern based IE based on key - value pairs and its transformation Simple segmentation techniques social network extraction from mailboxes Interconnection with Task 5.2 Semantic Annotation and whole WP5 Publications
95 T4.2 Information Extraction Model Extraction based on JAVA Regular Expressions Model supports: named backreferences macros Result of extraction is set of Key=>Value pairs Key=>Value pairs (Results) are further processed Extraction patterns are defined in XML ( we have XSD Schema ) Macros can be used unlimited times in any pattern Macros in macros ( any level ) Results can be enhanced by GATE annotations ( e.g. gazetteer lookups ) Macros could be used to create new patterns only by clicking
T4.2 Information Extraction Model 96 Address and product patterns Extraction Processing 3 words macro ZIP macro Street number macro Street name macro City name macro Country macro Address patterns
T4.2 Text Segmentation Simple segmenters (regex based): Words Sentences Paragraphs Email citations Segmentation based on result position Statistical segmenter Finds the most similar repeating tokens, which could be considered as segment dividers Uses Levenshtein distance, to compute similarity of dividing tokens Considers line frequency of dividers, number of dividers Gate segmentation Tokens SpaceTokens Splits ( between paragraphs ) Sentences Segments are represented as results ( position, length, key, value ) among other results and can be further processed 97 Bank statement
98 T4.3 Message Post Processing- Progress Task 4.3 covers Email builder (enhanced email message as static HTML) EGP produce enhanced email from module results Web GUI (AJAX web application) Employed GWT and GWT-Ext libraries Running on Jetty server integrated in OSGi container Standardized module output (XML Schema) Email builder and Web GUI use the same XML Schema Achievements Integration efforts with WP7, EGP and email annotation Improved look & feel of MPP visual components Email building integration schema Email attachments processing
99 T4.3 Hints/Actions and Email GUI XML-based email integration Email builder process module outputs according XML references (note concept) Action description URL link specifies actions for custom module invocation date, module type, position in text, etc. XML example
100 T4.3 MPP User Interface Prototype HTML part of e-mail MPP GUI component
T4.4 What are System Connectors? SC are an abstraction layer of following interactions: –Information from the exchanged emails lead to ‘supportive actions’ available through COMMIUS UI –Actions are tightly connected to the legacy systems. To achieve these actions, specific interactions with the legacy systems have to be accomplished Connectors Chaining –the output and the input of each SC is a sequence of tuples accompanied by metadata Specific types of SC identified up to now –Web Connector –Database & Hibernate Connector –Spreadsheet Connector –Web Service System SC –Geocoding/Reverse GeoCoding SC SCs must be configured before usage
102 T4.4 System connectors Achievements: –Prototype of SC for Hybernate DB Connector RDBs, WebApplications, Spreadsheets –A chained-usability schema has been proposed SC1 SC2 SC3… SCn –Meta-connector for integration with Information Extraction key-value pair approach Meta-Connector Web Connector SpreadSheet Connector Database Connector Key-value Transformed Key-value
103 WP4 Results Prototype Being developed and available as SourceForge.net project Ontea: CVS working code – release will be in November Acoma: release available Publications Journal and International pub: CAI, IEEE CEC 2009 - E3C Workshop, FIS 2008, ICCS 2008, CGW 2008 Michal Laclavik, Martin Seleng, Marek Ciglan, Ladislav Hluchy: Ontea: Platform for Pattern based Automated Semantic Annotation In Computing and Informatics, Vol. 28, 2009, 555–579 Michal Laclavik, Diana Maynard: Motivating intelligent email in business: an investigation into current trends for email processing and communication research; In Workshop on Emails in e-Commerce and Enterprise Context (E3C); IEEE Conference on Commerce and Enterprise Computing; DOI 10.1109/CEC.2009.47; pp. 476-482, 2009 Michal Laclavik, Martin Seleng, Emil Gatial, Ladislav Hluchy: Future Email Services and Applications; Proceedings of the Poster and Demonstration Paper Track of the 1st Future Internet Symposium (FIS'08), CEUR-WS, ISSN 1613-0073, Vol-399, pages 33-35, 2008. Michal Laclavik, Martin Seleng, Ladislav Hluchy: Towards Large Scale Semantic Annotation Built on MapReduce Architecture; In Proceedings of ICCS 2008; M. Bubak et al. (Eds.): ICCS 2008, Part III, LNCS 5103, pp. 331-338, 2008. Michal Laclavik, Martin Seleng, Marek Ciglan, Ladislav Hluchy: Supporting Collaboration by Large Scale Email Analysis; To appear in CGW 2008 proceedings (2009) Local: ITAT 2009 (1), WIKT 2009 (3), Znalosti 2009 (1), WIKT 2008 (2)
104 Commius Conclusion Support and infrastructure for: –Email based interoperability –Intelligent Email Processing Research Challenges addressed –Email Content Processing –Pattern based Information Extraction –support for Semantic Annotation –External and Legacy System Integration –Email GUI – Human Computer Interaction Issues –Architectures –Large Scale Information Processing Achievements –Deliverable D4.1.2 –prototypes including all tasks ready for good quality M22 Deliverables –Several scientific publications –prototype as release on Acoma.sf.net
AIIA - Adaptive Interoperability Framework for Private and Public Sector Slovak National Project
Project Partners Institute of Informatics, Slovak Academy of Sciences Anasoft SANET 106
Work Packages WP1 Architecture WP2 Email Enabled Interoperability Infrastructure WP3 Information Proc and Ext Components WP4 Semantic Processing Components WP5 System Conectors Components WP6 Itneroperability Modules WP7 Pilot Applictions 107
Secure Agent Infrastructure Institute of Informatics Slovak Academy of Sciences, Bratislava (UI SAV)
109 Introduction Objective new smart functionscommunication more effective and helpful allow creating a pervasive and trusted communication infrastructurecrisis management “Add new smart functions to existing services which will make the communication more effective and helpful for users. Smart functions will be provided by distributed IT systems based on an agents’ infrastructure.” Thus we should allow creating a pervasive and trusted communication infrastructure fulfilling requirements of crisis management users and ready for immediate application. Requirements distributed agent paradigmconfidentiality and access to resourcesProvision of a distributed agent paradigm to achieve confidentiality and access to resources. smart negotiating systemparameterisation and independent handling of access requestsIt should also provide a smart negotiating system for parameterisation and independent handling of access requests to achieve rapid reaction. collection of software services execute in a secure mannerShould be designed as a collection of software services with agent-like features (mobility, pro-activity) which would execute in a secure manner. access legacy IT systemsEnable access legacy IT systems whilst keeping information confidential. security should be providedSecure docking moduleThe security should be provided by a specialized component called “Secure docking module” (SDM).
110 Role of II SAS in the Project WP2 System analysis and design –T 2.1.2. Secure agent infrastructure analysis, detailed needed infrastructure definition (UI SAV, HIT) –Our effort 11 PM WP4 Secure agent infrastructure –WP Leader –Our effort 38 PM WP7 Integration of research results –Our effort 4 PM WP10 Demonstrator creation and presentation –Our effort 8 PM WP11 Dissemination of research results –Our effort 5 PM
112 Hardware Security Core element of the SAI is the Secure Docking Station (SDS) Realized by two complementary devices called the Secure Docking Module (SDM) and the Trusted Docking Station (TDS) SDS = TDS + SDM Secure Docking Station Technical University Graz Design and development of SDM and TDS is being done by Technical University Graz
115 SAI Subsystems SubsystemBasic description and functionality Distributed Secure Agent Platform (DSAP) The core agent platform. Will provide means for agent deployment, execution, migration and communication. Process Management Subsystem (PMS) Based on the plan collected from users will generate a plan of activities. Executes the plan. User Communication Agent (UCA) Will communicate with users in a form of guided dialog through electronic device. Will include authentication and interface to authorization of the user. Agent Repository (AR) Database of system users, agents and their certificates. Process of accreditation of agents. Public Key Infrastructure (PKI) Certification and verification of agents, users and resources. Resource Inquire System (RIS) Will provide information which system to query for specific information about resources.
116 Common Requirements Fail-over Ensure fail-over system components especially in case of TS network. All subsystems will need to address this issue individually. Interfaces to external systems Security and Certificates with the Docking Station. Interfaces to legacy systems for crises management and related systems.
EGEE NA3: User training and Induction Overal goal: to stimulate and support the development of training mechanisms associated with the emerging national and region Grid organizations to support local training provision Sub-tasks: –Course content creation, scheduling, organization, and delivery - providing training material and courses for grid users and application developers –Coordinated trainer support mechanisms - online training support framework by provision of digital library with a SOA containing course material and documentations (e-Learning provision) –Training-Infrastructure provision - the operation of GILDA testbed, updates of the gLite middleware and user level services and tools, monitoring t-Infrastructure and the first level support –Activity management and partner coordination
Goals of NA3 training courses Induction course –to give an understanding of grid computing –to give an understanding of the EGEE project and its activities –to give an understanding of grid security –to provide an understanding of grid middleware gLite –to enable basic practical use of gLite services Application developer course –to make familiar with fundamental methods of creating new grid applications and porting legacy code applications to the grid data management in grid –to present some high-level grid tools and frameworks designed for developing more complex grid applications
II SAS – NA3 Activity GCCP – International Workshop on Grid Computing for Complex problems Main sessions: –Grid Computing national and international grid projects, grid research infrastructures, grid security, grid access –Distributed Computing and Large Scale Applications grid, service-oriented, knowledge-based computing, data movement mechanisms, workflows bio, life, earth, chemistry, environmental, astronomy & astrophysics, high energy, business financial challenges –High Performance Computing and Large Scale Simulations –Grid Course & Tutorial
II SAS – NA3 Activity Induction and Application Developer Courses –total number of courses: 17 –participants: potential grid users, researchers, students –number of participants: 6-20 –Training material in Slovak language –Main subjects of courses: Introduction to grids and EGEE project Principles of parallel and distributed computing Grid security mechanisms Access to grids & Virtual Organizations Development of grid applications (gridification process) Grid middleware gLite High-level grid tools (Genius portal, DIANE) On-line demonstration of a grid application Hands-on tutorial (gLite on Gilda testbed: JDL, job submission, job monitoring, retrieving job output, data management)
II SAS – NA3 Activity Talks, Discussions, and Consultations –Main themes: grid computing, how to involved to EGEE, demonstrating the capabilities of EGEE Grid technology, grid applications E-Learning – SlovakGrid Web portal –Relevant documents about Grids, clusters and distributed computing –References to EGEE documentations –Quick-start documents about: Certificates, Virtual Organizations, Gilda, Genius, gLite, etc. –Descriptions of several gridified scientific applications Maintenance and updating the II SAS GILDA training site
EGEE NA4: User community support and expansion Overal goal: to ensure that Grid technology and the EGEE infrastructure are used successfully by a broad spectrum of scientific collaborations Sub-taks: –Support Virtual Organizations suport Application porting support Direct user support (Help desk) –Strategic Discipline Clusters (High energy physics, Life science, Earth science, Grid observatory, Computational chemistry, Astronomy and Astrophysics, Fusion) Specific support concerning domain-specific software Development and evaluation of higher-level services to address advanced or specific needs Evolution of gLite and testing –Activity Coordination
123 Collection of study materials on grid computing for beginners: Manuals translated to Slovak language (gLite, data management, GENIUS, P-GRADE, …) Examples of grid programs Workload management animation, and many other animations Course presentations – certificates, VO, middleware Tutorials Applications How to start: step by step directions for beginners with links to screenshots More than 180 screenshots in Windows, Linux, and gEclipse environment Videos and other colorful materials on HPC EGEE infosheets translated to Slovak language Grid Computing (CD) - eLearning material in Slovak