LINGUISTICS RESEARCH AND ANALYSIS OF THE BULGARIAN FOLKLORE. EXPERIMENTAL IMPLEMENTATION OF LINGUISTIC COMPONENTS IN BULGARIAN FOLKLORE DIGITAL LIBRARY.

Slides:



Advertisements
Similar presentations
WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
Collections Management Museums EMu – Upcoming Developments Upcoming Developments Bernard Marshall Chief Technology Officer KE Software.
Service-based architecture for personalized and adaptive access to the knowledge in digital library Desislava Paneva Institute of Mathematics and Informatics.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Michael Donovan, River Campus Libraries – 12/03 DocuShare Overview and Training.
INTRODUCTION The Group WEB BROWSER FOR RELATION Goals.
Unicode: The Right Tools, but How to Use Them? Presentation to the Digital Library Federation Fall Forum November 18, 2003 Elizabeth A.S. Beaudin, OACIS.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Progress Report 11/1/01 Matt Bridges. Overview Data collection and analysis tool for web site traffic Lets website administrators know who is on their.
Collections Management Museums EMu 3.1 / 3.2 – New Features EMu 3.1 / 3.2 New Features Bernard Marshall Chief Technology Officer KE Software.
Scout Portal Toolkit For Web/Database Legal Material 2004 CONFERENCE FOR LAW SCHOOL COMPUTING.
Developing Health Geographic Information Systems (HGIS) for Khorasan Province in Iran (Technical Report) S.H. Sanaei-Nejad, (MSc, PhD) Ferdowsi University.
B.A. (Mahayana Studies) Introduction to Computer Science November March Office Tools A look at the main tools most computer users.
Lessons learned within international collaboration in the area of digital preservation of cultural heritage Gábor KAPOSI – MTA SZTAKI Tibor SZKALICZKI.
Towards Online Accessibility of Valuable Phenomena of the Bulgarian Folklore Heritage Radoslav Pavlov 1 Konstantin Rangochev 1 Desislava Paneva-Marinova.
Educational Application on Top of Digital Libraries for Cultural Heritage Desislava Paneva-Marinova Radoslav Pavlov Institute of Mathematics and Informatics.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Galina Bogdanova, Konstantin Rangochev, Desislava Paneva-Marinova, Nikolay Noev Institute of Mathematics and Informatics, Bulgarian Academy of Sciences.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Web Designing By Bhupendra Ratha, Lecturer School of Library & Information Science D.A.V.V., Indore.
Microcomputer Fundamentals Computer Class This class is designed for first-time computer users. Over the next several weeks, we will discuss how computers.
ONTOLOGICAL MODEL OF THE KNOWLEDGE IN FOLKLORE DIGITAL LIBRARY Desislava Paneva Institute of Mathematics and Informatics – Bulgarian Academy of Sciences.
Information Technologies for Presentation of Bulgarian Folk Songs with Music, Notes and Text in a Digital Library Lozanka Peycheva, Nikolay Kirov, Maria.
Tech Terminology for non-technical people Tim Bornholtz 2006 Annual Conference.
University of Illinois at Urbana-Champaign A Unified Platform for Archival Description and Access Christopher J. Prom, Christopher A. Rishel, Scott W.
Multi-lingual & multi- institutional distant learning Example of an international master programme in Computational Linguistics November, Blaubeuren,
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
DATABASES Southern Region CEO Wednesday 13 th October 2010.
Modernising DiVA: The Integration of the Fedora Repository Software Open Repositories 2009, Atlanta, USA, 20 May Uwe Klosa, Electronic Publishing Centre.
ISpheresImage iSpheresImage Feature Overview and Progress Summary.
EVIA Digital Archive New Tools William G. Cowan Mike Durbin Digital Library Program EVIA Digital Archive DLP Brown Bag 20 September 2006.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Business Modeling of the Application Architecture of the Bulgarian Folklore Artery Business Modeling of the Application Architecture of the Bulgarian Folklore.
Database Planning, Management and Maintenance Track 3: Basic Course in Database.
Educational applications on top of digital libraries for cultural heritage Radoslav Pavlov, Desislava Paneva-Marinova Institute of Mathematics and Informatics.
LABORATORY DATA MANAGEMENT SYSTEM HARSHIT MAHESHWARI (10290) N V SUBBA RAO (10466) GUIDED BY PROF. T.V. PRABHAKAR.
REAL ESTATE INVENTORY SYSTEM Training Seminar - December 1, 2011 Tirana, Albania Guidelines on how to work with the Promise System.
Examples for Open Access Scholar Electronic Repository by New Bulgarian University IP LibCMASS Sofia 2011 Contract № 2011-ERA-IP-7 Sofia, September,
Meta-Server System Software Lab. Overview In the Music Virtual Channel system, clients can’t query for a song initiatively Through the metadata server,
Microsoft FrontPage 2003 Illustrated Complete Integrating a Database with a Web Site.
Service-oriented architecture of the Bulgarian folklore library Konstantin Rangochev † Vasil Badev † Desislava Paneva † Detelin Luchev ‡ † Institute of.
Windows 7 WampServer 2.1 MySQL PHP 5.3 Script Apache Server User Record or Select Media Upload to Internet Return URL Forward URL Create.
Syllabus Management System Matt Bernstein, Paul Capelli, Jared Segal.
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
Use Case for Creative Learning-by- Authoring Lubomil Draganov, Desislava Paneva – Marinova, Radoslav Pavlov Institute of Mathematics and Informatics –
Knowledge Technologies for Description of the Semantics of the Bulgarian Iconographical Artefacts Lilia Pavlova-Draganova Laboratory of Telemаtics – BAS,
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Library Online Resource Analysis (LORA) System Introduction Electronic information resources and databases have become an essential part of library collections.
How Web Database Architectures Work CPS181s April 8, 2003.
Archives, Libraries, Museums: Possibilities of Co-operation within the Enwirinment of the Global Information Infrastructure - Croatian experience Vlatka.
Collections Management Museums What’s new in EMu ? Part II Bernard Marshall Chief Technology Officer KE Software.
Radoslav Pavlov, Galina Bogdanova, Desislava Paneva- Marinova, Todor Todorov, Konstantin Rangochev
1/7/2016www.infocampus.co.in1. 1/7/2016www.infocampus.co.in2 Web Development training gives you and all-round training in both the design and the development.
By Dr. Ranjan Samanta, Tuhin Subhra Ghosh, Soumen Mondal A COMPARATIVE STUDY OF E-CONTENT AMONG EPG-pathshala, eGyanKosh AND NPTEL.
CS491B Software Design Lab Project Report Yuet-Chi Lee California State University, Los Angeles.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
EVIA Digital Archive Technical Overview EVIA Digital Archive DLP Brown Bag: 7 December 2005.
Equipment and Help Call Management System (EHCMS) PresentedBy Kevin Hsu 3/13/2003.
Administrative System for a Speech Pathology Office By Devin Peterman.
A Workshop on LibreOffice Er. Arvind Kumar Assistant Professor, Department of Computer Science & Engineering
Architecture Review 10/11/2004
Web Technologies Computing Science Thompson Rivers University
VI-SEEM Data Discovery Service
Lecture 8 Database Implementation
HR Portal Team Dr. Ashraf Armoush Supervisor Ala’eddeen Awwad
Microsoft Access 2003 Illustrated Complete
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Web Technologies Computing Science Thompson Rivers University
Presentation transcript:

LINGUISTICS RESEARCH AND ANALYSIS OF THE BULGARIAN FOLKLORE. EXPERIMENTAL IMPLEMENTATION OF LINGUISTIC COMPONENTS IN BULGARIAN FOLKLORE DIGITAL LIBRARY Konstantin Rangochev 1 Maxim Goynov 1 Desislava Paneva-Marinova 1 Detelin Luchev 2 1 Institute of Mathematics and Informatics-BAS 2 Ethnographic Institute with Museum -BAS International Conference on Information Research and Applications June 2010, Varna, Bulgaria

Presentation overview Linguistics research and analysis of the Bulgarian folklore National research project: “Knowledge Technologies for Creation of Digital Presentation and Significant Repositories of Folklore Heritage” (FolkKnow) Functionality “Bulgarian folklore digital library” multimedia digital library Experimental implementation of a linguistic component in BFDL

Linguistics research and analysis of the Bulgarian folklore (1) The main component of the linguistic research of the Bulgarian folklore is the analysis of its lexical structure. How many and what token it contains? Is there and what is the domination or the lack of some groups of tokens? Paradigm relationships in the folklore lexemes Context lexemes/Folklore language formulas Frequency of the lexemes, verses/sentences in which they are, number, numbering in the song, etc. of the verses/sentences. Word forms Regional characteristics of the folklore lexical structure, etc.

Linguistics research and analysis of the Bulgarian folklore (2) Tools, formalizing the folklore analysis: Frequency dictionary A general frequency dictionary – it contains the all lexical units which are in a folklore object repository; A regional frequency dictionary – it contains all the text units which come of a definite folklore region or of a concrete settlement; A functional frequency dictionary – it contains all the text units which have identical functions: descriptions of the rites, various types of songs, narratives, etc.

Linguistics research and analysis of the Bulgarian folklore (3) Table: Comparison of the Bulgarian folklore and spoken languages.

Linguistics research and analysis of the Bulgarian folklore (4) Concordance dictionaries show the lexeme with/in her context. Example for songs: “Fifty heroes are drinking wine” – the underlined lexeme is the examined and the lexemes in italic are her context. Example for narrative text: In the description of the rituals one complete sentence is the context of the observed lexeme (from point to point).

FolkKnow project FolkKnow project: “Knowledge Technologies for Creation of Digital Presentation and Significant Repositories of Folklore Heritage” (contract number: IO-03-03/2006) Supported by National Science Fund of the Bulgarian Ministry of Education and Science Partners: Institute of Mathematics and Informatics - BAS, Institute for Folklore-BAS, Veliko Tarnovo University Module 3: “Development of Digital Libraries and Information Portal with Virtual Exposition - Bulgarian Folklore Heritage”

FolkKnow project

Bulgarian folklore digital library Web address:

Main services (1) Functional modules for: folklore object adding folklore object editing folklore object delete folklore object preview collection creation collection preview

Description of folklore object Folklore object preview

Main services (2) Simple search: by a signature or archival number by title by language by annotation by type of a folklore object by file type by record information Search in the record information: by situation by interviewer’s name by recorder’s name by record date by place where the record was made

Main services (3) Extended search through all the object’s characteristics

Main services (4) Module for –Managing and monitoring users’ data and activities: registration, logs, data changes, level set, actions, related to the object manipulation: search, preview, delete, add, edit, select, etc., administrative actions. –File format conversion –XML export of the BFDL objects

Linguistic search in text folklore objects Search of a word in the different types of dictionaries; Search of two or more words, searching of verbal formulas in the folklore lexis: “Drinking wine”, “Marko seated”. Search of a group of words, investigating the paradigmatic relations in the folklore lexis (river- stream- brook- rill…) Search for a root of a word, studying the folklore word- formation: “drink” (I am drinking, I have drunk, they have drunk…).

Experimental implementation of a linguistic component in BFDL Frequency dictionary functional specification Linguistic analysis of the available set of test folklore objects; Determination of the frequency of meeting the lexemes in text folklore objects; Creating of lists of the lexemes, –in frequency order –in alphabetical order Taking the number of the lexical units; Taking the number of the repeats of the lexical units.

Experimental implementation of a linguistic component in BFDL Sequence Diagram

Experimental implementation of a linguistic component in BFDL Analysis class diagram for the BFDL linguistic component

Implementation of the Bulgarian folklore digital library The main tools and languages used: Microsoft Windows Server 2008 x64 Standard; Web server: Apache HTTP Server v 2.2, PHP v 2.2.9; Database management system: MySQL v 5.1 Standard; Tools for the additional modules: FFMPEG, vwWare, HTML, JavaScript, AJAX; Database query language: SPARQL