Presentation is loading. Please wait.

Presentation is loading. Please wait.

Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents Victor BARANOV.

Similar presentations


Presentation on theme: "Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents Victor BARANOV."— Presentation transcript:

1 Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents Victor BARANOV Linguistics Department Izhevsk State Technical University Laboratory of Computer-Aided Philological Research Udmurtia State University

2 Dagstuhl, December, 2006 Digital Historical Corpora2 Title page of the portal of IAS “Manuscript”

3 Dagstuhl, December, 2006 Digital Historical Corpora3 Model of hierarchies and subnets of manuscript and text units

4 Dagstuhl, December, 2006 Digital Historical Corpora4 Net of linguistic relationships се быша дроузи мои. се. быша дроузи мои себышадроузимои се быша дроузи мои. быша дроузи мои се быша дроузи быша дроузи Дроузи мои Text Predicate part Syntactic group Word-form Relationship Средство связи Εnd of the “single" relationship Εnd of the “multiple" relationship Mean of relationship Word-combination Co-ordination Dependence

5 Dagstuhl, December, 2006 Digital Historical Corpora5 Model of the Manuscript system

6 Dagstuhl, December, 2006 Digital Historical Corpora6 Editor OldEd: main panels

7 Dagstuhl, December, 2006 Digital Historical Corpora7 Editor OldEd: Text input and editing

8 Dagstuhl, December, 2006 Digital Historical Corpora8 Editor OldEd: Fragmentation of the manuscript texts into units and relationships with the dictionary units Dictionary of fragments Properties of fragments Fragments

9 Dagstuhl, December, 2006 Digital Historical Corpora9 Editor OldEd: Visualization of unit relationships Symbol Geometric hierarchy: Line Page Linguistic hierarchy: word-form normalize forms Dictionary: Lemma Dictionary: word-forms of texts Properties and values of the Lemma

10 Dagstuhl, December, 2006 Digital Historical Corpora10 Editor OldEd: Page layout

11 Dagstuhl, December, 2006 Digital Historical Corpora11 Result of creation of the layout on the site Marginalia

12 Dagstuhl, December, 2006 Digital Historical Corpora12 Automated lemmatization and establishing relationships between words and lemmas

13 Dagstuhl, December, 2006 Digital Historical Corpora13 Electronic edition: search page Search criteria Collections & Manuscripts Search result

14 Dagstuhl, December, 2006 Digital Historical Corpora14 Search result: word index and concordance

15 Dagstuhl, December, 2006 Digital Historical Corpora15 Module of retrievals: selection of the text

16 Dagstuhl, December, 2006 Digital Historical Corpora16 Module of retrievals: selection of the unit

17 Dagstuhl, December, 2006 Digital Historical Corpora17 Module of retrievals: setting the unit properties and values

18 Dagstuhl, December, 2006 Digital Historical Corpora18 Module of retrievals: saving the query

19 Dagstuhl, December, 2006 Digital Historical Corpora19 Module of retrievals: specifying the composition of the query result

20 Dagstuhl, December, 2006 Digital Historical Corpora20 Comparative index of the word forms

21 Dagstuhl, December, 2006 Digital Historical Corpora21 Comparative index of the fragments

22 Dagstuhl, December, 2006 Digital Historical Corpora22 Grammar dictionaries Grammar dictionary of the modern Russian language Grammar dictionary of the Old Russian language Grammar dictionary of the Old Slavonic language Grammar dictionary pseudo-elements Text N Text 6 Text 5 Text 4 Text 3 Text 2 Text 1

23 Dagstuhl, December, 2006 Digital Historical Corpora23 Grammar dictionaries: retrieval form

24 Dagstuhl, December, 2006 Digital Historical Corpora24 Grammar dictionaries: bringing the Old Russian word-forms to the lemma

25 Dagstuhl, December, 2006 Digital Historical Corpora25 Grammar dictionaries: оbtaining paradigm of lemma

26 Dagstuhl, December, 2006 Digital Historical Corpora26 Electronic editions

27 Dagstuhl, December, 2006 Digital Historical Corpora27 Electronic edition: reverse index of word-forms and context

28 Dagstuhl, December, 2006 Digital Historical Corpora28 Acknowledgment The work on the creation of IRS Manuscript is being carried out with the support from the Russian Foundation of Basic Research (Grant # 05-07-90217в). Τhe work on the creation of the automated morphologic analyzer with the support of the Russian Foundation for the Humanities (Grant # 05-04-12408в).

29 Dagstuhl, December, 2006 Digital Historical Corpora29 Contacts Victor Baranov - baranov@udm.ru http://manuscripts.ru/index_en.html Laboratory of Computer-Aided Philological Research Udmurtia State University Linguistics Department Izhevsk State Technical University Izhevsk, Russia


Download ppt "Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents Victor BARANOV."

Similar presentations


Ads by Google