Presentation on theme: "D-Square Digital Databases and Digital Tools for WBD and WLD Folkert de Vriend 17-05-06 Digital Databases and Digital Tools for WBD and WLD Folkert de."— Presentation transcript:
D-Square Digital Databases and Digital Tools for WBD and WLD Folkert de Vriend 17-05-06 Digital Databases and Digital Tools for WBD and WLD Folkert de Vriend 17-05-06
Outline Digitisation project (shortly) Plans and ideas for papers A.Data driven clustering B.Open Language Resources C.Cartography
People CLST Lou Boves Henk van den Heuvel Folkert de Vriend CLS Roeland van Hout Joep Kruijsen Jos Swanenberg Polderland Theo van de Heuvel
WBD page ->
Data conversion overview ->
Deel III MS-Word Editors/ManagementUsersEditors/ManagementUsers AnalogDigital Analog (parts of) Vol. I+II MS-Word Filing cards Website WBD/WLD with tools for searching and cartography Enriched data XML Raw data FileM Pro (parts of) Vol. I+II MacWrite Questionnaires Nijmegen and Leuven Questionnaires (chiefly) Meertens Raw data Vol. I + II Vol. III Edited data Specialized print editions (dialect atlas or local dictionary) Online DB WBD (Polderland) Edited data XML Vol. III FileM Pro SGV on CD (Polderland) Vol. III
Web access Taxonomic Taxonomic acces to data Search Search interface
Research ideas and plans
A: Data driven clustering Human interpretation of patterns vs computational clustering based on distances. (lexical or phonetic)
B: Open Language Resources “Wikipedia style” LR Digitisation not the end of the evolution of a LR Evolution of Web seems to be towards “Social Computing” Think of railroads -> cars
Policing How to automate police activities regarding open (language) resources? Maybe “distant” entries/edits are more suspicious. When distant -> notify police.
C: Cartography Cartography as tool not just for illustration -> Google Earth. Advantages: Different views on the data. Easy to link different resources (also for end user)
Implementation = Short term Paper on data driven clustering. Paper on cartography. Longer term Paper(s) on Open LR / Social Computing.