Presentation on theme: "Susan Gehr Cell/text (707)"— Presentation transcript:
1 Susan Gehr firstname.lastname@example.org Cell/text (707) 599-2719 Toolbox 1Susan GehrCell/text (707)Welcome to Toolbox 1
2 With gratitudeAlbert Bickford, Toolbox instructor for InField 2008, 2010 & CoLang 2012Neil Brinneman, Shoebox instructor 2003Shoebox/Toolbox Field Linguist’s Toolbox Google GroupSchoolmates at University of Oregon, especially Connie DickinsonMy teachers and sources of Toolbox support.
3 Course Goals Review of Toolbox for use in creating lexical databases Introduction to connections to the topics related to dictionary projects (workflow, lexicography, community involvement)Information about how to learn more and get help after this classStudy of how Toolbox is part of the Karuk dictionary projectOther related software1. I’ll teach about using Toolbox assuming that you won’t have someone setting up the project for you. There is substantial preparation to do in advance of entering data.
4 Getting to know each other What languages and projects are you working on with Toolbox?Did you bring your own computer and your own data to work with?Will you be using Toolbox on the computers in Watson 419? If so, I need to ask Jari to install the New Project Package.What are your current Toolbox-related questions or learning goals?Will someone else be setting up your project for you, or do you need to do it yourself?I now understand that no one here has prior experience with Toolbox. That’s ok. I
5 LogisticsMain class: Tuesday June 19 to Friday June 22 2:15 to 3:45 PMLocation of Class: 419 Watson LibraryPossibilities for consultation outside of class
6 Course Outline June 19, Tuesday Introductions to our Toolbox projects Discussion of the parts and preparation behind every Toolbox ProjectLanguage Encoding (orthography, sort order)Database Type developmentWorkflowProject ManagementToolbox life cycle, considerations for migrating data
7 Course Outline 2 June 20, Wednesday June 21, Thursday Database types, continuedEstablishing a data structure / cheatsheetWhat are your fields? What rules do they require?Documenting all decisions that you make about your workflow, whether you work alone or as part of a teamGetting to work – Using ToolboxJune 21, ThursdayMore on Using Toolbox – Based on class need & interest
8 Course Outline 3 Friday, June 22 Related software (operating systems, Microsoft Word, Windows virtualization software for Mac users, backup software, etc.)Outputting dataFor online and print use (dictionaries, wordlists made with filters)Working with a publisher or a printer
9 Karuk Toolbox Project Started in November 2003 Took 2-week course at JAARSWilliam Bright data and participationNailing down the language encoding and revising it in 2009Publication of a printed dictionary (2005) and an online dictionary (ongoing with UC Berkeley Linguistics)
10 Toolbox Project Preparation Before data entry can start, you need to ‘teach’ Toolbox some things about your project.Language EncodingOrthographySort OrderUnicode compatibilityShow the Karuk 2009 Language Encoding.Open the language encoding document as a plain text document. Talk about the process of establishing a sort order.Talk about being careful with the fonts: Unicode Compatibility
11 Language Encoding Features 2/21/04Language Encoding FeaturesThe order for sorting (using that script).Upper and lower case forms of the characters (if any).Special groupings of characters, variables, which are useful in examining or searching the data.A font to represent the character shapes.Often, a special keyboard to facilitate entering the characters.
12 Exercise: Creating your sort order 2/21/04Exercise: Creating your sort orderWhat is the order of your writing system?Will your audience expect it to sort as English does, or will they expect some other sort order?
13 On the Computer Go to the Project menu > Language Encodings… 2/21/04On the ComputerGo to the Project menu > Language Encodings…Select the vernacular Language Encoding and click the Copy button.Type your sort order into the Primary Characters box.
15 Toolbox Project Preparation Database typeWhat types of information do you want to keep track of?MDF – the Multi-Dictionary FormatterWhat is it?A database type that can function as a data structure standard and a data content standardReadMDFFields Toolbox Project and then read recommended parts ofMDF_2000.pdfFrom MDF Fields Toolbox Project:The original MDF manual, "Making Dictionaries: A guide to lexicography and theMulti-Dictionary Formatter", by Coward and Grimes, was published in 1995 andgave instructions for a set of independent CC tables running in the DOS operatingsystem. While still an extremely valuable document, there are parts which are nolonger relevant, and some which have been changed.\txt The manual itself is only available as a PDF file, though the revision of chaptertwo is a Word document. They can be obtained from ...\txt The following are suggestions for reading the manual:\txt Chapter 1 - Skip (specifics of installation of the original DOS version)\txt Chapter 2 - see revised version: MDF-ch2-revised.doc. Has a description of allMDF markers. Ignore references to "older approaches". The section on gloss vsreversal vs definition is useful.\txt Chapter 3 - Skip. Specifics of running the original DOS version.\txt Chapter 4 Theory, which needs to be considered.\txt Chapter 5 - Read section 5.3 about categories of markers. The rest is mostlyabout why to use Shoebox instead of Word, and about sorting, which is donedifferently in Toolbox.\txt Chapter 6 discusses: principles for choosing the headword, example sentences,homonym vs different senses, semantic categories, dialect information. Note,sometimes gives alternative ways to handle the same information. Check with adictionary consultant if you have doubts.\txt Chapter 7 Various possible lexical function relationships. Usefulness of semanticrelationships as an approach for eliciting data.\txt Chapter 8 Lots of good theory on making good entries; good qestions to ask.Special considerations for activities & events, states & processes, loans &etymologies, ritual speech, etc.\txt Chapter 9 Discussion on part of speech.\txt Chapter 10 A dictionary is more than a list of words. Discusses preface material,appendices, etc.\txt Appendices A-E include lists of markers, recommended order of fields, lists ofsemantic domains and lexical functions, and some commonly used abbreviations.\txt Appendix F-I - Skip. Specific to the original version. Not relevant now.
16 Toolbox Project Preparation Database typeWhat is your data structure?What are your rules about entering data?Do you keep a cheatsheet data structure and a notebook?Karuk examples in Word and in Notebook
17 What are databases & database types? 2/21/04What are databases & database types?In Toolbox, databases can be:dictionariesone text or a collection of textsIn Toolbox, a database type is a file that:includes a collection of properties that defines various fields of the database and some of the methods used for manipulating records.
18 More about database types 2/21/04More about database typesA Dictionary database type might contain:A recommended set of Field MarkersFilters used for finding particular records in a lexicon (eg, all nouns, a particular morpheme, words with homonyms, etc)The Date Stamp field marker \dt
19 Making your own Dictionary type 2/21/04Making your own Dictionary typeYou can & should make a database type for your language by copying the MDF type and modifying the copy (see Karuk MDF)
20 Fields in a Dictionary Database 2/21/04Fields in a Dictionary Database
21 2/21/04What fields might I use?See the list of all MDF fields pp , Making Dictionaries, a guide to lexicography and MDF.Take time before you start making lots of entries into your database to, otherwise you might have to either correct them by hand or have someone write a CC Table to correct all the records.
22 Exercise: Choose field markers & make a cheatsheet 2/21/04Exercise: Choose field markers & make a cheatsheetBasic minimum set\lx - Lexeme\a - Alternate form\u - Underlying form\ps - Part of speech\ge - Gloss\de - Definition\sd - Semantic domain\nt - Notes\dt - Date Last Edited
24 Toolbox Project Preparation Workflow mattersDo you work by yourself?Are you working on a team?One dictionary, one language, one team,but different locations? Version control and file management turn chaos into quality∗Charles E. Grimes Linguistics, RSPAS, ANU, and SIL International
25 Toolbox Project Preparation Project managementPrinted dictionaries have front and backmatterOnline dictionaries have websites with supplemental text, graphics and sound filesHow will you keep track of master files, working files, passwords, contact information, and other valuable records?One dictionary, one language, one team,but different locations? Version control and file management turn chaos into quality∗Charles E. Grimes Linguistics, RSPAS, ANU, and SIL International
26 Toolbox Project Preparation Toolbox life cycle“Toolbox is nearing the end of its life-cycle.” Albert Bickford, SIL, June 2008.So Toolbox must be really near the end here in 2012?“the [Toolbox] programmer is still active…working on a kind of large new feature which is taking its own sweet time…Our assignment for the foreseeable future is Toolbox. All programs and people are mortal, but we are doing our best to keep ourselves and Toolbox among the living.” Toolbox Support. May 1, 2012For now, I will keep using Toolbox, keeping my eye out for the day that it can’t be run on contemporary computers/operating systems and support dries up.
27 End of Day 1?Suggestions & adjustments for following days based on student need and preparation for Toolbox 2.Work on Language Encodings and Database Types
28 Day 2 Work on Language Encodings and Database Types Database types & workflow matters, continuedEstablishing a data structure / cheatsheetWhat are your fields? What rules do they require?What does the MDF book say about your chosen fields?What if, 5,000 records in, you want to add new fields?Documenting all decisions that you make about your workflow, whether you work alone or as part of a team
29 Day 3 June 21, Thursday Data structure matters continued Related software (operating systems, Microsoft Word, Windows virtualization software for Mac users, backup software, etc.)
30 Day 4 Friday, June 22 Outputting data User testing For online and print use (dictionaries, wordlists made with filters)Working with a publisher or a printerUser testingDoes your main audience like your work?