Presentation on theme: "From Words to Meaning to Insight Copyright Leximancer 2011."— Presentation transcript:
From Words to Meaning to Insight Copyright Leximancer 2011
Control Panel Generate Concept Seeds
Leximancer automatically generates seeds on first pass Determines word frequencies (how often a word occurs), determines Name-like, word-like, and compounds This interface provides a method to control how this step works Text Processing Settings specifies how actual text is processed Concept Seeds Settings controls concept seed generation Generate Concept Seeds
Generate Concept Seeds Text Processing Check tagging for folder names and/or file names to make tags for later comparison. Finds tags in the loaded data. Ex: Mike: and Therapist: associates text following a speaker label with that person until next speaker label. Prose off (set to 0) for conversations, tweets, all research Performs stemming but should be off for first run
Generate Concept Seeds Concept Seeds Normally do not change these options.
Control Panel Generate Thesaurus
In this phase, Leximancer develops tailored dictionaries for the concept seeds (starting words) It then generates a list of weighted terms (a thesaurus) that constitute evidence for the presence of each concept High-relevance evidence words occur commonly in contexts where the concept is discussed in the text, and rarely where it is not (these evidence words are what you see in the thesaurus with their z-scores) Sentences may contain several concepts Generate Thesaurus
Concept Seeds allows you to edit, remove, and add to the concepts generated from Leximancer. You can also add your own seeds to guide Leximancer towards your particular concepts of interest. Thesaurus Settings gives you the opportunity to control how the thesaurus is created. Generate Thesaurus cont
Generate Thesaurus: Concept Seeds Auto Concepts: Concepts found by Leximancer. You can edit and remove. Auto Tags: Tags found by Leximancer. These came from your tag settings for folder and file names, spreadsheet column headers, dialog labels. User Defined Concepts: Place to add your own concepts. Used if Leximancer did not find these numerically important. User Defined Tags: To locate keywords relative to concepts, but no thesaurus is generated. Example: Insert codeword(s) into your text and Leximancer will identify concepts around that word. Download and Upload: Save and restore.
The Sentiment Lens looks at positive and negative sentiment in your text Clicking this button loads predefined list of sentiment terms good, like, bad, poor _favterms common favorable terms _unfavterms negative terms _negation terms under User Defined Tags (don t want as concepts) Never, no, not, nothing Sentiment Lens
In thesaurus, click on _unfavterms Lists words found with unfavorable terms score is z-score Sentiment Lens Example
Generate Thesaurus Thesaurus Settings Normally do not change these options.
Control Panel Run Project
Run Project Compound Concepts This editor allows for the combination of like concepts, synonyms, and other possibilities into a single concept. Leximancer will add this new concept as another single concept for the Concept Map, Network Map, and all statistics. The original two concepts will remain. Thus, thesaurus and map for individuals and combination for comparison. For this example, Fukishimi and tsunami are combined.
Run Project Concept Coding Settings The most common use of this is to tag or kill concepts. Kill removes all data where that concept occurs. Warning: Advanced feature, will remove data. Be careful! Tags on the map.
Run Project Project Output Settings Normally do not change these options. Concept Map provides control for the Concept Map buttons defaults. We will cover Insight Dashboard as a separate topic.
The concept map is linked to a text browser Valuable exploratory tool From the browser you can drill down into the text Learn what the concepts refer to Investigate the nature of concept relationships Query Tab Queries entire data set Syntax WORD: NAME: TERM: implicit coding in results Booleans AND and OR and NOT supported Data Querying
Researcher creates all seeds for concepts Useful for large text collection where only a few themes are to be explored; zoom into particular issue Process 1. Load data 2. Turn off Automatic Concept Identification 3. Enter your concepts into Edit Emergent Seeds 4. Tell Thesaurus Learning how many related concepts 5. Run Concept Map with related concepts and thesaurus created for your concepts; everything on that map relates to your issues Profiling
Detailed data; mostly quantitative report Quadrant Diagram (more later) Must have set up Tags (folders, files, spreadsheet: auto-generated) Particularly useful when you want to examine Attributes (independent variables) Certain categories (dependent variables) The Dashboard
Control Panel Run Project->Edit Project Output Settings->Insight Dashboard Tab Almost always click Generate Insight Dashboard Generate Quadrant Report Include all Content Blocks Auto Scale Categories: Select red text (Tags) of interest and click arrow <=== Attributes: Select concepts. Often All Concepts (word-like) and click arrow <=== Dashboard Configuration Button on Control Panel/ Top Right
Quadrant Report Relative Frequency 99% 3% 33%4% Strength Tag1 Tag 2 Legend Frequency: If text is in category (tag), probability text mentions the attribute. What is talked about most frequently on Fox News. Strength: If attribute is in section of text, probability it comes from that category (tag). What topics are unique to Fox News.
Quadrant Report cont Relative Frequency 99% 3% 33%4% Strength Legend Percentages Categories here mean: Occur often and unique to tag Categories here mean: Occur seldom, not unique to tag Tag1 Tag 2 Categories here mean: Occur seldom and unique to tag Categories here mean: Occur often, not unique to tag
Data Exports Data exports available in most tabs in Concept Map view Export concepts, counts, scores Useful for comparison, sharing Backups Other graphics programs (even Excel or Numbers) Data exports available in most tabs in Concept Map view Export concepts, counts, scores Useful for comparison, sharing Backups Other graphics programs (even Excel or Numbers)
Most processed data available Download into spreadsheet Your own graphics package SPSS Save data Not sure how often this is done Data Exports Button on Control Panel/ Top Right
Researcher Logbook Many methodologies require detailed notes Logs good for traceability Some researchers use notes, bookmarks, task lists Processing Log View Log on bottom right of Control Panel Screen Provides information of what Leximancer is doing Good source for total data counts for research papers Pathways Useful in theory-driven study start_point end-> end_point (indirect mediating and moderating relationships) Miscellaneous