Presentation is loading. Please wait.

Presentation is loading. Please wait.

From Words to Meaning to Insight

Similar presentations


Presentation on theme: "From Words to Meaning to Insight"— Presentation transcript:

1 From Words to Meaning to Insight
Copyright Leximancer 2011

2 In-depth Leximancer

3 Control Panel Generate Concept Seeds

4 Generate Concept Seeds
Leximancer automatically generates seeds on first pass Determines word frequencies (how often a word occurs), determines Name-like, word-like, and compounds This interface provides a method to control how this step works Text Processing Settings specifies how actual text is processed Concept Seeds Settings controls concept seed generation

5 Generate Concept Seeds Text Processing
Check tagging for folder names and/or file names to make tags for later comparison. Prose looks for English grammar and sentences. If data are not clean or sparse, turn this to 0. Dialog tags are also used to span 2 sentence text blocks. Prose off (set to 0) for conversations, tweets, all research Finds tags in the loaded data. Ex: Mike: and Therapist: associates text following a speaker label with that person until next speaker label. Performs stemming but should be off for first run

6 Generate Concept Seeds Concept Seeds
Normally do not change these options.

7 Control Panel Generate Thesaurus

8 Generate Thesaurus In this phase, Leximancer develops tailored dictionaries for the concept “seeds” (starting words) It then generates a list of weighted terms (a thesaurus) that constitute evidence for the presence of each concept High-relevance evidence words occur commonly in contexts where the concept is discussed in the text, and rarely where it is not (these evidence words are what you see in the thesaurus with their z-scores) Sentences may contain several concepts

9 Generate Thesaurus cont
Concept Seeds allows you to edit, remove, and add to the concepts generated from Leximancer. You can also add your own seeds to guide Leximancer towards your particular concepts of interest. Thesaurus Settings gives you the opportunity to control how the thesaurus is created.

10 Generate Thesaurus: Concept Seeds
Auto Concepts: Concepts found by Leximancer. You can edit and remove. Auto Tags: Tags found by Leximancer. These came from your tag settings for folder and file names, spreadsheet column headers, dialog labels. User Defined Concepts: Place to add your own concepts. Used if Leximancer did not find these numerically important. User Defined Tags: To locate keywords relative to concepts, but no thesaurus is generated. Example: Insert codeword(s) into your text and Leximancer will identify concepts around that word. Download and Upload: Save and restore.

11 List of positive and negative terms tailored to your dataset.
Sentiment Lens The Sentiment Lens looks at positive and negative sentiment in your text Clicking this button loads predefined list of sentiment terms good, like, bad, poor _favterms common favorable terms _unfavterms negative terms _negation terms under User Defined Tags (don’t want as concepts) Never, no, not, nothing The negation handle not good no happiness The lens will create _favterms and _unfavterms concepts with relationships and frequencies in the thesaurus. List of positive and negative terms tailored to your dataset.

12 Sentiment Lens Example
In thesaurus, click on _unfavterms Lists words found with unfavorable terms score is z-score

13 Generate Thesaurus Thesaurus Settings
Normally do not change these options. If very small text collection, may turn off Learn Concept Thesaurus for keyword coding. Hand coders typically like to do this.

14 Control Panel Run Project

15 Run Project Compound Concepts
This editor allows for the combination of like concepts, synonyms, and other possibilities into a single concept. For this example, Fukishimi and tsunami are combined. Leximancer will add this new concept as another single concept for the Concept Map, Network Map, and all statistics. The original two concepts will remain. Thus, thesaurus and map for individuals and combination for comparison.

16 Run Project Concept Coding Settings
The most common use of this is to tag or kill concepts. Kill removes all data where that concept occurs. Warning: Advanced feature, will remove data. Be careful! Tags on the map.

17 Run Project Project Output Settings
Normally do not change these options. Concept Map provides control for the Concept Map buttons defaults. We will cover Insight Dashboard as a separate topic.

18 Data Querying The concept map is linked to a text browser
Valuable exploratory tool From the browser you can drill down into the text Learn what the concepts refer to Investigate the nature of concept relationships Query Tab Queries entire data set Syntax WORD: NAME: TERM: implicit coding in results Booleans AND and OR and NOT supported TERM is keyword

19 Profiling Researcher creates all seeds for concepts Process
Useful for large text collection where only a few themes are to be explored; zoom into particular issue Process Load data Turn off Automatic Concept Identification Enter your concepts into Edit Emergent Seeds Tell Thesaurus Learning how many related concepts Run Concept Map with related concepts and thesaurus created for your concepts; everything on that map relates to your issues

20 The Dashboard Detailed data; mostly quantitative report
Quadrant Diagram (more later) Must have set up Tags (folders, files, spreadsheet: auto-generated) Particularly useful when you want to examine Attributes (independent variables) Certain categories (dependent variables) what tags are auto-generated?

21 Dashboard Configuration Button on Control Panel/ Top Right
Control Panel Run Project->Edit Project Output Settings->Insight Dashboard Tab Almost always click Generate Insight Dashboard Generate Quadrant Report Include all Content Blocks Auto Scale Categories: Select red text (Tags) of interest and click arrow <=== Attributes: Select concepts. Often All Concepts (word-like) and click arrow <===

22 Quadrant Report 99% Strength: If attribute is in section of text, probability it comes from that category (tag). What topics are unique to Fox News. Strength Frequency: If text is in category (tag), probability text mentions the attribute. What is talked about most frequently on Fox News. 3% Relative Frequency 4% 33% Legend Tag1 Tag 2

23 Quadrant Report cont 99% Categories here mean:
Occur often and unique to tag Categories here mean: Occur seldom and unique to tag Strength Categories here mean: Occur seldom, not unique to tag Categories here mean: Occur often, not unique to tag 3% Percentages Relative Frequency 4% 33% Legend Tag1 Tag 2

24 Data Exports Data exports available in most tabs in Concept Map view
Export concepts, counts, scores Useful for comparison, sharing Backups Other graphics programs (even Excel or Numbers) The ones grayed out mean that ... ????

25 Data Exports Button on Control Panel/ Top Right
Most processed data available Download into spreadsheet Your own graphics package SPSS Save data Not sure how often this is done

26 Miscellaneous Researcher Logbook Processing Log Pathways
Many methodologies require detailed notes Logs good for traceability Some researchers use notes, bookmarks, task lists Processing Log View Log on bottom right of Control Panel Screen Provides information of what Leximancer is doing Good source for total data counts for research papers Pathways Useful in theory-driven study start_point end-> end_point (indirect mediating and moderating relationships)


Download ppt "From Words to Meaning to Insight"

Similar presentations


Ads by Google