Download presentation
Presentation is loading. Please wait.
1
LING 581: Advanced Computational Linguistics Lecture Notes January 19th
2
Administrivia New room – Shantz 338 – (I have asked Jennifer Columbus to investigate refund: however, I’m told it may not happen) Marshall 480 Shantz 338
3
Penn Treebank Availability – Source: Linguistic Data Consortium (LDC) U. of Arizona is a (fee-paying) member of this consortium Resources are made available to the community through the main library URL – http://sabio.library.arizona.edu/search/X
4
Penn Treebank (V3) Call Record
5
Penn Treebank 1.Tagging Guide 2.Arpa94 paper 3.Parse Guide 1.Tagging Guide 2.Arpa94 paper 3.Parse Guide
6
Penn Treebank
7
sections 00-24
8
Penn Treebank
9
tregex Tregex is a Tgrep2-style utility for matching patterns in trees. written In Java written In Java run-tregex-gui.command shell script -mx flag, the 300m default memory size will need to be increased depending on the platform
10
tregex Select the PTB directory – TREEBANK_3/parsed/mrg/wsj/ Browse Deselect any unwanted files
11
tregex Search
12
tregex Help
13
tregex Help
14
tregex Help
15
tregex Help
16
tregex Help
17
tregex Pattern: – (@NP <, (@NP $+ (/,/ $+ (@NP $+ /,/=comma))) <- =comma)
18
tregex Help
19
tregex
20
Different results from: – @SBAR < /^WH.*-([0-9]+)$/#1%index << (@NP < (/^-NONE-/ < /^\*T\*-([0-9]+)$/#1%index))
21
tregex Example: WHADVP also possible (not just WHNP)
22
Ungraded Homework Exercise Search for NP trace relative clauses as defined below: Be ready to compare search pattern and number found next time in class
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.