How to Tag a Corpus Using Stanford Tagger. Accuracy All tokens: 97.32% Unknown words: 90.79%

Slides:



Advertisements
Similar presentations
Clicking on the link for the.tns file gives you will get the following screen: Select Save File and click OK.
Advertisements

Windows XP / Microsoft Word Computer Applications.
Creating a Program In today’s lesson we will look at: what programming is different types of programs how we create a program installing an IDE to get.
Java Programming Working with TextPad. Using TextPad to Work with Java This text editor is designed for working with Java You can download a trial version.
CS115 HOW TO INSTALL THE JAVA DEVELOPMENT KIT (JDK)
Go to the link ( as shown, then choose downloads.
Programming Introduction November 9 Unit 7. What is Programming? Besides being a huge industry? Programming is the process used to write computer programs.
Installation on Windows Vista/Windows 7 NOTE: Installation on Windows Vista can differ depending on the version of Windows installed​ ​ In most computers.
How to install the Zelle graphics package
Installation on Windows Vista/Windows 7 NOTE: Installation on Windows Vista can differ depending on the version of Windows installed​ ​ In most computers.
13-Jul-15 Getting Ready for Java. 2 What You Need 256 MB of RAM (512 MB or more recommended) 500 MHz Pentium or better Macintosh: must run Mac OS X, preferably.
1a – Getting Started Lingma Acheson Department of Computer and Information Science, IUPUI CSCI N331 VB.NET Programming.
Command Console Tutorial BCIS 3680 Enterprise Programming.
® Microsoft Office 2010 Managing Your Files. XP Files in a Folder Window.
SERVLETS.
One to One instructions Installing and configuring samba on Ubuntu Linux to enable Linux to share files and documents with Windows XP.
Set up the Environment for leJOS NXJ. Required Software Software –JDK 1.6 –leJOS NXJ –USB driver for NXT Mindstorm nxt USB driver Libusb MMN Lab.
Streaming Twitter. Install pycurl library Use a lab computer From the course website Download the links from pycurl and twitter streamer Extract site-packages.zip,
Tutorial 11 Installing, Updating, and Configuring Software
Installing the SAFARIODBC.EXE For use with Excel May 3, 2002.
Constructing Your Own Corpus from Written Language.
Batch File Basics Automate repetitive computer processes Friday, March 12, 2014 Presented by Thomas Redd, STS, Granite School District.
BlowFish 2000 Copyright © by Gregory Braun. All rights reserved Installation and Users Guide by Robert Moncrief II.
Putting Applets into Web Pages.  Two things are involved in the process of putting applets onto web pages ◦ The.class files of the applet ◦ The html.
Basic Setup Copyright © Liferay, Inc. All Rights Reserved. No material may be reproduced electronically or in print without written permission.
Java and C# [this is a bonus – it is not a required lesson] ACO101: Introduction to Computer Science.
( ) 1 ISYE 7210—Fall 2005 Design of Real-Time Interactive Simulations (in Java) Initial Notes Christine M. Mitchell Director & Professor Center for.
COP 3330 Notes 1/12. Today's topics Downloading Java and Eclipse Hello World Basic control structures Basic I/O Strings.
Hidden Markov Toolkit (HTK) Installation Fang-Hui Chu Department of Computer Science & Information Engineering National Taiwan Normal University.
9/2/ CS171 -Math & Computer Science Department at Emory University.
Welcome to java installation. After download java software, assuming you downloaded jdk1.7.0_11 Follow the procedure bellow to install java.
archiving. archiving is for downloading, keeping and protecting all sent and received messages (including attachments)so they can be.
 If we set java settings permanently they are available from all command prompts even after system restart.
How to create and install packages in R Presenter: Roman Jandarov
DOS and the Command Line CS 21a: Introduction to Computing I First Semester,
Java Programming, Second Edition Appendix A Working with Java SDK 1.4.
Installing SAS 1. Requirements If you do not have an old copy of SAS installed on your computer, go directly to Slide 6. Make sure you have uninstalled.
Leadership. Accountability. Integrity. EnSiteUSA Standard Fonts.
1 Installing Cantera 1.7, Python 2.5, and Python SDToolbox for Windows J. Ziegler, S. Browne, and J. E. Shepherd Caltech Revised August, 2007.
Windows Installation Tutorial NASA ARSET For Python help, contact: Justin Roberts-Pierel
1 Installing Java on Your PC. Installing Java To develop Java programs on your PC: Install JDK (Java Development Kit) Add the directory where JDK was.
Installing Java on a Home machine For Windows Users: Download/Install: Go to downloads html.
1 Getting Started with C++ Part 1 Windows. 2 Objective You will be able to create, compile, and run a very simple C++ program on Windows, using Microsoft.
Java On the ENB 116 Computers The JDK is now available on the ENB 116 computers. You can use a classroom computer rather than your own laptop or CIRCE.
How to Fix Binkw32.dll Error
DHIS2 Live on Windows laptops/desktops (NOT for production servers!!)
Windows Installation Tutorial NASA ARSET For Python help, contact: Justin Roberts-Pierel
How to install JavaCV in Eclipse. Make sure to download and install all these before you proceed Eclipse for Java EE developers (current is Juno)
INTERNET APPLICATIONS CPIT405 Install a web server and analyze packets.
1 Microsoft Access Security Warnings Note: This presentation was created with "Access 2002". You might have slightly different warnings with other versions.
CS 201 Lecture 1 (b) Using an IDE Tarik Booker CS 201: Introduction to Programming California State University, Los Angeles.
Required Tools Copyright © 2005 Liferay, LLC All Rights Reserved. No material may be reproduced electronically or in print without written permission from.
Day 1 Session 2. Setup & Installation
How To Make The Backup Of AVG Antivirus 2017 Definitions.
Installing Java on a Home machine
Python Lesson 12 Mr. Kalmes.
Obtaining the Required Tools
ASP.NET and IIS – Windows 7, 8, 10 Marie SHAH 01/12/2016.
DHIS2 Live on Windows laptops/desktops (NOT for production servers!!)
Python Lesson 12 Mr. Kalmes.
Tips to Manually Uninstall Norton Antivirus 2012.
Macrosystems EDDIE: Getting Started + Troubleshooting Tips
Installing Java on a Home machine
European Computer Driving Licence
CS115 HOW TO INSTALL THE JAVA DEVELOPMENT KIT (JDK)
Macrosystems EDDIE: Getting Started + Troubleshooting Tips
Advanced Programming in Java
Windows Installation Tutorial
Hidden Markov Toolkit (HTK) Installation
Review of Previous Lesson
Presentation transcript:

How to Tag a Corpus Using Stanford Tagger

Accuracy All tokens: 97.32% Unknown words: 90.79%

What You Need JRE: sp?locale=en

To make sure that Windows can find the Java compiler and interpreter: Select Start -> Computer -> System Properties -> Advanced system settings -> Environment Variables -> System variables - > PATH. [ In Vista, select Start -> My Computer -> Properties -> Advanced -> Environment Variables -> System variables -> PATH. ] [ In Windows XP, Select Start -> Control Panel -> System -> Advanced -> Environment Variables -> System variables -> PATH. ] Prepend C:\Program Files\Java\jdk1.6.0_27\bin; to the beginning of the PATH variable. Click OK three times.

Installing Java (JRE) on your computer  Click Start  type cmd and press enter  this will open the command prompt window  type java –version and press enter  you will get a message: java version “1.7.0” (or may be an older version) If you do not get this message it means you could not install Java correctly. Ask for help.

Install the Stanford POS Tagger Basic English Stanford Tagger Version 3.1.3: postagger tgz

Installing Basic English Stanford Tagger Version Click on the link that I provided above download the zip file. Unzip the file to Documents using an archive manager software, such as WinRAR, 7-Zip, or WinZip You might want to change the name of this unzipped folder to stanTagger. I do this because the original name is too long: stanford-postagger

Create a Corpus Folder In stanTagger folder create two folders to hold your files. I name them myCorpus and myTaggedCorpus Now put some text files (or your corpus) in myCorpus Make sure there are no spaces in your file names. For example, writtenArgument.txt instead of written Argument.txt Carry your folder named stanTagger under C: so that you can find it easily.

Tagging Files  Start your command window as described above  Go to C: by typing the command cd.. twice  Go in stanTagger by typing cd stanTagger

Tagging files To be able to use the Stanford-Tagger on every file automatically, we need to do some programming. We can do this with Perl or other programming languages, such as Java, PHP, Python, and so on. However, I found programming the Command Prompt to be the simplest and will share the code I prepared.

Tagging files Code to be used in Command Prompt: FOR %a IN (C:\stanTagger\myCorpus\*.txt) DO stanford-postagger models\left3words-wsj tagger myCorpus\%~nxa >myTaggedCorpus\%~nxa You can simply copy the above code and paste it in the Command Prompt

New Code! FOR %a IN (C:\stanTagger\myCorpus\*.txt) DO stanford-postagger models\wsj left3words.tagger myCorpus\%~nxa >myTagge dCorpus\%~nxa

Newest Code! FOR %a IN (C:\stanTagger\myCorpus\*.txt) DO stanford-postagger models\english- left3words- distsim.tagger myCorpus\%~nxa >myTaggedCo rpus\%~nxa

Each file may take about 2-3 seconds and at the end, you will see that myTaggedChineseFolder contains the tagged files.