IDigBio Augmenting OCR Workshop October 1, 2012 Plants, Herbivores, and Parasitoids NSF ADBC Digitization TCN Kimberly Watson.

Slides:



Advertisements
Similar presentations
HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association.
Advertisements

The West` Washington Idaho 1 Montana Oregon California 3 4 Nevada Utah
NSF ADBC Digitization TCN-TTD Plants, Herbivores, and Parasitoids A Model System for the study of Tri-Trophic Associations Ten months later… presentation.
TOTAL CASES FILED IN MAINE PER 1,000 POPULATION CALENDAR YEARS FILINGS PER 1,000 POPULATION This chart shows bankruptcy filings relative to.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
The New York Botanical Garden The Macroalgae Digitization Project Advancing online algal collections at the New York Botanical Garden and beyond Stephen.
Plants, Herbivores, and Parasitoids A Model System for the study of Tri-Trophic Associations Katja Seltmann, NSF ADBC Digitization TCN, iDigBio Paleocollections.
Plants, Herbivores, and Parasitoids A Model System for the study of Tri-Trophic Associations NSF ADBC Digitization TCN Melissa Tulig, Toby Schuh & Rob.
IDigBio Botany 2012 Digitization Workshop July 12, 2012 Plants, Herbivores, and Parasitoids NSF ADBC Digitization TCN Kimberly Watson, Melissa Tulig.
BINARY CODING. Alabama Arizona California Connecticut Florida Hawaii Illinois Iowa Kentucky Maine Massachusetts Minnesota Missouri 0 Nebraska New Hampshire.
Digitizing California Arthropod Collections Peter Oboyski, Phuc Nguyen, Serge Belongie, Rosemary Gillespie Essig Museum of Entomology University of California.
The Role of Small Herbaria in Large Digitization Projects Chris Neefus, Albion Hodgdon Herbarium (NHA) University of New Hampshire, Durham, New Hampshire,
The Macroalgal Herbarium Consortium ACCESSING 150 YEARS OF SPECIMEN DATA TO UNDERSTAND CHANGES IN THE MARINE/AQUATIC ENVIRONMENT.
What are the states in the Northeast Region?
NSF EF Welcome to Summit III University of Florida Florida State University.
U.S. Civil War Map On a current map of the U.S. identify and label the Union States, the Confederate States, and U.S. territories. Create a map key and.
Integrative research using digitized specimens: examples from the Consortium of California Herbaria Brent Mishler University and Jepson Herbaria University.
1st iDigBio – BRIT Hackathon iDigBio Augmenting Optical Character Recognition Working Group (AOCR wg) February 13 – 14, 2013.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
Update from the Entomological Society of America (ESA) Systematics, Evolution, and Biodiversity (SysEB) Section Symposium: From Voucher.
2.3 million specimens, 65 institutions, 1 year later DIGITIZING 'ALL' NORTH AMERICAN LICHEN AND BRYOPHYTE SPECIMENS Corinna Gries Edward Gilbert Thomas.
This chart compares the percentage of cases filed in Maine under chapter 13 with the national average between 1999 and As a percent of total filings,
The Macroalgal Digitization Project Chris Neefus, Department of Biological Sciences University of New Hampshire, Durham, New Hampshire.
SCAN Survey Results: Engaging the Public with Insect Digitization Workflows Dr. Melody Basham Hasbrouck Insect Collection Outreach Specialist Project Director.
Plants, Herbivores, and Parasitoids: A Model System for the Study of Tri-trophic Associations Robert Naczi 1, Melissa Tulig 1, Richard Rabeler 2, Robert.
Presented by: Michael Bevans Information Manager for Digitization
Map Review. California Kentucky Alabama.
OCR implementation in The Caribbean Plants Digitization Project A project to image and catalog over 150,000 Caribbean specimens at the New York Botanical.
1. AFL-CIO What percentage of the funds received by Alabama K-12 public schools in school year was provided by the state of Alabama? a)44% b)53%
University of Florida Florida State University
Edward Gilbert Corinna Gries Thomas H. Nash III Robert Anglin.
Directions: Label Texas, Arkansas, Louisiana, Mississippi, Tennessee, Alabama, Georgia, Florida, South Carolina, North Carolina, Virginia--- then color.
The Macroalgal Herbarium Consortium ACCESSING 150 YEARS OF SPECIMEN DATA TO UNDERSTAND CHANGES IN THE MARINE/AQUATIC ENVIRONMENT.
CHAPTER 7 FILINGS IN MAINE CALENDAR YEARS 1999 – 2009 CALENDAR YEAR CHAPTER 7 FILINGS This chart shows total case filings in Maine for calendar years 1999.
Corinna Gries Edward Gilbert Thomas H. Nash III. Lichens Bryophytes Climate Change  NSF ADBC funding 2011 ~ 2.3 million specimen (90%) ○ 900,000 lichens.
2.3 million specimens, 65 institutions, 1 year later DIGITIZING 'ALL' NORTH AMERICAN LICHEN AND BRYOPHYTE SPECIMENS Corinna Gries Edward Gilbert Thomas.
Plants, Herbivores, and Parasitoids A Model System for the study of Tri-Trophic Associations Katja Seltmann, TTD-TCN Project Manager Public Participation.
The Macroalgal Herbarium Consortium Accessing 150 Years of Specimen Data to Understand Changes in the Marine/Aquatic Environment Janet Sullivan and Chris.
The William and Linda Steere Herbarium The New York Botanical Garden
Hawaii Alaska (not to scale) Alaska GeoCurrents Customizable Base Map text.
STATE of the STATES Evaluating US Regional AV siobhan hagan, university of baltimore lynette stoudt, georgia historical society anne wells, chicago film.
 Research Question  Goals and Scope  Digitization Workflow  Geo-referencing  Dissemination  Outreach and Crowd Sourcing.
US MAP TEST Practice
©CSCOPE 2007 Economic Regions of the United States Economic Regions of the United States.
TOTAL CASE FILINGS - MAINE CALENDAR YEARS 1999 – 2009 CALENDAR YEAR Total Filings This chart shows total case filings in Maine for calendar years 1999.
The student will use maps locating the 50 states and the cities most significant to the historical development of the United States Cities serve as centers.
1st Hour2nd Hour3rd Hour Day #1 Day #2 Day #3 Day #4 Day #5 Day #2 Day #3 Day #4 Day #5.
8/10/16 Lesson 1-1: States and Regions
2c: States grouped by region
The United States Song Wee Sing America.
Supplementary Data Tables, Utilization and Volume
USAGE OF THE – GHz BAND IN THE USA
AMR Additional Questions
Tri-Trophic Thematic Collection Network
The States How many states are in the United States?
Department of Environmental Quality
Table 2.3: Beds per 1,000 Persons by State, 2013 and 2014
Regions of the United States
DO NOW: TAKE OUT ANY FORMS OR PAPERS YOU NEED TO TURN IN
Regions of the United States
Supplementary Data Tables, Utilization and Volume
Regions How many do you know?.
Slave States, Free States
Presidential Electoral College Map
WASHINGTON MAINE MONTANA VERMONT NORTH DAKOTA MINNESOTA MICHIGAN
States During the Civil War
INHS Insect collection digitization workflow
CBD Topical Sales Restrictions by State (as of May 23, 2019)
Wisconsin State Herbarium (WIS) – University of Wisconsin Madison’s
Percent of adults aged 18 years and older who have obesity †
Presentation transcript:

iDigBio Augmenting OCR Workshop October 1, 2012 Plants, Herbivores, and Parasitoids NSF ADBC Digitization TCN Kimberly Watson

Crop Plants Pierce plant stems and leaves; specialize on one species or numerous. Reduce plant vigor, transmit disease, reduce harvest yield. Hymenoptera (Parasitoid wasps) Lay eggs inside aphid; larva consumes host from the inside out; emerges from “mummy” as an adult. Plants A Tri-Trophic Example Herbivores Parasitoids Photo: Hemiptera (e.g. Aphids) Photo: Produce fruits and tubers of significant agricultural and economic importance. Poaceae: corn, wheat, rice Fabaceae: soybean, hay Solanaceae: tomato, potato

Species of Interest: North American Biota Family# species Apiaceae250 Asteraceae2,400 Chenopodiaceae250 Cupressaceae30 Cyperaceae850 Fabaceae850 Fagaceae97 Grossulariaceae53 Juglandaceae17 Lamiaceae240 Oleaceae35 Pinaceae66 Poaceae1,400 Polygonaceae440 Rhamnaceae75 Rosaceae360 Salicaceae123 Scrophulariaceae430 Solanaceae85 Zygophyllaceae15 Total8,066 Hemiptera# species Coccoidea (scale insects)986 Aphidoidea (plant lice)1,532 Psylloidea (jumping plant lice)176 Auchenorrhyncha (cicadas, hoppers)4,629 Heteroptera3,827 Total11,150 Hymenoptera# species Aphelinidae212 Encyrtidae490 Mymaridae187 Signiphoridae19 Trichogrammatidae131 Total1,039 Herbivores Plants Parasitoids

Insect Specimen Digitization Institutions (18) Specimens databased % Georeferenced Prior funding Specimens to be databased American Museum of Natural History30,000100NSF-PBI333,000 B. P. Bishop Museum, Honolulu00 70,000 California Academy of Sciences4,000100NSF-PBI40,000 California Dept. Food & Agriculture1,000100NSF-PBI75,000 Carnegie Museum, Pittsburgh01 15,000 Colorado State University01 15,000 Cornell University01 30,000 Illinois Natural History Survey36,000100NSF-REVSYS73,000 Mississippi State University00 50,000 North Carolina State University1,000100NSF-BRC75,000 Oregon State University1, ,000 Texas A&M University15,000100NSF-PBI150,000 Univ. of California, Berkeley, Essig Museum12,00092NSF-PBI, NSF-BRC45,000 University of California, Riverside14,000100NSF-PBI, NSF-DBI75,000 University of Delaware2, ,000 University of Kansas00 50,000 University of Kentucky00 35,000 University of Massachussetts, Amherst10, ,000 Total126,000 1,206,000 Grand Total 1,332,000

Plant Specimen Digitization Institutions (14) Specimens databased % Georeferenced Prior funding Specimens to be databased Eastern Michigan University0010,000 Illinois Natural History Survey308, ,000 Iowa State University46, ,000 Miami University14,000535,000 Missouri Botanical Garden247,00025NSF-BRC101,000 New York Botanical Garden102,00030NSF-BRC, NSF-PBI274,000 University of Colorado51,000067,000 University of Illinois0030,000 University of Kansas129, ,000 University of Maine100,000034,000 University of Michigan26, ,000 University of Minnesota93,00010NSF- BRC70,000 University of Texas105, ,000 University of Wisconsin120, ,000 Total1,341,0001,224,000 GRAND TOTAL2,565,000

Catalog skeletal records Barcode Scientific (“Filed As”) name Use Tropicos® authority files Average ± /hr Send existing data to NY Complete records Georeferenced (if available) Darwin Core format Rapid Data Entry

Photograph every specimen 21 megapixel DSLR camera Macro lens, 55 mm Photo-Box, even illumination Barcode = Image file name Average ±80-120/hr Send JPG images to NY Rapid Image Capture

>1 barcode per sheet Crop to lower right Crop to label Export JPGs of labels Batch Image Post-Processing JPG images compiled at NY

ABBYY Hot Folder Run Once/Recurring Automatically Analyze Autoselect Language Save as text files Barcode.txt Batch OCR ABBYY FineReader 11 Corporate Edition

Using the OCR data Merge individual text files into single Excel worksheet using a Powershell script Search, group, enter data for several collections at once

Partner Institutions NYBGAMNH 7 7 Image specimens barcode.jpg Complete Plant Data + Complete Insect Data 7 OCR barcode.txt Skeletal Data Barcode “Filed-As” Name Existing Complete Data Plant Specimen Digitization Workflow Duplicate matching Complete and skeletal records combined at NYBG Populate skeletal records using OCR data, duplicate matching, crowd sourcing Populate records from images Data sort & parse 7 Image Crop Crowd Sourcing DATABASE Complete Data Skeletal Data Images OCR.txt

Tri-Trophic TCN Partners BOTANY – Robert Naczi, New York Botanical Garden – Robert Magill, Missouri Botanical Garden – Richard Rabeler, University of Michigan – Melissa Tulig, New York Botanical Garden – Barbara Thiers, New York Botanical Garden – Kim Watson, New York Botanical Garden – Margaret Koopman, Eastern Michigan University – Loy Phillippe, Illinois Natural History Survey – Deborah Lewis, Iowa State University – Michael Vincent, Miami University – Timothy Hogan, University of Colorado – Mary Ann Feist, University of Illinois – Craig Freeman, University of Kansas – Christopher Cambell, University of Maine – Anita Cholewa, University of Minnesota – Beryl Simpson, University of Texas – Kenneth Cameron, University of Wisconsin Data Contributors – Consortium of Pacific Northwest Herbaria – Consortium of California Herbaria – Southwest Biodiversity Consortium ENTOMOLOGY – Randall Schuh, American Museum of Natural History – Christine Johnson, American Museum of Natural History – Christiane Weirauch, University of California, Riverside – John Heraty, University of California, Riverside – Charles Bartlett, University of Delaware – Benjamin Normark, University of Massachusetts, Amherst – Katja Seltmann, American Museum of Natural History – Neal Evenhuis, BP Bishop Museum, Honolulu – David Kavanaugh,California Academy of Sciences – Stephen D. Gaimari,California Dept. Food and Agriculture – Chen Young, Carnegie Museum, Pittsburg – Boris C. Kondratieff, Colorado State University – James K. Liebherr, Cornell University – Dmitry Dmitriev, Illinois Natural History Survey – Richard Brown, Mississippi State University – Andy Deans, North Carolina State University – David Maddison, Oregon State University – Christopher Marshall, Oregon State University – John Oswald, Texas A&M University – Kipling Will, University of California, Berkeley – Caroline Chaboo, University of Kansas – Michael Sharkey, University of Kentucky – John Pickering, University of Georgia Data Contributors – Canadian National Collection, Ottawa – University of California, Davis – Kansas State University NSF Award#