Morphological Identification of P. maniculatus and P. leucopus Mice with Java ABSTRACT Efficient methods of distinguishing sister taxa in areas of sympatry.

Slides:



Advertisements
Similar presentations
♦We now know that our PCR protocol is working. ♦From here, we will work on obtaining a larger sample size from local woodlots. ♦If indeed we find only.
Advertisements

Winter Reproduction of Peromyscus in Rider Park, Lycoming Co., PA K.W Hopkins, A.K. Smolarek, and D.R. Broussard Department of Biology, Lycoming College,
Automatically Annotating and Integrating Spatial Datasets Chieng-Chien Chen, Snehal Thakkar, Crail Knoblock, Cyrus Shahabi Department of Computer Science.
GenomePixelizer - a visualization tool for comparative genomics within and between species. A. Kozik, E. Kochetkova, and R. Michelmore (Department of Vegetable.
Detecting Computer Intrusions Using Behavioral Biometrics Ahmed Awad E. A, and Issa Traore University of Victoria PST’05 Oct 13,2005.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Jeff Shen, Morgan Kearse, Jeff Shi, Yang Ding, & Owen Astrachan Genome Revolution Focus 2007, Duke University, Durham, North Carolina Introduction.
Combining Human and Machine Capabilities for Improved Accuracy and Speed in Visual Recognition Tasks Research Experiment Design Sprint: IVS Flower Recognition.
Traffic Sign Recognition Jacob Carlson Sean St. Onge Advisor: Dr. Thomas L. Stewart.
Artificial Neural Networks (ANNs)
Sparsity, Scalability and Distribution in Recommender Systems
Geometric Crossovers for Supervised Motif Discovery Rolv Seehuus NTNU.
Computer Science Prof. Bill Pugh Dept. of Computer Science.
Online Stacked Graphical Learning Zhenzhen Kou +, Vitor R. Carvalho *, and William W. Cohen + Machine Learning Department + / Language Technologies Institute.
Chapter 12: Simulation and Modeling Invitation to Computer Science, Java Version, Third Edition.
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Personalized QoS-Aware Web Service Recommendation and Visualization.
1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Analysis of Molecular and Clinical Data at PolyomX Adrian Driga 1, Kathryn Graham 1, 2, Sambasivarao Damaraju 1, 2, Jennifer Listgarten 3, Russ Greiner.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Improved Gene Expression Programming to Solve the Inverse Problem for Ordinary Differential Equations Kangshun Li Professor, Ph.D Professor, Ph.D College.
Steps for Extraction Creation of cyan intensity image in preparation for thresholding: Although many dorsal fin images exhibit a good contrast between.
Using SAS® Information Map Studio
Recognizing Activities of Daily Living from Sensor Data Henry Kautz Department of Computer Science University of Rochester.
Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.
CISC Machine Learning for Solving Systems Problems Presented by: Alparslan SARI Dept of Computer & Information Sciences University of Delaware
PreDetector : Prokaryotic Regulatory Element Detector Samuel Hiard 1, Sébastien Rigali 2, Séverine Colson 2, Raphaël Marée 1 and Louis Wehenkel 1 1 Department.
Document Clustering for Forensic Analysis: An Approach for Improving Computer Inspection.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.
Dissertation Defense Machine Assisted Grading of Rare Collectibles through the COINS framework By: Rick Bassett July 24, 2003.
Editing Graphics Effects and Improvements. Editing Graphics Graphics editors have features for changing the sizes of images as well as their colors and.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
Learning from Positive and Unlabeled Examples Investigator: Bing Liu, Computer Science Prime Grant Support: National Science Foundation Problem Statement.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
TEMPLATE DESIGN © E-Eye : A Multi Media Based Unauthorized Object Identification and Tracking System Tolgahan Cakaloglu.
Implementation of a Relational Database as an Aid to Automatic Target Recognition Christopher C. Frost Computer Science Mentor: Steven Vanstone.
Project 1: Classification Using Neural Networks Kim, Kwonill Biointelligence laboratory Artificial Intelligence.
POSTER TEMPLATE BY: Background Objectives Psychophysical Experiment Smoothness Features Project Pipeline and outlines The purpose.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
December 13, G raphical A symmetric P rocessing Prototype Presentation December 13, 2004.
Robodog Frontal Facial Recognition AUTHORS GROUP 5: Jing Hu EE ’05 Jessica Pannequin EE ‘05 Chanatip Kitwiwattanachai EE’ 05 DEMO TIMES: Thursday, April.
A New Generation of Artificial Neural Networks.  Support Vector Machines (SVM) appeared in the early nineties in the COLT92 ACM Conference.  SVM have.
A SCRIPT FOR ARCHIVING DIGITAL RESEARCH DATA IMPROVING ACCURACY AND EFFICIENCY IN THE DATAVERSE NETWORK ABSTRACT SUMMARY Rachel Carriere, Thu-Mai Christian,
Multi-Area Load Forecasting for System with Large Geographical Area S. Fan, K. Methaprayoon, W. J. Lee Industrial and Commercial Power Systems Technical.
Experience Report: System Log Analysis for Anomaly Detection
Big data classification using neural network
Computational and Biological Vision Course, Semester A,
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
Can Computer Algorithms Guess Your Age and Gender?
Computer pattern recognition of
Hybrid Features based Gender Classification
Department of Computer Science
Brain Hemorrhage Detection and Classification Steps
Using Tensorflow to Detect Objects in an Image
Students: Meiling He Advisor: Prof. Brain Armstrong
Analysis and classification of images based on focus
Biometric technology.
Interactive Visual System
What's New in eCognition 9
Using Tensorflow to Detect Objects in an Image
Machine Learning for Visual Scene Classification with EEG Data
Applying principles of computer science in a biological context
What's New in eCognition 9
Pooja Pun, Avdesh Mishra, Simon Lailvaux, Md Tamjidul Hoque
What's New in eCognition 9
Presentation transcript:

Morphological Identification of P. maniculatus and P. leucopus Mice with Java ABSTRACT Efficient methods of distinguishing sister taxa in areas of sympatry are important for testing theories of niche partitioning, character displacement, and hybridization. Peromyscus leucopus and P. maniculatus are sympatric in many areas of the eastern United States. Previous methods of morphological identification have not matched the accuracy of genetic analysis. Although genetic analysis may serve as the most accurate speciation technique, it remains costly and time consuming. Computer analysis allows additional rigor in morphological identification, so an algorithm was developed in Java to distinguish these species. The user can import a photo of a Peromyscus mouse, with the full body and tail shown, and run a 4- point analysis of fur color, tail to body ratio, ear to body ratio, and dorsal-ventral tail gradient. Logistic regression is applied to these traits to achieve a species classification. Using a data set of 48 species-identified mice, this program accurately classifies 95% of P. leucopus and 90% of P. maniculatus photos. With additional data and statistical optimization, we believe that these results can be improved. These results are promising because they have lowered the implicit bias in morphological identification. This program has many applications in character displacement studies in areas with and without hybridization. Margo Morton 1, Brian Shelburne 1, Richard Phillips 2 OBJECTIVE To determine whether P. leucopus and P. maniculatus mice could be differentiated by morphological characteristics alone, and to determine whether automated characteristic analysis decreased inherent observer bias in previous morphological identification methods. METHODS RESULTS CONCLUSIONS BACKGROUND The idea for a computer program that could identify species of mice was originally proposed by Professor Richard S. Phillips, who believes that field biologists could benefit from an efficient classification algorithm. P. maniculatus and P. leucopus mice are used widely in biological and ecological research, but are sympatric in many areas of the United States and can be difficult to distinguish even by trained biologists. Currently, biologists must rely on genetic sequencing which is time consuming and expensive. This semester, we sequenced six mice, at a cost of $242. This cost of $40 per mouse would not be practical for large character studies involving hundreds of mice. If mice could be accurately identified based on physical characteristics alone, then an algorithm would be much more practical. According to leading biological literature on mouse identification, P. maniculatus and P. leucopus can be distinguished through several characteristics. These features would be subjective and hard to quantify by a human but objective to a computer, such as coloration patterns on a tail. Research led me to conclude that the best way to identify mice was a 4-point analysis of fur color, tail to body ratio, ear to body ratio, and dorsal-ventral tail gradient. Figure 1: P. maniculatusFigure 2: P. leucopus A dataset was developed from images of mice trapped and identified by Prof. Phillips, as well as mouse images from online databases. The final dataset included 48 species-identified mice. The species test was a sequence-based analysis of a hypervariable region of the mitochondrial 16S rRNA gene. Java was selected to code this software, due to the fact that the final software could be executed and run on any machine. Java AWT and Swing libraries were used in order to develop a straightforward and robust user interface, with functionality that feels familiar to users of graphical editing programs such as Photoshop. User interface functions include the ability to load images into the program, analyze with tools, scale the image, and undo/redo actions. The user can import a photo of a Peromyscus mouse, with the full body and tail shown, and run a 4-point analysis of fur color, tail to body ratio, ear to body ratio, and dorsal- ventral tail gradient. Logistic regression is applied to these traits to achieve a species classification. 1. An area of the fur is averaged to get a fur color value, then a distance formula is used to compute the distance to the average color of P. leucopus and P. maniculatus in 3D Colorspace Color Distance = ((r2 - r1) 2 + (g2 - g1) 2 + (b2 - b1) 2 ) 1/2 2. The tail, ear, and body length are measured in pixels, and divided to get ratios of these values 3. A pixel from the dorsal and ventral part of the tail is selected, and a distance formula is used to get a tail gradient value Figure 3. These blocks of color represent the average fur color of a P. leucopus mouse and a P. maniculatus mouse. A computer can use the distance equation to determine how close a fur color is to these averages. Caballas, Rauleen, Stephanie Fore, and Hyun-Joo Kim. "Regional Model for Identifying Peromyscus Leucopus and Peromyscus Maniculatus." Thesis. Truman State University, n.d. Print. Choate, Jerry R. "Identification and Recent Distribution of White-Footed Mice (Peromyscus) in New England." Journal of Mammalogy 54.1 (1973): JSTOR. Web. 27 Mar Lindquist, Erin S., Charles F. Aquadro, Deedra McClearn, and Kevin J. McGowan. "Field Identification of the Mice Peromyscus Leucopus Noveboracensis and P. Maniculatus Gracilis in Central NewYork." Canadian Field- Naturalist (2003): Print. Rich, Stephen M., William Kilpatrick, Jodi L. Shippee, and Kenneth L. Crowell. "Morphological Differentiation and Identification of Peromyscus Leucopus and P. Maniculatus in Northeastern North America." Journal of Mammalogy 77.4 (1996): JSTOR. Web. 27 Mar I believe this research has shown that computer software can accurately classify P. maniculatus and P. leucopus based on physical characteristics alone. Computer classification, when compared to the morphologic identification methods proposed by Truman, Cabalas and Rich, yields similar accuracy for P. leucopus and vastly increased accuracy for P. maniculatus mice. It has also shown to be faster and less costly than traditional genetic analysis methods. This program could be extended to classify additional species of mice, such as P. gossypinus or P. polionotus given a set of unique morphologic identifiers. Travel funding provided by Wittenberg’s Student Development Board. 1. Wittenberg University, Department of Mathematics and Computer Science, Box 720, Springfield, OH Wittenberg University, Department of Biology, Box 720, Springfield, OH ACKNOWLEDGEMENTS SOURCES Using a data set of 48 species-identified mice, this program accurately classifies 95% of P. leucopus and 90% of P. maniculatus photos. With additional data and statistical optimization, we believe that these results can be improved. The following logistic regression equation was developed for this dataset: In the future, the algorithm may be modified to dynamically generate logistic regression coefficients, in order to more accurately classify new data sets. These results are promising because they have lowered the implicit bias in morphological identification. This software has many applications in character displacement studies in areas with and without hybridization. Mouse Image SOFTWARE OVERVIEW Analyzing the Image Data Saved in Excel Figure 5: Screencaps demonstrating the use of the software. A photograph is selected, loaded into the program, analyzed, and the results are stored in an excel file. Software available for download at github.com/mortonm FUTURE RESEARCH This software has the potential for improvement, and is available under the GNU General Public License v3.0 for anyone to contribute to the development. Possible improvements to the program include: Automation of mouse classification through feature recognition Dynamic generation of new logistic regression coefficients Demonstration of result reproducibility through Receiver Operating Characteristic (ROC) plots Testing the software with a larger image dataset