Modeling molecular evolution Jodi Schwarz and Marc Smith Vassar College Biol/CS353 Bioinformatics.

Slides:



Advertisements
Similar presentations
Willing to spend the time! Self motivated! Self responsibility! (If you need something Ask For IT!!!!!) Ability to communicate! (Vocabulary) Write,
Advertisements

A Systems Approach To Training
Ability-Based Education at Alverno College. Proposed Outcomes for Session 1. To introduce you to Alvernos approach to designing integrative general education.
E-Science Data Information and Knowledge Transformation Thoughts on Education and Training for E-Science Based on edikt project experience Dr. Denise Ecklund.
Program Goals Just Arent Enough: Strategies for Putting Learning Outcomes into Words Dr. Jill L. Lane Research Associate/Program Manager Schreyer Institute.
Evolution of genomes.
Is it Mathematics? Linking to Content Standards. Some questions to ask when looking at student performance Is it academic? – Content referenced: reading,
Assessment Report Biology School of Science and Mathematics Rey Sia, Chair Laurie B. Cook, Assessment Coordinator.
Statistics in Bioinformatics May 2, 2002 Quiz-15 min Learning objectives-Understand equally likely outcomes, Counting techniques (Example, genetic code,
May 2, May 2, 2015May 2, 2015May 2, 2015 Azusa, CA Sheldon X. Liang Ph. D. Software Engineering in CS at APU Azusa Pacific University, Azusa, CA.
PBL Post-Project Review. 1. Student Engagement2. Project Idea3. Student Learning4. Authenticity of Project Tasks and Products5. Quality and Use of Driving.
GCAT-SEEK Workshop, 2013 Dr. Tammy Tobin, Susquehanna University Adapted from Mary J. Allen, SACS-COC Summer Institute Developing and Using Rubrics to.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Using Bioinformatics to Make the Bio- Math Connection The Confessions of a Biology Teacher.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Bioinformatics and Phylogenetic Analysis
Computer Science Department Program Improvement Plan December 3, 2004.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Algorithms and Efficiency of Algorithms February 4th.
Evaluating Assessments of Novice Programming Environments Paul Gross and Kris Powers Tufts University.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Turning a Disciplinary Lens to Teaching and Learning Preparing Tomorrow’s Professionals for a Complex World The integrity of the discipline leads to a.
Biology core curriculum vertical alignment study Shana Kerr Biology Teaching Retreat 2015.
Seeing the Destination So We Can Direct Others to It
T. E. Potok - University of Tennessee Software Engineering Dr. Thomas E. Potok Adjunct Professor UT Research Staff Member ORNL.
Introduction To System Analysis and design
1 CSC 221: Computer Programming I Fall 2004 course overview  what did we set out to learn?  what did you actually learn?  where do you go from here?
1 UTeach Professional Development Courses. 2 UTS Step 1 Early exposure to classroom environment (can be as early as a student’s first semester)
Unit 2: Engineering Design Process
Bioinformatics Sean Langford, Larry Hale. What is it?  Bioinformatics is a scientific field involving many disciplines that focuses on the development.
CriteriaExemplary (4 - 5) Good (2 – 3) Needs Improvement (0 – 1) Identifying Problem and Main Objective Initial QuestionsQuestions are probing and help.
CS 360 Lecture 3.  The software process is a structured set of activities required to develop a software system.  Fundamental Assumption:  Good software.
Introduction to Bioinformatics Prologue. Bioinformatics Living things have the ability to store, utilize, and pass on information Bioinformatics strives.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
1 Chapter 5 Software Engineering Practice. 2 What is “Practice”? Practice is a broad array of concepts, principles, methods, and tools that you must consider.
These courseware materials are to be used in conjunction with Software Engineering: A Practitioner’s Approach, 6/e and are provided with permission by.
Coming up: Software Engineering: A Practitioner’s Approach, 6/e Chapter 5 Practice: A Generic View copyright © 1996, 2001, 2005 R.S. Pressman & Associates,
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
What is HQPD?. Ohio Standards for PD HQPD is a purposeful, structured and continuous process that occurs over time. HQPD is a purposeful, structured and.
Assessing the Frequency of Empirical Evaluation in Software Modeling Research Workshop on Experiences and Empirical Studies in Software Modelling (EESSMod)
Achievethecore.org 1 Setting the Context for the Common Core State Standards Sandra Alberti Student Achievement Partners.
Bioinformatics: Theory and Practice – Striking a Balance (a plea for teaching, as well as doing, Bioinformatics) Practice (Molecular Biology) Theory: Central.
How do these groups of students think about and learn genetics?
I Power Higher Computing Software Development The Software Development Process.
Chris DeWald Science Instructional Coordinator Montana Office of Public Instruction.
Construction of Substitution Matrices
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Overview of Bioinformatics 1 Module Denis Manley..
Evolution of BINF Bioinformatics Certificate Foothill College R. Cormia, L. English, K. Erickson.
The E ngineering Design Process Foundations of Technology The E ngineering Design Process © 2013 International Technology and Engineering Educators Association,
1 Software Engineering: A Practitioner’s Approach, 6/e Chapter 5 Practice: A Generic View Software Engineering: A Practitioner’s Approach, 6/e Chapter.
1. Administrators will gain a deeper understanding of the connection between arts, engagement, student success, and college and career readiness. 2. Administrators.
Bioinformatics Curriculum Issues, goals, curriculum.
1 CSC 221: Computer Programming I Spring 2008 course overview  What did we set out to learn?  What did you actually learn?  Where do you go from here?
The E ngineering Design Process Advanced Design Applications The E ngineering Design Process Teacher Resource – The First Five Days: Day 2 © 2014 International.
Research Problem In one sentence, describe the problem that is the focus of your classroom research project about student learning: The traditional approach.
Student Learning Objectives (SLO) Resources for Science 1.
ITEAMS is designed to provide teachers with the tools to develop inquiry-based lessons, and the opportunity to develop a deep understanding of implementation.
1 CS 501 Spring 2002 CS 501: Software Engineering Lecture 27 Software Engineering as Engineering.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
Teaching Bioinformatics in Concert Anya L. Goodman and Alex Dekhtyar Department of Chemistry and Biochemistry California Polytechnic State University.
Planning for and Attending an Important Meeting Advanced Social Communication High School: Lesson Seven.
Board on Science Education Draft released 15 July 2011
Using core competencies in curriculum design
Bioinformatics Overview
Assessment Planning and Learning Outcome Design Dr
Applying principles of computer science in a biological context
Curriculum Coordinator: Patrick LaPierre February 3, 2017
Presentation transcript:

Modeling molecular evolution Jodi Schwarz and Marc Smith Vassar College Biol/CS353 Bioinformatics

Team taught Biol and CompSci course 7 students: – CS experience: 3 yes, 4 no – Bio experience: 5 yes, 2 no Project-based course; no exams Worked in Biol/CS pairs on projects I3U near end of course; last project before independent research projects

Common approach for all projects Biological question Algorithm design – Step-by-step approach to complete a task or solve the problem Implementation – The actual programming “script” that will carry out the steps of the algorithm Evaluation of implementation and algorithm Revision or augmentation

I3U: added an experimental component to our basic approach Previous projects focused on pattern finding, mining whole genome data Goal of I3U: Model a biological/evolutionary process Test the model with empirical data Perform computational experiments

Model molecular evolution Step 1: model the effect of random vs targeted nucleotide substitutions on a protein sequence – What do we mean by random? – determine the similarity of the original protein sequence to the “evolved” sequence Step 2: Assess the real nt diversity at positions 1, 2, 3 of codons in real homologs (HSP70) – Construct alignment of homologs and determine nt diversity at each position Evaluate the models using the empirical data

Learning goals CS students: To apply their knowledge of data structures and algorithms to a biological domain Biology students: To apply their knowledge of the biology to design algorithms For the collaboration: – To become familiar with modeling a biological process: a simple model must be constructed and tested first – To test the model using empirical data

Assessment Assignments – Alignment assignment – 2 Perl scripts Model random vs targeted substitution pattern Determine the codon nt diversity in HSP70 genes – Output from the 2 Perl scripts Raw output Graphs summarizing data Observation – Collaboration – Critical thinking

Random substitutions substitutions targeted to 3 rd psn Example student results Effect of random vs targeted substitutions on a protein sequence (compared the “ancestral” sequence to the “evolved” sequence ) 100 runs

Example student results of empirical data Average diversity by nucleotide position within codons: Codon position 1: 1.50 Codon position 2: 1.29 Codon position 3: 2.32 Most variation occurs in position 3

Collaboration across disciplines How we tried to teach collaboration: – We defined the meaning of collaboration CS students do not need to become biologists and vice versa Each person contributes a different set of expertise Learning how to speak each other’s language Communication – We modeled it Overt reliance on each other’s expertise Spontaneous discussions – Giving students lots of experience collaborating: several shifts in pairs over the semester

Assessment of collaboration Attitude : reluctant vs eager At beginning (self) vs. during project (experience) Gradational Assessment of Collaboration ScoreSelfExperience 0reluctantavoided 1eagerproblems 2reluctantpositive 3eagerpositive StudentScoreTeam ScoreTeams A02A+C B14B+F C26E+G D3 E33D worked alone F3 G3

1 how a genomics approach crosses levels of biological organization 2 how genomic-level science is conducted 3 how computational approaches are deployed to answer genomic questions? 4 how to find potential functional /evolutionary patterns in DNA/protein sequence 5 independently use bioinformatic tools to address biological/genomic questions. 6 examine the output of a bioinformatic analysis and relate it to a biological question. 7 provide one or more clear examples of how genomics uses an interdisciplinary approach Most improvement: questions that are explicitly bioinformatic Least: questions that are more broadly about genomics (CS) Likert Scale (1-5)

What worked well Overall approach was great: question, algorithm, implementation, analysis, iteration Use of starter code allowed students to – Undertake much more sophisticated projects – see examples of more advanced algorithm/code Encountering unanticipated results and problems – Gaps in alignments not in groups of 3 – Spontaneous discussions leading to AHA moments Students enjoyed the modeling process – One student’s final project focused on modeling molecular evolution

What didn’t work as well Some collaborations are not successful Ran out of time: insufficient analysis and reflection For the I3U: Assessment strategy not well developed – Can we retroactively extract more informative assessment?

Assessing biology knowledge Algorithm development – Ability to help partner understand different mutation vs selection – Ability to recognize assumptions of model – Ability to use the empirical data to evaluate model

Assessing the CS Variables – Abstraction: representing information as data – Types of data: predefined, atomic, aggregate – Scope: declaration, initialization, mutation Algorithms – Control flow: unconditional, conditional, repetition – Input/Output and regex (pattern matching) – Top-down design: subroutines – To reuse or not to reuse (code)? Incremental development / experimentation Elegance: readability and maintainability

Biological question – What pattern of nucleotide substitution occurs in protein-coding genes? Algorithm – What does we know about mutation, nt/AA sequences? – Assumptions Implementation – Instructors provided “starter code” – Students read and ran the code to see what it did – Pairs discussed how to add and refine it, and did so Evaluation – Analyze the CS: Did it run and did it do the job we asked? – Analyze the biology: Did it accurately represent the biological process? Testing the models against empirical evidence – Aligned HSP70 genes and evaluated the pattern of substitution Which model most closely matched the biology?