Standardized Workflows (II) Carlos Oscar Sorzano Techn. Director I 2 PC Natl. Center Biotechnology (CSIC)

Slides:



Advertisements
Similar presentations
Digital Photography Week Two Review Week Two Review.
Advertisements

NICS 2 NICS 2 UV-optimized spectrograph UV-optimized spectrograph high resolution IR spectrograph high resolution IR spectrograph call for proposal: deadline.
Query optimisation.
Comp325 Algorithmic and Game Theoretic Foundation for Internet Economics/Xiaotie Deng Lecture 19 Matching Market Equilibrium
Welcome to Web Treasure HUNT Team Name:. Basic rules Recommended search Google.com Points vary for each question Maximum of 3 negatives for full.
Chapter 18 Methodology – Monitoring and Tuning the Operational System Transparencies © Pearson Education Limited 1995, 2005.
Understanding Tables on the Web Jingjing Wang. Problem to Solve A wealth of information in the World Wide Web Not easy to access or process by machine.
TIPS TO USE THIS FILE Always click Yes to enable macros. Only save this file as a.ppt (for PowerPoint 03 and 07) or.pptm (for PowerPoint 10+. Do NOT save.
The Pigeonhole Principle
We will multiply by multiples of 10, 100, and 1000.
DEPTH OF FIELD PHOTO II MRS. KAMPF. SHUTTER SPEED CONTROLS THE AMOUNT OF TIME LIGHT HAS TO ENTER THE CAMERA AND EXPOSE THE FILM. SHUTTER SPEEDS ARE.
Kinetics (Reaction Rate)
Hydrological information systems Svein Taksdal Head of section, Section for Hydroinformatics Hydrology department Norwegian Water Resources and Energy.
© Dr I M Bradley CG109 - Individual Project (Undergraduate) Overview Briefing.
Working with images and scenes CS 5010 Program Design Paradigms “Bootcamp” Lesson 2.5 TexPoint fonts used in EMF. Read the TexPoint manual before you delete.
3/5/2009Computer systems1 Analyzing System Using Data Dictionaries Computer System: 1. Data Dictionary 2. Data Dictionary Categories 3. Creating Data Dictionary.
Function Tables 02/12/12 lntaylor ©. Table of Contents Learning Objectives Linear Equations Build a Function Table Build a T Chart Reading a Function.
Raster Data. The Raster Data Model The Raster Data Model is used to model spatial phenomena that vary continuously over a surface and that do not have.
© 2002 by Prentice Hall 1 David M. Kroenke Database Processing Eighth Edition Chapter 2 Introduction to Database Development.
PAWN V0.7 University of Maryland Institute for Advanced Computer Studies.
Integrating data sources on the World-Wide Web Ramon Lawrence and Ken Barker U. of Manitoba, U. of Calgary
Physical design. Stage 6 - Physical Design Retrieve the target physical environment Create physical data design Create function component implementation.
Introduction to Programming with C++ Fourth Edition
Particle picking and Screening (Practical work)
Scipion: Toward software integration, reproducibility and validation in EM image processing Biocomputing Unit, Instruct Image Processing Center, CNB-CSIC.
Spatial data models (types)
The Relational Database Model
Department of Physics and Astronomy DIGITAL IMAGE PROCESSING
Chapter 3 Sections 3.5 – 3.7. Vector Data Representation object-based “discrete objects”
LSBR Single particle processing workflow in Bsoft Bernard Heymann.
PHP meets MySQL.
Standardized Workflows (I) Carlos Oscar Sorzano Techn. Director I 2 PC Natl. Center Biotechnology (CSIC)
Network of Excellence Benchmark Carlos Oscar Sorzano Techn. Director I 2 PC Natl. Center Biotechnology (CSIC)
McGraw-Hill Career Education© 2008 by the McGraw-Hill Companies, Inc. All Rights Reserved. 2-1 Office PowerPoint 2007 Lab 2 Modifying and Refining a Presentation.
Elastic Image Registration To Fully Explore Macromolecular Dynamics By Electron Microscopy Qiyu Jin 1, Carlos Oscar Sanchez Sorzano 2, Isabelle Callebaut.
Databases. Not All Tables Are Created Equal Spreadsheets use tables to store data and formulas associated with that data The “meaning” of data is implicit.
WEEK# 2 Haifa Abulaiha August 24,
XP Chapter 1 Succeeding in Business with Microsoft Office Access 2003: A Problem-Solving Approach 1 Preparing To Automate Data Management Chapter 1 “You.
CIS 210 Systems Analysis and Development Week 6 Part II Designing Databases,
LAV Software Status Emanuele Leonardi – Tommaso Spadaro Photon Veto WG meeting – 2015/03/24.
1 CSE 2337 Introduction to Data Management Access Book – Ch 1.
CONTENTS Processing structures and commands Control structures – Sequence Sequence – Selection Selection – Iteration Iteration Naming conventions – File.
+ Information Systems and Databases 2.2 Organisation.
Information Interchange José María Carazo Director I 2 PC Natl. Center Biotechnology (CSIC)
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
CS130 Visual Basic Project 4 Lecture Fall New topics in project 4 Database, file (table), records, fields. Application that contains menus, submenus,
Assignment 3 Your going to modify your last Best Movies page and Recipe page Your going to modify your last Best Movies page and Recipe page You will need.
An Introduction to Programming with C++ Sixth Edition Chapter 12 Two-Dimensional Arrays.
Online Programming| Online Training| Real Time Projects | Certifications |Online Classes| Corporate Training |Jobs| CONTACT US: STANSYS SOFTWARE SOLUTIONS.
DAY 2 Haifa Abulaiha January 13,
Mastering Autodesk Revit MEP 2016
SysML v2 Formalism: Requirements & Benefits
Final Report Alpha Squad Seven
knowledge organization for a food secure world
Week 12 Option 3: Database Design
ACL SCREEN Start with Standard ACL Project Screen.
Welcome to Physical Science!
1. You work for the University of Tawara and are going to create a database to analyse data about student courses and accommodation. Dates are to be stored.
Chapter 2 Matter and Change.
Building the BEST Structure Challenge
Volume 23, Issue 9, Pages (September 2015)
JSON Data Demo.
ITS Workbench Two Problems, One Open Standards Based Solution
Metadata The metadata contains
Rubén Díaz-Avalos, Donald L.D. Caspar  Biophysical Journal 
Volume 20, Issue 12, Pages (December 2012)
Volume 18, Issue 1, Pages (January 2010)
Working with images and scenes
Erika J Mancini, Felix de Haas, Stephen D Fuller  Structure 
Presentation transcript:

Standardized Workflows (II) Carlos Oscar Sorzano Techn. Director I 2 PC Natl. Center Biotechnology (CSIC)

Interchange Points need an interchange standard Specific proposal in the discussion

Some interchange points CTF estimation of micrographs: List of Micrograph, Voltage, DefocusU, DefocusV, AngleUX, Cs Particle picking: List of Micrograph, micrographXcoor, micrographYcoor Particle extraction: List of images 2D Alignment/Classification: 3D Alignment/Classification: List of image/volume [2D or 3D alignment] [A list of class representatives] [Class representative assignment]

Interchange Proposal Interchange information: – Data: images, volumes and stacks – MetaData: list of … Data structure – Data: Array of real values with X varying faster – MetaData: Table with specific column names Data format – Data: MRC file – MetaData: STAR file with specific block names

Data: Images, volumes and stacks.mrcMRC 2D image.mrcMRC 3D image (can be distinguished from 2D by header).mrcsMRC Stack of 2D images Heymann, J. B.; Chagoyen, M. & Belnap, D. M. J. Structural Biology, 2005, 151, HeaderVal0Val1ValN Val0Val1 x y (0,0) x y

MetaData: Interchange format

MetaData: Components UID: Unique Identifier Micrograph (one or many motifs) Image (motif of interest) Volume CTF Coordinates 2D Alignment 3D Alignment Comment

MetaData: UID, Micrograph, Comment Any entry in the metadata file has a unique entry number listOfMicrographs.3dem: # 3DEM_STAR_1 General file comment # data_block_1 loop_ _UID _micrographLocator _comment 1 InputData/micrograph1.mrc Comment 1 2 InputData/micrograph2.mrc Comment 2 data_block_2 loop_ _UID _micrographLocator 3 InputData/micrographA.mrc 4 InputData/micrographB.mrc

MetaData: CTF Any entry in the metadata file has a unique entry number listOCTFs.3dem: # 3DEM_STAR_1 * # data_block_1 loop_ _UID _Voltage _DefocusU _DefocusV _AngleUX _Cs (in kV) (in μm) (in degrees) (in mm.) X Y U V

MetaData: Images and Volumes Images can be in individual files (mrc) or stacks (mrcs). Volumes are in individual files (mrc). listOfImages.3dem: # 3DEM_STAR_1 * # data_block loop_ _UID _imageLocator 1 image00001.mrc 2 image00002.mrc 3 4 listOfVolumes.3dem: # 3DEM_STAR_1 * # data_block loop_ _UID _volumeLocator 1 volume00001.mrc 2 volume00002.mrc

MetaData: Coordinates Heymann, J. B.; Chagoyen, M. & Belnap, D. M. J. Structural Biology, 2005, 151, x y (0,0) listOfCoordinates.3dem: # 3DEM_STAR_1 * # data_block loop_ _UID _micrographXcoor _micrographYcoor

MetaData: 2D Alignment x y Δx=+5 Δy=-15 Δψ=60° listOfAlignedImages.3dem: # 3DEM_STAR_1 * # data_block loop_ _UID _imageLocator _imageXOff _imageYOff _imagePsiOff

MetaData: 2D Alignment x y listOfAlignedImages.3dem: # 3DEM_STAR_1 * # data_alignedImageList_A loop_ _UID _imageLocator _imageXOff _imageYOff _imagePsiOff _imageFlip Δx=+5 Δy=-15 Δψ=60° flip

MetaData: 2D Alignment x y Δx=+5 Δy=-15 Δψ=60° flip

MetaData: 3D Alignment List of 3D aligned images.3dem: # 3DEM_STAR_1 * # data_block loop_ _UID _imageLocator _homogeneousEulerMatrix I1 mage00001.mrc [a11 a12 a13 Xoff a21 a22 a23 Yoff a31 a32 a33 0] Heymann, J. B.; Chagoyen, M. & Belnap, D. M. J. Structural Biology, 2005, 151, List of 3D aligned volumes.3dem: # 3DEM_STAR_1 * # data_block loop_ _UID _volumeLocator _homogeneousEulerMatrix 1 volume00001.mrc [a11 a12 a13 Xoff a21 a22 a23 Yoff a31 a32 a33 Zoff]

MetaData: Relationships listOfImagesAndDownsampledImages.3dem: # 3DEM_STAR_1 * # data_fullSize_images loop_ _UID _imageLocator 1 2 data_downsampledSize_images loop_ _UID _imageLocator 3 4 data_correspondingDownsampleImage loop_ _UID _UID1 _UID

MetaData: Interchange format There can be any number of blocks within a Star file. Each block can combine any number of components Examples: Micrograph and locations: micrographLocator, micrographXcoor, micrographYcoor Micrograph and CTF: micrographLocator, Voltage, DefocusU, DefocusV, AngleUX, Cs Micrograph, image, location and CTF: micrographLocator, imageLocator, micrographXcoor, micrographYcoor, Voltage, DefocusU, DefocusV, AngleUX, Cs

Some interchange points CTF estimation of micrographs: Block name: micrographCTFs micrographLocator, Voltage, DefocusU, DefocusV, AngleUX, Cs Particle picking: Block name: particlePicking micrographLocator, micrographXcoor, micrographYcoor Particle extraction: Block name: imageList imageLocator 2D Alignment/Classification: 3D Alignment/Classification: A block per class: class_00001, class_00002, … Class blocks: imageLocator [2D or 3D alignment] [A block of class representatives] Blockname: class_representatives imageLocator [Class representative assignment] Blockname: class_representative_assignment UID1, UID2

How to check? Automatic check of results Syntactically and Semantically

Discussion Is this the correct strategy? (interchange points + interchange format) Which interchange points? Which interchange format? How to reach a standard?

Interchange objects CTF determination Micrograph screening Particle picking Particle extraction and screening 2D alignment and classification Initial volume construction 3D Model refinement Micrograph phase correction 3D Model amplitude correction

What do we want to interchange? What is the minimum amount of information needed? How are we going to store it?