Interactive Series Baseline Correction Algorithm

Slides:



Advertisements
Similar presentations
Instrumental Analysis
Advertisements

The world leader in serving science TQ ANALYST SOFTWARE Putting your applications on target.
Application of NIR for counterfeit drug detection Another proof that chemometrics is usable: NIR confirmed by HPLC-DAD-MS and CE-UV Institute of Chemical.
Regression analysis Relating two data matrices/tables to each other Purpose: prediction and interpretation Y-data X-data.
LIGO- G Z Optimally Combining the Hanford Interferometer Strain Channels Albert Lazzarini LIGO Laboratory Caltech S. Bose, P. Fritschel, M. McHugh,
Quantification – Concepts and Integration
Microspectrophotometry Validation. Reasons for Changing Instruments Reduced reliability. Limited efficiency. Limited availability and cost of replacement.
Budapest May 27, 2008 Unifying mixed linear models and the MASH algorithm for breakpoint detection and correction Anders Grimvall, Sackmone Sirisack, Agne.
S-SENCE Signal processing for chemical sensors Martin Holmberg S-SENCE Applied Physics, Department of Physics and Measurement Technology (IFM) Linköping.
Sergey Kucheryavski Raman spectroscopy Acquisition, preprocessing and analysis of spectra.
World Health Organization
ANALYTICAL CHEMISTRY ERT 207
QUALITY CONTROL OF COMPOSITION OF BLACK POLYMERES.
Two cases of chemometrics application in protein crystallography European Molecular Biology Laboratory (EMBL), Hamburg, Germany Andrey Bogomolov.
Proposal for a Standard Representation of the Results of GC-MS Analysis: A Module for ArMet Helen Fuell 1, Manfred Beckmann 2, John Draper 2, Oliver Fiehn.
Basic Questions Regarding All Analytical & Instrumental Methods (p 17-18) What accuracy and precision are required? How much sample do I have available,
CALIBRATION Prof.Dr.Cevdet Demir
© Tefko Saracevic, Rutgers University 1 EVALUATION in searching IR systems Digital libraries Reference sources Web sources.
Initial estimates for MCR-ALS method: EFA and SIMPLISMA
Elaine Martin Centre for Process Analytics and Control Technology University of Newcastle, England The Conjunction of Process and.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
1 CPAC: May 11, 2006 NeSSIfying GC a proposal to the Washington Technology Center Brian Rohrback, Infometrix, Inc. CPAC Team David Veltkamp, Brian Marquardt,
Atomic Absorption Spectroscopy
Yuri Kalambet, Ampersand Ltd., Moscow, Russia
CALIBRATION METHODS.
PAT Validation Working Group Process and Analytical Validation Working Group Arthur H. Kibbe, Ph.D. Chair June 13, 2002.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Function BIRN: Quality Assurance Practices Introduction: Conclusion: Function BIRN In developing a common fMRI protocol for a multi-center study of schizophrenia,
1 A Local and Remote Radio Frequency Identification Learning Environment Andrew Shields & David Butcher Wireless and Mobility Research Group, Institute.
To determine the rate constants for the second order consecutive reactions, a number of chemometrics and hard kinetic based methods are described. The.
Spectroscopy Chromatography PhysChem Naming Drawing and Databasing Enterprise Solutions Software for Interactive Curve Resolution using SIMPLISMA Andrey.
Chemometric functions in Excel
Spectroscopy Chromatography PhysChem Naming Drawing and Databasing Enterprise Solutions STRUCTURALLY AWARE APPROACH TO THE INTERPRETATION OF VIBRATIONAL.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Threeway analysis Batch organic synthesis. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.
EFFICIENT ROAD MAPPING VIA INTERACTIVE IMAGE SEGMENTATION Presenter: Alexander Velizhev CMRT’09 ISPRS Workshop O. Barinova, R. Shapovalov, S. Sudakov,
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
Digital Media Lab 1 Data Mining Applied To Fault Detection Shinho Jeong Jaewon Shim Hyunsoo Lee {cinooco, poohut,
Metabolomics Metabolome Reflects the State of the Cell, Organ or Organism Change in the metabolome is a direct consequence of protein activity changes.
Quality Assurance How do you know your results are correct? How confident are you?
CALIBRATION METHODS. For many analytical techniques, we need to evaluate the response of the unknown sample against the responses of a set of standards.
Lecture 07: Dealing with Big Data
MOS Data Reduction Michael Balogh University of Durham.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
QUANTITATIVE ANALYSIS OF POLYMORPHIC MIXTURES USING INFRARED SPECTROSCOPY IR Spectroscopy Calibration –Homogeneous Solid-State Mixtures –Multivariate Calibration.
Two-Dimensional Infrared Correlation in Time-resolved Spectroscopy Sadeq M. Al - Alawi Department of Chemistry University of Bahrain October 24, 2002.
Raman Spectra Processing Prepared by: Ian Adam, Nathaniel Maor, Jonathan Phung Drexel University, School of Biomedical Engineering BMES 546: Biocomputational.
Lecture 8 Peak Parameters and Quantitative chromatography
Date of download: 6/22/2016 Copyright © 2016 SPIE. All rights reserved. Glucose sensor architecture. The lamp provides broadband electromagnetic radiation.
Date of download: 6/25/2016 Copyright © 2016 SPIE. All rights reserved. Lensfree imaging module. (a) Schematic illustrating the principle of lensfree image.
VIDYA PRATISHTHAN’S COLLEGE OF ENGINEERING, BARAMATI.
Profiling: What is it? Notes and reflections on profiling and how it could be used in process mining.
Research Design and Methods (METHODOLOGY)
Course survey: what has been done, and what should be done
Applications of UV-Vis Spectroscopy
Fig. 1. proFIA approach for peak detection and quantification
Fourier Transform Infrared Spectroscopy
Digital image self-adaptive acquisition in medical x-ray imaging
Rotational Ambiguity in Hard-Soft Modeling Method
Data Reduction and Analysis Techniques
Research Design and Methods (METHODOLOGY)
Strategies for Eliminating Interferences in Optical Emission Spectroscopy Best practices to optimize your method and correct for interferences to produce.
Interval selection complexity
Data Reduction and Analysis Techniques
Chapter 1: The Nature of Analytical Chemistry
X.2 Linear Discriminant Analysis: 2-Class
Analytics – Statistical Approaches
Gas chromatography Software
TQ Analyst Gamal A. Hamid.
On-Line Prediction and Classification using The Unscrambler On-Line Predictor (OLUP) and Classifier (OLUC)
Presentation transcript:

Interactive Series Baseline Correction Algorithm Andrey Bogomolova, Willem Windigb, Susan M. Geerc, Debra B. Blondellc, and Mark J. Robbinsc a ACD/Labs, Russian Chemometrics Society, Moscow, Russia b Eigenvector Research Inc., Rochester, NY, USA c Eastman Kodak Company, Rochester, NY, USA

Baseline (Background) Problem Baseline is an “eternal” issue in analytical data processing “Baseline” or “background”? no clear distinction baseline is associated with a smooth line reflecting a “physical” interference background tends to be used in a more general sense to designate ANY unwanted signal including noise and chemical components Our preference is given to the term “baseline” because smoothness of the background signal is the main assumption of the proposed correction algorithm

Classical Approach to the Baseline Correction Problem Classical baseline correction algorithms with respect to single curve are almost exhaustively elaborated in the literature A baseline to be subtracted is fitted by a linear (polynomial) function to the nodes that belong to signal-free regions The nodes can be automatically detected by the software or manually placed by the user These methods are advantageous for half-automatic processing where software-generated results need to be revised by a human expert

Serial (Batch) Methods Development of two-dimensional spectroscopy and hyphenated techniques demanded new methods applicable to data matrices Early works in this direction applied automated baseline correction algorithms to every individual curve in a matrix dataset The main problem with this approach is that it neglects internal (inter-spectral) correlations Instead of the expected rank reduction it may introduce additional variance into the dataset It is a “black-box” routine that is difficult to control

Multivariate Background Correction Multivariate data analysis produced a revolutionary impact onto the baseline problem in general The paradigmatic shift from hard- (knowledge-driven) to soft- or self- (data-driven) modeling has opened new horizons and introduced new concepts PLS introduces the means to address the background without its subtraction in the calibration context OSC by S. Wold turns the problem inside out eliminating the variance that is irrelevant for calibration (orthogonal to Y) from the data (X) A number of other excellent algorithms…

Our Objectives The researchers are typically concentrated at the development of fully automated background correction methods Statement: fuzzy character of the baseline problem in general puts in doubt the feasibility of automated (expert-free) baseline correction routines In contrast, we present an alternative approach that tends to maximize the means of control for a human operator simplicity visualization interactive stepwise algorithm

The Method The method is applied to a series of curves (e.g., spectra or chromatograms) The method consists of two distinct steps First, a prototype baseline is constructed from linear segments by selecting a set of nodes To aid in the node selection the mean values are calculated to represent the entire series: Second, the prototype baseline is used to construct individual baselines to be subtracted from the series curves by adjusting the nodes vertically to the corrected curve

HPLC/DAD: Sample Data Raw Corrected Subtracting the baseline Selecting nodes Calculating the mean

2nd Derivative for Node Selection

Baseline Correction for Curve Resolution Baseline correction is an application-specific preprocessing technique The present baseline correction algorithm has been developed to improve the performance of SIMPLISMA (SIMPLe-to-use Interactive Self-modeling Mixture Analysis) curve resolution technique The algorithm has been used at Eastman Kodak Company over 10 years for routine analysis of TGA/IR data that represent a challenging case for curve resolution: a lot of components high degree of overlap intensive background signal

TGA/IR Sample Data Reprinted with permission from Eastman Kodak Company, 2005

Baseline Nature in TGA/IR The most common reasons for TGA/IR baseline drift: Temperature fluctuations over time Instrument drift Material scattering Impurities Inappropriate background, etc. In the present dataset - miscellaneous reasons Spectral domain is more suitable for series baseline correction because of narrow peaks and explicit baseline areas

TGA/IR: Baseline Correction Raw Corrected Subtracting “Snapping” the baseline Raw spectral series Calculating the mean Reprinted with permission from Eastman Kodak Company, 2005

TGA/IR: Corrected Data Map Reprinted with permission from Eastman Kodak Company, 2005

TGA/IR: SIMPLISMA Curve Resolution Reprinted with permission from Eastman Kodak Company, 2005

IR Library Identification Reprinted with permission from Eastman Kodak Company, 2005

Conclusions A new interactive approach to the baseline correction problem has been suggested It allows for adapting traditional automated single-scan baseline correction routines or for performing manual correction on matrix data as if they were a single curve Advantages of the method include “transparency” of the process and the means for extensive operator interaction The method has passed long-term testing in an industrial laboratory and was integrated into a professional software package In spite of the simplicity of the algorithm, it allows for successful elimination of baselines – even in complex cases such as TGA/IR data

Acknowledgements Antony Williams for his friendly support, and Michel Hachey for his help and valuable ideas