A Comparative Analysis of Software Refinement Techniques Ion IVAN Adrian VISOIU.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Better Software Defect Prediction Using Equalized Learning With Machine Learners Kim Kaminsky Gary D. Boetticher Department of Computer Science University.
Correlation and Linear Regression.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
edgard.gnansounou The bio-refinery Concept Contacts Process Synthesis and Life Cycle Edgard Gnansounou +41 (0)21 / epfl.ch Laboratory.
Chapter 13 Multiple Regression
Non-Linear Problems General approach. Non-linear Optimization Many objective functions, tend to be non-linear. Design problems for which the objective.
ISSPIT Ajman University of Science & Technology, UAE
Improving code generation. Better code generation requires greater context Over expressions: optimal ordering of subtrees Over basic blocks: Common subexpression.
Chapter 12 Multiple Regression
Software Metrics II Speaker: Jerry Gao Ph.D. San Jose State University URL: Sept., 2001.
Genetic Algorithms Can Be Used To Obtain Good Linear Congruential Generators Presented by Ben Sproat.
Improving Code Generation Honors Compilers April 16 th 2002.
Simple Linear Regression Analysis
1 Cost Estimation CIS 375 Bruce R. Maxim UM-Dearborn.
CISC673 – Optimizing Compilers1/34 Presented by: Sameer Kulkarni Dept of Computer & Information Sciences University of Delaware Phase Ordering.
Chapter 11 Simple Regression
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Dr M F Abbod Using Intelligent Optimisation Methods to Improve the Group Method of Data Handling in Time Series Prediction Maysam Abbod and Karishma Dashpande.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Estimate of Swimming Energy Expenditure Utilizing an Omnidirectional Accelerometer and Swim Performance Measures Jeanne D. Johnston and Joel M. Stager,
Improved Gene Expression Programming to Solve the Inverse Problem for Ordinary Differential Equations Kangshun Li Professor, Ph.D Professor, Ph.D College.
16-1 Linear Trend The long term trend of many business series often approximates a straight line.
Product Metrics An overview. What are metrics? “ A quantitative measure of the degree to which a system, component, or process possesses a given attribute.”
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Unit 3 We are learning to use a linear model to examine part-whole relationships and their connection to percents. We are developing strategies to find.
1 ECE 453 – CS 447 – SE 465 Software Testing & Quality Assurance Lecture 23 Instructor Paulo Alencar.
Statistical Methods Statistical Methods Descriptive Inferential
Neural and Evolutionary Computing - Lecture 9 1 Evolutionary Neural Networks Design  Motivation  Evolutionary training  Evolutionary design of the architecture.
Bashkir State Univerity The Chair of Mathematical Modeling , Ufa, Zaki Validi str. 32 Phone: ,
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Disciplined Software Engineering Lecture #3 Software Engineering Institute Carnegie Mellon University Pittsburgh, PA Sponsored by the U.S. Department.
1 Program Testing (Lecture 14) Prof. R. Mall Dept. of CSE, IIT, Kharagpur.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 Multiple Regression
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
ECON 338/ENVR 305 CLICKER QUESTIONS Statistics – Question Set #8 (from Chapter 10)
Operations Research The OR Process. What is OR? It is a Process It assists Decision Makers It has a set of Tools It is applicable in many Situations.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
Alice E. Smith and Mehmet Gulsen Department of Industrial Engineering
Random Interpretation Sumit Gulwani UC-Berkeley. 1 Program Analysis Applications in all aspects of software development, e.g. Program correctness Compiler.
AS Research Methods - REVISION. Methods and Techniques Pilot Studies – used why? Experimental Method –THREE types of experiment? –S&W of each? Correlational.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Genetic Algorithms based Parameter Identification of Yeast Fed-Batch Cultivation Angelova M., Tzonkov St., Pencheva T.
Bivariate Correlation and Regression. For Quantitative variables measured on the Interval Level Which one is the First? –Correlation or Regression.
CS223: Software Engineering Lecture 21: Unit Testing Metric.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
EVOLUTIONARY SYSTEMS AND GENETIC ALGORITHMS NAME: AKSHITKUMAR PATEL STUDENT ID: GRAD POSITION PAPER.
Evolutionary Design of the Closed Loop Control on the Basis of NN-ANARX Model Using Genetic Algoritm.
Implementation & Integration Research Paper Presentation Nick Conway.
Determining How Costs Behave
Multiple Regression.
Software Testing.
REGRESSION (R2).
Regression Analysis Module 3.
ENME 392 Regression Theory
What is Correlation Analysis?
Function Point Analysis
How to Predict More with Less: Defect Prediction Using Machine Learners in an Implicitly Data Starved Domain Kim Kaminsky Gary D. Boetticher Department.
Simple Linear Regression
Correlation and Regression
Genetic Algorithm Optimization for Selecting the Best Architecture of a Multi-Layer Perceptron Neural Network. A Credit Scoring Case Alejandro Correa,
Linear Regression.
Multiple Regression.
DRQ 8 Dr. Capps AGEC points
Presented by Trey Brumley and Ryan Carter
Chapter 19 Technical Metrics for Software
Cases. Simple Regression Linear Multiple Regression.
Regression and Correlation of Data
Presentation transcript:

A Comparative Analysis of Software Refinement Techniques Ion IVAN Adrian VISOIU

Contents Model Structures Model Generators Model Complexity Refinement through Variable Elimination Refinement through Complexity Decrease Refinement through Genetic Algorithms Experimental Results Refined Model Validation Conclusions

Model structures Obtaining a model phenomenon/process analysis establishing influence factors connecting factors to variables measuring variables using an experiment plan generating analytical forms establishing model performance criterion making estimations computing performance criterion for each model selecting a model class model orderingrefined model validation Criterion fulfilled

Model Generators Inputs –list of independent variables –the dependent variable –dataset –restrictions –performance criterion Outputs –list of generated models ordered by the performance criterion Classes: linear, nonlinear, lagged variables

Model complexity For the model M C(M)=n 1 log n 1 + n 2 log n 2 where n 1 - number of operands (variables and coefficients); n 2 - number of operators

Variable Elimination Independent X 1, X 2,.., X n Dependent Y Initial model y = one by one variable elimination y = a i *x i, i=1, 2,.., n

COMPLEXITY DECREASE power terms -> variables function calls -> variables function composition -> direct aggregation y=ax 2 +bz 2 +cu 2 +e is replaced by y = Ax+BZ+CU+E

Genetic algorithms Symbolic regression Population Chromosome – codes an expression Genes – subexpressions – operands, operators, constants Evolution: selection, mutation, transposition, recombination

Experimental results Dataset ERR –number of defects a module LOC –source code lines RM – branch count CC – cyclomatic complexity

Experimental results Variable elimination ERR_EST= a*LOC + b*RM + c*CC + d R 2 adjusted =0,7174 ERR_EST = a*LOC +b*CC+c R 2 adjusted =0,725 Complexity decrease ERR_EST = a*LOC b + c*CC d + e R2 = ERR_EST = a*LOC +b*CC+c R 2 adjusted =0,725 Validation For a new dataset correlation between the recorded values and the estimated ones is 0,8204

Conclusions cyclic character of refinement process refinement modules,model generators -> modelbase refined models: simple, easy to interpret

Contact