Computational Methods in Physics PHYS 3437 Dr Rob Thacker Dept of Astronomy & Physics (MM-301C)

Computational Methods in Physics PHYS 3437 Dr Rob Thacker Dept of Astronomy & Physics (MM-301C) thacker@ap.smu.ca

Today’s Lecture Interpolation & Approximation III Interpolation & Approximation III LU decompositions for matrix algebra LU decompositions for matrix algebra Least squares polynomial fitting Least squares polynomial fitting Fitting to data Fitting to data Fitting to functions – the Hilbert Matrix Fitting to functions – the Hilbert Matrix Use of orthogonal polynomials to simplify the fitting process Use of orthogonal polynomials to simplify the fitting process

The standard approach For a linear system of n equations with n unknowns Gaussian elimination is the standard technique For a linear system of n equations with n unknowns Gaussian elimination is the standard technique As we all know G.E. is an effective but time consuming algorithm – applying the algorithm on an n×n matrix takes ~n 3 operations ( O (n 3 )). While we can’t change the order of the calculation we can reduce the overall number of calculations.

The LU Decomposition method A non-singular matrix can be written as a product of lower and upper diagonal matrices A non-singular matrix can be written as a product of lower and upper diagonal matrices A=LU where, The simple nature of L & U means that it is actually fairly straightforward to relate the values to A The simple nature of L & U means that it is actually fairly straightforward to relate the values to A Then can find the l ij and u ij fairly easily Then can find the l ij and u ij fairly easily

By examination If A=LU, multiply to get If A=LU, multiply to get Equate first col: Equate first row: Equate second col: Now we know all l i1, i=1,..,n Use l 11 vals to get all u 1j j=2,…,n Use previous steps to get l j2 i=2,…,n

Alternate from columns to rows Thus expanding each column of A allows us to equate a column of values in L Thus expanding each column of A allows us to equate a column of values in L Similarly, expanding a row of A allows to equate to a row of values in U Similarly, expanding a row of A allows to equate to a row of values in U However, most rows will be combinations of U and L entries and we must alternate between columns and rows to ensure each expression contains only 1 unknown However, most rows will be combinations of U and L entries and we must alternate between columns and rows to ensure each expression contains only 1 unknown

General Formulae (Exercise) Provided we consider column values on or below the diagonal then we have for the k-th column Provided we consider column values on or below the diagonal then we have for the k-th column Similarly, for row values to the right of the main diagonal then we have for the k-th row Similarly, for row values to the right of the main diagonal then we have for the k-th row Convince yourself of these two relationships… Convince yourself of these two relationships…

Utilizing the LU decomposition We need to solve eqn (5) for z first We need to solve eqn (5) for z first Once z is known we can then solve eqn (4) for x Once z is known we can then solve eqn (4) for x Sounds like a lot of work! Why is this useful? Sounds like a lot of work! Why is this useful? The triangular matrices are already in a Gauss-Jordan eliminated form so both can be solved by substitution The triangular matrices are already in a Gauss-Jordan eliminated form so both can be solved by substitution

Consider solving If we write out the equations: If we write out the equations: Which is solved via “forward substitution” – exactly analogous to the back substitution we saw for triadiagonal solvers Which is solved via “forward substitution” – exactly analogous to the back substitution we saw for triadiagonal solvers

Next solve Again if we right out the equations then: Again if we right out the equations then: We solve by back substitution again: We solve by back substitution again:

Least squares fit of a polynomial We’ve already seen how to fit a n-th degree polynomial to n+1 points (x i,y i ) i=1,…,n We’ve already seen how to fit a n-th degree polynomial to n+1 points (x i,y i ) i=1,…,n Lagrange interpolation polynomial Lagrange interpolation polynomial For noisy data we would not want a fit to pass through all the points For noisy data we would not want a fit to pass through all the points We want to find a best fit polynomial or order m We want to find a best fit polynomial or order m The sum of the squares of the residuals is The sum of the squares of the residuals is

Minimize the “error” We next minimize S by taking partial derivatives w.r.t. the coefficients c n We next minimize S by taking partial derivatives w.r.t. the coefficients c n e.g. for c 0 : e.g. for c 0 :

More examples For c 1 we find: For c 1 we find: and so on… Taking partials w.r.t. to m+1 coefficients & equating to zero gives m+1 equations in m+1 unknowns (the c j j=0,…,m) and so on… Taking partials w.r.t. to m+1 coefficients & equating to zero gives m+1 equations in m+1 unknowns (the c j j=0,…,m)

Matrix form These are the so-called “normal equations” in analogy with normals to the level set of a function These are the so-called “normal equations” in analogy with normals to the level set of a function Can be solved using the LU decomposition method we just discussed to find the coefficients c j Can be solved using the LU decomposition method we just discussed to find the coefficients c j

How do we choose m? Higher order fits are not always better Higher order fits are not always better Many dependent variables are inherently following some underlying “law.” Many dependent variables are inherently following some underlying “law.” If the underlying relationship were quadratic, fitting a cubic would be largely pointless If the underlying relationship were quadratic, fitting a cubic would be largely pointless However, we don’t always know the underlying relationship However, we don’t always know the underlying relationship So we need to look at the data… So we need to look at the data…

Strategy Plot the data Plot the data If linear looks OK then start with m=1 If linear looks OK then start with m=1 If curve is visible try m=2 – if higher order then try log-log graph to estimate m If curve is visible try m=2 – if higher order then try log-log graph to estimate m Linear fit looks reasonable Start with m=1 Strong suggestion of underlying curve Start with m=2

Strategy cont. After each fit evaluate S m and plot the residuals r i =p(x i )-y i After each fit evaluate S m and plot the residuals r i =p(x i )-y i If S m ~S m-1 then this indicates that the mth (or m-1 th) order is about as good as you can do If S m ~S m-1 then this indicates that the mth (or m-1 th) order is about as good as you can do Plotting the residuals will also give intuition to the quality of fit Plotting the residuals will also give intuition to the quality of fit Residuals should scatter around zero with no implicit “bias” Residuals should scatter around zero with no implicit “bias”

Residuals r r Residuals scattered about zero with A large number of sign changes Residuals clearly have a positive bias with fewer sign changes This plot suggests you need an m+1 order fit Example of a pretty good fit

Extending the approach to fitting functions We can use the method developed to fit a continuous function f(x) in an interval [a,b] We can use the method developed to fit a continuous function f(x) in an interval [a,b] Rather than a sum of squares we now have an integral that we need to minimize Rather than a sum of squares we now have an integral that we need to minimize As an example, let f(x)=sin(  x) and the limits be [0,1], find the best quadratic (c 0 +c 1 x+c 2 x 2 ) to fit As an example, let f(x)=sin(  x) and the limits be [0,1], find the best quadratic (c 0 +c 1 x+c 2 x 2 ) to fit We need to minimize We need to minimize

Compute derivatives We work in the same way as the earlier fit: We work in the same way as the earlier fit: All integrals are tractable, and in matrix form we get: All integrals are tractable, and in matrix form we get:

Comparing to Taylor expansion If we solve the system on the previous page we derive a polynomial If we solve the system on the previous page we derive a polynomial We can compare this to the Taylor expansion to second order about x=1/2 We can compare this to the Taylor expansion to second order about x=1/2 Notice that the least squares fit and Taylor expansion are different

Least squares fit is more accurate globally Taylor expansion is more accurate around the expansion point but becomes progressively worse further away

Hilbert Matrices The specific form of the coefficient matrix is called a “Hilbert Matrix” and arises because of the polynomial fit The specific form of the coefficient matrix is called a “Hilbert Matrix” and arises because of the polynomial fit Key point: f(x) only affects the right hand side Key point: f(x) only affects the right hand side For a polynomial fit of degree m the Hilbert Matrix has a dimension of m+1 For a polynomial fit of degree m the Hilbert Matrix has a dimension of m+1 As m increases, the determinant → 0 for m→∞ As m increases, the determinant → 0 for m→∞ This creates significant numerical difficulties This creates significant numerical difficulties

Hilbert Matrices: problems & solutions The Hilbert Matrix is classic example of an ill conditioned matrix The Hilbert Matrix is classic example of an ill conditioned matrix By m=4 (i.e. a 5×5 Hilbert matrix) the small determinant causes difficulty in single precision By m=4 (i.e. a 5×5 Hilbert matrix) the small determinant causes difficulty in single precision By m=8 (i.e. a 9×9 Hilbert matrix) problems at double precision (real*8) By m=8 (i.e. a 9×9 Hilbert matrix) problems at double precision (real*8) We can improve upon the fitting method by choosing to fit a sum of polynomials: We can improve upon the fitting method by choosing to fit a sum of polynomials: p i are ith degree polynomials

Fitting over basis polynomials We’ll postpone the nature of the p i (x) for a moment… We’ll postpone the nature of the p i (x) for a moment… We now need to find the  i by minimizing the least square function S We now need to find the  i by minimizing the least square function S

Choosing the p i (x) In general, choosing polynomials appropriately will ensure that this is not a Hilbert Matrix In general, choosing polynomials appropriately will ensure that this is not a Hilbert Matrix We can again use an LU solve to find the  i and hence determine the fitting polynomial We can again use an LU solve to find the  i and hence determine the fitting polynomial However, if we had a system of polynomials for which However, if we had a system of polynomials for which The system would be trivial to solve! Such polynomials are said to be orthogonal Such polynomials are said to be orthogonal Note that the integral is directly analogous to a dot product of vectors of rank N vectors (the integral has “infinite rank”)! Note that the integral is directly analogous to a dot product of vectors of rank N vectors (the integral has “infinite rank”)!

Simplifying the normal equations If we have orthogonal polynomials, then from (1) If we have orthogonal polynomials, then from (1)

Further simplification In the case that the polynomials are orthonormal so that In the case that the polynomials are orthonormal so that Then eqn (3) reduces to Then eqn (3) reduces to There is no need to solve a matrix in this case! There is no need to solve a matrix in this case!

Orthogonality definition: weighting functions In practice it is often useful to include a weighting function, w(x), in the definition of the orthogonality requirement In practice it is often useful to include a weighting function, w(x), in the definition of the orthogonality requirement Orthonomality is then Orthonomality is then We can include the weighting function in the definition of the least squares formula for S We can include the weighting function in the definition of the least squares formula for S

Examples of Orthogonal (Orthonormal) functions [a,b]w(x)symbolName [-1,1]1 P n (x) Legendre polynomials [-1,1] (1-x 2 ) -1/2 T n (x) Chebyshev I [-1,1] (1-x 2 ) -1/2 U n (x) Chebyshev II [0,∞)exp(-x) L n (x) Laguerre (-∞,∞) exp(-x 2 ) H n (x) Hermite See Arfken

Summary LU decomposition is simple approach to solving matrices that uses both forward and backward substitution LU decomposition is simple approach to solving matrices that uses both forward and backward substitution Least squares fitting requires setting up the normal equations to minimize the sum of the squares Least squares fitting requires setting up the normal equations to minimize the sum of the squares Fitting functions with a simple polynomial will results in ill conditioned Hilbert matrices Fitting functions with a simple polynomial will results in ill conditioned Hilbert matrices More general fitting using a sum over orthogonal/orthonormal polynomials can reduce the fitting problem to a series of integrals More general fitting using a sum over orthogonal/orthonormal polynomials can reduce the fitting problem to a series of integrals

Next Lecture Introduction to numerical integration Introduction to numerical integration

Computational Methods in Physics PHYS 3437 Dr Rob Thacker Dept of Astronomy & Physics (MM-301C)

Similar presentations

Presentation on theme: "Computational Methods in Physics PHYS 3437 Dr Rob Thacker Dept of Astronomy & Physics (MM-301C)"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Computational Methods in Physics PHYS 3437 Dr Rob Thacker Dept of Astronomy & Physics (MM-301C)

Similar presentations

Presentation on theme: "Computational Methods in Physics PHYS 3437 Dr Rob Thacker Dept of Astronomy & Physics (MM-301C)"— Presentation transcript:

Similar presentations

About project

Feedback