Engineering Probability and Statistics - SE-205 -Chap 1 By S. O. Duffuaa
Course Objectives w Introduce the students to basic probability and statistics and demonstrate its wide application in the area of Systems Engineering.
Main Course Outcomes w Students should be able to perform: Summarize and present data Describe probability distributions Compute probabilities using density/mass functions Conduct interval estimation Make inference about populations Use statistical package/Minitab
Text Book and References w “Applied Statistics and Probability for Engineers “ by D. C. Montgomery and Runger, w “Probability and Statistics for Engineers and Scientists” 5 th by Walpole and Mayers. w Statistics by Murry Speigel
Course Policy w Home-works and attendance 15% w Quizzes 15% w Exam1 20% w Exam II 20% w Final Exam 30%
SE- 205 Place in SE Curriculum w Central Course w Prerequisite for 7 SE courses SE 303, SE 320, SE 323, SE 325, SE 447, SE 480, SE 463 and may be others. See SE Curriculum Tree
Engineering Problem Solving w Develop clear and concise problem description w Identify the important factors in the problem. w Propose a model for the problem w Conduct appropriate experimentation w Refine the model
Engineering Problem Solving w Validate the solution w Conclusion and recommendations
Statistics Science of data collection, summarization, presentation and analysis for better decision making. How to collect data ? How to summarize it ? How to present it ? How do you analyze it and make conclusions and correct decisions ?
Role of Statistics w Many aspects of Engineering deals with data – Product and process design w Identify sources of variability w Essential for decision making
Data Collection w Observational study Observe the system Historical data w The objective is to build a system model usually called empirical models w Design of experiment Plays key role in engineering design
Data Collection w Sources of data collection: Observation of something you can’t control (observe a system or historical data) Conduct an experiment Surveying opinions of people in social problem
Statistics w Divided into : Descriptive Statistics Inferential Statistics
Forms of Data Description w Point summary w Tabular format w Graphical format w Diagrams
Point Summary 1) Central tendency measures Sample Mean x = x i /n Population Mean(µ) Median --- Middle value Mode --- Most frequent value Percentile
Point Summary 2) Variability measures Range = Max x i - Min x i Variance = V = S 2 = (x i – x ) 2 / n-1 also = Standard deviation = S S = Square root (V) Coefficient of variation = S/ x Inter-quartile range (IQR) (x i 2 ) – {[( x i ) 2 ]/n} n -1
Diagrams: Dot Diagram w A diagram that has on the x-axis the points plotted : Given the following grades of a class: 50, 23, 40, 90, 95, 10, 80, 50, 75, 55, 60,
Dot Diagram w A diagram that has on the x-axis the points plotted : Given the following grades of a class: 50, 23, 40, 90, 95, 10, 80, 50, 75, 55, 60,
Graphical Format Time Frequency Plot The Time Frequency Plot tells the following : 1) The Center of Data 2) The Variability 3) The Trends or Shifts in the data Control Chart
Time Frequency Plot
Lower control limit = x = Upper control limit = Control Charts
Central Line = Average ( X ) Lower Control Limit (LCL)= X – 3S Upper Control Limit (UCL)= X + 3S Control Charts
Lecture Objectives w Sample and population w Random sample w Present the following: w Stem-leaf diagram w The frequency distribution w Histogram
Population and Sample w Population is the totality of observations we are concerned with. w Example: All Engineers in the Kingdom, All SE students etc. w Sample : Subset of the population 50 Engineers selected at random, 10 SE students selected at random.
Mean and Variance w Sample mean X-bar w Population mean µ w Sample variance S 2 w Population variance σ 2
Stem-And –Leaf Diagram w Each number x i is divided into two parts the stem consisting of one or two leading digits w The rest of the digits constitute the leaf. w Example if the data is 126 then 12 is stem and 6 is the leaf. What is the stem and leaf for 76
Data Table 1.1 Compressive Strength of 80 Aluminum Lithium Alloy
Stem-And-Leaf f Stem leaf frequency
Number of Stems Considerations StemLeaf
Stem number considerations Stem leaf 6L U L U L U L2 3 9U5
Number of Stems w Between 20 and 5 w Roughly n where n number of data points
Percentiles w Pth percentile of the data is a value where at least P% of the data takes on this value or less and at least (1-P)% of the data takes on this value or more. w Median is 50 th percentile. ( Q 2 ) w First quartile Q 1 is the 25 th percentile. w Third quartile Q 3 is the 75 th percentile.
Percentile Computation : Example Data : 5, 7, 25, 10, 22, 13, 15, 27, 45, 18, 3, 30 Compute 90 th percentile. 1. Sort the data from smallest to largest 3, 5, 7, 10, 13, 15, 18, 22, 25, 27, 30, Multiply 90/100 x 12 = 10.8 round it to to the next integer which is 11. Therefore the 90 th percentile is point # 11 which is 30.
Percentile Computation : Example w If the product of the percent with the number of the data came out to be a number. Then the percentile is the average of the data point corresponding to this number and the data point corresponding to the next number. w Quartiles computation is similar to the percentiles.
w Pth percentile = (P/ 100)*n = r double (round it up & take its rank) (r) integer (take Avg. of its rank & # after) w Inter-quartile range = Q 3 – Q 1 w Frequency Distribution Table : 1) # class intervals (k) = 5 < k < 20 k ~ n 2) The width of the intervals (W) = Range/k = (Max-Min) / n
Class Interval (psi) Tally (# data in this interval ) Frequency Relative Frequency = (Frequency/ n) Cumulative Relative Frequency 70 ≤ x < 90|| ≤ x < 110||| ≤ x < 130|||| | ≤ x < 150|||| |||| |||| ≤ x < 170|||| |||| |||| |||| || ≤ x <1 90|||| |||| |||| || ≤ x < 210|||| ≤ x < 230|||| ≤ x < 250||
Compressive Strength (psi) Frequency
Histogram: is the graph of the frequency distribution table that shows class intervals V.S. freq. or (Cumulative) Relative freq
1.5 IQR IQR1.5 IQR Outliers Extreme OutliersOutliers First QuartileSecond Quartile Third Quartile Whisker extends to largest data point within 1.5 interquartile ranges from third quartile Whisker extends to smallest data point within 1.5 interquartile ranges from first quartile Extreme Outliers
Strength
Plant Quantity Index