Presentation is loading. Please wait.

Presentation is loading. Please wait.

Use of imputed tree lists for FVS landscape projections: An overview of some issues and opportunities. Eric L. Smith Forest Health Technology Enterprise.

Similar presentations


Presentation on theme: "Use of imputed tree lists for FVS landscape projections: An overview of some issues and opportunities. Eric L. Smith Forest Health Technology Enterprise."— Presentation transcript:

1 Use of imputed tree lists for FVS landscape projections: An overview of some issues and opportunities. Eric L. Smith Forest Health Technology Enterprise Team U.S. Forest Service Fort Collins, CO

2 Problem: We would like to run FVS simulations for large landscapes, but we only have plot data for some of the stands One solution: For each uninventoried stand, use imputation techniques to find plot data taken from a similar site and use that data as if it were taken from the un-inventoried stand.

3 Imputation “Imputation” is a generic term for methods which can be used to estimate missing data. There are many ways to do this. For example, in FVS, you can provide a tree height but, if you don’t, FVS can impute it: estimate it from a height as fn(dbh) model.

4 Nearest Neighbor Imputation “Nearest Neighbor” (NN) imputation is a statistical technique which substitutes many values from another sample plot which is like the plot with the missing data, based on what information you do have about the plot with the missing values. The kind of information we do have (or can get) includes the kind of mapped data in GIS coverages and satellite data.

5 Why NN Imputation? In general, use of an entire plot sample insures the group of data elements represents a realistic combination of conditionsIn general, use of an entire plot sample insures the group of data elements represents a realistic combination of conditions For use in FVS, we need the whole tree list and sometimes addition plot dataFor use in FVS, we need the whole tree list and sometimes addition plot data

6 Process example Gradient Nearest Neighbor from Ohmann and Gregory, 2002

7 Example mapped data Landsat Bands, transformations, texture Climate Means, seasonal variability Topography Elevation, slope, aspect, solar Soil Soil Texture, drainage, mineral type Disturbance Past fires, harvest, &ID Location Lat., Long. Ownership Federal, state, forest industry, other private Adapted from Ohmann and Gregory

8 Mapped data information Physiographic variables relates to “potential vegetation” or successional pathwayPhysiographic variables relates to “potential vegetation” or successional pathway Satellite data relates to current tree sizes and density (pathway state)Satellite data relates to current tree sizes and density (pathway state) If management (or fire) has created variation in understory conditions which is hidden from the satellite by the overstory, this could be a problem.If management (or fire) has created variation in understory conditions which is hidden from the satellite by the overstory, this could be a problem.

9 The status of NN for FVS The NN technique most associated with FVS, Most Similar Neighbor (MSN), has been around for over 10 years, additional techniques are being added to the software by Crookston and others.The NN technique most associated with FVS, Most Similar Neighbor (MSN), has been around for over 10 years, additional techniques are being added to the software by Crookston and others. There is a increased recognition for the need for landscape simulations for fire and other applications.There is a increased recognition for the need for landscape simulations for fire and other applications. FIA annual data increasing available for all forested lands, while recent stand exam data is decreasing.FIA annual data increasing available for all forested lands, while recent stand exam data is decreasing. Adequate computer storage, processing power, software, and GIS-based mapped data are now widely available to perform large imputation projects.Adequate computer storage, processing power, software, and GIS-based mapped data are now widely available to perform large imputation projects.

10 Some current Major NN Efforts Crookston et al, RMRS, MoscowCrookston et al, RMRS, Moscow –MSN support, new YAImpute package Ohmann et al, PNWRS, CorvallisOhmann et al, PNWRS, Corvallis –Gradient NN (GNN), mapping in CA, OR, WA McRoberts & Finley, NRS, St. PaulMcRoberts & Finley, NRS, St. Paul –Faster processing (ANN), variance estimation Twombly, NRIS, have Informs, will travelTwombly, NRIS, have Informs, will travel –MSN inside INFORMS, creates Nat’l Forest maps LeMay et al, UBC, VancouverLeMay et al, UBC, Vancouver –Various application in Canada

11 Large NN imputations are here PNW, Ohmann Mn, McRoberts Pa, Lister NFs, Twombly

12 Scale: Compartments to States The application of NN imputation to fill in a (small?) number of uninventoried stands in a small landscape takes place in a very different information context than the NN allocation of large scale inventory plots to a large area (sub-states to multi-states).

13 Small area application Can know conditions and historyCan know conditions and history Can gather more ground informationCan gather more ground information Can relate imputation results to the on the ground realityCan relate imputation results to the on the ground reality Inventory often linked to purpose and reasonably intensiveInventory often linked to purpose and reasonably intensive Homogeneous areas (stands) can be predefined and be a sampled unitHomogeneous areas (stands) can be predefined and be a sampled unit Data and relationships between data are likely to be consistentData and relationships between data are likely to be consistent

14 Large area application Too large to have direct knowledge aboutToo large to have direct knowledge about Sampling intensity is generally lowSampling intensity is generally low Homogeneous areas not pre-defined but can be done so (using image analysis and GIS tools)Homogeneous areas not pre-defined but can be done so (using image analysis and GIS tools) Data and relationships between data are often inconsistent across areaData and relationships between data are often inconsistent across area Can gather more information- but through existing sources of remote sensing and other mapped dataCan gather more information- but through existing sources of remote sensing and other mapped data Inventories may not be linked to the desired applications of the usersInventories may not be linked to the desired applications of the users However, inventory design may provide statistically reliable population estimatesHowever, inventory design may provide statistically reliable population estimates

15 Scale shifts focus to map data Fine scale details are less reliable as sample intensity decreases and the imputation geographic range increase; But, from the stand point of the inventory estimates, imputation allows: (1) the more precise estimation of inventory data for small areas; (2) the estimation of additional types of summary variables for post stratified conditions; (3) the FVS projection of inventory subpopulations using associated tree lists by area and adjusted for a range of site conditions.

16 Error and Variance Need goodness of fit measures to evaluate the relative quality of proceduresNeed goodness of fit measures to evaluate the relative quality of procedures Understanding sources of errors which contribute to variance needed to know if and how they can be reducedUnderstanding sources of errors which contribute to variance needed to know if and how they can be reduced Variance estimates for NN results are complex and difficult, and under active investigationVariance estimates for NN results are complex and difficult, and under active investigation There are different approaches used by different disciplinesThere are different approaches used by different disciplines

17 FIA Plot Design Trees 5 inch and over are measured on 4 subplots, each 1/24 th acre Trees 1 to 5 inch are measured on 4 microplots, each 1/300 th acre Eventually, there should be at least one plot per 6000 forested acres, nationwide

18 Spatial scale: FIA vs. Landsat Landsat pixels are 30x30 meters (900 m 2 )Landsat pixels are 30x30 meters (900 m 2 ) Each FIA subplot (>5 in.) is 167 m 2 (19% of the pixel)Each FIA subplot (>5 in.) is 167 m 2 (19% of the pixel) Each FIA microplot (1 to 5 in.) is 13.7 m 2 (1.5% of the pixel)Each FIA microplot (1 to 5 in.) is 13.7 m 2 (1.5% of the pixel) This difference in scale may result in an underestimate the accuracy of the imputation if the sample estimates are assumed to be “true”This difference in scale may result in an underestimate the accuracy of the imputation if the sample estimates are assumed to be “true” In addition, there is positional error and other sampling and measurement error associated with FIA plot data, Landsat data, and other map dataIn addition, there is positional error and other sampling and measurement error associated with FIA plot data, Landsat data, and other map data Image from McRoberts, 2006 30m x 30m pixel

19 k Nearest Neighbor k Nearest Neighbor technique allows the selection of more than one reference data set, usually averaged to estimate target conditions. (using 3 closest neighbors would be “k=3”)k Nearest Neighbor technique allows the selection of more than one reference data set, usually averaged to estimate target conditions. (using 3 closest neighbors would be “k=3”) In FVS, the kNN approach could treat the multiple near neighbors as imputed sub-plots.In FVS, the kNN approach could treat the multiple near neighbors as imputed sub-plots. This may be desirable in the case of a scale mismatch between the intensive plot and the map data. It also creates more variation across the landscape, perhaps better representing transitions between conditions.This may be desirable in the case of a scale mismatch between the intensive plot and the map data. It also creates more variation across the landscape, perhaps better representing transitions between conditions. kNN option is included in YAImputekNN option is included in YAImpute

20 Questions: Would k > 1 be a good tradeoff between real mixes of plot conditions and the sample uncertainty of plots smaller than pixels?Would k > 1 be a good tradeoff between real mixes of plot conditions and the sample uncertainty of plots smaller than pixels? Could additional pixel-sized information be gathered at sample point locations (e.g. photo-interpreted crown cover or cover type) and included in the multivariate data analysis?Could additional pixel-sized information be gathered at sample point locations (e.g. photo-interpreted crown cover or cover type) and included in the multivariate data analysis?

21 How much does it matter? The issues of goodness of the imputation need to be considered in the context of the simulation: the use of the results and the models’ sensitivity to the lack of accuracy.The issues of goodness of the imputation need to be considered in the context of the simulation: the use of the results and the models’ sensitivity to the lack of accuracy. Model applications have a range of sensitivityModel applications have a range of sensitivity Analysis projrcts have a range of sensitivityAnalysis projrcts have a range of sensitivity Sensitivity tests can be performedSensitivity tests can be performed

22 Envision project using imputed data This imputation application has a low sensitivity to error Crystal Lakes Fuel Trt Project Arapaho-Roosevelt NF as seen from road intersection

23 A fire-beetle project using MSN This FFE WWPB application has catastrophic and contagion behaviors, and may be sensitive to imputation errors Five Buttes Analysis Area Deschutes Nat’l Forest

24 Imputation Sensitivity Analysis 2011 HIGH 2011 LOW In this analysis, two landscapes were imputed, high and a low pine beetle risk, based on risked rating stands which fell in each of many stand classifications. These maps represent the no action, “after beetle outbreak” BA for each Red River pine beetle analysis, Nez Perce National Forest

25 Sensitivity: High minus Low 2011 H-L The difference in the two extremes show how much the results may have changed if better data were available, and where the uncertainty is manifested on the landscape. This is the “no action” alternative; so a comparison can also be made as to the sensitivity of the action-no action difference to these 2 extreme landscape ranges. 1986 H-L

26 An additional challenge What is “most similar” depends on what aspects are considered in the analysis. If these products are used in decision making, we face the challenge to produce understandable, useful products which can be integrated with other corporate resource data systems and analyses. (Its not so good to have several, different estimates of where something important might be. It drives the boss crazy, but the appellants’ lawyers love it)

27 Acknowledgements Nick CrookstonNick Crookston Andrew McMahanAndrew McMahan Ron McRobertsRon McRoberts Ken PierceKen Pierce Al StageAl Stage and to all of you from out of town, who are here on Valentine’s Day, away from those you hold dearand to all of you from out of town, who are here on Valentine’s Day, away from those you hold dear


Download ppt "Use of imputed tree lists for FVS landscape projections: An overview of some issues and opportunities. Eric L. Smith Forest Health Technology Enterprise."

Similar presentations


Ads by Google