Presentation on theme: "The National Mortgage Database (NMDB) Robert Avery, Ken Brevoort, Theresa DiVenti, Carla Inclan, Ian Keith, Jessica Lee, Lexian Liu, Ismail Mohamed, Forrest."— Presentation transcript:
The National Mortgage Database (NMDB) Robert Avery, Ken Brevoort, Theresa DiVenti, Carla Inclan, Ian Keith, Jessica Lee, Lexian Liu, Ismail Mohamed, Forrest Pafenberg, Jay Schultz, Cynthia Waldron, Xun Wang, Claudia Wood, Peter Zorn Federal Housing Finance Agency Consumer Financial Protection Bureau Freddie Mac Urban Institute June 11, 2013 The views expressed are those of the authors and do not necessarily represent those of the Consumer Financial Protection Bureau, the Federal Housing Finance Agency, Freddie Mac or their staff.
What is the NMDB? A new, nationally representative, loan-level mortgage database jointly funded and managed by the FHFA and CFPB based on a prototype developed by Freddie Mac. »1 st lien mortgages reported to the credit bureaus are used as both the sampling frame and the source of performance data. No new data is collectedthe NMDB will make better use of data that already exists. »The database is a 1/20 sample (not a registry of loans). »Because the credit bureaus archive their data, the NMDB recovers data that would have been available had the project been started years ago. The initial 1/20 sample is representative of all mortgages open at any time from January 1998 to June 2012 and (with weights) any borrower who had at least one mortgage during that period. »Going forward, a 1/20 representative sample of newly originated mortgages will be added each quarter, and terminated mortgages will exit the sample. »10.1 million mortgages are in the initial historic database. In the future the database will track about 3.5 million active mortgages. Credit bureau data are comprehensive. However, they are raw servicing data which requires significant cleaning to make them useful. Also need to add data from other sources. »Major commitment of government staff to do this. Never done before. »Working with active cooperation of credit bureau staff. NMDB will also have a survey component. Each quarter a representative subset of borrowers associated with loans newly added to the database will be sent a mail survey soliciting information on their mortgage shopping and origination experience. National Mortgage Database2
Four Overlapping Databases The basic unit of observation is the mortgage. »The database will contain full credit information for all borrowers associated with the sampled mortgages. »Borrower data will be gathered from one year prior to sampled mortgage origination to one year after termination and tracked quarterly. »Performance on the sample mortgages will be collected monthly. The NMDB will also make available an historic data base containing full credit data (including scores) from 1998 to 2012 of a representative sample of borrowers associated with an active mortgage during the 1998 to 2012 period. »The database will contain information on all mortgages taken out by these borrowers during the 1998 to »Data also gathered on all other credit obligations active during this period. »Performance for each mortgage will be tracked from 2000 to The NMDB will maintain a separate database of a representative 1-in-20 sample individuals who have ever had an active mortgage from 1998 onward. »Quarterly information will be maintained on these individuals from one-year prior to taking out their first mortgage (or 1998) until they die. »Persons will be added to the database when they take out their first mortgage.. The NMDB origination survey data will also be maintained as a separate database. National Mortgage Database3
Why is the NMDB Needed? HMDA: –Not fully reasonably representativedoes not include HMDA non-reporters. –Lacks detailed borrower, loan or performance data. –Available only 9 to 21 months after mortgages are originated. LPS McDash and/or CoreLogic: –Servicing files from 26 large servicers versus 2,000 servicers in credit bureaus. –Not representativepoor coverage of portfolio loans. –Same problems as underlying NMDB dataduplication, hanging performance and servicing salesbut not cleaned as the NMDB will be, so you dont know it. –No information on other obligations, previous or subsequent mortgages, or borrowers. Problems with NY Fed Equifax: –Similar source as NMDB, but unit of observation is borrowers not loans. –Same problems as underlying NMDBduplication, hanging performance and servicing salesbut not cleaned as the NMDB will be, so you dont know it. –Little supplementation with other data and difficult to link files over time. National Mortgage Database4
What is Missing in Bureau Data? Key items missing are property value (LTV) and characteristics, borrower characteristics (e.g. age, race, income, gender), and some mortgage characteristics (e.g. ARM status, PMI, origination channel). The database is being supplemented with information obtained from matching to existing external sources (some still under negotiation): »Home Mortgage Disclosure Act (HMDA) (70% match rate gives income/race). »Property transaction (deed/title) data (55% match rate). »MLS data (useful for purchase price in non-disclosure states). »Property appraisal data. »Household moving/address information on last three addresses. »Third party servicing data (e.g. LP, LPS). Private label MBS data. Maybe securities data as well (e.g. Ginnie Mae, GSEs). »Administrative files (FHA, VA, RHS, GSEs, home loan banks and possibly large banks). 47% of sample loans are Govt-backed. An additional 17% of borrowers have a non-sampled Govt backed loan. »Data on age, gender and marital status from public records collected by the credit bureau. National Mortgage Database5
What Specific Fields will NMDB Have? For each sample mortgage the database will have: »MonthlyPerformance (delinquent, current, foreclosure); balance; scheduled payment; actual payment; escrow payment; amortizing contract rate. »Fixed CharacteristicsDate opened; term; amount borrowed; number of borrowers; mortgage purpose (home purchase, refinance, new mortgage on free and clear property); owner occupancy status; type of mortgage (FHA/VA/RHS/home improvement/manufactured housing/other); GSE (Fannie/Freddie/Ginnie/Private MBS); servicer type; balloon amount and date; appraised property value, APR, CLTV, LTV and DTI used in underwriting; ARM status; PMI; date closed, payoff amount, and termination form (if closed). »Modification/foreclosure statusdate entered modification/foreclosure; change in terms; special program (HARP/HAMP); part of bankruptcy; charge-off amount. For each sample mortgage co-signer the database will have: »Age (date of birth); gender; marital status; deceased indicator; race/ethnicity (from HMDA); income at the time of origination (from HMDA); »Quarterly Vantage Credit Score, bankruptcy, and income estimator »Do they live in property associated with mortgage; first-time homebuyer; census- tract/zip code and timing of last three addresses National Mortgage Database6
Specific Fields (continued) For the property associated with each sample mortgage the database will have: »QuarterlyLTV; CLTV; and value (from AVM model). »Fixed CharacteristicsDate purchased; purchase amount; location (census tract, MSA and Zip Code); type of property (e.g. single family); age of structure; square footage; assessed value; owner-occupied. For all concurrent 2 nd liens on the property associated with each sample mortgage the database will have: »MonthlyPerformance (delinquent, current, foreclosure); balance; scheduled payment; actual payment; escrow payment; amortizing contract rate; credit limit (if a HELOC). »Fixed CharacteristicsDate opened (piggie-back or not); term; open- or closed- end; amount borrowed (or credit limit); number of borrowers; same servicer as 1st; date closed, payoff amount, and termination form (if closed). National Mortgage Database7
Specific Fields (continued) For all other mortgages, credit cards, installment loans, student loans, auto loans, lines of credit, and other consumer loans associated with sample mortgage co-borrowers the database will have: »MonthlyPerformance (delinquent, current, foreclosure); balance; scheduled payment; actual payment; escrow payment; amortizing contract rate; credit limit (if open-ended). »Fixed CharacteristicsType of credit; date opened; term; open- or closed-end; amount borrowed (or credit limit); number of borrowers; same servicer/property as sample mortgage; date closed, payoff amount, and termination form (if closed). Information on inquiries and public records for all borrowers associated with sample mortgages will also be gathered. An origination survey will be sent (mailed) to a representative subset of new mortgagees in the database each quarter. The survey has been pre-tested three times with response rates of 60 and 45 percent for the last two pilots. The survey is designed to pick up information on issues like loan shopping and suitability that are not available from any other source. National Mortgage Database8
Timeline Contract signed with Experian on September 27, Initial data delivery took place in December 20121/20 sample of all loans in existence between January 1998 and June 2012 (10.1 million loans and 14.7 million borrowers after preliminary cleaning). An analytic group at FHFA, Freddie Mac and CFPB is processing and cleaning the data and will match it to external sources, impute data for loans that cannot be matched, and develop a series of regular reports and queries to facilitate use of the NMDB. »It will likely take until next spring to finish cleaning the data. »8 FTEs working on the projectmajor commitment of FHFA. »Many challenges in following people and mortgages (e.g. servicing is sold; people die or are added to mortgages). An existing pilot prototype dataset in development for 2 ½ years funded by Freddie Mac (1/500 sample of loans outstanding since 2003). »Prototype will be maintained and updated until at least summer »Already used in FHFAs 2012 HERA-mandated report. »Pilot testing of an additional Origination survey and a Delinquency Survey. National Mortgage Database9
Access and the Future NMDB is being set up as a public good. We believe that the contract signed with Experian is a model for data access. The challenge is to (1) protect borrower/lender personally identifiable information and (2) provide useful data. Local geography is critical for mortgage analysis. Our solution: Data is physically housed only on a FHFA/CFPB server. Access, however, is allowed for any federal government/reserve bank/GSE employee going through access process: »Must sign an agreement not to reverse engineer identity of borrower or lender. Severe penalties for violations of agreement. »All work behind a firewalldata cant be removed. »NMDB software must support a variety of purposessimple queries (number of new mortgages in California) to complex research projects. »We are working to allow broader academic/research public access via Census-style programs. National Mortgage Database10
Examples of how NMDB can be used Example 1: Second liens Example 2: Loan performance transition matrix Example 3: Credit tightening Example 4: Market Comparisons All examples with 2010 data using the Prototype National Mortgage Database11
Example 1: Second liens NMDB coverage is more extensive than HMDAs National Mortgage Database12
Example 1: Second liens (continued) Default rates are higher for firsts with seconds National Mortgage Database13
Example 1 continued Performance of firsts w/ different types of seconds National Mortgage Database14 88% of firsts and their associated seconds perform similarly When performance diverges, seconds tend to out-perform their associated firsts. GSE firsts and their associated seconds perform better than non-GSE loans. Firsts with piggyback non-HELOC (closed end) seconds have the highest default rates.
Example 2: Loan performance transition matrix 60D+ loans tend to worsen in performance National Mortgage Database15 95% of current mortgages remain current the next month. Slightly over 40% of 30-day delinquent loans remain 30-day delinquent the next month (the mode), with roughly equal percentages transitioning into current and 60- day delinquent. The disproportionate share of loans delinquent 60 or more days transition into an even worse performing state the next month.
Example 2: Loan performance transition matrix (continued) First without seconds cure more frequently National Mortgage Database16
Example 3 Credit quality of originations National Mortgage Database17 Score distributions are the tightest (lowest risk) since » The Vantage score cutoff, as measured by the 5 th percentile of the score distribution, is currently set higher for both purchase and refinance loans than at any point since Refinance mortgages appear to face an especially high score cutoff.
Example 3 continued Credit quality of originationsGSE comparison National Mortgage Database18 The credit quality of GSE loans significantly exceeds that of non-GSE loans, especially for purchase money mortgages.
Example 4: Market comparisons Comparison of FHA Originations National Mortgage Database19 The credit quality of FHA/VA market originations is consistently lower than that of non-FHA/VA market originations. This difference in quality diminished somewhat during the height of the boom (2004 through 2006), and has increased since 2007.
Example 4: Market comparisons (continued) Monitoring and benchmarking FHA National Mortgage Database20 MonitoringAs of June 2010 (for loans originated since 2003): –13.4% of FHA loans were either in a state of delinquency or were closed with a loss. –11.5% of all open FHA loans were in a state of delinquency. –Comparable figures for VA were 8.5% and 6.9%, respectively; –Comparable figures for RHS were 6.9% and 6.5%, respectively. BenchmarkingControlling for loan size, geography (state) and cohort: –FHA is underperforming. The average delinquency rate of loans with FHAs mix of loan size, state and cohort is 7.9% and 6.2%, respectively. –FHAs worst performing book year is 2007, with an excess delinquency rate of 12.4% above average. –Newly eligible FHA loans (above old limits) are performing worse than market by about 4 percentage points. However, this may be market effectFHA loans in same markets but below old limits have about the same excess delinquency. –VA and RHS are performing as predicted (+/- 0.5 percentage points).