diff --git a/NEED/README.md b/NEED/README.md index 01b80e387ac9f8e818b5e4bb1f022fe4f772e8e1..8678883c0d4efefbdce21df22cc7f6b0357b44aa 100644 --- a/NEED/README.md +++ b/NEED/README.md @@ -14,7 +14,7 @@ Notes (mostly to self): * EULF is a semi-random sample of the 8m records which have an Energy Performance Certificate. * It includes only those with valid values on key variables (Property Age, Property Type, Floor Area Band and Energy Efficiency Band) and (especially) valid observations for electricity in 2012. * Records were selected based on the frequency of household type in the dataset relative to the total dwelling stock so that uncommon property types are over-represented, common types are under-represented and the supplied weight corrects for this. - * Implications for sample bias unclear - there may be other systematic biases not capture by the weight? + * Implications for sample bias unclear - there may be other systematic biases not captured by the weight? * UPRN = unique property reference = linkage mechanism (uses AddressBase) * Bias caused by linkage failure is unknown although the DECC NEED Data Framework report from 2011 suggest match rates of 94%-100% (https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/209264/Annex_B_-_Quality_Assurance.pdf)