From 13809aa1fbaf769272fcdc65f31ca988354d113c Mon Sep 17 00:00:00 2001
From: Ben Anderson <b.anderson@soton.ac.uk>
Date: Mon, 15 Sep 2014 18:16:49 +0100
Subject: [PATCH] updated readme

---
 NEED/README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/NEED/README.md b/NEED/README.md
index f984d08..3107586 100644
--- a/NEED/README.md
+++ b/NEED/README.md
@@ -13,7 +13,7 @@ Notes (mostly to self):
 * The End User License file (EULF) dataset is a sample of just over 4 million households 
 * EULF is a semi-random sample of the 8m records which have an Energy Performance Certificate. 
  * It includes only those with valid values on key variables (Property Age, Property Type, Floor Area Band and Energy Efficiency Band) and (especially) valid observations for electricity in 2012. 
- * Records were selected based on the frequency of household type in the dataset relative to the total dwelling stock so that uncommon property types (e.g. older detached properties) are over-represented and common types (e.g. flats where turnover is high) are under-represented. The supplied weight corrects for this for descriptive analaysis. 
+ * Records were selected based on the frequency of household type in the dataset relative to the total dwelling stock so that uncommon property types (e.g. older detached properties) are over-represented and common types (e.g. flats where turnover is high) are under-represented. The supplied weight corrects for this for descriptive analysis. 
  * Implications for sample bias unclear - there may be other systematic biases not captured by the weight?
 * UPRN = unique property reference = linkage mechanism (uses AddressBase)
 * Bias caused by linkage failure is unknown although the DECC NEED Data Framework report from 2011 suggest match rates of 94%-100% (https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/209264/Annex_B_-_Quality_Assurance.pdf)
-- 
GitLab