LAName %like% "Isle of Wight" | LAName %like% "Hampshire"]
return(saveDT)
}
```
# Intro
Fractional weights are sufficient for the calculation of area level weighted statistics. However a number of applications such as local electricity network demand modelling or agent-based transport/mobility models require household units - implying the need for integerisation of weights to enable the selection of specific households to fit a given area. Lovelace & Ballas introduced a new approach to taking fracitonal weights generated using IPF and integerising them to give exact numbers of households per unit area. Their work showed that their method produced better fitting populations than other integerisation approaches (internal validation) but did not provide external validation to demonsrate the extent to which their approach reproduced the accuracy of fractional weights when calculating area level statistics. This paper estimates the mean and median annual household electricity consumption (in kWh) for English MSOAs using standard IPF and Lovelace and Ballas' approaches. It then comapres the results with observed MSOA level electricity consumption for areas where residential consumption is more than 90% of total electricity consumption.
Fractional weights are sufficient for the calculation of area level weighted statistics. However a number of applications such as local electricity network demand modelling or agent-based transport/mobility models require household units - implying the need for integerisation of weights to enable the selection of specific households to fit a given area. Lovelace & Ballas introduced a new approach to taking fracitonal weights generated using IPF and [integerising them](https://spatial-microsim-book.robinlovelace.net/smsimr.html#sintegerisation) to give exact numbers of households per unit area. Their work showed that their method produced better fitting populations than other integerisation approaches (internal validation) but did not provide external validation to demonsrate the extent to which their approach reproduced the accuracy of fractional weights when calculating area level statistics. This paper estimates the mean and median annual household electricity consumption (in kWh) for English MSOAs using standard IPF and Lovelace and Ballas' approaches. It then comapres the results with observed MSOA level electricity consumption for areas where residential consumption is more than 90% of total electricity consumption.
# Data prep
...
...
@@ -77,33 +89,45 @@ Various datasets can be linked at these levels:
* MSOA
* BEIS domestic gas/elec (annual kWh)
* BEIS non-domestic gas/elec (annual kWh)
Note that the [BEIS documentation](https://www.gov.uk/government/publications/regional-energy-data-guidance-note) states:
* Gas:
* Annualised and weather corrected
* Consumers using less than 73,200 kWh a year are classified as domestic.
* Electricity:
* Annualised, not weather corrected
* Excludes: NI, Central Volume Allocation (CVA) users; large industrial consumers who receive their electricity through high voltage lines of the transmission system and hence have different arrangements with their electricity suppliers; Electricity used by companies that generate their own electricity and consume it without passing over the public distribution network
* The automatic cut-off point for non-domestic consumption is 100,000 kWh. Domestic consumers with consumption of between 50,000 and 100,000 kWh is reallocated to the non-domestic sector following a validation process
## Load MSOA electricity data
We might as well work at MSOA level rather than try to aggreate LSOA upwards.
For now what we want to do is flag the MSOAs where most energy demand is residential - we can then use these for validation against the small area model-based estimates.
At this point it is interesting to note that the various MSOA files have different numbers of rows:
* MSOA look up table from NSPL: `r nrow(dtMsoaLUT)` rows with `r uniqueN(dtMsoaLUT$MSOACode)` unique MSOAs
kableExtra::kable(ht, caption = "First few rows of 'extra' MSOAs") %>%
kable_styling()
```
So it seems that the half-hourly electricity consumption data which comes from "larger non-domestic consumers" has not been allocated to MSOAs. Indeed the guodance states:
>"Half Hourly consumption (consumption by the larger non-domestic customers) totals, at a local authority level are provided in the Middle Layer Super Output Area (MSOA) level datasets"
Note that there are no LA codes in this dataset so these totals cannot be allocated to partoicuaor aggregations of MSOAs using this data.
kableExtra::kable(t, caption = "Sum, mean and % of total non-domestic electricity by source",
digits = 3) %>%
kable_styling()
```
As we can see the mean of the means is considerably smaller than the mean of the individual observations in both cases.
Overall, this is OK for our purposes but might cause problems for someone who thought they had captured _all_ non-domestic consumption at MSOA level - there would be a lot missing!
caption = paste0("All ",n," Solent region MSOAs where % domestic electricity > ",
consRatioThreshold,
" - first few rows")
) %>%
kable_styling()
```
These would be the MSOAs to use if we wanted to get LV network data that _should_ mostly be due to domestic consumption.
But in this case, we don't. We just need to simulate mean annual kWh for each LSOA in the Solent area (as that matches our sample) using the two methods and then validate against the appropriate BEIS data!