low-voltage-loadforecasting.github.io

low-voltage-loadforecasting.github.io

This is the repository for the Github pages site produced with Jekyll to our page hosting an overview of load forecasting data sets as presented in our preprint on arXiv.

Review of Low-Voltage Load Forecasting

This is the repository for the Github pages site produced with Jekyll to our page hosting an overview of load forecasting data sets as presented in our paper available as preprint on arXiv and published in Applied Energy.

How To Contribute a dataset

Submit by opening an issue

The easiest way to contribute is to use the issue functionality of Github. For that switch into the tab "Issues" above. Then click "New Issue". Optionally add the label "dataset". Then provide as much as possible from the following information:

  • Name: A short name that can be referenced
  • Url: Link to where the dataset is hosted or can be requested
  • Type: Should be one or several of "Residential", "Commercial", "Industrial", "Other", "Substation"
  • Customers: Number of entitites in the dataset (e.g., number of households, number of substations)
  • Resolution: Resolution in minutes, seconds or kHz
  • Duration: What is the typical duration of data available in days, weeks or months
  • Intervention: If an intervention like Time-of-Use or dynamic prices was part of the measurement campaign
  • Submetering: Yes/No If there is any hierarchy in the dataset (e.g. households belonging to same substation, or measurements within the household)
  • Weather: Yes/No If there is weather data available
  • Location: Country and possibly city or Region
  • Other: If there is other measurements belonging to the dataset besides load.
  • Licence: The licence (if any)

Many datasets ask for (or require) attribution of a certain paper. If so, provide it's bibliographic information. Ideally, provide a bibtex entry or simply list the information required (see here for example bibtex entries).

See this issue as an example.

Submit via pull request

You can also add a paper yourself and add it to the repository as a pull request. This repository has two branches. The code hosted in branch 'source' and the static page hosted in 'gh-pages' which will automatically be deployed by Github to the page. So to add a dataset, follow the following steps:

  • Checkout the 'source' branch
  • Make your changes locally and test them (requires Jekyll), for testing run: bundle exec jekyll serve
  • Build the page locally, run bundle exec jekyll build, which creates the sources in folder '_site'
  • Commit (and push) the changes to the source branch
  • Now to publish the web site by only pushing the '_site' subfolder to the branch 'gh-pages': git subtree push --prefix _site origin gh-pages

Any Ideas or Feedback?

We know, this is yet another list of datasets out there. We believe it is a useful resource and providing a way to keep it up-to-date beyond the paper adds value. If you have any idea on how this could be improved or have other ideas, either open an issue for discussion or contact us. It serves currently the purpose of manual exploration and there was no focus on properly standardizing and harmonizing that data source in the YAML.

Other lists of datasets