This data repository includes data for WiDS Texas Datathon 2021. While the power load data and common related variables (e.g., weather, COVID-19) are public, we created an automated data pipeline to collect them in this repository for participants' convenience. The data is fetched from their original source into this repository once a day by some GitHub Actions.
This data repository is NOT the minimal dataset that participants must use OR the maximal dataset that they could use. Participants should select the data based on their models. Participants are encouraged to use external public data if they believe it could improve their models.
To download the data to your local, please clone the git repository.
git clone [email protected]:WiDSTexas2021/datathon-code.git
cd datathon-code/data
To update your local data up to the state of the repository, please git pull
.
-
ercot_hourly_load.csv
includes hourly power load in the eight ERCOT weather zones. The most recent few weeks of data is from ERCOT Actual System Load, while earlier data is from ERCOT Load Data Archives. Note that all timestamps include time zone information (UTC offset, which depends on daylight saving time). -
weather_zone_cities.json
lists all Texas cities in each ERCOT weather zone. This could help to align power load data with city-wise data (e.g. weather data). -
weather_zone_counties.json
lists all Texas counties in each ERCOT weather zone. This could help to align power load data with county-wise data (e.g. COVID-19 data).
-
weather_history.csv
includes past weather data of 10 cities cross the 8 ECROT weather zones. The data is from World Weather Online and reported every 3 hours starting from July 1, 2008. See here for details about each column. -
weather_forecast.csv
includes weather forecast of 10 cities cross the 8 ECROT weather zones. The data is from World Weather Online and forecast every 3 hours in the next 13 days (including today). See here for details about each column.
-
texas_covid_confirmed.csv
includes cumulative confirmed COVID-19 cases in Texas by county. The data is updated in JHU COVID-19 data repository in a daily basis. -
texas_covid_deaths.csv
includes cumulative COVID-19 deaths count in Texas by county. The data is updated in JHU COVID-19 data repository in a daily basis.