Who is the hubverse for?#
A modeling hub consists of one or more hub administrators who are responsible for coordinating model submissions from modelers who submit model outputs in a standardized format defined by the hub. Model outputs are validated on submission and at the end of every round, a summary of the individual and ensemble model outputs is presented to stakeholders who may be responsible for informing policy decisions. All of these roles are directly supported by hubverse developers You may fit one, several, or none of these roles. Below are details of what aspects of the hubverse each role are going to interact with.
Hub administrators#
You are responsible for setting up hubs in response to official challenges that define specific modeling targets and timeframes. This involves:
setting up central repository for the model submissions and target data
codifying the submission format for the modeling efforts in
tasks.json
submission cadance (
rounds
)modeling tasks (
model_tasks
) that correspond to specific modeling targets (e.g. incident hospitalizations)combinations of categorical variables for model inputs (
task_ids
)modeling output types and valid ranges (
output_type
)
transforming truth (target) data into standardized modeling targets for downstream validation
validate submissions as they arrive
The quickstart chapter will get you up and running with a new hub. The user guide is a good resource for translating the modeling challenge language into a tasks.json
configuration file.
You can use the {hubAdmin}
R package to aid in the creation of this file.
Modelers#
You will ultimately submit your model outputs in the format specified by the Hub Administrators. Do that freaky modeling magic you do with whatever resources you have on hand. You use R? Great! Python? Go for it! Excel? Nobody’s stopping you! Dark Magick? Uh, maybe try a TI-86.
You will be most interested in knowing how to format your output.
For that, you can use a function from the {hubAdmin}
R package to create an expanded model output grid.
You can then fit your outputs to that format and submit it to the hub.
They will be validated on submission.
As a modeler, you will need to
understand the definitions of and differences between model output types
understand the structure of model output datasets
build ensembles using the
{hubEnsembles}
R package.
Data analysts#
With the standardized data formats and schema in each hub, you can confidently write analysis scripts that will work for all model submissions to a hub.
No data cleaning needed.
To get started, you can read the accessing data from a hub vignette from the {hubData}
R package
Once you have access to the data, you can summarise it in the following ways:
loading data
predictions data from a formal hub repo
predictions data sitting in a folder on your computer
plotting forecasts
evaluating forecasts
ensembling forecasts (via the
{hubEnsembles}
R packageand more!
Stakeholders#
You will be presented with visualizations describing the models’ trajectory and uncertainty. As the modeling effort continues, you will be presented with scores for each model that show how well it predicted trajectories.
Make good decisions.
Hubverse developers#
If you want to contribute to the hubverse, you should adhere to our Code Of Conduct.
We have a contributing guide that describes the process for contributing minor fixes.
If you want to know more about our packaging process and other valuable details, you can find more in the {hubDevs}
R package.