******** XGI-DATA ******** XGI-DATA is a repository of openly available hypergraph datasets in JSON format with corresponding documentation of network statistics, limitations of the data, and methods of collection. They are hosted in the `XGI Community `_ on Zenodo. This is loosely inspired by `Datasheets for Datasets `_ by Gebru et al. Loading datasets ---------------- Loading a dataset using XGI is as simple as the following two lines: .. code-block:: python import xgi H = xgi.load_xgi_data("") XGI-DATA uses an HTTP request to load the hypergraph dataset. See the `load_xgi_data() documentation `_ for a complete description of the function. The directed hypergraph datasets from the Biochemical, Genetic and Genomic (BIGG) database can also be loaded with: .. code-block:: python import xgi H = xgi.load_bigg_data("") See the `load_bigg_data() documentation `_ for a complete description of the function. Dataset format -------------- XGI-DATA provides higher-order datasets in two formats: (1) Hypergraph Interchange Format (HIF)-compliant JSON (2) XGI-specific JSON format. See the `HIF-standard `_ documentation on format (1). All future datasets and updates to current datasets will be stored as format (1). Format (2) is structured as follows: * :code:`hypergraph-data`: This tag accesses the attributes of the entire hypergraph dataset such as the authors or dataset name. * :code:`node-data`: This tag accesses the nodes of the hypergraph and their associated properties as a dictionary where the keys are node IDs and the corresponding values are dictionaries. If a node doesn't have any properties, the associated dictionary is empty. * :code:`name`: This tag accesses the node's name if there is one that is different from the ID specified in the hyperedges. * Other tags are user-specified based on the particular attributes provided by the dataset. * :code:`edge-data`: This tag accesses the hyperedges of the hypergraph and their associated attributes. * :code:`name`: This tag accesses the edge's name if one is provided. * :code:`timestamp`: This is the tag specifying the time associated with the hyperedge if it is given. All times are stored in ISO8601 standard. * Other tags are user-specified based on the particular attributes provided by the dataset. * :code:`edge-dict`: This tag accesses the edge IDs and the corresponding nodes which participate in that hyperedge. Network Statistics ------------------ .. raw:: html

More details on individual datasets are available at the XGI-DATA page.

In this table, |V| denotes the number of nodes, |E| denotes the number of edges, |E*| denotes the number of unique edges, and smax denotes the maximum edge size.