Skip to contents

manydata is the central package in the many packages universe aimed at collecting, connecting, and correcting network data across issue-domains of global governance. To assist users in doing so, manydata contains functions that enable users to download and manipulate data easily.

Why manydata?

manydata offers users access to all of the tested data in the various ‘many packages’ available, for use in analyses of global governance and beyond. A special feature of the ‘many packages’ is that it is not ‘opinionated’ - instead of offering a single, supposedly authoritative version of global governance events, the packages in the many packages universe gather well-regarded datasets in each issue-domain into three-dimensional ‘datacubes’. The chief advantage of this for global governance researchers is that it enables a quick and easy way to check the robustness of their results using different formulations of the study population or concept specification. The ‘datacube’ structure has a specific coding system for the variables across the datasets. For more details, please see the vignette.

Downloading and installing manydata

The easiest way to install manydata is directly from CRAN.

install.packages("manydata")

The development version of the package manydata can also be downloaded from GitHub.

# install.packages("remotes")
remotes::install_github("globalgov/manydata")

Available ‘many’ packages

manydata connects users to other packages that help fill global governance researchers’ data needs. The get_packages() function can be used to discover the ‘many packages’ currently available.

Please see the website for more information about how to use manydata.

Visualising ‘many’ databases

Once ‘many’ data packages are downloaded, manydata helps users visualize the relationship between matched observations across datasets within a database. Database profiling functions return confirmed, unique, missing, conflicting, or majority values in all (non-ID) variables in the datasets for a ‘many’ package database.

db_plot(database = emperors, key = "ID", variable = "all", category = "all")
#> There were 116 matched observations by ID variable across datasets in database.

Consolidating ‘many’ databases

manydata also contains flexible methods for consolidating ‘many’ package database into a single dataset with some combination of the rows, columns, as well as for how to resolve conflicts for observations across datasets.

consolidate(database = emperors, rows = "every", cols = "every",
            resolve = "coalesce", key = "ID")
#> There were 116 matched observations by ID variable across datasets in database.

#> # A tibble: 41 × 3
#>    ID             Beg         End        
#>    <chr>          <mdate>     <mdate>    
#>  1 Aemilian       0253-08-15~ 0253-10-15~
#>  2 Augustus       -0026-01-16 0014-08-19 
#>  3 Aurelian       0270-09-15  0275-09-15 
#>  4 Balbinus       0238-04-22  0238-07-29 
#>  5 Caracalla      0198        0217-04-08 
#>  6 Carinus        0283-08-01~ 0285-08-01~
#>  7 Carus          0282-10-01~ 0283-08-01~
#>  8 Claudius       0041-01-25  0054-10-13 
#>  9 Commodus       0177        0192-12-31 
#> 10 Constantine II 0337-05-22  0340-01-01 
#> # … with 31 more rows

Cheat Sheet

manydata contains several other functions to help global governance researchers. For a quick overview, please also check the package cheat sheet.

Contributing to the many packages universe

For more information for developers and data contributors to ‘many packages’, please see {manypkgs} the website.