The goal of pg is to provide both R and C++ header access to the Polya Gamma distribution sampling routine.
Installation
You can install the development version of pg
from GitHub with:
# install.packages("devtools")
devtools::install_github("tmsalab/pg")
Usage
Let X
be a Polya Gamma Distribution denoted by PG(h, z)
, where h
is the “shape” parameter and z
is the “scale” parameter. Presently, the following sampling cases are enabled:
-
h > 170
: Normal approximation method -
h <= 170
andh > 13
: Saddlepoint method -
h = 1
orh = 2
: Devroye method -
h > 0
: Sum of gammas method. -
h < 0
: Result is automatically set to zero.
Not implemented:
-
h <= 13
andh > 1
: Alternative method (waiting for verification)
The package structure allows for the sampling routines to be accessed either via C++ or through R. The return type can be either a single value or a vector. When repeat sampling is needed with the same b
and c
, please use the vectorized sampler.
Sampling with C++
Using the sampling routine in C++ through a standalone .cpp
file requires either the rpg_scalar_hybrid()
, rpg_vector_hybrid()
, or rpg_hybrid()
function to be accessed in the pg
C++ namespace. Each of these functions will automatically select the appropriate algorithm based on criteria discussed previously.
#include <pg.h>
// [[Rcpp::depends(RcppArmadillo, pg)]]
// [[Rcpp::export]]
double rpg_scalar(const double h, const double z) {
return pg::rpg_scalar_hybrid(h, z);
}
// [[Rcpp::export]]
::vec rpg_hybrid(const arma::vec& h, const arma::vec& z) {
armareturn pg::rpg_hybrid(h, z);
}
// [[Rcpp::export]]
::vec rpg_vector(unsigned int n, const double h, const double z) {
armareturn pg::rpg_vector_hybrid(n, h, z);
}
For use within an R package, include a the pg
package name in the DESCRIPTION
file. From there, include the pg.h
header in a similar manner to the stand-alone C++ example.
:
LinkingTo
Rcpp,
RcppArmadillo pg
Sampling with R
For use within an R file, you can do:
# Number of observations to sample
n = 4
# Select the PG(h, z) values
h = 1; z = 0.5
# Set a seed for reproducibility
set.seed(141)
# Sample a single observation
pg::rpg_scalar(h, z)
#> [1] 0.05752942
# Set a seed for reproducibility
set.seed(141)
# Sample a vector of observations
pg::rpg_vector(n, h, z)
#> [,1]
#> [1,] 0.05752942
#> [2,] 0.38752679
#> [3,] 0.38710433
#> [4,] 0.18847913
See also
The following are useful resources regarding the Polya Gamma distribution.
- Papers
- “Bayesian Inference for Logistic Models Using Pólya–Gamma Latent Variables” by Nicholas G. Polson, James G. Scott, and Jesse Windle (2013) doi:10.1080/01621459.2013.829001. Paper that invented the Polya Gamma
- “Sampling Polya Gamma random variates: alternate and approximate techniques” by Jesse Windle, Nicholas G. Polson, and James G. Scott (2014) https://arxiv.org/abs/1405.0506. Provides an efficiency overview of the different sampling approaches to sampling from a Polya Gamma distribution.
- R Implementations:
-
BayesLogit
R package by Nicholas G. Polson, James G. Scott, and Jesse Windle. Provides the main C++ class samplers contained used by thepg
package. -
pgdraw
by Daniel F. Schmidt and Enes Makalic. This package construction relies heavily on free-floating functions andRcpp
data structures. -
bayesCL
by Rok Cesnovar and Erik Strumbelj. This package boast a sampler that is 100x faster through offloading of the computation onto a GPU. Though, the package is not actively maintained.
-
- Support in other languages:
-
Python
has thepypolyagamma
package by Scott Linderman. -
Stan
lacks an implementation for the Polya Gamma distribution since it relies on joint proposals rather than full conditionals.
-
Author
James Balamuta leaning heavily on work done in BayesLogit
R package by Nicholas G. Polson, James G. Scott, and Jesse Windle.
Citing the pg
package
To ensure future development of the package, please cite pg
package if used during an analysis or simulation study. Citation information for the package may be acquired by using in R:
citation("pg")