Package 'CGGP' reference manual

Title:	Composite Grid Gaussian Processes
Description:	Run computer experiments using the adaptive composite grid algorithm with a Gaussian process model. The algorithm works best when running an experiment that can evaluate thousands of points from a deterministic computer simulation. This package is an implementation of a forthcoming paper by Plumlee, Erickson, Ankenman, et al. For a preprint of the paper, contact the maintainer of this package.
Authors:	Collin Erickson [aut, cre], Matthew Plumlee [aut]
Maintainer:	Collin Erickson <[email protected]>
License:	GPL-3
Version:	1.0.4.9000
Built:	2025-03-20 03:15:45 UTC
Source:	https://github.com/collinerickson/cggp

CGGP: A package for running sparse grid computer experiments

Description

The CGGP package implements the method presented in Plumlee et al. (2019).

CGGP functions

The CGGP functions: CGGPcreate, CGGPfit, CGGPappend, and CGGPpred

Calculate MSE over single dimension

Description

Calculated using grid of integration points. Can be calculated exactly, but not much reason in 1D.

Usage

CGGP_internal_calcMSE(xl, theta, CorrMat)
CGGP_internal_calcMSE(xl, theta, CorrMat)

Arguments

`xl`	Vector of points in 1D
`theta`	Correlation parameters
`CorrMat`	Function that gives correlation matrix for vectors of 1D points.

Value

MSE value

Examples

CGGP_internal_calcMSE(xl=c(0,.5,.9), theta=c(1,2,3),
         CorrMat=CGGP_internal_CorrMatCauchySQT)
CGGP_internal_calcMSE(xl=c(0,.5,.9), theta=c(1,2,3),
         CorrMat=CGGP_internal_CorrMatCauchySQT)

Calculate MSE over blocks

Description

Delta of adding block is product over i=1..d of IMSE(i,j-1) - IMSE(i,j)

Usage

CGGP_internal_calcMSEde(valsinds, MSE_MAP)
CGGP_internal_calcMSEde(valsinds, MSE_MAP)

Arguments

`valsinds`	Block levels to calculate MSEs for
`MSE_MAP`	Matrix of MSE values

Value

All MSE values

Examples

SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2})
SG <- CGGPfit(SG, Y=y)
MSE_MAP <- outer(1:SG$d, 1:8, 
     Vectorize(function(dimlcv, lcv1) {
        CGGP_internal_calcMSE(SG$xb[1:SG$sizest[dimlcv]],
        theta=SG$thetaMAP[(dimlcv-1)*SG$numpara+1:SG$numpara],
        CorrMat=SG$CorrMat)
 }))
CGGP_internal_calcMSEde(SG$po[1:SG$poCOUNT, ], MSE_MAP)
SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2})
SG <- CGGPfit(SG, Y=y)
MSE_MAP <- outer(1:SG$d, 1:8, 
     Vectorize(function(dimlcv, lcv1) {
        CGGP_internal_calcMSE(SG$xb[1:SG$sizest[dimlcv]],
        theta=SG$thetaMAP[(dimlcv-1)*SG$numpara+1:SG$numpara],
        CorrMat=SG$CorrMat)
 }))
CGGP_internal_calcMSEde(SG$po[1:SG$poCOUNT, ], MSE_MAP)

Calculate predictive weights for CGGP

Description

Predictive weights are Sigma^(-1)*y in standard GP. This calculation is much faster since we don't need to solve the full system of equations.

Usage

CGGP_internal_calcpw(CGGP, y, theta, return_lS = FALSE)
CGGP_internal_calcpw(CGGP, y, theta, return_lS = FALSE)

Arguments

`CGGP`	CGGP object
`y`	Measured values for CGGP$design
`theta`	Correlation parameters
`return_lS`	Should lS be returned?

Value

Vector with predictive weights

Examples

cggp <- CGGPcreate(d=3, batchsize=100)
y <- apply(cggp$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
CGGP_internal_calcpw(CGGP=cggp, y=y, theta=cggp$thetaMAP)
cggp <- CGGPcreate(d=3, batchsize=100)
y <- apply(cggp$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
CGGP_internal_calcpw(CGGP=cggp, y=y, theta=cggp$thetaMAP)

Calculate derivative of pw

Description

Calculate derivative of pw

Usage

CGGP_internal_calcpwanddpw(CGGP, y, theta, return_lS = FALSE)
CGGP_internal_calcpwanddpw(CGGP, y, theta, return_lS = FALSE)

Arguments

`CGGP`	CGGP object
`y`	Measured values for CGGP$design
`theta`	Correlation parameters
`return_lS`	Should lS and dlS be returned?

Value

derivative matrix of pw with respect to logtheta

Examples

cggp <- CGGPcreate(d=3, batchsize=100)
y <- apply(cggp$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
CGGP_internal_calcpwanddpw(CGGP=cggp, y=y, theta=cggp$thetaMAP)
cggp <- CGGPcreate(d=3, batchsize=100)
y <- apply(cggp$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
CGGP_internal_calcpwanddpw(CGGP=cggp, y=y, theta=cggp$thetaMAP)

Cauchy correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatCauchy(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatCauchy(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatCauchy(c(0,.2,.4),c(.1,.3,.5), theta=c(-1,.9,.1))
CGGP_internal_CorrMatCauchy(c(0,.2,.4),c(.1,.3,.5), theta=c(-1,.9,.1))

CauchySQ correlation function

Description

Calculate correlation matrix for two sets of points in one dimension Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatCauchySQ(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatCauchySQ(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatCauchySQ(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7,-.5))
CGGP_internal_CorrMatCauchySQ(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7,-.5))

CauchySQT correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatCauchySQT(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatCauchySQT(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatCauchySQT(c(0,.2,.4),c(.1,.3,.5), theta=c(-.1,.3,-.7))
CGGP_internal_CorrMatCauchySQT(c(0,.2,.4),c(.1,.3,.5), theta=c(-.1,.3,-.7))

Gaussian correlation function

Description

Calculate correlation matrix for two sets of points in one dimension Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatGaussian(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatGaussian(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Details

WE HIGHLY ADVISE NOT USING THIS CORRELATION FUNCTION. Try Power Exponential, CauchySQT, Cauchy, or Matern 3/2 instead.

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatGaussian(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))
CGGP_internal_CorrMatGaussian(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))

Matern 3/2 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatMatern32(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatMatern32(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatMatern32(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))
CGGP_internal_CorrMatMatern32(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))

Matern 5/2 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatMatern52(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatMatern52(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatMatern52(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))
CGGP_internal_CorrMatMatern52(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))

Power exponential correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatPowerExp(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatPowerExp(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatPowerExp(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7,.2))
CGGP_internal_CorrMatPowerExp(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7,.2))

Wendland0 (Triangle) correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatWendland0(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatWendland0(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatWendland0(c(0,.2,.4),c(.1,.3,.5), theta=-.7)
CGGP_internal_CorrMatWendland0(c(0,.2,.4),c(.1,.3,.5), theta=-.7)

Wendland1 1 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatWendland1(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatWendland1(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatWendland1(c(0,.2,.4),c(.1,.3,.5), theta=-.7)
CGGP_internal_CorrMatWendland1(c(0,.2,.4),c(.1,.3,.5), theta=-.7)

Wendland2 2 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatWendland2(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)
CGGP_internal_CorrMatWendland2(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

`x1`	Vector of coordinates from same dimension
`x2`	Vector of coordinates from same dimension
`theta`	Correlation parameters: LS Log of parameter that controls lengthscale FD Logit of 0.5*parameter that controls the fractal dimension HE Log of parameter that controls the hurst effect
`return_dCdtheta`	Should dCdtheta be returned?
`return_numpara`	Should it just return the number of parameters?
`returnlogs`	Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatWendland2(c(0,.2,.4),c(.1,.3,.5), theta=-.7)
CGGP_internal_CorrMatWendland2(c(0,.2,.4),c(.1,.3,.5), theta=-.7)

Gradient of negative log likelihood posterior

Description

Gradient of negative log likelihood posterior

Usage

CGGP_internal_gneglogpost(
  theta,
  CGGP,
  y,
  ...,
  return_lik = FALSE,
  ys = NULL,
  Xs = NULL,
  HandlingSuppData = "Correct"
)
CGGP_internal_gneglogpost(
  theta,
  CGGP,
  y,
  ...,
  return_lik = FALSE,
  ys = NULL,
  Xs = NULL,
  HandlingSuppData = "Correct"
)

Arguments

`theta`	Log of correlation parameters
`CGGP`	CGGP object
`y`	CGGP$design measured values
`...`	Forces you to name remaining arguments
`return_lik`	If yes, it returns a list with lik and glik
`ys`	Supplementary output data
`Xs`	Supplementary input data
`HandlingSuppData`	How should supplementary data be handled? * Correct: full likelihood with grid and supplemental data * Only: only use supplemental data * Ignore: ignore supplemental data

Value

Vector for gradient of likelihood w.r.t. x (theta)

Examples

cg <- CGGPcreate(d=3, batchsize=20)
Y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(cg, Y)
CGGP_internal_gneglogpost(cg$thetaMAP, CGGP=cg, y=cg$y)
cg <- CGGPcreate(d=3, batchsize=20)
Y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(cg, Y)
CGGP_internal_gneglogpost(cg$thetaMAP, CGGP=cg, y=cg$y)

Calculate MSE prediction along a single dimension

Description

Calculate MSE prediction along a single dimension

Usage

CGGP_internal_MSEpredcalc(xp, xl, theta, CorrMat)
CGGP_internal_MSEpredcalc(xp, xl, theta, CorrMat)

Arguments

`xp`	Points at which to calculate MSE
`xl`	Levels along dimension, vector???
`theta`	Correlation parameters
`CorrMat`	Function that gives correlation matrix for vectors of 1D points.

Value

MSE predictions

Examples

CGGP_internal_MSEpredcalc(c(.4,.52), c(0,.25,.5,.75,1), theta=c(.1,.2),
             CorrMat=CGGP_internal_CorrMatCauchySQ)
CGGP_internal_MSEpredcalc(c(.4,.52), c(0,.25,.5,.75,1), theta=c(.1,.2),
             CorrMat=CGGP_internal_CorrMatCauchySQ)

Calculate negative log posterior

Description

Calculate negative log posterior

Usage

CGGP_internal_neglogpost(
  theta,
  CGGP,
  y,
  ...,
  ys = NULL,
  Xs = NULL,
  HandlingSuppData = "Correct"
)
CGGP_internal_neglogpost(
  theta,
  CGGP,
  y,
  ...,
  ys = NULL,
  Xs = NULL,
  HandlingSuppData = "Correct"
)

Arguments

`theta`	Correlation parameters
`CGGP`	CGGP object
`y`	Measured values of CGGP$design
`...`	Forces you to name remaining arguments
`ys`	Supplementary output data
`Xs`	Supplementary input data
`HandlingSuppData`	How should supplementary data be handled? Correct: full likelihood with grid and supplemental data Only: only use supplemental data Ignore: ignore supplemental data

Value

Likelihood

Examples

cg <- CGGPcreate(d=3, batchsize=20)
Y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(cg, Y)
CGGP_internal_neglogpost(cg$thetaMAP, CGGP=cg, y=cg$y)
cg <- CGGPcreate(d=3, batchsize=20)
Y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(cg, Y)
CGGP_internal_neglogpost(cg$thetaMAP, CGGP=cg, y=cg$y)

Set correlation function of CGGP object

Description

Set correlation function of CGGP object

Usage

CGGP_internal_set_corr(CGGP, corr)
CGGP_internal_set_corr(CGGP, corr)

Arguments

`CGGP`	CGGP object
`corr`	Correlation function

Value

CGGP object

Examples

obj <- CGGPcreate(3, 20, corr="matern52")
CGGP_internal_set_corr(obj, "gaussian")
obj <- CGGPcreate(3, 20, corr="matern52")
CGGP_internal_set_corr(obj, "gaussian")

Add points to CGGP

Description

Add 'batchsize' points to 'SG' using 'theta'.

Usage

CGGPappend(CGGP, batchsize, selectionmethod = "MAP")
CGGPappend(CGGP, batchsize, selectionmethod = "MAP")

Arguments

`CGGP`	Sparse grid object
`batchsize`	Number of points to add
`selectionmethod`	How points will be selected: one of 'UCB', 'TS', 'MAP', 'Oldest', 'Random', or 'Lowest'. 'UCB' uses Upper Confidence Bound estimates for the parameters. 'TS' uses Thompson sampling, a random sample from the posterior. 'MAP' uses maximum a posteriori parameter estimates. 'Oldest' adds the block that has been available the longest. 'Random' adds a random block. 'Lowest' adds the block with the lowest sum of index levels. 'UCB' and 'TS' are based on bandit algorithms and account for uncertainty in the parameter estimates, but are the slowest. ‘MAP' is fast but doesn’t account for parameter uncertainty. The other three are naive methods that are not adaptive and won't perform well.

Value

SG with new points added.

Examples

SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2})
SG <- CGGPfit(SG, Y=y)
SG <- CGGPappend(CGGP=SG, batchsize=20, selectionmethod="MAP")
SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2})
SG <- CGGPfit(SG, Y=y)
SG <- CGGPappend(CGGP=SG, batchsize=20, selectionmethod="MAP")

Create sparse grid GP

Description

Create sparse grid GP

Usage

CGGPcreate(
  d,
  batchsize,
  corr = "PowerExponential",
  grid_sizes = c(1, 2, 4, 4, 8, 12, 20, 28, 32),
  Xs = NULL,
  Ys = NULL,
  HandlingSuppData = "Correct",
  supp_args = list()
)
CGGPcreate(
  d,
  batchsize,
  corr = "PowerExponential",
  grid_sizes = c(1, 2, 4, 4, 8, 12, 20, 28, 32),
  Xs = NULL,
  Ys = NULL,
  HandlingSuppData = "Correct",
  supp_args = list()
)

Arguments

`d`	Input dimension
`batchsize`	Number added to design each batch for now only on predictions
`corr`	Name of correlation function to use. Must be one of "CauchySQT", "CauchySQ", "Cauchy", "Gaussian", "PowerExp", "Matern32", "Matern52".
`grid_sizes`	Size of grid refinements.
`Xs`	Supplemental X data
`Ys`	Supplemental Y data
`HandlingSuppData`	How should supplementary data be handled? * Correct: full likelihood with grid and supplemental data * Only: only use supplemental data * Ignore: ignore supplemental data
`supp_args`	Arguments used to fit if Xs and Ys are given

Value

CGGP

Examples

CGGPcreate(d=8,200)
CGGPcreate(d=8,200)

Update CGGP model given data

Description

This function will update the GP parameters for a CGGP design.

Usage

CGGPfit(
  CGGP,
  Y,
  Xs = NULL,
  Ys = NULL,
  theta0 = pmax(pmin(CGGP$thetaMAP, 0.8), -0.8),
  HandlingSuppData = CGGP$HandlingSuppData,
  separateoutputparameterdimensions = is.matrix(CGGP$thetaMAP),
  set_thetaMAP_to,
  corr,
  Ynew
)
CGGPfit(
  CGGP,
  Y,
  Xs = NULL,
  Ys = NULL,
  theta0 = pmax(pmin(CGGP$thetaMAP, 0.8), -0.8),
  HandlingSuppData = CGGP$HandlingSuppData,
  separateoutputparameterdimensions = is.matrix(CGGP$thetaMAP),
  set_thetaMAP_to,
  corr,
  Ynew
)

Arguments

`CGGP`	Sparse grid objects
`Y`	Output values calculated at CGGP$design
`Xs`	Supplemental X matrix
`Ys`	Supplemental Y values
`theta0`	Initial theta
`HandlingSuppData`	How should supplementary data be handled? * Correct: full likelihood with grid and supplemental data * Only: only use supplemental data * Ignore: ignore supplemental data
`separateoutputparameterdimensions`	If multiple output dimensions, should separate parameters be fit to each dimension?
`set_thetaMAP_to`	Value for thetaMAP to be set to
`corr`	Will update correlation function, if left missing it will be same as last time.
`Ynew`	Values of 'CGGP$design_unevaluated'

Value

Updated CGGP object fit to data given

Examples

cg <- CGGPcreate(d=3, batchsize=100)
y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(CGGP=cg, Y=y)
cg <- CGGPcreate(d=3, batchsize=100)
y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(CGGP=cg, Y=y)

CGGP block plot

Description

Plot the 2D projections of the blocks of an CGGP object.

Usage

CGGPplotblocks(CGGP, singleplot = TRUE)
CGGPplotblocks(CGGP, singleplot = TRUE)

Arguments

`CGGP`	CGGP object
`singleplot`	If only two dimensions, should a single plot be made?

Value

ggplot2 plot

Examples


# The first and fourth dimensions are most active and will have greater depth
ss <- CGGPcreate(d=5, batchsize=50)
f <- function(x) {cos(2*pi*x[1]*3) + x[3]*exp(4*x[4])}
ss <- CGGPfit(ss, Y=apply(ss$design, 1, f))
ss <- CGGPappend(CGGP=ss, batchsize=100)
CGGPplotblocks(ss)

mat <- matrix(c(1,1,1,2,2,1,2,2,1,3), ncol=2, byrow=TRUE)
CGGPplotblocks(mat)

# The first and fourth dimensions are most active and will have greater depth
ss <- CGGPcreate(d=5, batchsize=50)
f <- function(x) {cos(2*pi*x[1]*3) + x[3]*exp(4*x[4])}
ss <- CGGPfit(ss, Y=apply(ss$design, 1, f))
ss <- CGGPappend(CGGP=ss, batchsize=100)
CGGPplotblocks(ss)

mat <- matrix(c(1,1,1,2,2,1,2,2,1,3), ncol=2, byrow=TRUE)
CGGPplotblocks(mat)

Plot CGGP block selection over time

Description

Shows the order in which blocks were selected for each dimension. Gives an idea of how the selections change over time.

Usage

CGGPplotblockselection(CGGP, indims)
CGGPplotblockselection(CGGP, indims)

Arguments

`CGGP`	CGGP object
`indims`	Which input dimensions should be shown?

Value

ggplot2 object

Examples

gs <- CGGPcreate(d=3, batchsize=100)
# All dimensions will look similar
CGGPplotblockselection(gs)

# You need to append with CGGPappend after fitting to see a difference
f <- function(x){x[1]^1.2}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
gs <- CGGPappend(gs, 100)
# Now you will see higher for X1 from 100 to 200 while others remain low.
CGGPplotblockselection(gs)

gs <- CGGPcreate(d=3, batchsize=100)
# All dimensions will look similar
CGGPplotblockselection(gs)

# You need to append with CGGPappend after fitting to see a difference
f <- function(x){x[1]^1.2}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
gs <- CGGPappend(gs, 100)
# Now you will see higher for X1 from 100 to 200 while others remain low.
CGGPplotblockselection(gs)

Plot correlation samples

Description

Plot samples for a given correlation function and parameters. Useful for getting an idea of what the correlation parameters mean in terms of smoothness.

Usage

CGGPplotcorr(
  Corr = CGGP_internal_CorrMatGaussian,
  theta = NULL,
  numlines = 20,
  outdims = NULL,
  zero = TRUE
)
CGGPplotcorr(
  Corr = CGGP_internal_CorrMatGaussian,
  theta = NULL,
  numlines = 20,
  outdims = NULL,
  zero = TRUE
)

Arguments

`Corr`	Correlation function or CGGP object. If CGGP object, it will make plots for thetaMAP, the max a posteriori theta.
`theta`	Parameters for Corr
`numlines`	Number of sample paths to draw
`outdims`	Which output dimensions should be used?
`zero`	Should the sample paths start at y=0?

Value

Plot

Examples


CGGPplotcorr()
CGGPplotcorr(theta=c(-2,-1,0,1))

SG <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+sin(2*pi*x[2]*3)}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, Y=y)
CGGPplotcorr(SG)

CGGPplotcorr()
CGGPplotcorr(theta=c(-2,-1,0,1))

SG <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+sin(2*pi*x[2]*3)}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, Y=y)
CGGPplotcorr(SG)

The values on the diagonal are largest design depth for that dimension. The off-diagonal values are the largest design depth that both dimensions have been measured at simultaneously. A greater depth means that more points have been measured along that dimension or two-dimensional subspace.

Usage

CGGPplotheat(CGGP)
CGGPplotheat(CGGP)

Arguments

CGGP

CGGP object

Value

A heat map made from ggplot2

References

https://stackoverflow.com/questions/14290364/heatmap-with-values-ggplot2

Examples


# All dimensions should look similar
d <- 8
SG = CGGPcreate(d,201)
CGGPplotheat(SG)

# The first and fourth dimensions are most active and will have greater depth
SG <- CGGPcreate(d=5, batchsize=50)
f <- function(x) {cos(2*pi*x[1]*3) + exp(4*x[4])}
for (i in 1:1) {
  SG <- CGGPfit(SG, Y=apply(SG$design, 1, f))
  SG <- CGGPappend(CGGP=SG, batchsize=200)
}
# SG <- CGGPfit(SG, Y=apply(SG$design, 1, f))
CGGPplotheat(SG)

# All dimensions should look similar
d <- 8
SG = CGGPcreate(d,201)
CGGPplotheat(SG)

# The first and fourth dimensions are most active and will have greater depth
SG <- CGGPcreate(d=5, batchsize=50)
f <- function(x) {cos(2*pi*x[1]*3) + exp(4*x[4])}
for (i in 1:1) {
  SG <- CGGPfit(SG, Y=apply(SG$design, 1, f))
  SG <- CGGPappend(CGGP=SG, batchsize=200)
}
# SG <- CGGPfit(SG, Y=apply(SG$design, 1, f))
CGGPplotheat(SG)

Histogram of measurements at each design depth of each input dimension

Description

A greater design depth signifies a more important dimension. Thus a larger right tail on the histogram are more important variables.

Usage

CGGPplothist(CGGP, ylog = TRUE)
CGGPplothist(CGGP, ylog = TRUE)

Arguments

`CGGP`	CGGP object
`ylog`	Should the y axis be put on a log scale?

Value

Histogram plot made using ggplot2

Examples


# All dimensions should look similar
d <- 8
SG = CGGPcreate(d,201)
CGGPplothist(SG)
CGGPplothist(SG, ylog=FALSE)

# The first dimension is more active and will have greater depth
f <- function(x) {sin(x[1]^.6*5)}
SG <- CGGPcreate(d=5, batchsize=100)
SG <- CGGPfit(SG, apply(SG$design, 1, f))
SG <- CGGPappend(CGGP=SG, batchsize=1000)
CGGPplothist(SG)

# All dimensions should look similar
d <- 8
SG = CGGPcreate(d,201)
CGGPplothist(SG)
CGGPplothist(SG, ylog=FALSE)

# The first dimension is more active and will have greater depth
f <- function(x) {sin(x[1]^.6*5)}
SG <- CGGPcreate(d=5, batchsize=100)
SG <- CGGPfit(SG, apply(SG$design, 1, f))
SG <- CGGPappend(CGGP=SG, batchsize=1000)
CGGPplothist(SG)

Plot negative log posterior likelihood of samples

Description

Plot negative log posterior likelihood of samples

Usage

CGGPplotsamplesneglogpost(CGGP)
CGGPplotsamplesneglogpost(CGGP)

Arguments

CGGP

CGGP object

Value

ggplot2 object

Examples

gs <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
CGGPplotsamplesneglogpost(gs)
gs <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
CGGPplotsamplesneglogpost(gs)

CGGP slice plot

Description

Show prediction plots when varying over only one dimension. Most useful when setting all values to 0.5 because it will have the most points.

Usage

CGGPplotslice(
  CGGP,
  proj = 0.5,
  np = 300,
  color = "pink",
  outdims,
  scales = "free_y",
  facet = "grid"
)
CGGPplotslice(
  CGGP,
  proj = 0.5,
  np = 300,
  color = "pink",
  outdims,
  scales = "free_y",
  facet = "grid"
)

Arguments

`CGGP`	CGGP object
`proj`	Point to project onto
`np`	Number of points to use along each dimension
`color`	Color to make error region
`outdims`	If multiple outputs, which of them should be plotted?
`scales`	Parameter passed to ggplot2::facet_grid()
`facet`	If "grid", will use ggplot2::facet_grid(), if "wrap" will use ggplot2::facet_wrap(). Only applicable for a single output dimension.

Value

ggplot2 object

Examples


d <- 5
f1 <- function(x){x[1]+x[2]^2 + cos(x[3]^2*2*pi*4) - 3.3}
s1 <- CGGPcreate(d, 200)
s1 <- CGGPfit(s1, apply(s1$design, 1, f1))
#s1 <- CGGPappend(s1, 200)
#s1 <- CGGPfit(s1, apply(s1$design, 1, f1))
CGGPplotslice(s1)
CGGPplotslice(s1, 0.)
CGGPplotslice(s1, s1$design[nrow(s1$design),])

d <- 5
f1 <- function(x){x[1]+x[2]^2 + cos(x[3]^2*2*pi*4) - 3.3}
s1 <- CGGPcreate(d, 200)
s1 <- CGGPfit(s1, apply(s1$design, 1, f1))
#s1 <- CGGPappend(s1, 200)
#s1 <- CGGPfit(s1, apply(s1$design, 1, f1))
CGGPplotslice(s1)
CGGPplotslice(s1, 0.)
CGGPplotslice(s1, s1$design[nrow(s1$design),])

Plot theta samples

Description

Plot theta samples

Usage

CGGPplottheta(CGGP)
CGGPplottheta(CGGP)

Arguments

CGGP

CGGP object

Value

ggplot2 object

Examples

gs <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
CGGPplottheta(gs)
gs <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
CGGPplottheta(gs)

Plot something similar to a semivariogram

Description

It's not actually a variogram or semivariogram. It shows how the correlation function falls off as distance increases.

Usage

CGGPplotvariogram(CGGP, facet = 1, outdims = NULL)
CGGPplotvariogram(CGGP, facet = 1, outdims = NULL)

Arguments

`CGGP`	CGGP object
`facet`	How should the plots be faceted? If 1, in a row, if 2, in a column, if 3, wrapped around.
`outdims`	Which output dimensions should be shown.

Value

ggplot2 object

Examples

SG <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, Y=y)
CGGPplotvariogram(SG)
SG <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, Y=y)
CGGPplotvariogram(SG)

Plot validation prediction errors for CGGP object

Description

Plot validation prediction errors for CGGP object

Usage

CGGPvalplot(CGGP, Xval, Yval, d = NULL)
CGGPvalplot(CGGP, Xval, Yval, d = NULL)

Arguments

`CGGP`	CGGP object that has been fitted
`Xval`	X validation data
`Yval`	Y validation data
`d`	If output is multivariate, which column to use. Will do all if left as NULL.

Value

None, makes a plot

Examples

SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(SG$design, 1, f1)
SG <- CGGPfit(SG, y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- apply(Xval, 1, f1)
CGGPvalplot(CGGP=SG, Xval=Xval, Yval=Yval)
SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(SG$design, 1, f1)
SG <- CGGPfit(SG, y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- apply(Xval, 1, f1)
CGGPvalplot(CGGP=SG, Xval=Xval, Yval=Yval)

Calculate stats for CGGP prediction on validation data

Description

Calculate stats for CGGP prediction on validation data

Usage

CGGPvalstats(CGGP, Xval, Yval, bydim = TRUE, ...)
CGGPvalstats(CGGP, Xval, Yval, bydim = TRUE, ...)

Arguments

`CGGP`	CGGP object
`Xval`	X validation matrix
`Yval`	Y validation data
`bydim`	If multiple outputs, should it be done separately by dimension?
`...`	Passed to valstats, such as which stats to calculate.

Value

data frame

Examples


SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(SG$design, 1, f1)
SG <- CGGPfit(SG, y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- apply(Xval, 1, f1)
CGGPvalstats(CGGP=SG, Xval=Xval, Yval=Yval)

# Multiple outputs
SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
f2 <- function(x){x[1]^1.3+.4*sin(6*x[2])+10}
y1 <- apply(SG$design, 1, f1)#+rnorm(1,0,.01)
y2 <- apply(SG$design, 1, f2)#+rnorm(1,0,.01)
y <- cbind(y1, y2)
SG <- CGGPfit(SG, Y=y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- cbind(apply(Xval, 1, f1),
              apply(Xval, 1, f2))
CGGPvalstats(SG, Xval, Yval)
CGGPvalstats(SG, Xval, Yval, bydim=FALSE)

SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(SG$design, 1, f1)
SG <- CGGPfit(SG, y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- apply(Xval, 1, f1)
CGGPvalstats(CGGP=SG, Xval=Xval, Yval=Yval)

# Multiple outputs
SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
f2 <- function(x){x[1]^1.3+.4*sin(6*x[2])+10}
y1 <- apply(SG$design, 1, f1)#+rnorm(1,0,.01)
y2 <- apply(SG$design, 1, f2)#+rnorm(1,0,.01)
y <- cbind(y1, y2)
SG <- CGGPfit(SG, Y=y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- cbind(apply(Xval, 1, f1),
              apply(Xval, 1, f2))
CGGPvalstats(SG, Xval, Yval)
CGGPvalstats(SG, Xval, Yval, bydim=FALSE)

S3 plot method for CGGP

Description

There are a few different plot functions for CGGP objects: 'CGGPplotblocks', 'CGGPplotblockselection', 'CGGPplotcorr', 'CGGPplotheat', 'CGGPplothist', 'CGGPvalplot', 'CGGPplotslice', 'CGGPplotslice', and 'CGGPplotvariogram'. Currently 'CGGPplotblocks' is the default plot object.

Usage

## S3 method for class 'CGGP'
plot(x, y, ...)
## S3 method for class 'CGGP'
plot(x, y, ...)

Arguments

`x`	CGGP object
`y`	Don't use
`...`	Passed to CGGPplotblocks

Value

Either makes plot or returns plot object

Examples

SG = CGGPcreate(3,100)
plot(SG)
SG = CGGPcreate(3,100)
plot(SG)

S3 predict method for CGGP

Description

Passes to CGGPpred

Predict using SG with y values at xp? Shouldn't y values already be stored in SG?

Usage

## S3 method for class 'CGGP'
predict(object, xp, ...)

CGGPpred(CGGP, xp, theta = NULL, outdims = NULL)
## S3 method for class 'CGGP'
predict(object, xp, ...)

CGGPpred(CGGP, xp, theta = NULL, outdims = NULL)

Arguments

`object`	CGGP object
`xp`	x value to predict at
`...`	Other arguments passed to 'CGGPpred'
`CGGP`	SG object
`theta`	Leave as NULL unless you want to use a value other than thetaMAP. Much slower.
`outdims`	If multiple outputs fit without PCA and with separate parameters, you can predict just for certain dimensions to speed it up. Will leave other columns in the output, but they will be wrong.

Value

Predicted mean values

Examples

SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
SG <- CGGPfit(SG, Y=y)
CGGPpred(SG, matrix(c(.1,.1,.1),1,3))
cbind(CGGPpred(SG, SG$design)$mean, y) # Should be near equal
SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
SG <- CGGPfit(SG, Y=y)
CGGPpred(SG, matrix(c(.1,.1,.1),1,3))
cbind(CGGPpred(SG, SG$design)$mean, y) # Should be near equal

Print CGGP object

Description

Default print as a list is bad since there's a lot of elements.

Usage

## S3 method for class 'CGGP'
print(x, ...)
## S3 method for class 'CGGP'
print(x, ...)

Arguments

`x`	CGGP object
`...`	Passed to print

Value

String to be printed

Examples

SG = CGGPcreate(3,21)
print(SG)
f <- function(x) {x[1]+exp(x[2]) + log(x[3]+4)}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, y)
print(SG)
SG = CGGPcreate(3,21)
print(SG)
f <- function(x) {x[1]+exp(x[2]) + log(x[3]+4)}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, y)
print(SG)

rcpp_fastmatclcr

Description

rcpp_fastmatclcr

Usage

rcpp_fastmatclcr(I, w, MSEmat, S, maxlevel)
rcpp_fastmatclcr(I, w, MSEmat, S, maxlevel)

Arguments

`I`	Matrix
`w`	vector
`MSEmat`	Matrix
`S`	Vector
`maxlevel`	Integer

Value

Nothing, void

rcpp_fastmatclcranddclcr

Description

rcpp_fastmatclcranddclcr

Usage

rcpp_fastmatclcranddclcr(I, w, MSEmat, dMSEmat, S, dS, maxlevel, numpara)
rcpp_fastmatclcranddclcr(I, w, MSEmat, dMSEmat, S, dS, maxlevel, numpara)

Arguments

`I`	Matrix
`w`	vector
`MSEmat`	Matrix
`dMSEmat`	Matrix
`S`	Vector
`dS`	Matrix
`maxlevel`	Integer
`numpara`	Integer

Value

Nothing, void

rcpp_kronDBS

Description

rcpp_kronDBS

Usage

rcpp_gkronDBS(A, dA, B, p)
rcpp_gkronDBS(A, dA, B, p)

Arguments

`A`	Vector
`dA`	Vector
`B`	Vector
`p`	Vector

Value

kronDBS calculation

Examples

rcpp_gkronDBS(c(1,1), c(0,0), c(.75), c(1,1))
rcpp_gkronDBS(c(1,1), c(0,0), c(.75), c(1,1))

rcpp_kronDBS

Description

rcpp_kronDBS

Usage

rcpp_kronDBS(A, B, p)
rcpp_kronDBS(A, B, p)

Arguments

`A`	Vector
`B`	Vector
`p`	Vector

Value

kronDBS calculation

Plot validation prediction errors

Description

Plot validation prediction errors

Usage

valplot(predmean, predvar, Yval, d = NULL)
valplot(predmean, predvar, Yval, d = NULL)

Arguments

`predmean`	Predicted mean
`predvar`	Predicted variance
`Yval`	Y validation data
`d`	If output is multivariate, which column to use. Will do all if left as NULL.

Value

None, makes a plot

Examples

x <- matrix(runif(100*3), ncol=3)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(x, 1, f1)
# Create a linear model on the data
mod <- lm(y ~ ., data.frame(x))
# Predict at validation data
Xval <- matrix(runif(3*100), ncol=3)
mod.pred <- predict.lm(mod, data.frame(Xval), se.fit=TRUE)
# Compare to true results
Yval <- apply(Xval, 1, f1)
valplot(mod.pred$fit, mod.pred$se.fit^2, Yval=Yval)
x <- matrix(runif(100*3), ncol=3)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(x, 1, f1)
# Create a linear model on the data
mod <- lm(y ~ ., data.frame(x))
# Predict at validation data
Xval <- matrix(runif(3*100), ncol=3)
mod.pred <- predict.lm(mod, data.frame(Xval), se.fit=TRUE)
# Compare to true results
Yval <- apply(Xval, 1, f1)
valplot(mod.pred$fit, mod.pred$se.fit^2, Yval=Yval)

Calculate stats for prediction on validation data

Description

Calculate stats for prediction on validation data

Usage

valstats(
  predmean,
  predvar,
  Yval,
  bydim = TRUE,
  RMSE = TRUE,
  score = TRUE,
  CRPscore = TRUE,
  coverage = TRUE,
  corr = TRUE,
  R2 = TRUE,
  MAE = FALSE,
  MIS90 = FALSE,
  metrics,
  min_var = .Machine$double.eps
)
valstats(
  predmean,
  predvar,
  Yval,
  bydim = TRUE,
  RMSE = TRUE,
  score = TRUE,
  CRPscore = TRUE,
  coverage = TRUE,
  corr = TRUE,
  R2 = TRUE,
  MAE = FALSE,
  MIS90 = FALSE,
  metrics,
  min_var = .Machine$double.eps
)

Arguments

`predmean`	Predicted mean
`predvar`	Predicted variance
`Yval`	Y validation data
`bydim`	If multiple outputs, should it be done separately by dimension?
`RMSE`	Should root mean squared error (RMSE) be included?
`score`	Should score be included?
`CRPscore`	Should CRP score be included?
`coverage`	Should coverage be included?
`corr`	Should correlation between predicted and true mean be included?
`R2`	Should R^2 be included?
`MAE`	Should mean absolute error (MAE) be included?
`MIS90`	Should mean interval score for 90% confidence be included? See Gneiting and Raftery (2007).
`metrics`	Optional additional metrics to be calculated. Should have same first three parameters as this function.
`min_var`	Minimum value of the predicted variance. Negative or zero variances can cause errors.

Value

data frame

References

Gneiting, Tilmann, and Adrian E. Raftery. "Strictly proper scoring rules, prediction, and estimation." Journal of the American Statistical Association 102.477 (2007): 359-378.

Examples

valstats(c(0,1,2), c(.01,.01,.01), c(0,1.1,1.9))
valstats(cbind(c(0,1,2), c(1,2,3)),
         cbind(c(.01,.01,.01),c(.1,.1,.1)),
         cbind(c(0,1.1,1.9),c(1,2,3)))
valstats(cbind(c(0,1,2), c(8,12,34)),
         cbind(c(.01,.01,.01),c(1.1,.81,1.1)),
         cbind(c(0,1.1,1.9),c(10,20,30)), bydim=FALSE)
valstats(cbind(c(.8,1.2,3.4), c(8,12,34)),
         cbind(c(.01,.01,.01),c(1.1,.81,1.1)),
         cbind(c(1,2,3),c(10,20,30)), bydim=FALSE)
valstats(c(0,1,2), c(.01,.01,.01), c(0,1.1,1.9))
valstats(cbind(c(0,1,2), c(1,2,3)),
         cbind(c(.01,.01,.01),c(.1,.1,.1)),
         cbind(c(0,1.1,1.9),c(1,2,3)))
valstats(cbind(c(0,1,2), c(8,12,34)),
         cbind(c(.01,.01,.01),c(1.1,.81,1.1)),
         cbind(c(0,1.1,1.9),c(10,20,30)), bydim=FALSE)
valstats(cbind(c(.8,1.2,3.4), c(8,12,34)),
         cbind(c(.01,.01,.01),c(1.1,.81,1.1)),
         cbind(c(1,2,3),c(10,20,30)), bydim=FALSE)

Package 'CGGP'

Help Index

CGGP: A package for running sparse grid computer experiments

Description

CGGP functions

Calculate MSE over single dimension

Description

Usage

Arguments

Value

Examples

Calculate MSE over blocks

Description

Usage

Arguments

Value

Examples

Calculate predictive weights for CGGP

Description

Usage

Arguments

Value

Examples

Calculate derivative of pw

Description

Usage

Arguments

Value

Examples

Cauchy correlation function

Description

Usage

Arguments

Value

See Also

Examples

CauchySQ correlation function

Description

Usage

Arguments

Value

See Also

Examples

CauchySQT correlation function

Description

Usage

Arguments

Value

See Also

Examples

Gaussian correlation function

Description

Usage

Arguments

Details

Value

See Also

Examples

Matern 3/2 correlation function

Description

Usage

Arguments

Value

See Also

Examples

Matern 5/2 correlation function

Description

Usage

Arguments

Value

See Also

Examples

Power exponential correlation function

Description

Usage

Arguments

Value

See Also

Examples

Wendland0 (Triangle) correlation function