Package 'dda' reference manual

Title:	Direction Dependence Analysis
Description:	A collection of tests to analyze the causal direction of dependence in linear models (Wiedermann, W., & von Eye, A., 2025, ISBN: 9781009381390). The package includes functions to perform Direction Dependence Analysis for variable distributions, residual distributions, and independence properties of predictors and residuals in competing causal models. In addition, the package contains functions to test the causal direction of dependence in conditional models (i.e., models with interaction terms) For more information see <https://www.ddaproject.com>.
Authors:	Wolfgang Wiedermann [aut, cre], Megan Hirni [aut]
Maintainer:	Wolfgang Wiedermann <[email protected]>
License:	MIT + file LICENSE
Version:	0.1.0
Built:	2025-03-07 20:27:05 UTC
Source:	https://github.com/wwiedermann/dda

Conditional Direction Dependence Analysis: Independence Properties

Description

cdda.indep computes CDDA test statistics to evaluate asymmetries of predictor-error independence of competing conditional models (y ~ x * m vs. x ~ y * m with m being a continuous or categorical moderator).

print returns the output of standard linear model coefficients for competing target and alternative models.

plot returns graphs for CDDA test statistics obtained from competing conditional models.

summary returns test statistics from the cdda.indep class object.

Usage

cdda.indep(
  formula = NULL,
  pred = NULL,
  mod = NULL,
  modval = NULL,
  data = list(),
  hetero = FALSE,
  diff = FALSE,
  nlfun = NULL,
  hsic.method = "gamma",
  B = 200,
  boot.type = "perc",
  conf.level = 0.95,
  parallelize = FALSE,
  cores = 1,
  ...
)

## S3 method for class 'cdda.indep'
print(x, ...)

## S3 method for class 'cdda.indep'
plot(x = NULL, stat = NULL, ylim = NULL, ...)

## S3 method for class 'cdda.indep'
summary(
  object,
  nlfun = FALSE,
  hetero = FALSE,
  hsic = TRUE,
  hsic.diff = FALSE,
  dcor = TRUE,
  dcor.diff = FALSE,
  mi.diff = FALSE,
  ...
)
cdda.indep(
  formula = NULL,
  pred = NULL,
  mod = NULL,
  modval = NULL,
  data = list(),
  hetero = FALSE,
  diff = FALSE,
  nlfun = NULL,
  hsic.method = "gamma",
  B = 200,
  boot.type = "perc",
  conf.level = 0.95,
  parallelize = FALSE,
  cores = 1,
  ...
)

## S3 method for class 'cdda.indep'
print(x, ...)

## S3 method for class 'cdda.indep'
plot(x = NULL, stat = NULL, ylim = NULL, ...)

## S3 method for class 'cdda.indep'
summary(
  object,
  nlfun = FALSE,
  hetero = FALSE,
  hsic = TRUE,
  hsic.diff = FALSE,
  dcor = TRUE,
  dcor.diff = FALSE,
  mi.diff = FALSE,
  ...
)

Arguments

`formula`	Symbolic formula of the model to be tested or an `lm` object.
`pred`	A character indicating the variable name of the predictor which serves as the outcome in the alternative model.
`mod`	A character indicating the variable name of the moderator.
`modval`	Characters or a numeric sequence specifying the moderator values used in post-hoc probing. Possible characters include `c("mean", "median", "JN")`.`modval = "mean"` tests the interaction effect at the moderator values `M – 1SD`, `M`, and `M + 1SD`; `modval = "median"` uses `Q1`, `Md`, and `Q3`. The Johnson-Neyman approach is applied when `modval = "JN"` with conditional effects being evaluated at the boundary values of the significance regions. When a numeric sequence is specified,the pick-a-point approach is used for the selected numeric values.
`data`	A required data frame containing the variables in the model.
`hetero`	A logical value indicating whether separate homoscedasticity tests should be returned when using `summary`, default is `FALSE`.
`diff`	A logical value indicating whether differences in HSIC, dCor, and MI values should be computed. Bootstrap confidence intervals are computed using B bootstrap samples.
`nlfun`	A logical value indicating whether non-linear correlation tests should be returned when using `summary`, default is `FALSE`.
`hsic.method`	A character indicating the inference method for the Hilbert-Schmidt Independence Criterion. Must be one of the four specifications `c("gamma", "eigenvalue", "boot", "permutation")`.`hsic.method = "gamma"` is the default.
`B`	Number of permutations for separate dCor tests and number of resamples when `hsic.method = c("boot", "permutation")` or `diff = TRUE`.
`boot.type`	A character indicating the type of bootstrap confidence intervals. Must be one of the two specifications `c("perc", "bca")`. `boot.type = "perc"` is the default.
`conf.level`	Confidence level for bootstrap confidence intervals.
`parallelize`	A logical value indicating whether bootstrapping is performed on multiple cores. Only used if `diff = TRUE`.
`cores`	A numeric value indicating the number of cores. Only used if `parallelize = TRUE`.
`...`	Additional arguments to be passed to the function.
`x`	An object of class `cdda.indep` when using `print` or `plot`.
`stat`	A character indicating the CDDA statistic to be plotted with the options `c("hsic.diff", "dcor.diff", "mi.diff")`.
`ylim`	A numeric vector of length 2 indicating the y-axis limits if `NULL`, the function will set the limits automatically.
`object`	An object of class `cdda.indep` when using `summary`.
`hsic`	A logical value indicating whether deparate HSIC tests should be returned when using `summary`, default is `TRUE`.
`hsic.diff`	A logical value indicating whether HSIC difference statistics should be returned when using `summary`, default is `FALSE`.
`dcor`	A logical value indicating whether separate Distance Correlation (dCor) tests should be returned when using `summary`, default is `TRUE`.
`dcor.diff`	A logical value indicating whether dCor difference statistics should be returned when using `summary`, default is `FALSE`.
`mi.diff`	A logical value indicating whether Mutual Information (MI) difference statistics should be returned when using `summary`, default is `FALSE`.

Value

A list of class cdda.indep containing the results of CDDA independence tests for pre-specified moderator values.

An object of class cdda.indep with competing model coefficients.

References

Wiedermann, W., & von Eye, A. (2025). Direction Dependence Analysis: Foundations and Statistical Methods. Cambridge, UK: Cambridge University Press.

Examples

set.seed(321)
n <- 700

## --- generate moderator

z <- sort(rnorm(n))
z1 <- z[z <= 0]
z2 <- z[z > 0]

## --- x -> y when z <= 0

x1 <- rchisq(length(z1), df = 4) - 4
e1 <- rchisq(length(z1), df = 3) - 3
y1 <- 0.5 * x1 + e1

## --- y -> x when m z > 0

y2 <- rchisq(length(z2), df = 4) - 4
e2 <- rchisq(length(z2), df = 3) - 3
x2 <- 0.25 * y2 + e2

y <- c(y1, y2); x <- c(x1, x2)

d <- data.frame(x, y, z)

m <- lm(y ~ x * z, data = d)


result <- cdda.indep(m,
                     pred = "x",
                     mod = "z",
                     modval = c(-1, 1),
                     data = d,
                     hetero = TRUE,
                     diff = TRUE,
                     parallelize = TRUE,
                     cores = 2,
                     nlfun = 2,
                     B = 50)


print(result)

plot(result, stat = "dcor.diff")

summary(result, hetero = FALSE)

set.seed(321)
n <- 700

## --- generate moderator

z <- sort(rnorm(n))
z1 <- z[z <= 0]
z2 <- z[z > 0]

## --- x -> y when z <= 0

x1 <- rchisq(length(z1), df = 4) - 4
e1 <- rchisq(length(z1), df = 3) - 3
y1 <- 0.5 * x1 + e1

## --- y -> x when m z > 0

y2 <- rchisq(length(z2), df = 4) - 4
e2 <- rchisq(length(z2), df = 3) - 3
x2 <- 0.25 * y2 + e2

y <- c(y1, y2); x <- c(x1, x2)

d <- data.frame(x, y, z)

m <- lm(y ~ x * z, data = d)


result <- cdda.indep(m,
                     pred = "x",
                     mod = "z",
                     modval = c(-1, 1),
                     data = d,
                     hetero = TRUE,
                     diff = TRUE,
                     parallelize = TRUE,
                     cores = 2,
                     nlfun = 2,
                     B = 50)


print(result)

plot(result, stat = "dcor.diff")

summary(result, hetero = FALSE)

Conditional Directional Dependence Analysis: Variable Distributions

Description

cdda.vardist computes DDA test statistics for observed variable distributions of competing conditional models (y ~ x * m vs.x ~ y * m with m being a continuous or categorical moderator).

print returns the output of standard linear model coefficients for competing target and alternative models.

plot returns graphs for CDDA test statistics obtained from competing conditional models.

summary returns test statistics from the cdda.vardist class object.

Usage

cdda.vardist(
  formula,
  pred = NULL,
  mod = NULL,
  data = list(),
  modval = NULL,
  B = 200,
  boot.type = "perc",
  conf.level = 0.95
)

## S3 method for class 'cdda.vardist'
print(x, ...)

## S3 method for class 'cdda.vardist'
plot(x, stat = NULL, ylim = NULL, ...)

## S3 method for class 'cdda.vardist'
summary(object, skew = TRUE, coskew = FALSE, kurt = TRUE, cokurt = FALSE, ...)
cdda.vardist(
  formula,
  pred = NULL,
  mod = NULL,
  data = list(),
  modval = NULL,
  B = 200,
  boot.type = "perc",
  conf.level = 0.95
)

## S3 method for class 'cdda.vardist'
print(x, ...)

## S3 method for class 'cdda.vardist'
plot(x, stat = NULL, ylim = NULL, ...)

## S3 method for class 'cdda.vardist'
summary(object, skew = TRUE, coskew = FALSE, kurt = TRUE, cokurt = FALSE, ...)

Arguments

`formula`	Symbolic formula of the model to be tested or a `lm` object
`pred`	A character indicating the variable name of the predictor which serves as the outcome in the alternative model.
`mod`	A character indicating the variable name of the moderator.
`data`	A required data frame containing the variables in the model.
`modval`	Characters or a numeric sequence specifying the moderator values used in post-hoc probing. Possible characters include `c("mean", "median", "JN")`. `modval = "mean"` tests the interaction effect at the moderator values `M - 1SD`, `M`, and `M + 1SD`; `modval = "median"` uses `Q1`, `Md`, and `Q3`. The Johnson-Neyman approach is applied when `modval = "JN"` with conditional effects being evaluated at the boundary values of the significance regions. When a numeric sequence is specified, the pick-a-point approach is used for the selected numeric values.
`B`	Number of bootstrap samples.
`boot.type`	A character indicating the type of bootstrap confidence intervals. Must be one of the two values `c("perc", "bca")`. `boot.type = "bca"` is the default.
`conf.level`	Confidence level for bootstrap confidence intervals.
`x`	An object of class `cdda.vardist` when using `print` or `plot`.
`...`	Additional arguments to be passed to the function.
`stat`	A character indicating the statistic to be plotted, default is `"rhs"`, with options `c("coskew", "cokurt", "rhs", "rcc", "rtanh")`.
`ylim`	A numeric vector of length 2 indicating the y-axis limits. If `NULL`, the function will set the limits automatically.
`object`	An object of class `cdda.vardist` when using `summary`.
`skew`	A logical value indicating whether skewness differences and separate D'Agostino skewness tests should be returned when using `summary`, default is `TRUE`.
`coskew`	A logical value indicating whether co-skewness differences should be returned when using `summary`, default is `FALSE`.
`kurt`	A logical value indicating whether excess kurtosis differences and Anscombe-Glynn kurtosis tests should be returned when using `summary`, default is `TRUE`.
`cokurt`	A logical value indicating whether co-kurtosis differences should be returned when using `summary`, default is `FALSE`.

Value

A list of class cdda.vardist containing the results of CDDA tests to evaluate distributional properties of observed variables for pre-specified moderator values.

References

Wiedermann, W., & von Eye, A. (2025). Direction Dependence Analysis: Foundations and Statistical Methods. Cambridge, UK: Cambridge University Press.

Examples

set.seed(321)
n <- 700

## --- generate moderator

z <- sort(rnorm(n))
z1 <- z[z <= 0]
z2 <- z[z > 0]

## --- x -> y when z <= 0

x1 <- rchisq(length(z1), df = 4) - 4
e1 <- rchisq(length(z1), df = 3) - 3
y1 <- 0.5 * x1 + e1

## --- y -> x when m z > 0

y2 <- rchisq(length(z2), df = 4) - 4
e2 <- rchisq(length(z2), df = 3) - 3
x2 <- 0.25 * y2 + e2

y <- c(y1, y2); x <- c(x1, x2)

d <- data.frame(x, y, z)

m <- lm(y ~ x * z, data = d)

result <- cdda.vardist(m, pred = "x", mod = "z", B = 50,
                      modval = c(-1, 1), data = d)

print(result)

plot(result, stat = "rtanh", ylim = c(-0.05, 0.05))

summary(result, skew = FALSE, kurt = FALSE, coskew = TRUE)

set.seed(321)
n <- 700

## --- generate moderator

z <- sort(rnorm(n))
z1 <- z[z <= 0]
z2 <- z[z > 0]

## --- x -> y when z <= 0

x1 <- rchisq(length(z1), df = 4) - 4
e1 <- rchisq(length(z1), df = 3) - 3
y1 <- 0.5 * x1 + e1

## --- y -> x when m z > 0

y2 <- rchisq(length(z2), df = 4) - 4
e2 <- rchisq(length(z2), df = 3) - 3
x2 <- 0.25 * y2 + e2

y <- c(y1, y2); x <- c(x1, x2)

d <- data.frame(x, y, z)

m <- lm(y ~ x * z, data = d)

result <- cdda.vardist(m, pred = "x", mod = "z", B = 50,
                      modval = c(-1, 1), data = d)

print(result)

plot(result, stat = "rtanh", ylim = c(-0.05, 0.05))

summary(result, skew = FALSE, kurt = FALSE, coskew = TRUE)

Direction Dependence Analysis: Independence Properties

Description

dda.indep computes DDA test statistics to evaluate asymmetries of predictor-error independence of causally competing models (y ~ x vs. x ~ y).

print returns DDA test statistics associated with dda.indep objects.

Usage

dda.indep(
  formula,
  pred = NULL,
  data = list(),
  nlfun = NULL,
  hetero = FALSE,
  hsic.method = "gamma",
  diff = FALSE,
  B = 200,
  boot.type = "perc",
  conf.level = 0.95,
  parallelize = FALSE,
  cores = 1
)

## S3 method for class 'dda.indep'
print(x, ...)
dda.indep(
  formula,
  pred = NULL,
  data = list(),
  nlfun = NULL,
  hetero = FALSE,
  hsic.method = "gamma",
  diff = FALSE,
  B = 200,
  boot.type = "perc",
  conf.level = 0.95,
  parallelize = FALSE,
  cores = 1
)

## S3 method for class 'dda.indep'
print(x, ...)

Arguments

`formula`	Symbolic formula of the model to be tested or a `lm` object.
`pred`	A character indicating the variable name of the predictor which serves as the outcome in the alternative model.
`data`	An optional data frame containing the variables in the model (by default variables are taken from the environment which `dda.indep` is called from).
`nlfun`	Either a numeric value or a function of .Primitive type used for non-linear correlation tests. When `nlfun` is numeric the value is used in a power transformation.
`hetero`	A logical value indicating whether separate homoscedasticity tests (i.e., standard and robust Breusch-Pagan tests) should be computed.
`hsic.method`	A character indicating the inference method for the Hilbert-Schmidt Independence Criterion (HSIC). Must be one of the four specifications `c("gamma", "eigenvalue", "boot", "permutation")`. `hsic.method = "gamma"`is the default.
`diff`	A logical value indicating whether differences in HSIC, Distance Correlation (dCor), and MI values should be computed. Bootstrap confidence intervals are computed using B bootstrap samples.
`B`	Number of permutations for separate dCor tests and number of resamples if `hsic.method = c("boot", "permutation")` or `diff = TRUE`.
`boot.type`	A vector of character strings representing the type of bootstrap confidence intervals. Must be one of the two specifications `c("perc", "bca")`.`boot.type = "perc"` is the default.
`conf.level`	Confidence level for bootstrap confidence intervals.
`parallelize`	A logical value indicating whether bootstrapping is performed on multiple cores. Only used if `diff = TRUE`.
`cores`	A numeric value indicating the number of cores. Only used if `parallelize = TRUE`.
`x`	An object of class `dda.indep` when using `print`.
`...`	Additional arguments to be passed to the function.

Value

An object of class dda.indep containing the results of DDA independence tests.

References

Wiedermann, W., & von Eye, A. (2025). Direction Dependence Analysis: Foundations and Statistical Methods. Cambridge, UK: Cambridge University Press.

Examples


set.seed(123)
n <- 500
x <- rchisq(n, df = 4) - 4
e <- rchisq(n, df = 3) - 3
y <- 0.5 * x + e
d <- data.frame(x, y)

result <- dda.indep(y ~ x, pred = "x", data = d, parallelize = TRUE, cores = 2,
          nlfun = 2, B = 50, hetero = TRUE, diff = TRUE)


print(result)

set.seed(123)
n <- 500
x <- rchisq(n, df = 4) - 4
e <- rchisq(n, df = 3) - 3
y <- 0.5 * x + e
d <- data.frame(x, y)

result <- dda.indep(y ~ x, pred = "x", data = d, parallelize = TRUE, cores = 2,
          nlfun = 2, B = 50, hetero = TRUE, diff = TRUE)


print(result)

Direction Dependence Analysis: Residual Distributions

Description

dda.resdist evaluates patterns of asymmetry of error distributions of causally competing models (y ~ x vs. x ~ y).

print returns DDA test statistics associated with dda.resdist objects.

Usage

dda.resdist(
  formula,
  pred = NULL,
  data = list(),
  B = 200,
  boot.type = "perc",
  prob.trans = FALSE,
  conf.level = 0.95
)

## S3 method for class 'dda.resdist'
print(x, ...)
dda.resdist(
  formula,
  pred = NULL,
  data = list(),
  B = 200,
  boot.type = "perc",
  prob.trans = FALSE,
  conf.level = 0.95
)

## S3 method for class 'dda.resdist'
print(x, ...)

Arguments

`formula`	Symbolic formula of the target model to be tested or a `lm` object.
`pred`	Variable name of the predictor which serves as the outcome in the alternative model.
`data`	An optional data frame containing the variables in the model (by default variables are taken from the environment which `dda.resdist` is called from).
`B`	Number of bootstrap samples.
`boot.type`	A vector of character strings representing the type of bootstrap confidence intervals required. Must be one of the two values `c("perc", "bca")`; `boot.type = "perc"` is the default.
`prob.trans`	A logical value indicating whether a probability integral transformation should be performed prior computation of skewness and kurtosis difference tests.
`conf.level`	Confidence level for bootstrap confidence intervals.
`x`	An object of class `dda.resdist` when using `print`.
`...`	Additional arguments to be passed to the method.

Value

An object of class ddaresdist containing the results of DDA tests of asymmetry patterns of error distributions obtained from the causally competing models.

References

Wiedermann, W., & von Eye, A. (2025). Direction Dependence Analysis: Foundations and Statistical Methods. Cambridge, UK: Cambridge University Press.

Examples

set.seed(123)
n <- 500
x <- rchisq(n, df = 4) - 4
e <- rchisq(n, df = 3) - 3
y <- 0.5 * x + e
d <- data.frame(x, y)

result <- dda.resdist(y ~ x, pred = "x", data = d,
            B = 50, conf.level = 0.90, prob.trans = TRUE)

print(result)

set.seed(123)
n <- 500
x <- rchisq(n, df = 4) - 4
e <- rchisq(n, df = 3) - 3
y <- 0.5 * x + e
d <- data.frame(x, y)

result <- dda.resdist(y ~ x, pred = "x", data = d,
            B = 50, conf.level = 0.90, prob.trans = TRUE)

print(result)

Direction Dependence Analysis: Variable Distributions

Description

dda.vardist evaluates patterns of asymmetry of variable distributions for causally competing models (y ~ x vs. x ~ y).

print returns DDA test statistics associated with dda.vardist objects.

Usage

dda.vardist(
  formula,
  pred = NULL,
  data = list(),
  B = 200,
  boot.type = "perc",
  conf.level = 0.95,
  ...
)

## S3 method for class 'dda.vardist'
print(x, ...)
dda.vardist(
  formula,
  pred = NULL,
  data = list(),
  B = 200,
  boot.type = "perc",
  conf.level = 0.95,
  ...
)

## S3 method for class 'dda.vardist'
print(x, ...)

Arguments

`formula`	Symbolic formula of the model to be tested or a `lm`object.
`pred`	Variable name of the predictor which serves as the outcome in the alternative model.
`data`	An optional data frame containing the variables in the model (by default variables are taken from the environment which `dda.vardist` is called from).
`B`	Number of bootstrap samples.
`boot.type`	A character indicating the type of bootstrap confidence intervals. Must be one of the two specifications c("perc", "bca"). boot.type = "perc" is the default.
`conf.level`	Confidence level for bootstrap confidence intervals.
`...`	Additional arguments to be passed to the function.
`x`	An object of class `dda.vardist` when using `print`.

Value

An object of class dda.vardist containing the results of DDA tests of asymmetry patterns of variable distributions.

An object of class dda.vardist.

References

Wiedermann, W., & von Eye, A. (2025). Direction Dependence Analysis: Foundations and Statistical Methods. Cambridge, UK: Cambridge University Press.

Examples

set.seed(123)
n <- 500

x <- rchisq(n, df = 4) - 4
e <- rchisq(n, df = 3) - 3
y <- 0.5 * x + e
d <- data.frame(x, y)

result <- dda.vardist(y ~ x, pred = "x", data = d, B = 50)

print(result)
set.seed(123)
n <- 500

x <- rchisq(n, df = 4) - 4
e <- rchisq(n, df = 3) - 3
y <- 0.5 * x + e
d <- data.frame(x, y)

result <- dda.vardist(y ~ x, pred = "x", data = d, B = 50)

print(result)

Package 'dda'

Help Index

Conditional Direction Dependence Analysis: Independence Properties

Description

Usage

Arguments

Value

References

See Also

Examples

Conditional Directional Dependence Analysis: Variable Distributions

Description

Usage

Arguments

Value

References

See Also

Examples

Direction Dependence Analysis: Independence Properties

Description

Usage

Arguments

Value

References

See Also

Examples

Direction Dependence Analysis: Residual Distributions

Description

Usage

Arguments

Value

References

Examples

Direction Dependence Analysis: Variable Distributions

Description

Usage

Arguments

Value

References

See Also

Examples