# For plotting
library(ggplot2)
# For extracting SDT parameters
library(neuropsychology)
I recently came across a paper that explained the equality between the parameters of signal detection theory (SDT) and the parameters of logistic regression in which the state (“absent”/“present”) is used to predict the response (“yes”/“no”, but also applicable in scale-rating designs) (DeCarlo, 1998).
Here is a short simulation-proof for this equality.
Setup
For this simulations we will need the following packages:
We will also need to make sure, for the logistic regression analysis, that our factors’ dummy coding is set to effects-coding - otherwise the intercept’s meaning will not correspond to the criterion (aka the overall response bias):
options(contrasts = c('contr.sum', 'contr.poly'))
The Simulations
<- 100L
n <- 100L B
We’ll run 100 simulations with 100 trials each.
Simulation Code
set.seed(1)
<- function(state,resp) {
SDT_params <- table(state,resp)
tab
<- dprime(
sdt_res n_hit = tab[2,2],
n_miss = tab[2,1],
n_fa = tab[1,2],
n_cr = tab[1,1]
)
c(sdt_res$dprime , sdt_res$c)
}
<- function(state,resp){
logistic_reg_params <- glm(resp ~ state, family = binomial())
fit
coef(fit)
}
# initialize
<- data.frame(d_ = numeric(B),
res c_ = numeric(B),
int = numeric(B),
slope = numeric(B))
# Loop
for (b in seq_len(B)) {
<- rexp(1,10) # random
true_sensitivity <- runif(1,-1,1) # random
true_criterion
# true state vector
<- rep(c(F,T), each = n/2)
state_i
# response vector
<- rnorm(n/2) # noise dist
Xn <- rnorm(n/2, mean = true_sensitivity) # signal + noise dist
Xs <- c(Xn,Xs)
X <- X > true_criterion
resp_i
# SDT params
1:2] <- SDT_params(state_i,resp_i)
res[b,
# logistic regression params
3:4] <- logistic_reg_params(state_i,resp_i)
res[b, }
Results
SDT parameters are on a standardized normal scale, meaning they are scaled to
- The slope must be also scaled by
due to R’s default effects coding.
- The intercept must also be scaled by
- see paper for the full rationale.
The red-dashed line represents the expected regression line predicting the SDT parameters from their logistic counterparts:
(The blue line is the empirical regression line.)
Conclusions
It is also possible to extend this equality to multi-level designs with generalized linear mixed models (GLMM; see chapters 3 and 9 in Modeling Psychophysical Data in R)2, but I see no reason this wouldn’t be possible… One could model random effects per subject, and the moderating effect of some