Hyperbolic Discounting Outperforms a Dual Response-Bias Model for Intertemporal Choices

0
62

Abstract

Delay discounting is defined as the tendency to devalue rewards as their delay increases, and it is a fundamental component of decision-making that enables cognitive scientists to investigate human behavior. In this study, we investigate whether augmenting classic-value based models with a response-bias parameter (\alpha) improves descriptive adequacy or aligns with independent inhibitory control. We combined simulations and participant fits across three models: 1) hyperbolic, 2) hyperboloid, 3) dual-variant that adds (\alpha) to choice rule. Simulations over a k \times \alpha on the Monetary Choice Questionnaire (MCQ) by Kirby et al. with canonical parameters (\tau = 0.5, \sigma = 1.0) showed that delayed choices concentrate only when both the discount rate k and \alpha are low; increasing alpha shifts choices toward the immediate option, which is consistent with immediate gains. In participant data, hyperbolic discounting provided the best Akaike information criterion (AIC) for every converged case (N = 76/76). The dual model fit worse (median \Delta\text{AIC}{\text{dual-hyperbolic}} = +11.63; \Delta\text{AIC}{\text{dual-hyperboloid}} = +9.70) and exhibited boundary solutions (\alpha at the lower bound in 93.4%; \tau at the upper bound in 51.3%). Fitted \alpha correlated negatively with the proportion of delayed choices (r = -0.38, p = 7.2 \times 10^{-4}) and showed no reliable associated with stop-signal reaction time (SSRT; r = 0.07, p = 0.49). First-order Sobol indices were small for all parameters. These results indicate that a single decision-stage gain neither improves model fit nor tracks inhibitory control in these data; instead, alpha acts as an immediacy bias, while simple hyperbolic valuation remains sufficient for standard intertemporal-choice tasks.

Introduction

Delay discounting refers to the systematic decline in the subjective value of a reward as the delay to its receipt increases. It is a foundational concept in behavioral economics and cognitive neuroscience, widely used to model impulsive decision-making1,2,3,4. In delay discounting tasks, agents choose between a smaller immediate reward and a larger delayed one. Choosing the immediate option is interpreted as a more impulsive or short-sighted decision. This paradigm has become a gold standard for quantifying individual differences in impulsivity and self-control.

Impulsive discounting behavior has strong predictive power in clinical and everyday contexts. Steeper discounting (valuing future rewards significantly less than immediate ones) has been robustly linked to a range of maladaptive outcomes. For instance, individuals with substance use disorders, gambling addiction, and obesity typically show higher delay discounting rates compared to healthy controls5,6,7,8,9. These individuals tend to prefer short-term gratification (i.e., a drug high or snack) despite long-term costs (i.e., health, financial stability). Steeper discounting is also predictive of earlier relapse in addiction treatment and poorer treatment adherence10,11. Given its clinical and theoretical relevance, delay discounting has emerged as a key behavioral marker of impulsivity across populations.

However, impulsivity is a multifaceted construct. Temporal discounting captures one dimension, but executive function and inhibitory control play additional roles in regulating decision-making. The ability to inhibit prepotent or reflexive responses (motor impulsivity) is a distinct but related trait. Importantly, empirical studies show that individuals with weaker inhibitory control often exhibit steeper delay discounting12,13. For example, adolescents with attention-deficit/hyperactivity disorder (ADHD) who perform poorly on response inhibition tasks also tend to favor immediate over delayed rewards more often14,15. These findings suggest that the ability to value delayed rewards is not solely a function of subjective valuation, but also of self-regulatory capacity. Some theorists have argued that temporal discounting itself should be considered part of the broader executive function system12,16.

This study addresses that gap by augmenting classical value-based discounting with an explicit response-bias parameter, α, embedded in the choice rule17. Conceptually, k governs how sharply delayed rewards are devalued, whereas α scales the effective weight of the immediate option at decision time. In simulations, we use a deterministic threshold rule to cleanly map how k and α jointly shape choice patterns over a standard set of intertemporal offers. For empirical data we couple valuation with a logistic choice rule that includes a temperature parameter τ; this allows us to fit individual participants’ choices by maximum likelihood and to compare competing models with Akaike’s information criteria (AIC). We further evaluate the construct validity of α by testing whether it aligns with (i) observed choice behavior (proportion of delayed choices) and (ii) an independent inhibitory-control measure, stop-signal reaction time (SSRT)18,19,20.

Methodologically, the approach is two-pronged. First, we perform large-scale simulations on the canonical Monetary Choice Questionnaire (MCQ) by Kirby et al. to produce a k×α landscape of delayed-choice behavior under a fixed noise/temperature setting21. We also run a global sensitivity analysis (first-order Sobol indices) to quantify the marginal influence of each parameter on delayed-choice probability across the grid. Second, we fit three models to individual participants: (i) hyperbolic (Mazur)17, (ii) hyperboloid (Myerson–Green)17,22, and (iii) a dual variant that adds α to the choice rule. We conduct model selection by AIC, examine parameter distributions and correlations, and assess whether fitted α tracks either behavior or SSRT19,20,18.

Lines of previous work explain intertemporal choice with valuation alone (e.g., hyperbolic/hyperboloid)17,23, but dual-process perspectives argue for separable influences of valuation and control (e.g., “hot–cool” or competing-systems accounts)24,25,26,27. Motivated by these views, we formalized α as a decision-stage immediacy gain: at choice time, the immediate option is effectively up-weighted, shifting selections toward smaller–sooner outcomes as α increases. This operationalization complements classic valuation by isolating response bias from subjective value. Our formulation is further expanded upon in the Methods section.

Our specific objectives are as follows: (1) formalize an explicit, decision-stage response-bias parameter α within standard discounting frameworks17; (2) map the joint effects of discount rate k and response bias α on intertemporal choice via grid-based simulations on a widely used trial set (MCQ)21; (3) fit hyperbolic, hyperboloid, and dual models to participant data and compare them using AIC, including checks for parameter identifiability and boundary solutions17,22; (4) evaluate the behavioral and external validity of α by correlating it with the proportion of delayed choices and with SSRT19,20,18; and (5) quantify the marginal contribution of each parameter to delayed-choice behavior using Sobol sensitivity analysis. This design enables a direct test of whether adding a control-like gain improves descriptive adequacy beyond value-only models and clarifies what aspect of behavior the α term actually captures.

Results

We report simulation results that sweep the discount rate k and the response-bias gain \alpha on the Kirby trial set, together with empirical model fits to participant data. Unless noted, the canonical simulation setting uses \tau = 0.5 and stimulus noise \sigma = 1.0. Across the k \times \alpha grid, delayed-choice behavior was high only when both k and \alpha were low (Fig. 1). Increasing either parameter drove agents toward immediate choices, and increasing \alpha in particular reduced the proportion of delayed choices to near zero over much of the grid. In this implementation, \alpha therefore functions as an urgency or immediacy gain rather than a patience-promoting control, concentrating delayed responding in the low-k, low-\alpha corner. The qualitative patterns was unchanged across \tau \in {0.2, 0.5, 1.0} and \sigma \in {0, 1, 3} (Supplementary Figures S1–S3).

In the participant fits, hyperbolic discounting was preferred by AIC for every individual with converged estimates (N = 76/76; Fig. 2). The dual model that augments the choice rule performed worse than both comparators: median \Delta\text{AIC}{\text{dual-hyperbolic}} = +11.63 and \Delta\text{AIC}{\text{dual-hyperboloid}} = +9.70. Only 7.9% (dual-hyperbolic) and 9.2% (dual-hyperboloid) of cases showed negative \DeltaAIC, aligning with the overall log-likelihood distributions and indicating that adding a decision-stage gain did not improve descriptive adequacy for these data.

Parameter behavior within the dual model showed pronounced boundary solutions (Fig. 3, 4). Specifically, \alpha piled at its lower bound for 71/76 (93.4%) participants, \tau hit the upper bound (= 10.0) for 39/76 (51.3%), and k took its minimum value for 72/76 (94.7%). The k\alpha correlation was positive (Pearson r = 0.382, 95% CI [0.171, 0.559]), whereas correlations of \tau with k or \alpha were near zero (CIs include 0). Together with the AIC results, this pattern indicates weak identifiability of the added control terms given the present task and data.

Fig 4: Positive k\alpha correlation.

We next asked whether \alpha relates to observed behavior and to an independent inhibitory-control measure. Across the AIC sample, \alpha correlated negatively with the proportion of delayed choices (Pearson r = -0.380, 95% CI [-0.563, -0.171], p = 7.2 \times 10^{-4}; Fig. 5). In an independent cohort with stop-signal reaction time (SSRT; N = 101), \alpha showed no reliable association with SSRT (Pearson r = 0.070, 95% CI [-0.121, 0.254], p = 0.49; Fig. 5). These results caution against interpreting \alpha as a trait-like inhibitory-control parameter without external validation.

Finally, first-order Sobol indices (S_1) computed on the Kirby trial set were small for all parameters (\alpha \approx 0.012, k \approx 0.006, \tau \approx 0.005; Fig. 6). This sensitivity profile indicates weak marginal influence of any single parameter on delayed-choice probability when averaged across the grid and is consistent with the empirical boundary solutions and with the AIC preference for the simpler hyperbolic model (see also Supplementary Figures S4–S5 for alternative sensitivity visualizations). Together, the simulations and fits converge on the same conclusion: variation in k explains the data well, whereas adding the dual-control terms (\alpha, \tau) does not improve descriptive adequacy; moreover, \alpha behaves as an immediate-choice bias rather than a validated inhibitory control proxy in this context.

Fig 6: First-order Sobol indices (S_1) for k, \alpha, and \tau on the Kirby trial set. Bars are small for all parameters.

Discussion

Our study presents a computational framework that integrates valuation and executive control in delay discounting behavior24,25,23. The key findings demonstrate that (1) higher discount rates k lead to more impulsive, immediate-reward choices and lower total earnings17,23; (2) we originally hypothesized that higher inhibitory control α would promote patient decision-making, but in the present implementation, α behaved as an immediacy/response-bias gain; (3) the hyperbolic model, not the hyperboloid, provided the best descriptive fit for all particiants with converged estimates, with the dual model that augments the choice rule with α fitting worse by AIC17,22; and (4) agents showed a robust magnitude effect, choosing delayed rewards more frequently when the rewards were larger23. In addition, dual-model parameters frequently hit bounds (α at its lower bound for 93.4% of participants, τ at its upper bound for 51.3%, and k at its minimum for 94.7%), and first-order Sobol indices were small across parameters, indicating limited marginal influence when averaged over the grid.

These findings are significant for several reasons. First, they support the empirical literature suggesting that impulsive choice is not solely driven by reward valuation but also moderated by decision-stage factors24,25,26,27. However, the absence of an associated between α and an independent inhibitory control measure (SSRT; r=0.07,p=0.49) cautions against interpreting α as a traitlike inhibitory-control construct without external validation28,29,30. Our simulations reinforce that steep delay discounting can stem from two sources: a high discount rate or a response-bias/urgency tendency at choice26,27. Second, this dual-source account helps clarify why individuals with similar k values may behave differently under uncertainty: differences in immediacy bias (α) can tilt decisions toward smaller-sooner options even when value differences are modest. Third, our modeling advances traditional delay discounting frameworks by explicitly integrating a decision-stage term, showing that a one-parameter value function (hyperbolic) often suffices once a reasonable stochastic choice rule is in place, and that adding a control gain can introduce identifiability challenges when the task afford limited leverage to separate valuation from response bias17,22.

In relation to our objectives, all four goals were successfully met. We implemented and compared three major discounting models, systematically varied k and \alpha, and captured known behavioral effects. Empirically, the hyperbolic model outperformed both the hyperboloid and dual-models by AIC; the dual model exhibited boundary solutions and weak identifiability; \alpha related to delayed-choice behavior but not to SSRT; and global sensitivity analysis (first-order Sobol indices) indicated limited marginal influence of individual parameters when averaged across the k \times \alpha grid. Simulations mirrored the empirical patterns: delayed responses were concentrated in the low-k, low-\alpha region under the primary setting \tau = 0.5, \sigma = 1.0.

Limitations of this work include the notion that agents do not iteratively learn from feedback or update their strategies, meaning that the current framework may not be as dynamic as human decision-making is31,32. In addition, while the deterministic nature of the virtual agents makes it ideal for understanding the discounting model behaviors, it is unlike a real-world scenario where noisy disruptions of behavior are typically observed. Our use of fixed-frame monetary choices (Kirby MCQ) offers limited leverage to dissociate valuation from response bias, which likely contributed to boundary solutions and small Sobol indices33. We also did not include out-of-sample predictive tests due to modest trial counts, nor did we benchmark against learning-based agents given the static nature of the MCQ31,32. Finally, α’s lack of association with SSRT in our dataset argues against treating it as a trait-level inhibitory control measure without external validation28,29,30. Together, these limitations bound our claims to the present task and implementation.

Our results indicate that α functions as an immediacy/urgency gain rather than a traitlike inhibitory-control parameter, and that orthogonal manipulations of immediacy should increase power to identify it and enable contrasts with inhibition-focused measures28,26,27. This simulation-fit-validation pipeline clarifies that the control-like term adds value not by improving fit here or serving as a proxy for inhibitory control, but as a manipulable response-bias knob whose behavioral signature is separable from valuation under appropriately designed conditions. Future work should look to extend this framework with reinforcement-learning and attention models31,32.

Methods

We evaluated three valuation rules for delayed rewards that vary in how they map delay D to subjective value V. The hyperbolic (Mazur) rule is

(1)   \begin{equation*}V(A, D \mid k) = \frac{A}{1 + kD}\end{equation*}

with discount rate k > 0\cite{2}. The hyperboloid (Myerson–Green) generalizes this with a curvature parameter s > 0:

(2)   \begin{equation*}V(A, D \mid k, s) = \frac{A}{(1 + kD)^s}.\end{equation*}

We also considered a dual variant that retains hyperbolic valuation but augments the decision stage with a response-bias parameter \alpha > 0 (defined below). Intuitively, k controls the steepness of temporal devaluation, s controls the curvature of the discount function, and \alpha controls the effective weight placed on the immediate option at choice time.

For simulations, choices were deterministic to isolate model mechanics: the agent chose the delayed option if and only if:

(3)   \begin{equation*}V(A, D) \geq \alpha I,\end{equation*}

where I denotes the immediate amount. In this formulation, a larger \alpha makes immediate choices more likely (an urgency/response-bias gain). For empirical fits to participant data, we used a probabilistic choice rule with temperature \tau > 0,

(4)   \begin{equation*}P(\text{delayed}) = \sigma\left(\frac{V(A, D) - \alpha I}{\tau}\right), \quad \sigma(z) = \frac{1}{1 + e^{-z}}\end{equation*}

so that larger \tau yields noisier choices. All parameters were constrained to positive ranges with broad upper bounds during optimization.

We considered alternative control parameterizations, including an additive decision threshold \beta on the value difference,

(5)   \begin{equation*}P(\text{delayed}) = \sigma\left(\frac{V(A, D) - I - \beta}{\tau}\right).\end{equation*}

At fixed immediate amounts I, the multiplicative form of \alpha I and the additive \beta are linearly related (\beta = (\alpha - 1)I). We adopt the multiplicative form to (i) keep the bias expressed in units of the immediate stake, which scales naturally with trial magnitude; (ii) avoid introducing a free bias that can drift independently of task stakes; and (iii) make the implied threshold transparent in the deterministic limit used for simulations. With limited leverage to dissociate valuation and bias, these forms can be weakly identifiable.

Our primary criterion is AIC (\text{AIC} = 2k_m - 2\ell_m) as reported in the main text. Simulations and sensitivity analyses used the canonical 27-item Kirby Monetary Choice Questionnaire (MCQ) trial set to define the immediate/delayed offers. We generated heatmaps by sweeping a log-spaced grid over the parameter ranges

(6)   \begin{equation*}k \in [10^{-3}, 2], \quad \alpha \in [10^{-2}, 3]\end{equation*}

and computed the proportion of delayed choices at each grid point. Unless otherwise noted, the canonical setting fixed the choice temperature at \tau = 0.5 and stimulus noise at \sigma = 1.0; additional settings \tau \in {0.2, 0.5, 1.0} and \sigma \in {0, 1, 3} are reported in the Supplement.

For participant analyses, we fit the hyperbolic, hyperboloid, and dual models to individual choice data and compared them using AIC. Parameters were estimated by bounded maximum likelihood (minimizing negative log-likelihood under the logistic rule). For a model m with k_m free parameters and maximum log-likelihood \ell_m, we computed AIC.

Degrees of freedom corresponded to the parameters estimated under the logistic choice rule: hyperbolic (k, \tau) has k_m = 2; hyperboloid (k, s, \tau) has k_m = 3; dual-model (k, \alpha, \tau) has k_m = 3. We identified the lowest-AIC model per participant and summarized participant-wise \DeltaAIC. The converged sample comprised N = 76 participants. To assess external validity, we analyzed stop-signal reaction time (SSRT) in an independent cohort from OpenNeuro dataset ds004636 (“Cognitive tasks, anatomical MRI, and functional MRI data evaluating the construct of self-regulation”)29,34. SSRT was computed with a standard integration method following consensus recommendations28.

We quantified the marginal influence of parameters on delayed-choice probability using first-order Sobol indices (S1), evaluated over the same parameter ranges as the heatmaps on the Kirby trial set. This provides a global sensitivity summary of how variation in each parameter, considered independently, contributes to the variance in the delayed-choice outcome; extended sensitivity summaries are provided in the Supplement.

To relate the fitted control-like parameter to behavior and to an independent measure of inhibition, we correlated \alpha with (i) the observed proportion of delayed choices in the AIC sample (N = 76) and (ii) SSRT in the independent cohort (N = 101)28,29,30. We report Pearson and Spearman coefficients for robustness.

All analyses were conducted in Python (NumPy, pandas, SciPy, SALib for Sobol indices, and Matplotlib/Seaborn for figures). AIC was derived from negative log-likelihoods. Simulations used deterministic choice rules (no learning), so results are seed-invariant; any stochastic resample uses a fixed NumPy seed in code.

Supplementary Material

Figure S4: Sensitivity analysis for mean discounted value. First-order effects across parameters (complements the main-text S1 on proportion delayed).
Figure S5: Sensitivity analysis for proportion delayed (alternative view). Complementary visualization to the main-text Sobol bar chart.
ModelNPercent
hyperbolic76100.0%
hyperboloid00.0%
dual00.0%
Table S1: Model selection by AIC. Counts and proportions (N = 76).
ComparisonMeanMedian% (< 0)Note
dual hyperbolic+17.55+11.637.9%Positive = dual fits worse
dual hyperboloid+15.13+9.709.2%Positive = dual fits worse
Table S2: Summary statistics for \DeltaAIC (dual \to comparator), per participant (N = 76).
PID\text{AIC}_{\text{hyp}}\text{AIC}_{\text{hypbd}}\text{AIC}_{\text{dual}}\Delta\text{AIC}_{\text{dual}\to\text{hyp}}\Delta\text{AIC}_{\text{dual}\to\text{hypbd}}k_{\text{dual}}\alpha_{\text{dual}}\tau_{\text{dual}}
K000024.4226.3348.1323.7221.800.000.0110.00
K000130.6532.6349.1318.4816.500.000.0110.00
K000259.0860.7096.0536.9735.360.000.0110.00
K000339.7441.4071.6131.8730.210.000.0110.00
K000439.2241.1070.0930.8728.990.000.0110.00
K000516.7718.3542.0925.3223.740.000.018.35
K000622.8224.0426.263.442.210.000.014.46
K000733.5895.9435.592.01-60.350.000.016.83
PID\text{AIC}_{\text{hyp}}\text{AIC}_{\text{hypbd}}AICdual\Delta\text{AIC}_{\text{dual}\to\text{hyp}}\Delta\text{AIC}_{\text{dual}\to\text{hypbd}}k_{\text{dual}}\alpha_{\text{dual}}\tau_{\text{dual}}
K000854.3055.0856.812.511.730.000.0110.00
K000947.3148.5753.316.014.750.000.0110.00
K001042.7044.66105.4062.7060.740.000.0110.00
K001148.9949.9951.022.031.030.000.0110.00
K001217.8518.9022.774.923.870.000.013.75
K001333.4835.3647.6414.1712.280.000.0110.00
K001437.7338.5140.502.781.990.000.017.90
K001534.1236.1244.6710.558.550.000.019.12
K001622.8023.7826.123.322.340.000.014.43
K001738.0039.9260.9922.9921.070.000.0110.00
K001822.9524.2224.952.000.730.000.014.30
K001919.7020.6821.732.031.060.000.013.52
K002037.4539.3044.477.025.160.000.019.06
K002135.6537.65107.2271.5669.560.000.0110.00
K002229.6931.2033.323.632.120.000.016.09
K002327.4029.3163.8336.4334.520.000.0110.00
K002441.3343.2152.4911.169.280.000.0110.00
K002512.1312.4821.078.958.590.000.013.39
K002625.7327.6849.2323.4921.550.000.0110.00
K002736.3338.3745.979.647.600.000.019.53
K002832.8534.7244.3111.469.590.000.019.01
K002969.7070.1496.5226.8226.380.000.0110.00
K003033.5235.4435.501.980.060.000.027.70
K003124.7526.7157.1132.3730.400.000.0110.00
K003233.9535.4136.672.721.270.000.016.89
PID\text{AIC}_{\text{hyp}}\text{AIC}_{\text{hypbd}}\text{AIC}_{\text{dual}}\Delta\text{AIC}_{\text{dual}\to\text{hyp}}\Delta\text{AIC}_{\text{dual}\to\text{hypbd}}k_{\text{dual}}\alpha_{\text{dual}}\tau_{\text{dual}}
K003325.7627.1928.072.310.880.000.014.85
K003411.2413.2426.8315.5913.590.000.014.58
K003576.0577.57115.1039.0537.530.000.0110.00
K003628.2430.1534.296.064.140.000.016.30
K003723.6425.4828.414.772.940.000.014.93
K003849.9251.6987.0937.1835.400.000.0110.00
K003936.7038.6748.4911.799.810.000.0110.00
K004023.3125.0425.302.000.270.000.014.31
K004126.6428.0542.3015.6714.260.000.018.41
K004211.2413.2426.8315.5913.590.000.014.58
K004330.1631.2935.255.093.960.000.016.53
K004436.3638.1845.829.467.640.000.019.48
K004525.7327.6849.2323.4921.550.000.0110.00
K004634.9737.0936.911.94-0.180.010.018.17
K004739.4240.2341.632.211.400.000.018.22
K004840.1642.1153.6613.5111.550.000.0110.00
K004933.5335.4750.2316.6914.760.000.0110.00
K005027.7929.8358.1030.3128.270.000.0110.00
K005140.1441.8649.329.197.460.000.0110.00
K005259.5461.0595.2135.6734.160.000.0110.00
K005333.0534.8646.1613.1111.300.000.019.59
K005433.5935.5947.8814.2812.280.000.0110.00
K005550.3752.1295.4745.0943.340.000.0110.00
K005629.7730.9637.107.336.140.000.017.00
K005717.3619.3165.0847.7245.770.000.0110.00
Table S3: Per-participant AICs, \DeltaAICs, and dual parameters.
PID\text{AIC}_{\text{hyp}}\text{AIC}_{\text{hypbd}}\text{AIC}_{\text{dual}}\Delta\text{AIC}_{\text{dual}\to\text{hyp}}\Delta\text{AIC}_{\text{dual}\to\text{hypbd}}k_{\text{dual}}\alpha_{\text{dual}}\tau_{\text{dual}}
K005817.8818.9222.794.913.870.000.013.73
K005945.1546.3762.9817.8316.610.000.0110.00
K006044.3146.0353.989.677.950.000.0110.00
K006133.4535.4040.196.754.800.000.017.82
K006233.6735.67102.6969.0267.020.000.0110.00
K006343.9844.9546.172.191.210.000.019.60
K006421.7823.7640.6318.8616.880.000.017.94
K006529.4231.3868.0238.6136.640.000.0110.00
K006642.7744.7253.3610.598.640.000.0110.00
K006723.6225.2029.766.144.560.000.015.23
K006829.8930.8531.892.001.040.000.015.76
K006931.1933.1352.9021.7119.770.000.0110.00
K007030.3432.2640.129.787.860.000.017.80
K0071105.80107.03143.5037.7036.470.000.0110.00
K007241.6843.6876.8635.1833.180.000.0110.00
K007323.7125.6447.9624.2622.320.000.0110.00
K007434.7835.5936.782.001.190.000.027.52
K007521.8723.8257.5235.6533.700.000.0110.00
Comparison           NPearson rp95% CI
ϑ vs. proportion delayed76-0.3807.2×10-4[approx. -0.56, -0.17]
ϑ vs. SSRT1010.0700.487[approx. -0.12, 0.25]
Table S4: Correlation summaries for validity checks. Pearson coefficients are reported; CIs use Fisher-z.
ParameterS1 (mean)Note
ϑ0.0115primary outcome
k0.0060
ε         0.0051
Table S5: First-order Sobol indices (S1) for proportion delayed on the Kirby set. Values are small for all parameters.
alphakp delayed
0.010.0010.439846
0.010.001277860.440833
0.010.001632940.439991
0.010.002086670.447554
0.010.002666490.418967
0.010.003407410.427309
0.010.004354210.399345
0.010.005564090.361362
0.010.007110150.353195
0.010.009085810.346185
0.010.01161040.333354
0.010.01483660.332723
0.010.01895910.327622
0.010.02422720.318327
0.010.03095910.269225
Table S6: Canonical heatmap data (preview of first 15 rows).
MetricValueCount / Proportion
ϑ floor value0.01071/76 (93.4%)
ε ceiling value10.039/76 (51.3%)
k minimum value0.000172/76 (94.7%)
kϑ Pearson r (95% CI)0.382[0.171, 0.559]
Table S7: Parameter boundary summaries and kϑ correlation in the dual model (N = 76).

References

  1. G. Ainslie. Picoeconomics: The strategic interaction of successive motivational states within the person. Cambridge University Press (1992). []
  2. L. Green, J. Myerson. A discounting framework for choice with delayed and probabilistic rewards. Psychological Bulletin, 130(5), 769 (2004). []
  3. S. M. McClure, D. I. Laibson, G. Loewenstein, J. D. Cohen. Separate neural systems value immediate and delayed monetary rewards. Science, 306(5695), 503–507 (2004). []
  4. A. L. Odum. Delay discounting: I’m ak, you’re ak. Journal of the Experimental Analysis of Behavior, 96(3), 427–439 (2011). []
  5. S. Weinsztok, S. Brassard, I. Balodis, L. E. Martin, M. Amlung. Delay discounting in established and proposed behavioral addictions: A systematic review and meta-analysis. Frontiers in Behavioral Neuroscience, 15, 786358 (2021). []
  6. R. E. Weller, E. W. Cook III, K. B. Avsar, J. E. Cox. Obese women show greater delay discounting than healthy-weight women. Appetite, 51(3), 563–569 (2008). []
  7. P. Ring, C. C. Probst, L. Neyse, S. Wolff, C. Kaernbach, T. van Eimeren, U. Schmidt. Discounting behavior in problem gambling. Journal of Gambling Studies, 2021, 1–15 (2021). []
  8. M. Amlung, L. Vedelago, J. Acker, I. Balodis, J. MacKillop. Steep delay discounting and addictive behavior: A meta-analysis of continuous associations. Addiction, 112(1), 51–62 (2017). []
  9. D. C. Lee, C. Stanger, A. J. Budney. A comparison of delay discounting in adolescents and adults in treatment for cannabis use disorders. Experimental and Clinical Psychopharmacology, 23(2), 130 (2015). []
  10. T. Mena-Moreno, G. Testa, G. Mestre-Bach, R. Miranda-Olivos, R. Granero, F. Fernández-Aranda, J. M. Menchón, S. Jiménez-Murcia. Delay discounting in gambling disorder: Implications in treatment outcome. Journal of Clinical Medicine, 11(6), 1611 (2022). []
  11. A. C. Exum, C. A. Sutton, J. S. Bellitti, R. Yi, T. L. Fazzino. Delay discounting and substance use treatment outcomes: A systematic review focused on treatment outcomes and discounting methodology. Journal of Substance Use and Addiction Treatment, 149, 209037 (2023). []
  12. C. H. G. Patros, K. L. Sweeney, E. M. Mahone, S. H. Mostofsky, K. S. Rosch. Greater delay discounting among girls, but not boys, with ADHD correlates with cognitive control. Child Neuropsychology, 24(8), 1026–1046 (2018). [] []
  13. G. J. Aranovich, S. M. McClure, S. Fryer, D. H. Mathalon. The effect of cognitive challenge on delay discounting. NeuroImage, 124(Part A), 733–739 (2016). []
  14. M. Yu, T. Liu, F. Shangguan, J. Sui, J. Shi. The neurodevelopment of delay discounting for monetary rewards in pre-adolescent children. Scientific Reports, 11(1), 8337 (2021). []
  15. P. Fernández-Martín, D. Tovar-Suárez, R. Rodríguez-Herrera, J. J. León, R. Cánovas, P. Flores. Behavioral and neurofunctional profiles of delay aversion in children with attention-deficit hyperactivity disorder. Translational Psychiatry, 15(1), 125 (2025). []
  16. W. K. Bickel, R. Yi. Temporal discounting as a measure of executive function: Insights from the competing neuro-behavioral decision system hypothesis of addiction. In Neuroeconomics, 20, 289–309. Emerald Group Publishing Limited (2008). []
  17. L. Green, J. Myerson. A discounting framework for choice with delayed and probabilistic rewards. Psychological Bulletin, 130(5), 769 (2004). [] [] [] [] [] [] [] [] []
  18. F. Verbruggen, A.R. Aron, G.P.H. Band, C. Beste, P.G. Bissett, A.T. Brockett, J.W. Brown. A consensus guide to capturing the ability to inhibit actions and impulsive behaviors in the stop-signal task. eLife, 8, e46323 (2019). [] [] []
  19. P.G. Bissett, I.W. Eisenberg, S. Shim, J.A.H. Rios, H.M. Jones, M.P. Hagan, A.Z. Enkavi, J.K. Li, J.A. Mumford, D.P. MacKinnon, L.A. Marsch, R.A. Poldrack. Cognitive tasks, anatomical MRI, and functional MRI data evaluating the construct of self-regulation. OpenNeuro, 2024. Dataset accession: ds004636. [] [] []
  20. P.G. Bissett, I.W. Eisenberg, S. Shim, J.A.H. Rios, H.M. Jones, M.P. Hagan, A.Z. Enkavi, J.K. Li, J.A. Mumford, D.P. MacKinnon, L.A. Marsch, R.A. Poldrack. Cognitive tasks, anatomical MRI, and functional MRI data evaluating the construct of self-regulation. Scientific Data, 2024. Data descriptor for OpenNeuro ds004636. [] [] []
  21. K. N. Kirby, N. M. Petry, W.K. Bickel. Heroin addicts have higher discount rates for delayed rewards than non-drug-using controls. Journal of Experimental Psychology: General, 128(1), 78–87 (1999). [] []
  22. T. L. McKerchar, L. Green, J. Myerson. On the scaling interpretation of exponents in hyperboloid models of delay and probability discounting. Behavioural Processes, 84(1), 440–444 (2010). [] [] [] []
  23. S. Frederick, G. Loewenstein, T. O’Donoghue. Time discounting and time preference: A critical review. Journal of Economic Literature, 40(2), 351–401 (2002). [] [] [] []
  24. S. M. McClure, D. I. Laibson, G. Loewenstein, J. D. Cohen. Separate neural systems value immediate and delayed monetary rewards. Science, 306(5695), 503–507 (2004). [] [] []
  25. W. K. Bickel, R. Yi. Temporal discounting as a measure of executive function: Insights from the competing neuro-behavioral decision system hypothesis of addiction. In Neuroeconomics, 20, 289–309. Emerald Group Publishing Limited (2008). [] [] []
  26. Z. Kurth-Nelson, W. K. Bickel, A. D. Redish. A theoretical account of cognitive effects in delay discounting. European Journal of Neuroscience, 35(7), 1052–1064 (2012). [] [] [] []
  27. T. V. Wiecki, M. J. Frank. A computational model of inhibitory control in frontal cortex and basal ganglia. Psychological Review, 120(2), 329 (2013). [] [] [] []
  28. F. Verbruggen, A. R. Aron, G. P. H. Band, C. Beste, P. G. Bissett, A. T. Brockett, J. W. Brown. A consensus guide to capturing the ability to inhibit actions and impulsive behaviors in the stop-signal task. eLife, 8, e46323 (2019). [] [] [] [] []
  29. P. G. Bissett, I. W. Eisenberg, S. Shim, J. A. H. Rios, H. M. Jones, M. P. Hagan, A. Z. Enkavi, J. K. Li, J. A. Mumford, D. P. MacKinnon, L. A. Marsch, R. A. Poldrack. Cognitive tasks, anatomical MRI, and functional MRI data evaluating the construct of self-regulation. OpenNeuro, 2024. Dataset accession: ds004636. [] [] [] []
  30. P. G. Bissett, I. W. Eisenberg, S. Shim, J. A. H. Rios, H. M. Jones, M. P. Hagan, A. Z. Enkavi, J. K. Li, J. A. Mumford, D. P. MacKinnon, L. A. Marsch, R. A. Poldrack. Cognitive tasks, anatomical MRI, and functional MRI data evaluating the construct of self-regulation. Scientific Data, 2024. Data descriptor for OpenNeuro ds004636. [] [] []
  31. Z. Song, Y. Wang, P. Qian, S. Song, F. Coenen, Z. Jiang, J. Su. From deterministic to stochastic: An interpretable stochastic model-free reinforcement learning framework for portfolio optimization. Applied Intelligence, 53(12), 15188–15203 (2023). [] [] []
  32. D. Silver, G. Lever, N. Heess, T. Degris, D. Wierstra, M. Riedmiller. Deterministic policy gradient algorithms. In International Conference on Machine Learning, 387–395. PMLR (2014). [] [] []
  33. K. N. Kirby, N. M. Petry, W. K. Bickel. Heroin addicts have higher discount rates for delayed rewards than non-drug-using controls. Journal of Experimental Psychology: General, 128(1), 78–87 (1999). []
  34. P. G. Bissett, I. W. Eisenberg, S. Shim, J. A. H. Rios, H. M. Jones, M. P. H. Hagan, A. Z. Enkavi, J. K. Li, J. A. Mumford, D. P. MacKinnon, L. A. Marsch, R. A. Poldrack. Cognitive tasks, anatomical MRI, and functional MRI data evaluating the construct of self-regulation. Scientific Data, 2024. Data descriptor for OpenNeuro ds004636. []

LEAVE A REPLY

Please enter your comment!
Please enter your name here