Abstract

Background:

The brain’s valuation network, including the ventral striatum and ventromedial prefrontal cortex (VMPFC), represents the value of rewards and punishments and underpins decision behavior. These neural signals are not fully characterized in individuals recovering from prescription opioid use disorder (POUD).

Objectives:

We tested the hypothesis that neural responses to monetary gain and loss differ in individuals recovering from POUD relative to individuals without prior substance use.

Methods:

Twenty-three individuals in an early stage of recovery from POUD (abstinent 2–3 weeks after admission to an inpatient treatment facility, no other substance use disorder), and 21 neurotypical controls group individuals without prior history of substance use completed a card guessing task during functional magnetic resonance imaging (fMRI), gaining or losing small monetary amounts after each guess. Whole-brain and valuation network regions of interest (ROI) analyses compared POUD and control group fMRI signal responses to monetary gain and loss outcomes.

Results:

Ventral striatum signal change following gain and loss outcomes differed between the POUD and control groups. Specifically, time series analysis suggested that left ventral striatum responses following monetary losses remained elevated for a longer duration in POUD compared with control group participants.

Conclusions:

This exploratory, small sample study suggests brain responses to non-drug incentives may differ in POUD compared with neurotypical controls, which has implications for understanding affective responses in individuals recovering from POUD.

1. Introduction

When individuals with opioid use disorder (OUD) undergo recovery, it is important to consider how they respond to non-drug incentives. Prior research suggests that responses to non-drug rewards and punishments differ in individuals with substance use disorder compared to individuals without a history of substance use. These differences have been examined in individuals with substance use disorders involving opioids, nicotine, cannabis, and stimulants [1, 2]. Theoretical explanations of drug-seeking behavior suggest that increased motivation for drug rewards is partly in compensation for diminished positive experience of non-drug rewards in OUD [3]. Similarly, theory suggests that avoidance of non-drug punishments and negative affect are a key motivator for substance use [4]. Indeed, both diminished positive emotion (linked to positive non-drug rewards) and enhanced negative emotion (linked to aversive non-drug punishments) are characteristics of individuals with OUD [5]. The current study uses small monetary gains and losses to examine brain responses to non-drug incentives in individuals recovering from OUD, relative to a control group that has no history of substance use.

Prior research demonstrates that positive and aversive non-drug-related incentives elicit affective experiences that differ in OUD compared to individuals with no substance use history. For example, current and abstinent heroin users exhibit blunted responses to positive, pleasant affective (i.e., rewarding) non-drug stimuli (e.g., erotic images) and increased responses to negative affective stimuli (e.g., images of body injury/mutilation) [2, 6, 7]. Similarly, substance users including heroin, heroin and cocaine mixed, cocaine, and alcohol users showed reduced arousal responses to positive non-drug images relative to individuals with no history of drug use [7]. Specifically, heroin users showed reduced arousal responses to positive images and increased arousal responses to negative images in comparison to the control group [7]. This previous research has identified important differences in affective experience during recovery from OUD, but the potential neural basis of these differences is not fully studied in this population.

Neural responses to positive non-drug reward and aversive non-drug punishments are one avenue to more fully understand the distinct affective experience of non-drug stimuli in individuals with OUD. The perceived value of these stimuli is theorized to be represented by neural activity in regions including the ventral striatum and ventromedial prefontal cortex (VMPFC), which are targets of midbrain dopaminergic neurons and part of the mesocorticolimbic pathway [8, 9, 10]. However, drugs of abuse can alter responses in this circuit leading to disturbed valuation of drug-related stimuli [11]. While valuation of drug-related stimuli is well-studied, less attention has been paid to neural responses that underlie valuation of non-drug incentives. Natural reward cues elicit increased activity throughout the mesocorticolimbic circuit in healthy individuals [12, 13, 14] and a decrease in activity in this circuit among drug users, including nicotine, alcohol, cocaine, and cannabis users [15, 16, 17, 18], indicating reduced valuation of rewards. Due to dysregulation of this circuitry that underlies valuation of both positive and aversive stimuli, individuals with substance use disorders may exhibit reduced sensitivity to natural rewards that originate from healthy behaviors and social connections. Instead, they may rely on substance use to preserve a sense of wellbeing [19]. This dysregulation of the brain’s valuation network that may result from chronic substance use then also contributes to the maintenance of drug-use behavior [20, 21].

Though dysregulation of non-drug incentive responses is a key issue in several types of substance use disorders, previous research examining recovery from OUD has not focused on responses to positive and negative non-drug incentives in the brain’s valuation system. In the present study, we examine the neural responses to monetary reward in individuals with prescription opioid user disorder (POUD) who are in the first month of recovery, in contrast to a control group without any history of drug use. We hypothesize that neural responses to monetary gain and loss differ in individuals recovering from POUD compared to individuals who have no history of drug use.

2. Methods
2.1 Participants

The findings reported here are from a task included in a larger study protocol that examined brain structure and functional connectivity in POUD and control group participants [22]. For transparency, selection criteria and portions of procedural descriptions here are directly quoted here from a prior report based on the same sample of participants [23]. Twenty-three POUD group individuals (Sex: 18 male, 5 female; Age mean (M) = 31.0, SD = 7.3) and 21 control group individuals (12 male, 9 female; age M = 33.6, SD = 7.5) were included in the final sample. POUD group individuals were in the first 3 weeks of a 6-month inpatient residential treatment program. Control group individuals were matched for age, education, and race/ethnic background (Table 1). Participants were included only if they were between 21 and 54 years of age, English was their primary language, they were right-handed, and they had near 20/20 vision (or corrected). Exclusion criteria were any serious physical illness, history of childhood learning disability or current special education, presence of any serious psychiatric illness, MRI contraindications, claustrophobia, abnormal hearing, history of loss of consciousness for more than 30 minutes, alcohol abuse and dependence including past dependence, and for women, pregnancy. Control group participants had no current or past substance use history. Inclusion in the POUD group required a history of using prescription opioid (PO) pills for at least the past 1 year and the POUD participants had to meet criteria in the Diagnostic and Statistical Manual of Mental Disorders5th ed. (DSM-5) [24] for moderate-to-severe POUD, according to a structured clinical interview (SCID-5). Participants were excluded if they had co-occurring POUD and any other substance use disorder (including tobacco use) at a moderate-to-severe level [23]. Participants were also excluded if they failed to respond to 25% or more trials of the card guessing task (1 control participant excluded), or if head motion estimates exceeded 5mm in framewise displacement between any volumes (1 control participant excluded). The 44 participants (23 POUD, 21 control) who met all criteria were from an original pool of 46 (2 controls excluded as noted previously). Sample size was determined by practical constraints (availability of POUD participants during the data collection window).

Table 1. Demographic and substance use information for individuals with prescription opioid use disorder (POUD) and healthy controls.
POUD Control t-stat p
(n = 23) (n = 21)
Mean, Range (SD) Mean, Range (SD)
Age (yrs.) 31, 22–48 (7.27) 33.62, 24–54 (7.48) –1.18 0.24
Education (yrs.) 11, 2–18 (2.3) 12, 3–19 (2.2) –1.07 0.28
Race/Ethnicity
Caucasian 12 7
African American 8 11
Hispanic 2 2
Not stated 1 0
Female (n) 5 6
Prescription Opioid Use
Frequency (days/week) 6.5, 3–7 (0.28) na
Duration of use (yrs.) 5.8, 2–18 (3.45) na
Money spent ($/week) $715, $70–2000 (413) na
DSM-5 criteria for POUD
Number of criteria met 8.75, 4–11 (2.10) 0
Cigarette use among participants who reported any smoking in the week before scanning:
Number reporting any smoking 20 1
Frequency (days/week) 6.95, 6–7 (0.22) 7
Quantity (cigarettes/day) 8.2, 2–21 (5.40) 7
Cigarette use (means including participants who reported no smoking):
Frequency (days/week) 6.04, 0–7 (2.40) 0.14, 0–3 (0.65) 11.33 <0.001
Quantity (cigarettes/day) 7.13, 0–21 (5.75) 0.33, 0–7 (1.53) 5.46 <0.001

No subjects met tobacco use disorder criteria (it was an exclusion criterion), smoking measure is based on reported behavior in the week before scanning (after screening). DSM-5, Diagnostic and Statistical Manual of Mental Disorders; SD, standard deviation; na, not applicable.

On the day of scanning, participants provided written informed consent approved by the Rutgers University Institutional Review Board and were administered a urine screen to rule out pregnancy in women, and to ensure negative urine toxicology for cocaine, methamphetamine, THC, opiate, and benzodiazepines (One Step Multi-Drug Screen Test Panel). They were assessed for recent alcohol use with a breathalyzer. After the session, participants received a $100 gift certificate for participation [23] which included the pre-determined monetary bonus earned in the card guessing task.

2.2 Card Guessing Task

Participants performed the card guessing task [14] following structural scans and a resting state functional scan, approximately 20 minutes after entering the scanner. Stimuli were presented and response times were collected using E-Prime version 2.0 (Psychological Software Tools, Pittsburgh, PA, USA). In each trial of the task, participants saw a card (white rectangle with a “?” in the center) and pressed a button to guess whether the value of the card was lower (second digit of the right hand) or higher (third digit of the right hand) than the number 5, with knowledge that the number would be between 1 and 10 but never 5, and that correct guesses would add $0.50 to their bonus while incorrect guesses would subtract $0.25 from their bonus. A trial consisted of the guess window (2 s, screen showed “?” card), followed by the outcome (1 s, screen showed the card with a number in the center and a green check or red “X” above the number), followed by a randomly jittered inter-trial fixation screen (4/6/8 s on 50%/25%/25% of trials respectively, screen showed a white “+” on black background, see Fig. 1). Participants performed 64 trials that were randomly ordered but predetermined to consist of 24 gain trials (outcome indicated a correct guess with a green check), 24 loss trials (outcome indicated an incorrect guess with a red “X”), and 16 no outcome trials (outcome showed a “#” on the card with no number and no check or “X”). If a participant failed to respond during the 2 s guessing window (a missed response), the outcome screen showed an empty card with “no response” in the center.

Fig. 1.

Card guessing task. On each trial, participants had 2 s to guess whether the number on a card was low or high (“Response”). The outcome of the trial was then displayed (“Outcome”), with gain trials (green check) and loss trials (red X) equally probable. An inter-trial interval followed each outcome. Analysis was focused on BOLD signal elicited by the display of each outcome.

2.3 Behavioral Analysis

Mean response times were calculated for each participant (mean across all trials where the participant responded within the 2 s window) and used to calculated group means, which were entered into an independent samples t-test to examine potential differences between groups. No group differences were predicted.

2.4 Neuroimaging Acquisition and Analysis

Images were collected on a 3.0-T Siemens TRIO scanner (Siemens Medical Solutions USA, Malvern, PA, USA) as reported in reference [23]: Structural images were acquired with a T1-weighted MPRAGE sequence (256 × 256 matrix, FOV 256 mm, 176 1-mm sagittal slices). Blood oxygen level dependent (BOLD) functional images were acquired with a multi-band echo-planar imaging sequence (repetition time (TR) = 600 ms, echo time (TE) = 28.2 ms, field of view (FOV) = 208 mm, flip angle 30°, echo spacing = 0.41 ms, multiband acceleration factor 5, 35 axial slices, voxel size 1.53 × 1.53 × 3.99 mm). A field map sequence acquired prior to functional imaging was used to correct geometrical distortion in the functional images. The fMRIPrep [25] pipeline was used for geometrical distortion correction, head motion and slice-timing correction, and normalization of images to Montreal Neurological Institute (MNI) standard space (details in Supplementary Material). For each subject a general linear model (GLM) was fit to the functional data [23] to estimate signal change related to the experience of gain, loss, and no outcome event types (using FMRIB’s Software Library (FSL) version 5.0, http://www.fmrib.ox.ac.uk/fsl/) [26, 27] (GLM specification is below). Contrast estimates for each subject (gain versus loss) based on parameter estimates from the GLM were tested for group level effects with a nonparametric permutation-based test of the group difference (10,000 permutations with FSL’s randomise tool) using a 1-sample test for a group level effect within each group for the gain versus loss contrast and a 2-sample group difference test. Results are reported at a whole-brain threshold-free cluster (TFCE) corrected threshold of p < 0.05 [28], and with a small-volume correction (SVC) for regions of interest (ROIs) in the striatum and VMPFC. The ROIs were defined from a meta-analytic contrast of brain responses to the subjective value of stimuli [29]. Voxelwise masks of regions that showed significant relation to positive subjective value versus negative subjective value in the meta-analysis [29] were intersected with a structural definition of the striatum (Harvard-Oxford Probabilistic Atlas) [30] and VMPFC [31], left and right regions separately. This restricted set of regions was selected to focus analyses on mesocorticolimbic pathway targets strongly associated with reward and punishment value representations [8, 9, 29]. Analyses focus on contrasts involving gain and loss events (gain versus null, loss versus null, gain versus loss). “No outcome” events were included in the design for exploration and are not included in analyses reported here.

2.4.1 Subject-level General Linear Model (GLM) Specification for Voxel-wise Analysis

The GLM consisted of three regressors to describe outcome events with 1 s duration (gain, loss, no outcome). These regressors were convolved with a canonical double-gamma response function in FSL’s FEAT analysis package [26]. Contrasts were computed to compare gain versus loss outcomes (POUD and control group separately), POUD gain vs control group gain outcomes, and POUD loss versus control group loss outcomes. Full model details are included in the Supplementary Materials.

2.4.2 Analysis of Signal Change Over Time Following Outcomes

For analysis of the time course of ROI event-related responses, the 0.6 s time to acquire a full BOLD volume was used as the time step to average signal values following the gain or loss outcome types. That is, the 20 volumes collected 0–12 s after each outcome screen presentation were averaged into 0.6 s time bins according to time elapsed following each type of outcome (20 bins), which yielded a description of average signal change in the 12 s following each outcome type. Signal change was calculated for each time bin as the average BOLD signal across voxels in an ROI, minus the mean ROI BOLD signal value across the whole functional scan. The values for gain and loss outcomes in each ROI were then submitted to group (POUD, control) by time (0–12 s) ANOVAS to examine potential group differences in the time course of neural responses to gain and loss (applying the Greenhouse-Geissler correction for non-sphericity for within-subject effects of time). Any analysis suggesting differences between groups were then repeated with covariates for age and sex. Post-hoc comparisons between groups were conducted at each time point in the case that there was a significant group or group by time interaction effect (uncorrected for multiple comparisons due to the exploratory nature of this analysis). For completeness in this exploratory analysis, a larger set of regions linked to reward or punishment valuation was defined from same meta-analysis that defined ventral striatum and VMPFC regions [29]. Voxelwise masks of regions that showed significant relation to positive or negative subjective value were used to define the regions examined. The regions were pre-supplementary motor area (pre-SMA), anterior insula, amygdala, and posterior cingulate (left and right-sided regions were defined for each). The time course of signal from these regions is analyzed and described in the Supplementary Materials.

3. Results
3.1 Behavioral Responses

Response times in the Card Guessing Task did not significantly differ for the POUD (M = 0.670 s, SD = 0.176) compared to the control group (M = 0.615 s, SD = 0.161; t(42) = 1.065, p = 0.293). Similarly, the number of missed responses did not significantly differ for the POUD (M = 1.261, SD = 2.115) compared to the control group (M = 1.143, SD = 1.493; t(42) = 0.215, p = 0.831).

3.2 Whole Brain GLM Analysis: Contrast of Neural Responses to Monetary Gain Versus Loss

The comparison of outcomes from monetary gain and loss showed that both POUD and control group participants exhibited a typical reward response. Bilateral regions of ventral striatum, VMPFC, and dorsal anterior cingulate cortex (dACC) showed significantly increased signal in response to gain compared to loss outcomes (see Fig. 2, Table 2; p < 0.05, corrected). Control group participants showed significantly increased posterior cingulate signal (PCC) in response to gain compared to loss outcomes (Fig. 2; p < 0.05, corrected). POUD participants showed a similar pattern in PCC that was below the corrected statistical threshold (peak at –2, –46, 34, t(20) = 3.80, p < 0.001 uncorrected). There was no region showing greater signal in response to loss compared to gain outcomes. There was no significant difference detected between groups for the gain versus loss contrast either at the whole-brain corrected threshold or at SVC thresholds in striatum or VMPFC.

Fig. 2.

Monetary gain versus loss responses in whole brain voxelwise GLM analysis. (A) Statistical map of monetary gain versus loss in POUD (blue) and control (red) participants in the ventral striatum (colors indicate voxelwise p < 0.05, corrected, coronal slices at y = 9). (B) The same contrasts also showed VMPFC responses for gain versus loss in both groups (axial slices at z = –8, sagittal slices at x = 4). VMPFC clusters are circled in yellow. Though PCC (circled in green) signal did not reach the significance threshold for the POUD group, this region showed similar effects in both groups (below the threshold in POUD). No group difference was detected in voxelwise comparisons. GLM, General Linear Model; POUD, prescription opioid use disorder; VMPFC, ventromedial prefrontal cortex; PCC, posterior cingulate signal; L, left; R, right.

Table 2. Whole brain contrasts of monetary gain versus loss outcomes and between group comparison.
Contrast, Group, Region Peak coordinate (x, y, z) Peak t-statistic Voxels in cluster
Monetary Gain > Loss
Control Group
Left/Right VMPFC 4, 52, –8 5.07 488
Left/Right dACC 12, 42, 18 4.32 46
Left/Right Ventral Striatum –14, 12, –6 8.16 1173
Left/Right Posterior Cingulate 0, –36, 38 5.45 1214
Right Precuneous Cortex 26, –60, 28 3.81 8
Right Lateral Occipital Cortex 54, –70, 2 4.74 485
Left Lateral Occipital Cortex –46, –74, 4 6.55 210
Left Lateral Occipital Cortex –46, –82, 26 4.94 3
POUD Group
Left/Right VMPFC and dACC 10, 52, –4 5.75 785
Right Superior, Middle Frontal Gyrus 18, 26, 50 6.17 392
Left Superior, Middle Frontal Gyrus –18, 42, 42 4.43 23
Right Ventral Striatum 14, 12, –6 9.24 248
Left Ventral Striatum –12, 8, –10 8.31 140
Left Lateral Occipital Cortex –46, 74, 2 5.36 115
Left Lateral Occipital Cortex –50, 78, 16 5.02 4
Control > POUD Group
No regions above threshold
POUD > Control Group
No regions above threshold

Non-parametric permutation test, p < 0.05 corrected.

VMPFC, ventromedial prefrontal cortex; dACC, dorsal anterior cingulate cortex; POUD, prescription opioid use disorder.

3.3 Region of Interest Analysis

This analysis examined responses in a priori regions of interest in ventral striatum and VMPFC, based on gain versus null and loss versus null contrasts from the GLM. Contrast estimates for gain (gain versus null) and loss (loss versus null) outcomes were entered into a 2 (group: POUD, control) by 2 (outcome valence: gain, loss) mixed ANOVA for each region. There was a main effect of outcome valence (gain > loss) in each region, but no significant effects of group or group by outcome valence in any region (Table 3 (Ref. [26]) summarizes results).

Table 3. Results of ROI parameter estimate 2 (group: POUD, control) by 2 (outcome valence: gain, loss) mixed ANOVA.
Effect, Region F(1,42) p value Effect size η2
Main effect of Group
Left ventral striatum 3.51 0.068 0.077
Right ventral striatum 1.38 0.246 0.032
Left VMPFC 2.22 0.144 0.050
Right VMPFC 1.38 0.247 0.031
Main effect of Outcome Valence
Left ventral striatum 98.85*** <0.001 0.702 (gain > loss)
Right ventral striatum 93.33*** <0.001 0.690 (gain > loss)
Left VMPFC 24.81*** <0.001 0.371 (gain > loss)
Right VMPFC 38.95*** <0.001 0.481 (gain > loss)
Interaction of Group and Outcome Valence
Left ventral striatum 3.29 0.077 0.073
Right ventral striatum <0.01 0.951 <0.001
Left VMPFC 0.02 0.891 <0.001
Right VMPFC 0.06 0.808 0.001

Estimates are from the analysis that assumed a canonical hemodynamic response function (double-gamma function [26]. Effect size is calculated as partial η2. *** indicates p < 0.001 (uncorrected).

3.4 Time Course of Neural Response to Monetary Gain and Loss

The above analyses rely on signal estimates from a GLM analysis where each regressor is convolved with an assumed fixed hemodynamic response which aims to model the expected shape of the hemodynamic response over time for all events. This fixed hemodynamic response approach is insensitive to potential differences in the duration of a response to reward or punishment, which may be critical for understanding mental disorder [32]. Thus, an exploratory analysis was conducted to examine potential group differences in the time course of a response to monetary gains or losses, based on signal values over a brief window following each type of outcome. Region of interest average signal values 0–12 s (20 TRs) after each outcome event were averaged by outcome type (gain, loss) into 0.6 s time bins for each participant. These ROI event-related signal time course values for gain and loss outcomes were then submitted to group (POUD, control) by time (20 TRs from 0–12 s following an outcome) ANOVA (gain and loss separately for each region of interest). Signal time courses following gain outcomes showed a significant group by time interaction in right ventral striatum (F(8.21, 344.74) = 2.63, p = 0.008, η2 = 0.051, after controlling for age and sex: F(8.22, 328.79) = 2.798, p = 0.005) but effects of group or group by time interactions following gain outcomes were nonsignificant in other regions (all p > 0.11). Signal time courses following loss outcomes showed a significant main effect of group in left ventral striatum (F(1, 42) = 6.744, p = 0.013, η2 = 0.014, after controlling for age and sex: F(1, 40) = 6.689, p = 0.013) but effects of group or group by time interactions were nonsignificant in other regions (all p > 0.11). Comparisons at each time point between POUD and control group signal time courses showed that in right ventral striatum, responses to monetary gain were greater for control compared to POUD participants at 1.2 s (t(42) = 2.47, p = 0.018) but greater for POUD compared to control participants at 2.4 s (t(42) = –2.40, p = 0.021) following a gain outcome. In left ventral striatum, responses to monetary loss were greater for POUD than control participants from 3.6–4.2 s following a loss outcome (3.6 s: t(42) = –2.52, p = 0.016; 4.2 s: t(42) = –2.41, p = 0.016; see Fig. 3, Ref. [29]). Comparisons at each time point were not conducted in other regions of interest due to nonsignificant group or group by time effects in the ANOVA results.

Fig. 3.

Time course of the ventral striatum response to gain and loss. The left image shows ventral striatum a priori regions of interest defined by meta-analysis [29] (slice at y = 12). Plots show signal change in these regions over the 12 s period following each outcome (top: gain outcomes; bottom: loss outcomes). Points represent mean signal change across participants in each group, error bands indicate 95% confidence interval around the mean. “*” indicates p < 0.05 in the group comparison at each time point. VS, ventral striatum; A.U., arbitrary units (signal change from the mean value in a region over time).

The time course of neural responses in the broader set of ROIs (pre-SMA, anterior insula, amygdala, posterior cingulate) is described in the Supplementary Materials with group or group by time effects noted here if they met an exploratory threshold of p < 0.05, uncorrected. The left anterior insula and right pre-SMA showed a main effect of group in responses to gain such that the POUD response to gain was greater than in the control group (p < 0.05, uncorrected). The right amygdala showed a main effect of group in responses to loss such that the POUD response to loss was greater than in the control group (p < 0.05, uncorrected).

4. Discussion

This study addresses a gap in prior research by exploring the temporal dynamics of brain responses to monetary gain and loss among individuals recovering from POUD. While we did not observe a difference in magnitude of responses to reward or punishment in analyses assuming a fixed hemodynamic response over time, significant differences emerged in the time course of brain responses to monetary loss in the left ventral striatum, with POUD individuals displaying greater magnitude and prolonged duration of response to negative non-drug stimuli (monetary loss) compared to controls. In contrast, while controls initially showed a stronger response to monetary gain in the right ventral striatum compared to POUD, the POUD group demonstrated an increased response to monetary gain after 2.4 s relative to controls. These findings suggest that individuals with POUD exhibit slower response to positive stimuli and increased sensitivity to negative stimuli, consistent with previous research showing impaired emotional regulation among individuals with OUD [5, 6, 7].

Substance use is known to alter responses in neural systems that process reward and punishment, often resulting in reduced sensitivity to natural rewards and heightened sensitivity to stress or adverse outcomes. These alterations may persist even during abstinence, contributing to difficulties in early recovery. For example, maladaptive reward processing has been linked to the challenges individuals face in finding motivation and enjoyment in non-drug-related activities, which can undermine efforts to sustain abstinence [11, 16, 33, 34]. The heightened punishment response may make individuals more susceptible to the emotional toll of negative experiences and psychological stress during recovery, a factor that has been associated with relapse risk [35].

These findings underscore potentially significant barriers to recovery from OUD. A diminished or delayed reward response may hinder the ability to evaluate, engage with, and seek out positive non-drug reinforcers, including monetary rewards but this effect may extend to other non-drug rewards such as social experiences and support that are known to guard against relapse [36, 37, 38]. Increased sensitivity and temporally extended responses to punishment may amplify the impact of negative outcomes during early recovery, and negative affect and stress are known precipitators of substance use and relapse [38, 39, 40]. Although psychological and behavioral data describing participants’ continued recovery is not available, it is known that opioid use relapse can occur after long periods of abstinence [41]. Changes in the dynamics of the brain’s response to non-drug rewards and punishment may pose important challenges for individuals in recovery — greater negative affect and stress may motivate drug use and diminished or delayed positive affect simultaneously impairs individuals’ ability to cope with that negative affect and stress. Understanding these mechanisms is critical for developing treatment approaches that address these barriers. For instance, therapies that aim to bolster natural reward responses and negative affect regulation resources may help individuals in early recovery by providing structured strategies to cope with these challenges [5, 40, 41]. Mindfulness-oriented recovery enhancement is one example of a treatment approach that focuses on the positive experience of natural non-drug rewards and strengthens strategies for coping with negative affect and drug-related cues [5, 40, 41]. The current study provides a framework for understanding the dynamics of non-drug incentive processing. Brain responses to non-drug incentives differ in POUD and control participants here but it is unclear if differences are caused by opioid use. Mu-opioid receptors are prominent in the striatum and VMPFC [42, 43], thus the pathway exists for opioid use to alter incentive responses in these regions. The findings suggest value in future work that aims to link opioid systems in the brain to incentive processing and affective function (in the striatum and other regions) [42, 43] and work that tracks longitudinal changes in incentive processing with potential links to treatment outcomes.

The voxelwise whole-brain analysis showed that, in both the POUD and control participants, BOLD signal in ventral striatum and VMPFC regions was increased in response to monetary gain compared to loss outcomes. This finding is consistent with a large neuroimaging literature examining non-drug reward and punishment responses in neurotypical humans [9, 14, 29]. This voxelwise whole-brain analysis, which assumed a fixed hemodynamic response over time and tested for differences in the magnitude of the response (corrected for multiple comparisons), was insensitive to differences between the POUD and control participants. Therefore, follow up analysis focused on regions of interest to provide a potentially more sensitive test of differences in between the two groups, without correction for multiple comparisons and with an examination of the time course of signal change following monetary gain and loss. The primary ROI analysis specifically focused on ventral striatum and VMPFC because these regions have been identified as the key components of “valuation network” in a in meta-analysis of neuroimaging studies of reward and punishment [29]. Further exploration of regions that consistently show non-drug value-related signal (pre-SMA, anterior insula, amygdala, and posterior cingulate) [29] showed differences suggesting that responses to monetary gain may be greater in POUD compared to control participants in right pre-SMA and left anterior insula, and that right amygdala responses to monetary loss may be greater in POUD compared to control participants. Effect sizes for these comparisons are provided in the Supplementary Materials to benefit the design of future work on this topic.

While the ROI analysis provides a focused examination of brain responses to non-drug reward and punishment, it remains an important limitation of the study that examination of potential differences in other regions linked to emotion regulation could not be conducted, partly due to the limited sample size and partly due to the experimental focus on non-drug reward and punishment responses rather than emotion regulation function. Neural circuitry underlying the ability to up-regulate positive and down-regulate negative non-drug-related emotional experiences is certainly important for understanding potentially disordered affective responding in substance use disorders [2, 3, 6, 7, 44, 45]. For example, medial prefrontal cortex (MPFC) bidirectional connections with amygdala are hypothesized to be underlie emotion regulation and this circuit may be dysregulated in anxiety disorder [46]. Research using Magnetic Resonance Spectroscopy and dynamic causal modeling shows that gamma-aminobutyric acid (GABA) concentration in MPFC is related to state anxiety [47] and reduced MPFC to amygdala effective connectivity (thought to represent top-down control from MPFC to amygdala) [48]. The pre-SMA region examined in the current study is partly overlapping but somewhat posterior to the MPFC region implicated in the broader emotion regulation literature [29, 44, 45, 46, 47, 48], and the monetary reward and punishment experiences examined here do not tend to recruit the same MPFC region implicated in emotion regulation [29]. Our exploratory analysis showed potentially elevated amygdala non-drug punishment responses in the OUD group. Thus, the MPFC-amygdala circuit represents an important target for further work to understand non-drug positive and negative experiences in recovery from OUD.

The small sample size limited the statistical power and consequently results do not survive correction for multiple comparisons. The absence of an a priori hypothesis about the specific timing of the observed differences means that findings should be viewed as exploratory and targets of further research to replicate findings in larger and less restricted samples of individuals recovering from opioid use disorder. The focus on the first month of abstinence, while relevant for early recovery, may not capture the evolution of reward and punishment processing as neural circuits adapt to sustained abstinence. The selection criteria for the study restricted the sample and thus additional research is needed to characterize neural responses to reward and punishment over longer periods of abstinence, how they may change over time, and how they may vary with factors such as addiction severity, psychiatric disorder comorbidity, and treatment. An additional limitation of this study is that, despite not meeting criteria for moderate or severe tobacco use disorder at screening, participants reported smoking more in the week prior to scanning (Table 1). The data are unable to distinguish between participants who recently increased smoking upon entering the inpatient treatment facility and those who continued smoking behavior that was below the threshold for moderate tobacco use disorder. While we note this as a limitation it is also true that this behavior is representative of substance use populations [49].

In summary, this study contributes to a growing body of literature on the neurobiological underpinnings of substance use disorders, particularly during the critical early stages of recovery.

5. Conclusion

Findings from this small sample study show that brain responses to monetary rewards and punishments may differ in POUD compared to neurotypical controls. POUD participants showed a slower response to positive stimuli and increased sensitivity to negative stimuli in the ventral striatum. Findings underscore the importance of understanding the temporal characteristics of reward processing in substance use disorders and have implications for understanding affective experiences in individuals recovering from POUD.

Availability of Data and Materials

The data and materials will be available on request. Analysis code and unthresholded statistical maps are available at https://github.com/jamilfelipe/CardTask-POUDpublic-2025/.

Author Contributions

JPB and JS conducted data preprocessing and analysis. JPB, MRD, and SR designed and conducted the research. JPB, BH, JS, MRD, and SR interpreted the data and wrote the manuscript. All authors contributed to the editorial changes in the manuscript and read and approved the final manuscript. All authors participated sufficiently in the work and agreed to be accountable for all aspects of the work.

Ethics Approval and Consent to Participate

All subjects gave their informed consent for inclusion before they participated in the study. The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Rutgers University Institutional Review Board (approval number: CR00025642).

Acknowledgment

We acknowledge the individuals with prescription opioid use disorder who volunteered for this study. We also acknowledge the CEO and staff members at Integrity House in Newark, New Jersey, for their support in this project.

Funding

This research was supported by the National Institute on Drug Abuse Grants (R03DA044496 and R61DA059906) to Suchismita Ray as a Contact Principal Investigator.

Conflict of Interest

The authors declare no conflict of interest.

Supplementary Material

Supplementary material associated with this article can be found, in the online version, at https://doi.org/10.31083/JIN36969.

References

Publisher’s Note: IMR Press stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Cite

Share