Context-Dependent Reinforcement-Learning Task

The repository replicates the task described in Gueguen et al. (2024) in PsychoPy. Different variants of this task are also used and described in Bavard et al. (2021).

Details

Conditions

Conditions (i.e., trial information) is specified in stim/conditions.csv.
Note that the task currently does not automatically create rewards and manage reward probabilities. All of this work and logic needs to be provided in the conditions.csv file for now. While this is a bit more tedious, it allows for a great level of control over what is shown. In the future, a script creating the conditions.csv file can be used to implement different task variants.

Columns in `conditions.csv`

The conditions-file should have the following columns:

phase: This specifies the task phase. Accepted values are:
- "training": Trials in this phase don't count towards total points tally and can be repeated up to training_n_repeats_max times. They use a separate set of symbols (identified by symbol1 and symbol2 values - see below - starting with "T")
- "learning": Trials in this phase count towards total points tally. They use symbol values "A", "B", etc.
- "transfer": Technically, can be identical to the learning phase, but will accept a different set of instructions, and, for example, different symbol pairings or feedback conditions (see below)
- "explicit": In this phase, no symbols are shown. Instead probability and outcome are shown explicitly for each option.
block: This allows for splitting phases into blocks. If show_block_dividers is set to True, break screens will be included. Block values can re-start within every phase. Note that the temporal_arrangement setting that shuffles trials or trial blocks (based on trial_type, see below) applies within each block.
trial_id: A running ID for trials. They are written into the data, they're not used for anything else.
trial_type: This variable is used to determine the temporal_arrangement setting. Within each block, if temporal_arrangement is set to "interleaved", all trial types are shuffled randomly. In contrast, if temporal_arrangement is set to "blocked", trials with the same trial_type remain chunked together, but the chunk order is shuffled.
symbol1, symbol2: These specify the symbols shown for each trial. Training symbols are denoted with T (e.g., T1). Task symbols are denoted with uppercase letters A, B, etc. Note, that the task will map different symbols (i.e., image files) to these symbol IDs for each run.
option1pos: Denotes the position (left or right) of option 1. Option 2 will take the other position.
feedback: Sets the feedback condition. This lets you change behavior in transfer and explicit phases. Accepted values are:
- "complete": Outcomes of both options (chosen and unchosen) are shown.
- "partial": Outcome of the chosen option is shown. Unchosen outcome is shown as "?"
- "none": Both outcomes are shown as "?"
- "skip": Feedback phase is skipped completely
outcome_randomness: Allows you to determine if option outcomes are realized truly random, according to specified probabilities in each trial (see below), or pseudorandomly, where you predefine the outcomes in the actual_outcome columns (see below).
- "random": Each option's realized outcome will be drawn truly randomly. For example, the outcome in column potential_outcome1 will be realized with probability given in the probability1 column, or 0 otherwise.
- "pseudorandom": Each option's realized outcome in this trial is predefined in its actual_outcome column.
potential_outcome1, potential_outcome2: Potential outcomes of the two options in this trial. If outcome_randomness is "random", outcomes in each trial will be this value with given probability, or 0 otherwise. Values in this column are also displayed in explicit phase.
actual_outcome1, actual_outcome2: Values in these columns are the realized outcomes. If outcome_randomness is "random", you don't need to specify these values, as actual outcomes will be stochastically determined (according to the probability values). If outcome_randomness is "pseudorandom", you need to specify the actual outcomes of each option in each trial. Across multiple rows of the conditions file, you can implicitly define the options' outcome probabilities.
probability1, probability2: If outcome_randomness is "random", these are the probabilities with which outcomes are realized in the trial. In the explicit phase they are also used to display reward probabilities of the two options explicitly. If outcome_randomness is "pseudorandom", the probabilities do not influence realized outcomes in each trial, but are only used for display in the explicit phase.

Addtional Task Settings

Additional task settings (e.g., timing variables, colors, etc.) can be set in the settings.py file. You should not be required to make changes to task.py.

Instructions

Task instructions for the different phases can be defined in instructions.py.
Instructions are implemented as src.slideshow.SlideShow, allowing forward and backward navigation. Instruction content can be provided as plain text slides (using src.slideshow.TextSlide) or image slides (src.slideshow.ImageSlide), or a mix of the two.

Output

If not specified differently, task data are saved to data. In addition to the experimental data, task settings (contained in the exp_info dictionary), and PsychoPy's own .psydat file are saved for every run.

Stimulus Images

Stimulus images are made with the Identicon generator.

References

Bavard, S., Rustichini, A., & Palminteri, S. (2021). Two sides of the same coin: Beneficial and detrimental consequences of range adaptation in human reinforcement learning. Science Advances, 7(14), eabe0340. https://doi.org/10.1126/sciadv.abe0340
Gueguen, M. C. M., Anlló, H., Bonagura, D., Kong, J., Hafezi, S., Palminteri, S., & Konova, A. B. (2024). Recent Opioid Use Impedes Range Adaptation in Reinforcement Learning in Human Addiction. Biological Psychiatry, 95(10), 974–984. https://doi.org/10.1016/j.biopsych.2023.12.005

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Context-Dependent Reinforcement-Learning Task

Details

Conditions

Columns in `conditions.csv`

Addtional Task Settings

Instructions

Output

Stimulus Images

References

Todo

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
instructions		instructions
src		src
stim		stim
.gitignore		.gitignore
instructions.py		instructions.py
readme.md		readme.md
settings.py		settings.py
task.py		task.py

moltaire/rl-context-task

Folders and files

Latest commit

History

Repository files navigation

Context-Dependent Reinforcement-Learning Task

Details

Conditions

Columns in conditions.csv

Addtional Task Settings

Instructions

Output

Stimulus Images

References

Todo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Columns in `conditions.csv`

Packages