You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Rachel and Kenta would like to modify the standard foraging task a bit as shown below, based on our "data-driven-question-driven demand". The idea is having intermediate reinforcers (CS+/CS-).
Essentially, a secondary reinforcer for the reward
100% deterministic sound CS+/CS- to let mouse know reward (or no-reward is coming)
To start, 1s fixed timing between CS+/CS- and reward. We will implement the delay as a distribution, try with fix delay and introduce variability as we get more information
Motivation
When kenta recorded phasic DA in dynamic foraging, we could not get a clear RPE signal until we timelocked to ‘last lick’
It’s difficult to disentangle ‘lick response’ and ‘reward response’
We will not be able to clearly tell how learning is happening wrt to NMs. Especially if NMs already correlate with movement.
Recording designs:
FIP DA x3 — VTA-DA-SomaCa / NAc-DA-AxonCa / NAc-DA release
FIP DA in NAc + NE in Ctx
Ephys DA soma / NAc MSNs
Plan
Need to gather more information with @micahwoodard , @alexpiet , @XX-Yin, @bruno-f-cruz to figure out how we can implement this (with minimal disruption to other experiments, with as much aligned efforts wrt to refactoring code here)
Rachel will scope/drive this project with feedback from @hagikent and @ZhixiaoSu
Once we have buy-in, present at Tuesday morning behavior meeting.
Then, Kenta will pilot this in test boxes in 428 so this will not affect 446, 447 experiments.
Expected change in code base
Python GUI: very minor, two new delay parameters.
Bonsai workflow: minor~intermediate (but not sure how to make it compatible to the current foraging task, -KH)
Change in task logic will be something like:
In addition, some minor back-end updates to play two additional tones.
Good question.... let's talk about it next Thursday. @hagikent do you have thoughts?
As a point of reference:
In Parker et al.-- " Reward outcomes were accompanied by different auditory stimul: 0.5 seconds of white noise for CS- and 0.5 seconds of 5 kHz pure tone for CS+. Every trial ended with a 3 seconds inter-trial delay (after the CS- auditory stimulus or the mice exit the reward port)."
@XX-Yin
Initially immediately to make the credit assignment easier, eventually a bit of delay(var) so action signal and reinforcement signal can be better isolated.
Please see the mock bonsai code; I intended Delay1 to be corresponding to it.
Rachel and Kenta would like to modify the standard foraging task a bit as shown below, based on our "data-driven-question-driven demand". The idea is having intermediate reinforcers (CS+/CS-).
Motivation
When kenta recorded phasic DA in dynamic foraging, we could not get a clear RPE signal until we timelocked to ‘last lick’
Recording designs:
Plan
Expected change in code base
Python GUI: very minor, two new delay parameters.
Bonsai workflow: minor~intermediate (but not sure how to make it compatible to the current foraging task, -KH)
Change in task logic will be something like:
In addition, some minor back-end updates to play two additional tones.
More detailed information here: https://alleninstitute-my.sharepoint.com/:p:/g/personal/rachel_lee_alleninstitute_org/Edi4EJV9YptFpQRpq3d59DwBo0GvP02s38wC5CFsXWbPmQ?e=en4zNa
The text was updated successfully, but these errors were encountered: