3  Current and Planned Studies

We list potential study ideas below, organized by project/data set. When projects have been tentatively claimed, that is indicated as well

3.1 RISK

  • Compare insight only to full EMA for lapse risk prediction [Gaylen (lead), Kendra, Ariela, Sarah]
  • GPS features solo for lapse risk prediction [Claire FYP (lead)]
  • Less burdensome EMA for lapse risk prediction. Maybe only morning. Maybe only some items. Maybe only some items depending on responses to other items [Coco?]
  • Use text message content & meta data for lapse prediction [Coco?]
  • Replicate stressor use with RISK data. MLM analyses [unassigned]
  • Full model using GPS, EMA, communications for lapse risk prediction [unassigned]
  • Lagged week level models for lapse risk prediction. Maybe 3 days, 1 week, two weeks lag [unassigned]
  • Model duration of lapse rather than binary outcome to allow move toward harm reduction approaches
  • Model lapses that are labeled (or quantified) on severity based on chaining, or perhaps the respone to them (what the say about future ab goal, EMA affect after, etc)
  • Establish individual lapse as a clinically meaningful event - use lapse/no lapse as feature to predict variety of negative outcomes from (next? subsequent period?) EMAs, e.g., craving, risky situations, stressful events, negative affect, self-efficacy/change of abstinence goal (abstinence violation effect). Account for previous negative outcome level (either include as covariate [feature]? or make outcome a change score) [unassigned]
  • Look at model performance in subsets of data. This will include traditional disparities based on race/ethnicity, sex, age, income. But also include model performance based on lapse frequency, time on system, AUD severity, etc. [Coco submitted a poster abstract on this topic. She is planning to lead effort to write paper on this topic]
  • Explore methods to reduce model performance bias among groups that are poorly represented in the training data. This could be done for participants of color in RISK1 data. We could consider resampling training data to increase relative representation of these participants. We can also consider methods to weight errors for these participants. [This was Coco’s idea. She plans to lead a paper on this topic after we first publish the previous paper about disparities in model performance] ## RISK2

3.2 NRT1

  • Build machine learning model for smoking lapse risk prediction (e.g., replicate RISK ema paper) [unassigned]

3.3 FACE