Random trial presentation problem

pzelchenko
New Member

Posts: 10

Random trial presentation problem Dec 25, 2021 7:31:59 GMT

Quote

Post by pzelchenko on Dec 25, 2021 7:31:59 GMT

Merry Christmas! Our experiments duplicate a protocol frequently used in studies of contextual cueing (CC) (see Chun & Jiang, 1998). We run each subject through 720 trials, broken into 30 blocks of 24 trials each. Each one of these 24 trials will present what CC experiments call an "old" or "new" display. The "old" condition picks from only 12 repeating display layouts, while the "new" condition picks from among 12 unique new layouts chosen for that task set from a set of 360 random layouts. Hence in each task block, there will be 24 trials: the 12 "old" displays and the 12 "new" displays, all interspersed randomly (e.g., ONNONONONNNOONONOONNONOO). There are a total of 372 possible layout arrangements in the experiment (12old + 30*12new = 372); these are all pre-generated as bitmaps. The task is to select the target among distractors and measure RT.

We're generating trials using the following task and block code for each of the 30 blocks (task 2 shown; all sets from 1-30 are otherwise identical):

task T2
  table all
  part fixation
  show bitmap @2
  readkey @32 5000
  part feedback
  save RT STATUS KEY @2 @61 BLOCKNAME &trial

block B2
  message info3
  set &trial 0
  tasklist
    T2 24 all_before_repeat
  end

Table "all" is from the file "all_tables.txt" (61 columns; attached). The first 30 columns are one for each of the 30 task sets, each column containing only the 24 stimulus image names (e.g., -show bitmap @2- in the above example). Note that for each block, the "new" are simply the following 12 in the set of bitmaps: these 12 should be shuffled with the 12 "old" to make up the 24 trials for that block. For the next block, the same 12 "old" will be shuffled with a subsequent range of 12 "new" layouts. Columns 31-60 hold the response keys (1-2) for each corresponding bitmap, and column 61 simply says "old" in rows 1-12 and "new" in rows 13-24 and is used only for data collection.

all_tables.txt file highlighting column 2 - refer to attachment

all_tables.txt file highlighting column 2 - refer to attachment

This code all seems to work fine. The problem comes with the randomized presentation of these columns. Whether to present the "old" (e001-e012) or "new" (e025-e036) at any trial N should hover near chance. At most, we might normally see two or perhaps three consecutive "old" or "new" trials ("head/tail" coins) in a row. And yet, for many blocks, there are some highly improbable runs of presentation. Too frequently we are seeing runs of 10 and higher.

From our 10 subjects collected so far, there are some patterns that offer clues: We see the skew only in 2 of the 10 subjects. The pattern within these subjects is very long consecutive runs of "old" or "new" screens (despite the fact that they are taken from a single table column!). Furthermore, within these subjects' long runs of new or old, we seem to see runs of items of two and sometimes three, either in forward or reverse order:

Anomalies in subject 1001 data random sequences

It appears that most subjects are not showing any randomization artifacts. However, subjects 1001 and 1008 are definitely skewed and wildly skewing the SD. Two samples of the same number of 7200 random coin tosses show a mean run length of 2 +/- 0.1, across each of 720 runs in 10 fictitious subjects (this means that HH or TT will be as common as HT or TH, but longer runs will be increasingly rare; the runs of 10 to 20 that we are regularly seeing should be vanishingly uncommon):

Analysis of 2 random tosses of 7200 versus the data

Analysis of 2 random tosses of 7200 versus the data

Nearly all subjects fall within the given normal range, with a couple of slight outliers. None of these subjects seems to exhibit unusually long runs or the within-condition forward and reverse runs of 2-3 consecutive objects. Subjects 1001 and 1008 tend to show much longer runs as well as frequent within-condition runs.

Unless we're doing something wrong in our code, I'm suspecting an implementation-specific issue with PsyToolkit's random-number generation routines in a specific browser. I'm trying to track down which browsers were used by which subjects. The two possibilities are Chrome or 360, probably both using Windows. See also three files attached.

Attachments:

all_tables.txt (5.17 KB)

exp_code.txt (30.52 KB)

First 10 - 2021-12-24a.xlsx (907.14 KB)

Last Edit: Dec 25, 2021 11:34:50 GMT by pzelchenko: Fixed a few inconsistencies

pzelchenko New Member Posts: 10	Random trial presentation problem Jan 4, 2022 8:48:57 GMT Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by pzelchenko on Jan 4, 2022 8:48:57 GMT Update: We started a separate experiment with the same conditions and I ran it in Firefox 95.0.2. Similar long runs are appearing.

pzelchenko
New Member

Posts: 10

Random trial presentation problem Jan 4, 2022 12:57:14 GMT

Quote

Post by pzelchenko on Jan 4, 2022 12:57:14 GMT

I've looked at the JavaScript and found the following randomizing code:

function psy_array_shuffle(a) {
  a = Array.from(a);
  a.sort(function() {
    return Math.random() - .5
  });
  return a
}

According to a lot of sources, the sort(function()) implementation looks elegant but is very unreliable for random ordering, depending on the implementation of the sort() comparator. As mentioned above, I've found runs far higher than probability for 360 and Mozilla Firefox browsers. For example, for an array of 24 items, we're seeing perfect and near-perfect consecutive runs of 6 to 12 and possibly more, in every other shuffle. Chrome doesn't appear affected. I haven't checked other browsers.

The standard of the industry is the Knuth-Fisher-Yates algorithm, a far more dependable sort algorithm that should also be at least as fast, just a few bytes more of code, and immune to idiosyncrasies of implementation:

function psy_array_shuffle_kfy(a) {
//  var i, j, t;
  a = Array.from(a)
  for(i = a.length-1; i > 0; i--) {
    j = Math.floor(Math.random() * (i+1))
    t = a[j]
    a[j] = a[i]
    a[i] = t
  }
  return a
}

You could also choose to use the sort(function(a,b)) with a pre-loaded, fully random array map, var random = array.map(Math.random), then array.sort(a,b) on the a or b -- see Bostock's "Will it Shuffle?" which includes some code under "sort (random order)". BUT! It will cost 2x the overhead, basically doing something very similar to the above code block but using another array instead of two swap variables. Law of parsimony.

Below are a few sources that discuss this problem and the Knuth-Fisher-Yates standard. It's a very common issue and it's a real shame that the coding community simply leaves this kind of problem totally unaddressed inside JavaScript.

www.linuxscrew.com/javascript-randomize-shuffle-array

bost.ocks.org/mike/shuffle/

bost.ocks.org/mike/shuffle/compare.html

blog.codinghorror.com/the-danger-of-naivete/

stackoverflow.com/questions/2450954/how-to-randomize-shuffle-a-javascript-array

en.wikipedia.org/wiki/Fisher-Yates_shuffle#The_modern_algorithm

en.wikipedia.org/wiki/Knuth_shuffle

codereview.stackexchange.com/questions/266359/fisher-yates-shuffle-but-using-javascript-array-functions

stackoverflow.com/questions/962802/is-it-correct-to-use-javascript-array-sort-method-for-shuffling

Last Edit: Jan 4, 2022 13:01:40 GMT by pzelchenko: Fancy-pants ProBoards ruins your editing

pzelchenko New Member Posts: 10	Random trial presentation problem Jan 5, 2022 14:50:20 GMT Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by pzelchenko on Jan 5, 2022 14:50:20 GMT Thanks to Dr. Stoet for fixing this so quickly! Implementing Fisher-Yates in psy_array_shuffle() should now make any shuffled array properly pseudorandom on any browser.

Random trial presentation problem

Post by pzelchenko on Dec 25, 2021 7:31:59 GMT

Post by pzelchenko on Jan 4, 2022 8:48:57 GMT

Post by pzelchenko on Jan 4, 2022 12:57:14 GMT

Post by pzelchenko on Jan 5, 2022 14:50:20 GMT