Channel order

#16
by magicleap - opened

Hi, thank you for sharing this pretrained model!

Could you clarify the channel order used during training? Is it the same as the order specified in this repository, https://github.com/jump-cellpainting/2024_Chandrasekaran_NatureMethods/tree/main?

ch01 - Alexa 647
ch02 - Alexa 568
ch03 - Alexa 488 long
ch04 - Alexa 488
ch05 - Hoechst 33342
ch06-8 - three brighfield z planes.

Thanks

Recursion Pharmaceuticals org

Hi,
Here are the details. The raw JUMP filenames have a set of characters indicating channel order, i.e.:

cpg0016-jump/source_13/images/20220914_Run1/images/CP-CC9-R1-01/CP-CC9-R1-01_A02_T0001F001L01A01Z01C01.tif,
cpg0016-jump/source_13/images/20220914_Run1/images/CP-CC9-R1-01/CP-CC9-R1-01_A02_T0001F001L01A01Z01C02.tif,
cpg0016-jump/source_13/images/20220914_Run1/images/CP-CC9-R1-01/CP-CC9-R1-01_A02_T0001F001L01A02Z01C03.tif,
cpg0016-jump/source_13/images/20220914_Run1/images/CP-CC9-R1-01/CP-CC9-R1-01_A02_T0001F001L01A03Z01C04.tif,
cpg0016-jump/source_13/images/20220914_Run1/images/CP-CC9-R1-01/CP-CC9-R1-01_A02_T0001F001L01A04Z01C05.tif

correspond to the five channels in field F001 of well A02 in plate CP-CC9-R1-01. Leveraging this ordering, we re-index the channels during training with the following order:

      rxrx3: [1, 2, 3, 4, 5, 6] (rxrx3 channels correspond to Nuclei, Endoplasmic Reticulum, F-Actin, Nucleoli & Cytoplasmic RNA, Mitochondria, and Golgi)
      jump-orf: [5, 4, 2, 3, 1, 2] (cpg0016 - source 4)
      jump-ko: [1, 3, 5, 3, 2, 1] (cpg0016 - source 13)

For inference the channel ordering doesn't matter as the model is channel agnostic. Passing the same set of channels with different permutations of the ordering will calculate the same embeddings. During training, the decoder order needs to be maintained, so we reordered the channels from different datasets to make them match visually.

Sign up or log in to comment