Batch alignment for whole blood studies

Hi everyone,
I would like to initiate a discussion concerning the use of reference control samples for batch alignment in whole blood studies.
BACKGROUND:
When performing cytometry analysis on samples stained in multiple batches, batch effects can occur. A number of approaches existing for performing ‘batch alignment’, which seeks to remove the technical (batch-derived) variance. A few of these (such as CytoNorm & CytofBatchAdjust) use ‘reference’ samples for this purpose. The classic example here would be a large study using cryopreserved PBMCs -- samples are frozen down when they are arrive at the lab, and then thawed/stained/run together in batches. In addition to this, a single donor may provide a large number of aliquots of cryopreserved PBMCs to serve as reference controls, where an aliquot can be thawed/stained/acquired alongside each batch of samples. Because the reference control aliquots are all biologically identical, any differences between then are due to the thawing/staining/acquisition that occur in each batch. In this way, the relative changes in expression patterns/ranges can be modelled and adjusted, and this same model can be applied to the samples -- removing batch-derived variance, while preserving biological variance. There are a few key requirements for reference controls, such as the need to span the full range of data that is to be aligned (e.g. if the reference control cells don’t express high enough of a certain marker, that marker can’t be aligned in the actual samples). There is a much longer discussion about the considerations required here, but this is enough information for the discussion below.
PROBLEM:
In a number of studies, whole blood samples may be prepared, stained, and run as they are arrive at the lab (i.e. not fixed or cryopreserved to processed in batches at a later stage). This essentially means each sample (or couple of samples) is a separate batch. This somewhat precludes the inclusion of reference controls, as, in theory, the same donor would have to provide fresh whole blood on each occasion a new sample arrives.
POTENTIAL APPROACH:
In some studies, an aliquot of cryopreserved PBMCs (from a single donor) may be thawed and stained/run alongside each sample as a form of qualitative control (i.e. it would reveal any major differences between batches due to staining conditions -- ab concentration, time, temperature, antibody lot, missing antibodies etc). It is feasible that such a control could serve as a reference control for alignment purposes. This would not be an ideal solution by any means, as the reference and actual samples are treated differently, leading to differences in marker expression patterns and population proportions (and the exclusion of whole cell types), as well as any biological changes induced by the isolation/cryopreservation process. However, differences due to shifts in staining conditions (as above) could still be modelled (for the markers and cell types that are common to both the reference PBMC sample and the whole blood samples), and this might be sufficient to adjust for staining artefacts from day to day.
There are other approaches that have been published previously, including aligning samples to each other (which assumes that most major phenotyping markers are stably expressed between patients), though this approach really only works to get samples ‘similar’ to each other to simplify gating/clustering, and is sensitive to changes in some key markers (e.g. CD3 might be downregulated due to stimulation etc).
QUESTIONS:
I have a few questions for the community, and I would be very interested in a variety of responses:
1. In a whole-blood staining situation as described above, what would be your preferred method of dealing with batch effects? I have deliberately excluded any discussion of careful attention to ensuring reproducible staining procedures above, but if your answer is ‘we just stain really carefully and consistently’ then I would love to hear that as well.
2. For whole blood, do you fix the samples (using smartube buffer etc) so that they can be stained and run in batches? Does the advantage of fixing/preserving the samples to run them together in batches outweigh the potential impact on marker expression?
3. For those of you who work with fresh whole blood samples, do you ever use a cryopreserved PBMC sample as a type of qualitative control? The approach described above somewhat assumes that these reference PBMC samples might be commonly collected anyway.
4. Has anyone ever tried performing batch alignment using an approach like this?
5. Do you find it easier/better/preferable to just accept the batch differences, and rely on adjusting the gates in each sample to accomodate these differences?
Thanks for your consideration, and I’m looking forward to hearing different perspectives!
Tom
I would like to initiate a discussion concerning the use of reference control samples for batch alignment in whole blood studies.
BACKGROUND:
When performing cytometry analysis on samples stained in multiple batches, batch effects can occur. A number of approaches existing for performing ‘batch alignment’, which seeks to remove the technical (batch-derived) variance. A few of these (such as CytoNorm & CytofBatchAdjust) use ‘reference’ samples for this purpose. The classic example here would be a large study using cryopreserved PBMCs -- samples are frozen down when they are arrive at the lab, and then thawed/stained/run together in batches. In addition to this, a single donor may provide a large number of aliquots of cryopreserved PBMCs to serve as reference controls, where an aliquot can be thawed/stained/acquired alongside each batch of samples. Because the reference control aliquots are all biologically identical, any differences between then are due to the thawing/staining/acquisition that occur in each batch. In this way, the relative changes in expression patterns/ranges can be modelled and adjusted, and this same model can be applied to the samples -- removing batch-derived variance, while preserving biological variance. There are a few key requirements for reference controls, such as the need to span the full range of data that is to be aligned (e.g. if the reference control cells don’t express high enough of a certain marker, that marker can’t be aligned in the actual samples). There is a much longer discussion about the considerations required here, but this is enough information for the discussion below.
PROBLEM:
In a number of studies, whole blood samples may be prepared, stained, and run as they are arrive at the lab (i.e. not fixed or cryopreserved to processed in batches at a later stage). This essentially means each sample (or couple of samples) is a separate batch. This somewhat precludes the inclusion of reference controls, as, in theory, the same donor would have to provide fresh whole blood on each occasion a new sample arrives.
POTENTIAL APPROACH:
In some studies, an aliquot of cryopreserved PBMCs (from a single donor) may be thawed and stained/run alongside each sample as a form of qualitative control (i.e. it would reveal any major differences between batches due to staining conditions -- ab concentration, time, temperature, antibody lot, missing antibodies etc). It is feasible that such a control could serve as a reference control for alignment purposes. This would not be an ideal solution by any means, as the reference and actual samples are treated differently, leading to differences in marker expression patterns and population proportions (and the exclusion of whole cell types), as well as any biological changes induced by the isolation/cryopreservation process. However, differences due to shifts in staining conditions (as above) could still be modelled (for the markers and cell types that are common to both the reference PBMC sample and the whole blood samples), and this might be sufficient to adjust for staining artefacts from day to day.
There are other approaches that have been published previously, including aligning samples to each other (which assumes that most major phenotyping markers are stably expressed between patients), though this approach really only works to get samples ‘similar’ to each other to simplify gating/clustering, and is sensitive to changes in some key markers (e.g. CD3 might be downregulated due to stimulation etc).
QUESTIONS:
I have a few questions for the community, and I would be very interested in a variety of responses:
1. In a whole-blood staining situation as described above, what would be your preferred method of dealing with batch effects? I have deliberately excluded any discussion of careful attention to ensuring reproducible staining procedures above, but if your answer is ‘we just stain really carefully and consistently’ then I would love to hear that as well.
2. For whole blood, do you fix the samples (using smartube buffer etc) so that they can be stained and run in batches? Does the advantage of fixing/preserving the samples to run them together in batches outweigh the potential impact on marker expression?
3. For those of you who work with fresh whole blood samples, do you ever use a cryopreserved PBMC sample as a type of qualitative control? The approach described above somewhat assumes that these reference PBMC samples might be commonly collected anyway.
4. Has anyone ever tried performing batch alignment using an approach like this?
5. Do you find it easier/better/preferable to just accept the batch differences, and rely on adjusting the gates in each sample to accomodate these differences?
Thanks for your consideration, and I’m looking forward to hearing different perspectives!
Tom