Skip to main content

Cytoprofiling Run Output Files

The following is a representative run output directory of a successful cytoprofiling run:

example-storage-location
└── DemoInstrumentName
└── 20240506_DemoInstrumentName_ExampleRunName
├── BaseCalling
│ ├── BaseCalls
│ │ └── BXX_CXXX.zip ... (for n cycles)
│ ├── CellXform
│ │ └── BXX ... (for n batches)
│ │ └── LXRXXCXXS1.xform ... (for n tiles)
│ ├── Location
│ │ └── BXX ... (for n batches)
│ │ └── LXRXXCXXS1ZXX.loc ... (for n tiles)
│ └── Filter
│ └── BXX ... (for n batches)
│ └── LXRXXCXXS1ZXX.filter ... (for n tiles)
├── Cell Segmentation
│ └── WellXX ... (for n wells)
│ ├── LXRXXCXXS1_Cell.tif ... (for n tiles)
│ └── LXRXXCXXS1_Nuclear.tif ... (for n tiles)
├── Thumbnails
│ └── WellXX_Thumbnail.png ... (for n wells)
├── Projection
│ └── WellXX ... (for n wells)
│ └──BXX_LXRXXCXXS1_Target.tif ... (for each target and n tiles)
├── Cytoprofiling
│ ├── Instrument
│ │ ├── AverageNormWellStats.csv
│ │ ├── RawCellStats.csv
│ │ ├── RawCellStats.parquet
│ │ ├── RunStats.json
│ │ ├── Versions.json
│ │ └── Wells
│ │ └── WellXX ... (for n wells)
│ │ └── BXX ... (for n batches)
│ │ └── LXRXXCXXS1_barcodes.parquet ... (for n tiles)
├── RunManifest.csv
├── RunManifest.json
├── RunParameters.json
├── Panel.json
├── RunStats.bin
├── RunAnalysisFilesUploaded.json
└── RunUploaded.json

Output Files

The following table defines the key cytoprofiling run output files from an AVIT24 System. Parquet files are column-based files that efficiently store data. For more information, see the Apache Parquet Documentation.

Directory and File NameFile FormatDescriptionQuantity
{root}/BaseCalling/BaseCalls/{batch}_C{cycle}.zipBinaryReport the raw cytoprofiling base call dataOne per batch per cycle
{root}/BaseCalling/Filter/{batch}/{tile}.filterBinaryFor use with future applicationsOne per tile per batch
{root}/BaseCalling/CellXform/{batch}/{tile}.xformBinaryTransforms polony locations onto cellsOne per tile per batch
{root}/BaseCalling/Location/{batch}/{tile}.locBinaryIdentify polony locations on the flow cellOne per tile per batch
{root}/BaseCalling/RunStats.binLogLog for internal troubleshooting of offline runsOne per run
{root}/CellSegmentation/{well}/{tile}_Cell.tifTIFCell segmentation masks for a well, where the value for a pixel in a cell is the cell IDOne per tile per well
{root}/CellSegmentation/{well}/{tile}_Nuclear.tifTIFNuclear segmentation masks for a well, where the value for a pixel in a nucleus is 1One per tile per well
{root}/Cytoprofiling/Instrument/AverageNormWellStats.csvCSVReports filtered and average metrics for each well in the runOne per run
{root}/Cytoprofiling/Instrument/Versions.jsonJSONReports the version number for CSV output files and bundled software programsOne per run
{root}/Cytoprofiling/Instrument/RawCellStats.csvCSVReports values per cell for all morphology features and raw target counts in a runOne per run
{root}/Cytoprofiling/Instrument/RawCellStats.parquetParquetR values per cell for all morphology features and raw target counts in a runOne per run
{root}/Cytoprofiling/Instrument/RunStats.jsonJSONReports run metricsOne per run
{root}/Cytoprofiling/Instrument/Wells/
{well}/{batch}/{tile}_barcodes.parquet
ParquetBarcoding information for each polony in a tileOne per tile per batch per well
{root}/Panel.jsonJSONRecords target detection information for the runOne per run
{root}/Projection/{well}/{batch}_{tile}_{target}.tifTIFZ-projected images of cell paint targetsOne per target per tile
{root}/RunManifest.csv.CSVManifest that records biological sample information and well mappingOne per run
{root}/RunManifest.jsonJSONVersion of the run manifest reserved for Element processesOne per run
{root}/RunParameters.jsonJSONRecords information about the run configurationOne per run
{root}/RunAnalysisFilesUploaded.jsonJSONWritten once the last analysis file is transferred and post-run analysis may beginOne per run
{root}/RunUploaded.jsonJSONThe last file transferred and marks run completionOne per run
{root}/Thumbnails/{well}_Thumbnail.pngPNGThumbnail image for a wellOne per well

Metrics

The output files contain a variety of metrics, including tile-specific and average metrics.

  • RawCellStats.csv and RawCellStats.parquet contain a full set of morphology and quantification metrics for each target and batch.
  • AverageNormWellStats.csv provides averages of these metrics for each well. Metrics that end with .std provide the standard deviation for the metric.

The files report metrics from the following CellProfiler modules:

  • MeasureObjectSizeShape
  • MeasureGranularity
  • MeasureObjectIntensity
  • MeasureObjectIntensityDistribution
  • MeasureTexture

Certain metrics are not available in these files. For example, the output files do not report Zernike metrics. In some files, columns for Z-axis metrics appear with values of 0. Z-axis metrics are not available in the RawCellStats.csv and RawCellStats.parquet files because they are not relevant to the analysis output.

For more information on the cytoprofiling metrics, see the CellProfiler Manual measurement information.

Barcodes Parquet Files

The {tile}_barcodes.parquet provide the raw barcode data for each tile in a batch. The files indicate the location of targets in different cells.

The following table defines the columns in the parquet files.

FieldDescriptionData Type
BarcodeIndexA barcode ID number that corresponds to the order of targets for a batch as listed in the Panel.json file. A value of 0 indicates an unassigned barcode.Int16
CellA tile-specific ID associated with a cell. The Cell ID in barcode parquet files differs from the Cell ID for the run in other parquet files.Int16
IsNuclearAn indicator for whether or not the barcode is in the nucleus of the cell.Boolean
XThe position of the barcode on the X-axis of the tile.UInt16
YThe position of the barcode on the Y-axis of the tile.UInt16

Panel File

The Panel.json file contains target information for each batch in the run. Sections of the file define information for batches and targets, including ImagingPrimerTubes, BarcodingPrimerTubes, ImagingTargets, and BarcodingTargets.

The following tables describe the information in the file.

FieldDescription
AnalysisSettingsIdentifies an array of settings for the run analysis
ApplicationThe type of application for the run, Counting
CellBoundaryTargetIdentifies the target used to generate cell segmentation masks
FileVersionThe version of the Panel.json file for the run
IDA unique ID assigned to the panel
NameThe name of the cytoprofiling kit associated with the panel
NuclearTargetIdentifies the target used to generate nuclear segmentation masks
PanelCartridgePartNumbersIdentifies the set of information that lists the cartridge part numbers for the panel
SupplementaryCellBoundaryTargetIdentifies an optional additional target used to generate cell segmentation masks

Primer Tube Sections

The following table defines information in the ImagingPrimerTubes and BarcodingPrimerTubes sections of the Panel.json file. The sections provide information about the settings for different batches.

FieldDescription
BarcodeMaskThe mask used to support barcoding for target analysis
BarcodingPrimerTubesIdentifies the set of information about the settings for batches with targets for analysis
BatchNameThe name for a particular batch
DefaultMismatchThe number of base mismatches permitted to assign a barcode to a target, typically 2
ImagingPrimerTubesIdentifies the set of information about the settings for batches related to cell paint
MinCyclesThe minimum number of cycles for a particular batch
PMGMaskA base mask used to generate the map of polonies.
RunOrderIdentifies the ordinal position for a batch in the run
TypeThe type of batch for amplification, PreAmp or PostAmp

Target Sections

The following table defines information in the ImagingTargets and BarcodingTargets sections of the Panel.json file. The sections provide information about the targets in each batch.

FieldDescription
AnalysesIdentifies an array with the types of analysis for a batch
BarcodeThe barcode of bases for a particular target in a batch
BarcodingTargetsIdentifies the set of information for targets in each batch for analysis
BaseThe base for a particular cycle in a batch
BatchNameThe name for a particular batch
ControlTypeIdentifies the type of control for a target in a batch, if applicable
CycleThe cycle that a particular base uses in a batch
CycleBasesIdentifies a set of information that defines the base to identify in a particular cycle for a batch
ImagingTargetsIdentifies the set of information for targets in each batch related to amplification
ProbeConcentrationThe concentration of the probe for a particular target in a batch
TargetThe name of a target for a batch
TargetTypeThe type of target for a batch as CellPaint, Protein or Transcript

Run Parameters File

The RunParameters.json file contains a record of the input information for a run. When evaluating run performance or troubleshooting, you can review this file to confirm the run used the correct parameters.

The following table describes the information in the RunParameters.json file.

FieldDescription
AdvancedSettingsIncludes information about advanced run settings such as custom recipes
AnalysisLanesThe lanes that the run uses as 1,2, or 1+2
ApplicationNameThe type of application for the run, Counting
BarcodeStrThe barcode number for a consumable, which the instrument scans and identifies during consumable loading
BaseForChannelsIdentifies the set of bases for the channels
BatchesIdentifies the set of batches for the run
BufferIdentifies a set of information that describes the buffer for the run
ChannelsIdentifies the colors to associate with channels
ColorForChannelsIdentifies the channels to associate with bases and colors
ConsumablesIdentifies the set of information for run consumables
CustomRecipeNameThe name of the custom recipe file uploaded in advanced run settings
CyclesIdentifies the number of cycles in a batch
CycleFormatDefines the format in which cycles appear
DateTimestamp for the run
ExpirationThe expiration timestamp for a consumable, which the instrument identifies or a user inputs during consumable loading
ExpirationStrAn expiration ID number for a consumable, associated with the expiration date
FileVersionThe version of the RunParameters.json file for the run
FlowcellIdentifies a set of information that describes the flow cell for the run
ImageHeightThe height of the image in pixels
ImageInfoIdentifies the set of information that defines characteristics of the cytoprofiling image
ImageWidthThe width of the image in pixels
LanesDefines an array that lists the lanes of the flow cell for the run
InstrumentNameThe name of the instrument for the run
LotNumberThe lot number for a consumable, which the instrument identifies during consumable loading before the run
NameThe name of a tile in a set of Tiles
OperatorNameThe name of the person who set up the run on the instrument
PanelCartridgeIdentifies a set of information that describes the sequencing cartridge for the run
PanelNameThe name of the panel used for the run
PartNumberThe part number for a consumable, which the instrument scans or a user inputs during consumable loading
PlatformVersionThe version of AVITI OS for the run
PMGMaskA base mask used to generate the map of polonies
RecipeExecutionIDA UUID for the run recipe, which governs the stages of a sequencing run
RecipeValuesContains additional values for the recipe that the run uses, such as a filterMask value
RunDescriptionAn optional description for the run
RunFolderNameThe name of the output folder that AVITI OS creates for the run
RunIDA UUID assigned to the run
RunNameA text-based run identifier entered by the operator
RunOrderIdentifies the ordinal position for a batch in the run
RunTypeThe type of AVITI24 System run, Cytoprofiling
SerialNumberThe serial number for a consumable, which the instrument scans or a user inputs during consumable loading
SideThe side of the instrument that the run uses, SideA or SideB
StorageConnectionIDA UUID for the storage connection that the run uses
TagsThe tags that the Operator applies to the run, as applicable
ThroughputSelectionIdentifies the path for the text file used to determine throughput selection for the run
TilesIdentifies a set of tiles on the flow cell for a well
TypeIdentifies the type of batch BarcodingBatch , PreAmpImagingBatch, or PostAmpImagingBatch
WellLayoutIdentifies the well layout, such as twelve well or single-well
WellsIdentifies a set of information for the wells in the run
WellLocationIdentifies the well for the following Tiles listed
XMillimetersThe well position in millimeters on the X-axis of the image
YMillimetersThe well position in millimeters on the Y-axis of the image
ZPositionsIdentifies the Z-positions associated with each batch in the run
ZsIdentifies the order of batches for the run to associate them with Z-positions

Run Uploaded File

The RunUploaded.json file indicates the completion of the run. The file contains high-level information about the run and an outcome field that confirms the run outcome. AVITI OS always transfers this file last, allowing its creation to serve as a trigger to start automated downstream analysis.

The following table defines the fields in the RunUploaded.json file.

FieldDescription
versionThe version of the RunUploaded.json file
instrumentThe name of the instrument for the run
instrumentIdA UUID for the instrument
outcomeThe final outcome of the run, OutcomeCompleted, OutcomeStopped, or OutcomeFailed
runTypeThe type of AVITI System run, Cytoprofiling
recipeExecutionIdA UUID for the run recipe and sourced from RunParameters.json
runIDA UUID assigned to the run and sourced from RunParameters.json

Run Analysis Files Uploaded

The RunAnalysisFilesUplaoded.json file indicates the completion all data that is required to begin downstream analysis. The file contains high-level information about the run and an outcome field that confirms the run outcome. AVITI OS always transfers this file only after all files required to begin analysis have been confirmed to be transferred, allowing its creation to serve as a trigger to start automated downstream analysis.