Skip to main content

Output Files

The following table lists the files and folders that Cells2Stats outputs. Parquet files are column-based files that efficiently store data. For more information, see the Apache Parquet Documentation.

FileDirectory and File NameDescriptionQuantity
Run manifest{root}/RunManifest.jsonJSON file that is reserved for Element processesOne per run
Run parameters{root}/RunParameters.jsonJSON file that records information about the run configurationOne per run
Run Stats{root}/RunStats.jsonJSON file that records overall statistics about the runOne per run
Panel{root}/Panel.jsonJSON file that records information about the targets for the runOne per run
Cell Segmentation Mask{root}/CellSegmentation/{well}/{tile}_Cell.tifCell segmentation masks for a well, where the value for a pixel in a cell is the cell IDOne per tile per well
Nuclear Segmentation Mask{root}/CellSegmentation/{well}/{tile}_Nuclear.tifNuclear segmentation masks for a well, where the value for a pixel in a nucleus is 1One per tile per well
Average Normalized Well Statistics{root}/AverageNormWellStats.csvFiltered and average metrics for each well in the runOne per run
Versions, JSON{root}/Versions.jsonFile that reports the version number for CSV output files and bundled software programsOne per run
Raw Cell Statistics, CSV{root}/RawCellStats.csvCSV file that reports values per cell for all morphology features and raw target counts in a runOne per run
Raw Cell Statistics, Parquet{root}/RawCellStats.parquetParquet file that reports values per cell for all morphology features and raw target counts in a runOne per run
Barcodes{root}/Wells/{well}/{batch}/{tile}_barcodes.parquetParquet files that provide barcoding information for each polony in a tileOne per tile per batch per well
Cells2Stats Log{root}/Log/Cells2Stats.logFile recording logs for Cells2Stats executionOne per run

Visualization Files

The visualization folder and visualization files are generated when the --visualization or --visualization-only flags are run. Also, an additional log file is generated. The following table lists the additional folders and files that are generated when visualization is run:

FileDirectory and File NameDescriptionQuantity
cyto.viz{root}/visualization/cyto.vizFile for use by Element Biosciences CytoCanvas toolOne per execution, when visualization is executed
Visualization.log{root}/Log/Visualization.logFile recording logs for the visualiation executionOne per execution, when visualization is executed
cells{root}/visualization/cellsAdditional folder that is used by CytoCanvas for cell boundariesOne per execution, when visualization is executed
targets{root}/visualization/targetsAdditional folder that is used by CytoCanvas to specify targets and locationsOne per execution, when visualization is executed
locations{root}/visualization/locationsAdditional folder that is used by CytoCanvas to specify targets and locationsOne per execution, when visualization is executed
Multiscale_flowcell.zarr{root}/visualization/multiscale_flowcell.zarrAdditional folder that contains imaging channels in PNG in a ome-zarr format.One per execution, when visualization is executed
WellInformation.json{root}/visualization/WellInformation.jsonAdditional folder that contains information on well location, label, and color in json formatOne per execution, when visualization is executed
TileInformation.json{root}/visualization/TileInformation.jsonAdditional folder that contains information on tile location, label, and color in json formatOne per execution, when visualization is executed

Metrics

The output files contain a variety of metrics such as tile-specific and average metrics.

  • The RawCellStats.csv and RawCellStats.parquet contain a full set of morphology and quantification metrics for each target and batch.
  • The AverageNormWellStats.csv file provides the averages of these metrics for each well. Metrics that end with .std provide the standard deviation for the metric.

Metrics files report metrics from the following CellProfiler modules, unless the user runs with the `--skip-cellprofiler' flag:

  • MeasureObjectSizeShape
  • MeasureGranularity
  • MeasureObjectIntensity
  • MeasureObjectIntensityDistribution
  • MeasureTexture

Certain metrics from these modules are not available in these files. For example, the output files do not report Zernike metrics. In some files, columns for Z-axis metrics appear with values of 0. Z-axis metrics are not available in the RawCellStats.csv and RawCellStats.parquet files because they are not relevant to the analysis output.

For more information on the cytoprofiling metrics, see the measurement information in the CellProfiler Manuals.

Run Statistics

The RunStats.json provides overall run statistics. The following table describes the run statistics fields:

FieldDescriptionData Type
AnalysisIDUnique ID for the analysisString
AnalysisVersionVersion of the analysis softwareString
FileVersionVersion of the file formatString
FlowCellIDID of the flow cell that is usedString
RunIDUnique ID for the runString
RunNameName of the sequencing runString
DemuxStatsStatistics related to demultiplexingObject
NucleatedRateThe fraction of cells that contain a segmented nucleusFloat
NumPoloniesNumber of polonies that are detectedInteger
PercentAssignedReadsPercentage of reads that are assigned to targetsFloat
PercentMismatchPercentage of reads with at least on base pair mismatchFloat
BatchesList of batch-specific statisticsArray of Objects
BatchNameName of the batchString
TargetsList of target-specific statisticsArray of Objects
TargetNameName of the sequencing targetString
ExpectedSequenceExpected DNA sequence for the targetString
NumPoloniesNumber of polonies that are assigned to this targetInteger
PercentMismatchPercentage of reads that are assigned to this target with at least one base pair mismatchFloat
WellsList of well-specific statisticsArray of Objects
WellLocationLocation of the well (for example, A1-F2)String
NumPoloniesNumber of polonies that are in the wellInteger
PercentAssignedReadsPercentage of reads that are assigned to targets in the wellFloat
PercentMismatchPercentage of reads in the well with at least one base pair mismatchFloat
UnassignedSequencesList of unassigned sequence detailsArray of Objects
CountCount of how many times a particular unassigned sequence appearsInteger
SequenceDNA sequence of unassigned readsString

Versions

The Versions.json reports the version number for CSV output files and bundled software programs. The following table describes the run statistics fields:

FieldDescriptionData Type
FileVersionOverall version of the fileString
FileVersionsVersions of individual files in the datasetObject
AverageNormWellStats.csvVersion of AverageNormWellStats.csv fileString
RawCellStats.csvVersion of RawCellStats.csv fileString
ProgramVersionsVersions of programs that are used in the analysisObject
CellProfilerVersion of the CellProfiler programString

Other Files

Some files are copied from the input directory, such as RunManifest.json, RunParameters.json, Panel.json, Cell Segmentation Masks, Nuclear Segmentation Masks, and Barcodes. For more information on these files, see Cytoprofiling Run Ouput Files.