Skip to main content

Output Files

The following table lists the files and folders that Cells2Stats outputs. Parquet files are column-based files that efficiently store data. For more information, see the Apache Parquet Documentation.

FileDirectory and File NameDescriptionQuantity
Run Manifest, CSV{root}/RunManifest.csvRun manifest for the Cells2Stats executionOne per run
Run Manifest, JSON{root}/RunManifest.jsonJSON file that is reserved for Element Biosciences™ processesOne per run
Run Parameters{root}/RunParameters.jsonJSON file that records information about the run configurationOne per run
Run Stats{root}/RunStats.jsonJSON file that records overall statistics about the runOne per run
Panel{root}/Panel.jsonJSON file that records information about the targets for the runOne per run
Cell Segmentation Mask{root}/CellSegmentation/{well}/{tile}_Cell.tifCell segmentation masks for a well, where the Cell ID is the value for a pixel in a cellOne per tile per well
Nuclear Segmentation Mask{root}/CellSegmentation/{well}/{tile}_Nuclear.tifNuclear segmentation masks for a well, where 1 is the value for a pixel in a nucleusOne per tile per well
Average Normalized Well Statistics{root}/AverageNormWellStats.csvFiltered and average metrics for each well in the runOne per run
Versions, JSON{root}/Versions.jsonFile that reports the version number for CSV output files and bundled software programsOne per run
Raw Cell Statistics, CSV{root}/RawCellStats.csvCSV file that reports values per cell for all morphology features and raw target counts in a runOne per run
Raw Cell Statistics, Parquet (Viz){root}/visualization/RawCellStats.parquetParquet file that reports values per cell for all morphology features and raw target counts in a runOne per run
Barcodes{root}/Wells/{well}/{batch}/{tile}_barcodes.parquetParquet files that provide barcoding information for each polony in a tileOne per tile per batch per well
Cells2Stats Log{root}/Log/Cells2Stats.logFile that records logs for the Cells2Stats executionOne per run
Target Counts, JSON{root}/TargetCounts.json(For Teton Atlas™ runs with Target Cell Assignment enabled) JSON file that records information about the target countsOne per run
Target Cell Assignment Manifest, CSV{root}/TargetCellAssignmentManifest.csvManifest for Teton Atlas runs with Target Cell Assignment enabledOne per run
Target Cell Assignment Manifest, JSON{root}/TargetCellAssignmentManifest.jsonJSON file that is reserved for Element processesOne per run
MultiQC Report, HTML{root}/multiqc_report.htmlInteractive HTML quality control (QC) summary that is generated from the RunStats.json summary. Not present if --skip-html-report is usedOne per run
Antibody Screen Kit Report, CSV{root}/AntibodyScreenKitReport.csv(For Teton™ Custom Screen runs) Report with cell, quality control, and custom protein informationOne per run

Visualization Files

The visualization folder and visualization files are generated when the --visualization or --visualization-only arguments are run. Also, an additional log file is generated. The following table lists the additional folders and files that are generated when visualization is run:

FileDirectory and File NameDescriptionQuantity
cyto.viz{root}/visualization/cyto.vizFile for use by the Element CytoCanvas™ toolOne per execution, when visualization is executed
visualization.log{root}/Log/Visualization.logFile that records logs for the visualiation executionOne per execution, when visualization is executed
cells{root}/visualization/cellsAdditional folder that is used by CytoCanvas for cell boundariesOne per execution, when visualization is executed
targets{root}/visualization/targetsAdditional folder that is used by CytoCanvas to specify targets and locationsOne per execution, when visualization is executed
locations{root}/visualization/locationsAdditional folder that is used by CytoCanvas to specify targets and locationsOne per execution, when visualization is executed
multiscale_flowcell.zarr{root}/visualization/multiscale_flowcell.zarrAdditional folder that contains imaging channels in PNG in a ome-zarr format.One per execution, when visualization is executed
WellInformation.json{root}/visualization/WellInformation.jsonAdditional folder that contains information on well location, label, and color in json formatOne per execution, when visualization is executed
TileInformation.json{root}/visualization/TileInformation.jsonAdditional folder that contains information on tile location, label, and color in json formatOne per execution, when visualization is executed
RawCellStats.parquet{root}/visualization/RawCellStats.parquetParquet file that reports values per cell for all morphology features and raw target counts in a runOne per run

MultiQC Reports

MultiQC reports, which are generated by Seqera and designed by Element Biosciences, are available for use with Cells2Stats. MultiQC reports analyze results and statistics from bioinformatics tool outputs, such as log files and console outputs. These reports help summarize experiments that contain multiple samples and multiple analysis steps and are designed to be placed at the end of pipelines or to be run manually when you finish running your tools. Furthermore, MultiQC reports contain the parsed data in a nice friendly format, ready for any further downstream analysis. For more information, see MultiQC Documentation.

Metrics

The output files contain a variety of metrics such as tile-specific and average metrics.

  • The RawCellStats.csv and RawCellStats.parquet contain a full set of morphology and quantification metrics for each target and batch.
  • The AverageNormWellStats.csv file provides the averages of these metrics for each well. Metrics that end with .std provide the standard deviation for the metric.

Metrics files report metrics from the following CellProfiler modules, unless the user runs the `--skip-cellprofiler' argument:

  • MeasureObjectSizeShape
  • MeasureGranularity
  • MeasureObjectIntensity
  • MeasureObjectIntensityDistribution
  • MeasureTexture

Certain metrics from these modules are not available in these files. For example, the output files do not report Zernike metrics. In some files, columns for Z-axis metrics appear with values of 0. Z-axis metrics are not available in the RawCellStats.csv and RawCellStats.parquet files because they are not relevant to the analysis output.

For more information on cytoprofiling metrics, see the measurement information in the CellProfiler Manuals.

Run Statistics

The RunStats.json provides overall run statistics. The following table describes the run statistics fields:

FieldDescriptionData Type
AnalysisIDUnique ID for the analysisString
AnalysisVersionVersion of the analysis softwareString
AssignedCountPerMM2Target counts per mm² of the cell area of a quality control target. Teton and Teton Custom Screen with quality control targets.Float
AssignedCountsPerMM2Target counts per mm² of the cell area of all barcoding target counts in the CytoStats and Teton Atlas target counts.Float
AverageAssignedCountsPerMM2Target counts per mm² of the cell area of all barcoding targets, averaged across wells.Float
BatchNameName of the batchString
BatchesList of batch-specific statisticsArray of Objects
CountCount of how many times a particular unassigned sequence appearsInteger
DemuxStatsStatistics related to demultiplexingObject
ExpectedSequenceExpected DNA sequence for the targetString
ExtraCellularRatioThe ratio of extracellular to intracellular assigned target densityFloat
FileVersionVersion of the file formatString
FlowCellIDID of the flow cell that is usedString
ManifestLabelLabel of the custom protein in the manifest when a custom protein is included. Teton Custom Screen with antibody targets.String
MeanAssignedBarcodingCountPerCellMean number of polonies assigned to a target sequence in a cell per wellFloat
MeanUniqueTargetsPerCellThe mean number of unique targets found in a cell. Teton Atlas with Target Cell Assignment enabled.Float
MedianAbundantTargetCountThe median count of the target that is assigned to cells. Teton Atlas with Target Cell Assignment enabled.Float
MedianCellDiameterMedian diameter of a cell after cell segmentation.Float
NuclearFractionFraction of the cellular assigned reads of the target that are in the nucleus. Teton Custom Screen with antibody targets.Float
NucleatedRateThe fraction of cells that contain a segmented nucleusFloat
NumPolonies (Total)Number of polonies that are detectedInteger
NumPolonies (Targets)Number of polonies that are assigned to this targetInteger
NumPolonies (Wells)Number of polonies that are in the wellInteger
PercentAssignedPercentage of reads that are assigned to a target within the target siteFloat
PercentAssignedMixedCellsPercentage of cells that are assigned and mixed with a confidence score above 50%Float
PercentAssignedPureCellsPercentage of cells that are assigned that are pure with a confidence score above 50%Float
PercentAssignedReads (Targets)Percentage of reads that are assigned to targetsFloat
PercentAssignedReads (Wells)Percentage of reads that are assigned to targets in the wellFloat
PercentMismatch (Total)Percentage of reads with at least one base pair mismatchFloat
PercentMismatch (Targets)Percentage of reads that are assigned to a target with at least one base pair mismatchFloat
PercentMismatch (Wells)Percentage of reads in the well with at least one base pair mismatchFloat
PercentTargetDropoutPercent of targets not assigned to any cell. Teton Atlas with Target Cell Assignment enabled.Float
PercentUnassignedLowCountCellsPercentage of cells that are unassigned that are pure but are below the confidence threshold of 50%.Float
PercentUnassignedMixedCellsPercentage of cells that are unassigned and mixed with a confidence score below 50%.Float
RunIDUnique ID for the runString
RunNameName of the sequencing runString
SequenceDNA sequence of unassigned readsString
TargetNameName of the sequencing targetString
TargetSiteNameName of the target site as provided in the Panel.json. Teton Atlas with Target Cell Assignment enabled.String
TargetsList of target-specific statisticsArray of Objects
UnassignedSequencesList of unassigned sequence detailsArray of Objects
WellLocationLocation of the well (for example, A1-F2)String
WellsList of well-specific statisticsArray of Objects

Versions

The Versions.json reports the version number for CSV output files and bundled software programs. The following table describes the run statistics fields:

FieldDescriptionData Type
FileVersionOverall version of the fileString
FileVersionsVersions of individual files in the datasetObject
AverageNormWellStats.csvVersion of AverageNormWellStats.csv fileString
RawCellStats.csvVersion of RawCellStats.csv fileString
ProgramVersionsVersions of programs that are used in the analysisObject
CellProfilerVersion of the CellProfiler programString

Other Files

Some files are copied from the input directory, such as RunManifest.json, RunParameters.json, Panel.json, Cell Segmentation Masks, Nuclear Segmentation Masks, and Barcodes. For more information on these files, see Cytoprofiling Run Ouput Files.