Skip to main content

Output files

The following table lists the files and folders that Cells2Stats outputs. Parquet files are column-based files that efficiently store data. For more information, see the Apache Parquet Documentation.

FileDirectory and File NameDescriptionQuantity
Spatial Data Object{root}/SpatialData/RunID.zarr.zipZarr-based structured object containing all modalities from a run for downstream analysis and visualization. For more information, see Spatial Data Object.One per run
CytoCanvas Visualization File{root}/SpatialData/cyto.vizFile for use by the Element CytoCanvas™ toolOne per run
Run Manifest, JSON{root}/RunManifest.jsonJSON file that is reserved for Element Biosciences™ processesOne per run
Run Parameters{root}/RunParameters.jsonJSON file that records information about the run configurationOne per run
Panel{root}/Panel.jsonJSON file that records information about the targets for the runOne per run
Run Stats{root}/RunStats.jsonJSON file that records overall statistics about the runOne per run
Average Normalized Well Statistics{root}/AverageNormWellStats.csvFiltered and average metrics for each well in the runOne per run
Raw Cell Statistics, CSV{root}/RawCellStats.csvCSV file that reports values per cell for all morphology features and raw target counts in a runOne per run
Versions, JSON{root}/Versions.jsonFile that reports the version number for CSV output files and bundled software programsOne per run
Barcodes{root}/Wells/{well}/{batch}/{tile}_barcodes.parquetParquet files that provide barcoding information for each polony in a tileOne per tile per batch per well
MultiQC Report, HTML{root}/multiqc_report.htmlInteractive HTML quality control (QC) summary that is generated from the RunStats.json summary. Not present if --skip-html-report is usedOne per run
Run Manifest, CSV{root}/RunManifest.csvRun manifest for the Cells2Stats executionOne per run
Cell Segmentation Mask{run}/CellSegmentation/{well}/{tile}_Cell.tifCell segmentation masks for a well, where the Cell ID is the value for a pixel in a cellOne per tile per well
Nuclear Segmentation Mask{run}/CellSegmentation/{well}/{tile}_Nuclear.tifNuclear segmentation masks for a well, where 1 is the value for a pixel in a nucleusOne per tile per well
Target Counts, JSON{root}/TargetCounts.json(For Teton Atlas™ runs with Target Cell Assignment enabled) JSON file that records read counts per spacer target across the run. Use this to assess library representation and identify low-count or absent spacers.One per run
Target Cell Assignment Manifest, CSV{root}/TargetCellAssignmentManifest.csvCopy of the target cell assignment manifest used in this run. Confirms which spacer sequences and settings were applied.One per run
Target Cell Assignment Manifest, JSON{root}/TargetCellAssignmentManifest.jsonJSON file that is reserved for Element processesOne per run
Antibody Screen Kit Report, CSV{root}/AntibodyScreenKitReport.csv(For Teton™ Custom Screen runs) Report with cell, quality control, and custom protein informationOne per run

Target cell assignment output details

When target cell assignment is enabled for at least one target site in a run, Cells2Stats populates target cell assignment-specific fields in RunStats.json and generates the following additional output files. For more information on how cell assignment works, see Cell assignment logic.

Cells are classified as pure (single spacer detected), mixed (multiple spacers, one dominant), or unassigned based on a confidence score. For a full explanation of these categories and how cell assignment is calculated, see Pure cells vs. mixed cells.

Key target cell assignment output files

FileWhat to use it for
TargetCounts.jsonSee total reads detected per spacer. High dropout (many spacers with zero reads) may indicate a manifest or TargetMask issue.
TargetCellAssignmentManifest.csvConfirm which sequences and settings were applied. Compare against your source library file to catch discrepancies.
Panel.jsonContains the TargetMask that was applied per target site. Check here if assignment rates are unexpectedly low.
RunStats.jsonContains all target cell assignment-specific quality metrics. See Target cell assignment metrics.
RawCellStats.csvPer-cell data including raw target read counts at each target site. Use for downstream per-cell analysis.

Target cell assignment metrics in RunStats.json

FieldDescriptionWhat to watch for
PercentAssignedPureCellsCells assigned to a single dominant spacer (confidence >0.5, meets minimum read count)Primary metric — aim as high as possible for your MOI
PercentAssignedMixedCellsCells with multiple spacers detected, but one is dominant (confidence >0.5)Expected to increase with higher MOI; still valid data
PercentUnassignedMixedCellsCells with multiple spacers; no spacer exceeds 50% confidenceHigh values suggest MOI may be too high for reliable single-guide assignment
PercentUnassignedLowCountCellsSpacer was matched but read count is below the minimum thresholdHigh values suggest low expression or insufficient sequencing depth
PercentTargetDropoutPercentage of spacers in the manifest not assigned to any cellHigh dropout may indicate library representation problems
MedianAbundantTargetCountMedian read count of dominant spacer in assigned cellsIndicator of per-cell sequencing depth at target site
MeanUniqueTargetsPerCellMean number of unique spacers detected per cellValues substantially above 1 suggest high MOI
UnassignedSequencesList of sequences detected at target site but not matching any manifest entryPrimary troubleshooting resource — see Using UnassignedSequences for troubleshooting

Using UnassignedSequences for troubleshooting

The UnassignedSequences field lists actual nucleotide sequences of reads detected at the target locus that did not match any spacer in the manifest. Inspect this field when assignment rates are low.

What you seeLikely causeResolution
Sequences that look like your spacers but reversedOrientation error in manifestRe-enter spacers in 5' to 3' direction
Spacer-like sequences of a different lengthTargetMask or sequence length mismatchConfirm the Y cycle count in TargetMask equals spacer length
Recognizable spacers from your librarySpacers missing from manifestAdd missing entries to [TARGET INFORMATION]
Unrecognizable sequencesOff-target reads or artifactsContact Element Technical Support

For more information on assignment thresholds and additional troubleshooting, see Cell assignment logic.

MultiQC Reports

MultiQC reports, which are generated by Seqera and designed by Element Biosciences, are available for use with Bases2Fastq™ and Cells2Stats. MultiQC reports analyze results and statistics from bioinformatics tool outputs, such as log files and console outputs. These reports help summarize experiments that contain multiple samples and multiple analysis steps and are designed to be placed at the end of pipelines or to be run manually when you finish running your tools. Furthermore, MultiQC reports contain the parsed data in a nice friendly format, ready for any further downstream analysis. For more information, see MultiQC Documentation.

Metrics

The output files contain a variety of metrics such as tile-specific and average metrics.

  • The RawCellStats.csv and RawCellStats.parquet contain a full set of morphology and quantification metrics for each target and batch.
  • The AverageNormWellStats.csv file provides the averages of these metrics for each well. Metrics that end with .std provide the standard deviation for the metric.

Metrics files report metrics from the following CellProfiler modules, unless the user runs the --skip-cellprofiler argument:

  • MeasureObjectSizeShape
  • MeasureGranularity
  • MeasureObjectIntensity
  • MeasureObjectIntensityDistribution
  • MeasureTexture

Certain metrics from these modules are not available in these files. For example, the output files do not report Zernike metrics. In some files, columns for Z-axis metrics appear with values of 0. Z-axis metrics are not available in the RawCellStats.csv and RawCellStats.parquet files because they are not relevant to the analysis output.

For more information on cytoprofiling metrics, see the measurement information in the CellProfiler Manuals.

Run statistics

The RunStats.json provides overall run statistics. The following table describes the run statistics fields:

FieldDescriptionData Type
AnalysisIDUnique ID for the analysisString
AnalysisVersionVersion of the analysis softwareString
AssignedCountPerMM2Target counts per mm² of the cell area of a quality control target. Teton and Teton Custom Screen with quality control targets.Float
AssignedCountsPerMM2Target counts per mm² of the cell area of all barcoding targets.Float
AverageAssignedCountsPerMM2Target counts per mm² of the cell area of all barcoding targets, averaged across wells.Float
BatchNameName of the batchString
BatchesList of batch-specific statisticsArray of Objects
CountCount of how many times a particular unassigned sequence appearsInteger
DemuxStatsStatistics related to demultiplexingObject
ExpectedSequenceExpected DNA sequence for the targetString
ExtraCellularRatioThe ratio of extracellular to intracellular assigned target densityFloat
FileVersionVersion of the file formatString
FlowCellIDID of the flow cell that is usedString
ManifestLabelLabel of the custom protein in the manifest when a custom protein is included. Teton Custom Screen with antibody targets.String
MeanAssignedBarcodingCountPerCellMean number of polonies assigned to a target sequence in a cell per wellFloat
MeanUniqueTargetsPerCellThe mean number of unique targets found in a cell. Teton Atlas with Target Cell Assignment enabled.Float
MedianAbundantTargetCountThe median count of the target that is assigned to cells. Teton Atlas with Target Cell Assignment enabled.Float
MedianCellDiameterMedian diameter of a cell after cell segmentation.Float
NuclearFractionFraction of the cellular assigned reads of the target that are in the nucleus. Teton Custom Screen with antibody targets.Float
NucleatedRateThe fraction of cells that contain a segmented nucleusFloat
NumPolonies (Total)Number of polonies that are detectedInteger
NumPolonies (Targets)Number of polonies that are assigned to this targetInteger
NumPolonies (Wells)Number of polonies that are in the wellInteger
PercentAssignedPercentage of reads that are assigned to a target within the target siteFloat
PercentAssignedMixedCellsPercentage of cells that are assigned and mixed with a confidence score above 50%Float
PercentAssignedPureCellsPercentage of cells that are assigned that are pure with a confidence score above 50%Float
PercentAssignedReads (Targets)Percentage of reads that are assigned to targetsFloat
PercentAssignedReads (Wells)Percentage of reads that are assigned to targets in the wellFloat
PercentCellularAssignedPercentage of reads that are assigned to cellsFloat
PercentMismatch (Total)Percentage of reads with at least one base pair mismatchFloat
PercentMismatch (Targets)Percentage of reads that are assigned to a target with at least one base pair mismatchFloat
PercentMismatch (Wells)Percentage of reads in the well with at least one base pair mismatchFloat
PercentTargetDropoutPercent of targets not assigned to any cell. Teton Atlas with Target Cell Assignment enabled.Float
PercentUnassignedLowCountCellsPercentage of cells that are unassigned that are pure but are below the confidence threshold of 50%.Float
PercentUnassignedMixedCellsPercentage of cells that are unassigned and mixed with a confidence score below 50%.Float
RunIDUnique ID for the runString
RunNameName of the sequencing runString
SequenceDNA sequence of unassigned readsString
TargetNameName of the sequencing targetString
TargetSiteNameName of the target site as provided in the Panel.json. Teton Atlas with Target Cell Assignment enabled.String
TargetsList of target-specific statisticsArray of Objects
UnassignedSequencesList of unassigned sequence detailsArray of Objects
WellLocationLocation of the well (for example, A1-F2)String
WellsList of well-specific statisticsArray of Objects

Versions

The Versions.json reports the version number for CSV output files and bundled software programs. The following table describes the fields:

FieldDescriptionData Type
FileVersionOverall version of the fileString
FileVersionsVersions of individual files in the datasetObject
AverageNormWellStats.csvVersion of AverageNormWellStats.csv fileString
RawCellStats.csvVersion of RawCellStats.csv fileString
ProgramVersionsVersions of programs that are used in the analysisObject
CellProfilerVersion of the CellProfiler programString

Other files

Some files are copied from the input directory, such as RunManifest.json, RunParameters.json, Panel.json, Cell Segmentation Masks, Nuclear Segmentation Masks, and Barcodes. For more information on these files, see Cytoprofiling Run Output Files.