Glossary
Last updated
Last updated
In order to understand the use of STOmics Tech' s analysis tools more clearly and conveniently, we have compiled common STOmics vocabulary descriptions.
Stereo-seq Chip T
Used in generating a spatially-resolved 3’ RNA library from biological tissue sections, only for fresh frozen (FF) samples currently.
Stereo-seq Chip N
Used in generating a spatially-resolved 3’ RNA library from biological tissue sections, only for formalin-fixed paraffin-embedded (FFPE) samples currently.
DNB
DNA NanoBall. DNA nanoball is the product of rolling-circle amplification (RCA) that is linearly amplified from the original circular single-stranded DNA template. DNB is the smallest capture unit on the Stereo-seq Chip.
Resolution/Pitch
Also called the chip resolution/pitch. A center-to-center distance of two adjacent DNBs.
CID
Coordinate ID. Spatial position identifier, the artificially synthesized barcode sequence unique to each spot on the Stereo-seq Chip.
MID
Molecular ID. A molecular identifier is an artificially synthesized sequence that is unique to each mRNA molecule captured from the sample. It helps differentiate the number of reads contributed by mRNA expression levels due to amplification. Two copies of native transcripts from the same molecule captured on one DNA nanoball will result in two independent reads with the same CID but different MID. In contrast, two reads with identical CID and MID originated from the same transcript but were amplified.
GEF
is a data management and storage format designed to support multidimensional datasets and high computational efficiency. Stereo-seq analysis workflow generates bin GEF and cellbin GEF files.
GEM
stores gene spatial expression data. SAW generates multiple gene expression matrix files in the workflow, the basic format requires four columns with a header row that shows the column names.
Bin
&
Bin size
Bin (or Square Bin) is the analysis unit on the expression heatmap. A bin is a fixed-sized square in which the expression value in this square is accumulated. Bins are not overlapped.
The value followed by "Bin" represents the square side length. For bin1
, each DNB on the Stereo-seq Chip is shown as a spot, which means one spot only contains the data from one DNB. BinN
means one spot that aggregates data from N×N
DNBs. For example, a spot of bin5 covers data from 5*5 DNBs.
CellBin
A cell bin stands for a region of cell on the expression heatmap recognized by the algorithm (either from image or heat map). Expression within a cell bin region is accumulated, and neighbor cell bins are not overlapped.