12 Summary

- Loaded raw data outputs from a cosmx machine into a Seurat object for processing
- Performed QC and filtering to carefully remove ‘unwanted’ cells of poor quality that might interfere with downstream analyses(low counts, or high negative background)
- Normalised and scaled data to enable quanitfication of expression information and cell-cell trancriptional similarity
- Produced a UMAP plot, and visualised our experimenal information and QC metrics
- Identified and corrected for a sample-level batch effect
- Clustered cells into clusters based on their transcriptional similarity
- Identified specific gene markers for each cluster
- Classified cells using a publically available reference dataset
- Labelled celltypes
- Identified spatially restricted genes
- Copious visualisations
At this point in the analysis we could go in many directions, depending on our experimental aims. Some potential options are linked in the next steps section.
Where we started:
| File name | Description | 
|---|---|
| SLIDE-polygons.csv.gz | Cell borders | 
| SLIDE_exprMat_file.csv.gz | Counts of genes per cell (counts matrix) | 
| SLIDE_fov_positions_file.csv.gz | Location of FOVs on slide | 
| SLIDE_metadata_file.csv.gz | Cell-level QC metadata | 
| SLIDE_tx_file.csv.gz | Location of individual transcripts | 
Our processed object:
