Fast Mode
CASSIA's Fast Mode offers a streamlined, one-line solution for running the complete analysis pipeline. This mode combines annotation, scoring, and annotation boost for correcting low quality annotations in a single function call, using optimized default parameters.
Basic Usage
runCASSIA_pipeline( output_file_name = "my_analysis", tissue = "brain", species = "human", marker = marker_data, max_workers = 4 )R
Add CASSIA results back to the seurat object
seurat_corrected <- add_cassia_to_seurat( seurat_obj = seurat_corrected, # The seurat object you want to add the CASSIA results to cassia_results_path = "/FastAnalysisResults_scored.csv", #where the scored results saved, specify the path cluster_col = "celltype", # Column in Seurat object with cell types cassia_cluster_col="True Cell Type" # Column in the scored results with the true cell types ) # This will add six new columns to the seurat object:the genearl celltype, all three subcelltypes, the mostly likely celltype, the second likely celltype, the third likely celltype, and mixed celltype,and the quality score of each cell type.R
Full Parameter Options
runCASSIA_pipeline( # Required parameters output_file_name, # Base name for output files tissue, # Tissue type (e.g., "brain", "blood") species, # Species (e.g., "human", "mouse") marker, # Marker file from findallmarker, path or the data obejct # Optional parameters with defaults max_workers = 4, # Number of parallel workers # Model configurations annotation_model = "gpt-4o", # Model for annotation annotation_provider = "openai", # Provider for annotation score_model = "anthropic/claude-3.5-sonnet", # Model for scoring, it is highly recommended to use a different and more powerful model for scoring score_provider = "openrouter", # Provider for scoring annotationboost_model="anthropic/claude-3.5-sonnet", # Model for annotation boost annotationboost_provider="openrouter", # Provider for annotation boost # Analysis parameters score_threshold = 75, # Minimum acceptable score additional_info = NULL # Additional context information )R
Output Files
The pipeline generates a folder which contains the following files:
- Annotation results csv files
- Scored results csv files
- Basic CASSIA report
- Annotation boost report
Performance Tips
- For optimal performance, adjust max_workersbased on your system's CPU cores
- Use additional_infoto provide relevant experimental context
- Monitor score_thresholdto balance stringency with throughput
Next we introduce each function in detail...