Tutorials Overview¶
This section provides comprehensive tutorials for using renalprog to analyze kidney cancer progression. Each tutorial is designed to be self-contained yet builds upon previous steps.
Tutorial Structure¶
🚀 Quick Start¶
New to renalprog? Start here!
- Quick Start: 10-minute introduction to core functionality
- Using Pretrained Models: Fastest way to reproduce paper results
- Data Requirements: Understanding input data formats
📋 Complete Pipeline¶
Step-by-step walkthrough of the entire analysis pipeline:
- Data Processing: Download, filter, and preprocess TCGA data
- VAE Training: Train variational autoencoders
- Reconstruction Validation: Assess model quality
- Trajectory Generation: Create synthetic progression paths
- Classification: Stage prediction and biomarker discovery
- Enrichment Analysis: Pathway analysis with GSEA
🎨 Visualization¶
- Visualization Guide: Create publication-quality figures
Learning Paths¶
For Biologists¶
If you're primarily interested in biological insights:
- Start with Quick Start
- Read Data Requirements to understand the data
- Follow Complete Pipeline end-to-end
- Focus on Enrichment Analysis for pathway interpretation
- Use Visualization Guide for publication figures
For Computational Scientists¶
If you want to customize models or develop new methods:
- Complete Quick Start
- Study VAE Training in detail
- Explore API Reference for implementation details
- Read Custom Models for extending functionality
- Review Architecture for design principles
For Reproducing Published Results¶
If you want to reproduce the paper:
Option 1: Using Pretrained Models (Recommended)
- Follow Using Pretrained Models tutorial
- This is the fastest and most accurate way to reproduce results
- Uses the exact models from the paper
Option 2: Training from Scratch
- Follow System Requirements
- Complete Data Preparation
- Execute Running the Pipeline
- Validate using Expected Results
Tutorial Conventions¶
Code Blocks¶
Python code to execute:
Shell commands:
Callouts¶
Note
Informational notes provide additional context.
Tip
Tips offer helpful suggestions and best practices.
Warning
Warnings highlight potential issues or common pitfalls.
Danger
Critical warnings about data loss or major errors.
Example
Example outputs or usage patterns.
File Paths¶
All file paths are relative to the repository root unless otherwise specified:
renalprog/
├── data/
│ ├── raw/ # Downloaded TCGA data
│ ├── interim/ # Intermediate processing outputs
│ └── processed/ # Final processed data
├── models/ # Trained models
├── reports/ # Analysis results
└── scripts/ # Pipeline scripts
Prerequisites¶
Before starting these tutorials, ensure you have:
- ✅ Installed
renalprog(Installation Guide) - ✅ Python 3.9+ and R 4.0+ available
- ✅ At least 16 GB RAM (8 GB minimum)
- ✅ 50+ GB free disk space
- ✅ (Optional) CUDA-capable GPU for faster training
Time Estimates¶
| Tutorial | Reading Time | Execution Time | Difficulty |
|---|---|---|---|
| Quick Start | 10 min | 15 min | ⭐ Easy |
| Data Processing | 15 min | 30 min | ⭐ Easy |
| VAE Training | 20 min | 2-4 hours* | ⭐⭐ Moderate |
| Reconstruction | 10 min | 10 min | ⭐ Easy |
| Trajectories | 15 min | 30 min | ⭐⭐ Moderate |
| Classification | 20 min | 1 hour | ⭐⭐ Moderate |
| Enrichment | 25 min | 2-6 hours* | ⭐⭐⭐ Advanced |
| Visualization | 15 min | 30 min | ⭐⭐ Moderate |
* With GPU acceleration and 8+ CPU cores. Times may be significantly longer on older hardware.
Getting Help¶
If you encounter issues while following these tutorials:
- Check the Troubleshooting Guide
- Review the API Reference for function details
- Search GitHub Issues
- Ask in GitHub Discussions
Next Steps¶
Ready to begin? Start with the Quick Start Tutorial!