Hello from OSRE 2026: CellQuery-ST
Cell-Aware Query Grounding for Single-Cell and Neighborhood Retrieval from Histology

Hi everyone! My name is Tong Wu, and I’m excited to be part of the OSRE 2026 cohort. I’m a master’s student at the University of Melbourne, with a strong interest in machine learning and its applications to biology and medicine. This summer I’ll be working on CellQuery-ST under the mentorship of Xi Li.
CellQuery-ST tackles a gap in computational pathology. Today’s models can predict gene expression or answer broad, slide-level questions, but they can’t yet answer cell-aware questions about a histology image โ things like “Where are the B-cell follicles?”, “Which regions show inflammatory myeloid activity?”, or “Which neighborhoods resemble a vascular niche?” My goal is to make this kind of biologically grounded querying possible on new slides.
To do this, I’ll build a cell-aware query grounding framework that learns from spatial omics data during training but only needs the image itself at inference time. Each slide is preprocessed into a spatial index of cells, patches, and neighborhoods, and a natural-language query is matched against that index to retrieve or score the relevant spatial evidence. The system pairs spatial pathology data with CellNet โ an existing paired single-cell and language resource โ to connect text queries with cell identities, cell states, and higher-level biological concepts.
The main deliverables will be:
- A benchmark covering four task families: cell-type grounding, cell-state/programme grounding, spatial-niche grounding, and communication-hotspot grounding.
- A reusable slide indexing and retrieval pipeline for histology images.
- Reproducible baseline models and evaluation utilities for seen/unseen query generalization.
- Documentation and tutorial notebooks showing how to preprocess a new slide, run queries, and evaluate results.
I’ll be sharing updates here throughout the summer โ thanks for following along!
- Proposal: Google Drive