Mapping the transcriptional regulatory circuit is central to understanding disease mechanisms.
Traditionally, ChIP-seq has been the go-to technology for mapping protein-DNA interactions.
But the harsh crosslinking requirement of ChIP-seq sometimes interferes with the binding of the antibody of interest. We have successfully employed CUT&RUN 1 2 in several papers to elucidate binding of BCL11A, GATA1, and NFY during globin gene switch.
Motif footprinting is a central element of CUT&RUN analysis. We can identify at a nucleotide resolution the protection of DNA from pA-MNase enzyme digestion; applied genome-wide, motif footprinting quantitatively assigns a binding score to each motif site.
A useful software CUT&RUNTools 3 also resulted from our study.
In addition to identifying genomic locations of TF via CUT&RUN, I also use a systems-biology approach to understand how the transcription factor network is dysregulated in a subtype dependent manner in breast cancer 4.
Using ENCODE ChIP-seq data of surrogate cell lines and motif analysis we observed that these subtypes are determined by a distinct repertoire of lineage-specific transcription factors.
Finally, similar ideas about combining coexpressed genes and motif analysis allow us to identify the critical regulators of cell identity in the Mouse Cell Atlas single cell transcriptomic dataset 5.
Each TF-targets group, collectively termed as a regulon, establishes the essential motifs potentially driving the cellular differentiation process.