Identification of macromolecular assemblies
Complete understanding of the cell requires knowledge of the underlying molecular network (i.e. social network of biomolecules). How and when molecules interact ultimately determines a cell’s behavior. Unfortunately, we lack an accurate and comprehensive map of all biomolecular interactions. My lab builds machine learning and data mining tools to discover macromolecular assemblies in high throughput proteomic experiments. Towards this I have built the most accurate and comprehensive protein complex map available, which I call hu.MAP. Hu.MAP has been central to 1) identifying novel disease genes including developmental diseases such as ciliopathies, 2) functionally annotating completely uncharacterized genes, and 3) discovering altogether novel protein assemblies suggesting new uncovered cellular functions.
See:
- Drew et al. hu.MAP 2.0: Integration of over 15,000 proteomic experiments builds a global compendium of human multiprotein assemblies. BioRxiv 2020
- Drew et al. Integration of over 9,000 mass spectrometry experiments builds a global map of human protein complexes. Molecular Systems Biology 2017
- Wan, Borgeson et al. Panorama of ancient metazoan macromolecular complexes. Nature 2015
- McWhite, Papoulas et al. A pan-plant protein complex map reveals deep conservation and novel assemblies. Cell 2020
Characterizing macromolecular assemblies
Although, we know the identity of many protein assemblies, we still know very little about what factors govern their different structural states. This lack of characterization severely inhibits our understanding of 1) how protein assemblies function, 2) how they assemble/disassemble, and 3) ultimately how they affect the global state of the cell. My lab develops high throughput proteomic methods to identify the cellular and biochemical factors (e.g. RNA/DNA/ligand content, post-translational modifications, genomic modifications, developmental stage) that govern macromolecular assemblies. Using these methods, we attempt to uncover general principles of assembly regulation and function.
See:
- Mallam et al. Drew Systematic discovery of endogenous human ribonucleoprotein complexes. Cell Reports, 2019
- Drew, Lee et al. A systematic, label-free method for identifying RNA-associated proteins in vivo provides insights into vertebrate ciliary beating machinery. Developmental Biology, 2020
- Floyd et al. Systematic Identification of Protein Phosphorylation-Mediated Interactions BioRxiv 2020
Determining the structure of assemblies
A general principle of biomolecular function is that structure defines function. With knowledge of the 3D structure of biomolecules we can fully understand their mechanistic function and begin to investigate ways of controlling biomolecules for therapeutic purposes. Unfortunately, we lack 3D structure for thousands of protein assemblies. My lab develops methods to generate structural models for macromolecular assemblies providing testable hypotheses as to their mechanistic function.
See:
- Drew et al. Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets. PLOS Computational Biology, 2017
- Wan, Borgeson et al. Panorama of ancient metazoan macromolecular complexes. Nature 2015
Modulation of molecular interactions
Improperly regulated protein interactions often cause disease but are difficult to target with traditional drug therapeutic approaches. My lab develops computational algorithms within the Rosetta Molecular Modeling Software Suite to design inhibitors that disrupt disease-causing interactions. My previous work in this area developed nanomolar binders to cancer drug targets (e.g. MDM2-P53 and HIF1α-P300). This work provides a foundation for modeling diverse non-protein based molecules in Rosetta, which greatly opens avenues to model a large swath of the chemical landscape such as peptidomimetics and oligosaccharides.
See: