Understanding and Utilizing Protein Interactions in Diverse Environments
Document
Description
Transient protein-protein and protein-molecule interactions fluctuate between associated and dissociated states. They are widespread in nature and mediate most biological processes. These interactions are complex and are strongly influenced by factors such as concentration, structure, and environment. Understanding and utilizing these types of interactions is useful from both a fundamental and design perspective. In this dissertation, transient protein interactions are used as the sensing element of a biosensor for small molecule detection. This is done by using a transcription factor-small molecule pair that mediates the activation of a CRISPR/Cas12a complex. Activation of the Cas12a enzyme results in an amplified readout mechanism that is either fluorescence or paper based. This biosensor can successfully detect 9 different small molecules including antibiotics with a tuneable detection limit ranging from low µM to low nM. By combining protein and nucleic acid-based systems, this biosensor has the potential to report on almost any protein-molecule interaction, linking this to the intrinsic amplification that is possible when working with nucleic acid-based technologies. The second part of this dissertation focuses on understanding protein-molecule interactions at a more fundamental level, and, in so doing, exploring design rules required to generalize sensors like the ones described above. This is done by training a neural network algorithm with binding data from high density peptide micro arrays incubated with specific protein targets. Because the peptide sequences were chosen simply to evenly, though sparsely, represent all sequence space, the resulting network provides a comprehensive sequence/binding relationship for a given target protein. While past work had shown that this works well on the arrays, here I have explored how well the neural networks thus trained, predict sequence-dependent binding in the context of protein-protein and peptide-protein interactions. Amino acid sequences, either free in solution or embedded in protein structure, will display somewhat different binding properties than sequences affixed to the surface of a high-density array. However, the neural network trained on array sequences was able to both identify binding regions in between proteins and predict surface plasmon resonance-based binding propensities for peptides with statistically significant levels of accuracy.