doi.bio/brian_hie

Brian Hie

Early Life and Education

Brian Hie is an Assistant Professor of Chemical Engineering and Data Science at Stanford University and an Innovation Investigator at Arc Institute. Hie was born and raised in the United States. During his college years, Hie was passionate about poetry, particularly the work of 17th-century poet John Donne. When it came time to choose a graduate school, Hie was torn between pursuing English at Harvard and computer science at MIT. Ultimately, he chose computer science.

Career

Hie is currently an Assistant Professor of Chemical Engineering and Data Science at Stanford University and an Innovation Investigator at Arc Institute. He supervises the Laboratory of Evolutionary Design, which conducts research at the intersection of biology and machine learning.

Previously, Hie was a Stanford Science Fellow in the Stanford University School of Medicine and a Visiting Researcher at Meta AI. He completed his Ph.D. at MIT CSAIL and was an undergraduate at Stanford University. He has also worked at Google X, Illumina, and Salesforce.

Research and Publications

Hie's research focuses on the intersection of biology and machine learning, specifically using machine learning techniques derived from natural language processing to understand protein evolution. He has published extensively in this field, with notable publications including:

"Efficient evolution of human antibodies from general protein language models" (2023)
"Machine Learning for Protein Engineering" (2023)
"Evolutionary-scale prediction of atomic-level protein structure with a language model" (2023)
"Learning the language of viral evolution and escape" (2021)
"Leveraging Uncertainty in Machine Learning Accelerates Biological Discovery and Design" (2020)

- "Efficient integration of heterogeneous single-cell transcriptomes using Scanorama" (2019)

Brian Hie

Brian Hie is an Assistant Professor of Chemical Engineering and Data Science at Stanford University and an Innovation Investigator at the Arc Institute. He supervises the Laboratory of Evolutionary Design, which conducts research at the intersection of biology and machine learning.

Education

Hie completed his undergraduate degree at Stanford University and went on to obtain a Ph.D. from the Massachusetts Institute of Technology (MIT) in Electrical Engineering and Computer Science.

Career

Before joining Stanford University as a faculty member, Hie worked at Google X, Illumina, and Salesforce. He also held positions as a Stanford Science Fellow at the Stanford University School of Medicine and a Visiting Researcher at Meta AI.

Research

Hie's research focuses on the application of machine learning techniques, particularly natural language processing, to the field of biology. He uses "protein language models" trained on large repositories of protein sequences to understand and predict protein evolution. His work has demonstrated that algorithms can predict protein evolution over extended periods, and he is now exploring the use of these models to design new proteins artificially.

Notable Works

Efficient evolution of human antibodies from general protein language models – This work demonstrates how general protein language models can efficiently evolve human antibodies by suggesting evolutionarily plausible mutations, even without information about the target antigen or protein structure.
Evolutionary velocity with protein language models predicts evolutionary dynamics of diverse proteins – In this paper, Hie and his co-authors introduce the concept of "evo-velocity," a dynamic vector field of protein evolution. This approach allows for predicting the evolutionary dynamics of proteins over vastly different timescales and provides insights into viral-host immune escape and the evolution of specific protein families.
Learning the language of viral evolution and escape – Here, Hie and his colleagues use machine learning algorithms originally developed for human natural language to model viral escape, i.e., viral mutations that evade the immune system and impede vaccine development.
Leveraging Uncertainty in Machine Learning Accelerates Biological Discovery and Design – This paper discusses the benefits of quantifying prediction uncertainty in machine learning algorithms to handle novel phenomena and facilitate biological discovery.
Geometric Sketching Compactly Summarizes the Single-Cell Transcriptomic Landscape – Hie and his co-authors propose a method to enhance and accelerate single-cell data analysis by summarizing transcriptomic heterogeneity within a dataset, enabling more comprehensive visualization and sensitive detection of rare cell types.
Efficient integration of heterogeneous single-cell transcriptomes using Scanorama – Scanorama is an algorithm presented by Hie and colleagues to integrate single-cell RNA sequencing (scRNA-seq) data from multiple experiments, laboratories, and technologies. It enables the merging of shared cell types and the removal of batch effects, facilitating the discovery of biological insights.
Realizing private and practical pharmacological collaboration – In this work, Hie et al. introduce a computational protocol that ensures data confidentiality while allowing for secure and collaborative training of predictive models in pharmacological research, thus enhancing the potential for life-saving breakthroughs.
Pooled ChIP-Seq Links Variation in Transcription Factor Binding to Complex Disease Risk – The paper introduces a pooling-based approach to mapping quantitative trait loci (QTLs) for molecular-level traits, specifically focusing on transcription factor (TF) binding sites and a histone modification. This method reduces costs significantly and provides insights into genetic variants affecting TF binding and chromosomal architecture.
Schema: metric learning enables interpretable synthesis of heterogeneous single-cell modalities – Schema is a framework that uses metric learning to synthesize information across heterogeneous modalities, such as age, disease status, gene expression, and chromatin accessibility, to gain a comprehensive understanding of biological processes.
Predicting the mutational drivers of future SARS-CoV-2 variants of concern – In this study, Hie and his co-authors aim to predict Spike amino acid changes that could contribute to future variants of concern in SARS-CoV-2. They test the importance of features such as epidemiology, evolution, immunology, and neural network-based protein structure predictions.

Youtube Videos

Youtube Title: AIRR-C Seminar Series, February 22nd, 2024 - Brian Hie, Stanford University, US

Youtube Link: link

Youtube Channel Name: AIRR Community

Youtube Channel Link: https://www.youtube.com/@AIRRCommunity

AIRR-C Seminar Series, February 22nd, 2024 - Brian Hie, Stanford University, US

Youtube Title: Scientist Stories: Brian Hie, Evolution of Human Antibodies From General Protein Language Models

Youtube Link: link

Youtube Channel Name: Axial

Youtube Channel Link: https://www.youtube.com/@axialxyz

Scientist Stories: Brian Hie, Evolution of Human Antibodies From General Protein Language Models

Youtube Title: Evo: DNA foundation modeling from molecular to genome scale | Brian Hie

Youtube Link: link

Youtube Channel Name: VantAI

Youtube Channel Link: https://www.youtube.com/@Vant_AI

Evo: DNA foundation modeling from molecular to genome scale | Brian Hie

Youtube Title: MIA: Brian Hie, Learning to read and write protein evolution

Youtube Link: link

Youtube Channel Name: Broad Institute

Youtube Channel Link: https://www.youtube.com/@broadinstitute

MIA: Brian Hie, Learning to read and write protein evolution

Youtube Title: Matthew Moments | Episode 3 | Brian Hie

Youtube Link: link

Youtube Channel Name: FirstSF

Youtube Channel Link: https://www.youtube.com/@FirstSF

Matthew Moments | Episode 3 | Brian Hie

Youtube Title: Scientist Stories: Brian Hie, Learning to Read & Write Protein Evolution

Youtube Link: link

Youtube Channel Name: Axial

Youtube Channel Link: https://www.youtube.com/@axialxyz

Scientist Stories: Brian Hie, Learning to Read & Write Protein Evolution

Youtube Title: Efficient Evolution of Human Antibodies From General Protein Language Models and Sequence Info Alone

Youtube Link: link

Youtube Channel Name: ML for protein engineering seminar series

Youtube Channel Link: https://www.youtube.com/@mlforproteinengineeringsem6420

Efficient Evolution of Human Antibodies From General Protein Language Models and Sequence Info Alone

Youtube Title: VBS is too short - Brian Hie (Parody of "Baby")

Youtube Link: link

Youtube Channel Name: Darren Tung

Youtube Channel Link: https://www.youtube.com/@dtung118

VBS is too short - Brian Hie (Parody of "Baby")

Youtube Title: Biological modeling from molecular to genome scale | AI and the Molecular World | Brian Hie

Youtube Link: link

Youtube Channel Name: Applied Machine Learning Days

Youtube Channel Link: https://www.youtube.com/@AppliedMachineLearningDays

Biological modeling from molecular to genome scale | AI and the Molecular World | Brian Hie

Youtube Title: kgml2021: Translational Biology, Brian Hie, Stanford University

Youtube Link: link

Youtube Channel Name: KGML Workshop

Youtube Channel Link: https://www.youtube.com/@kgmlworkshop6368

kgml2021: Translational Biology, Brian Hie, Stanford University

Youtube Title: HIE Research Update with Dr. Brian Kalish

Youtube Link: link

Youtube Channel Name: Hope for HIE

Youtube Channel Link: https://www.youtube.com/@HopeforHIE

HIE Research Update with Dr. Brian Kalish

Youtube Title: Scientist Stories: Eric Nguyen, Using AI to Design from the molecular to genome scale

Youtube Link: link

Youtube Channel Name: Axial

Youtube Channel Link: https://www.youtube.com/@axialxyz

Scientist Stories: Eric Nguyen, Using AI to Design from the molecular to genome scale

Youtube Title: HIE Awareness Month Member stories

Youtube Link: link

Youtube Channel Name: Hope for HIE

Youtube Channel Link: https://www.youtube.com/@HopeforHIE

HIE Awareness Month Member stories

Youtube Title: [Summary] The Gamechangers Webinar 6 | Disruptive Methodologies: AI, Machine Learning, and AMR

Youtube Link: link

Youtube Channel Name: RADAAR Team

Youtube Channel Link: https://www.youtube.com/@radaarteam513

![Summary] The Gamechangers Webinar 6 | Disruptive Methodologies: AI, Machine Learning, and AMR

Youtube Title: Studies underway to improve brain health in oxygen-deprived newborns

Youtube Link: link

Youtube Channel Name: UT Southwestern Medical Center

Youtube Channel Link: https://www.youtube.com/@UTSWMed

Studies underway to improve brain health in oxygen-deprived newborns

Youtube Title: EVO: DNA Foundation Models - Eric Nguyen | Stanford MLSys #96

Youtube Link: link

Youtube Channel Name: Stanford MLSys Seminars

Youtube Channel Link: https://www.youtube.com/@StanfordMLSysSeminars

EVO: DNA Foundation Models - Eric Nguyen | Stanford MLSys #96

Youtube Title: The Gamechangers | RADAAR AMR Policy Webinar-6

Youtube Link: link

Youtube Channel Name: RADAAR Team

Youtube Channel Link: https://www.youtube.com/@radaarteam513

The Gamechangers | RADAAR AMR Policy Webinar-6

Youtube Title: If You Could Hie to Kolob

Youtube Link: link

Youtube Channel Name: Brian Daw

Youtube Channel Link: https://www.youtube.com/channel/UClm6XGOmS9X2vmfXwoZ-LNw

If You Could Hie to Kolob

Youtube Title: The Increasing Value of HIE in Colorado

Youtube Link: link

Youtube Channel Name: Contexture

Youtube Channel Link: https://www.youtube.com/@contexture2065

The Increasing Value of HIE in Colorado