Immune Signatures of Vaccination and Infection Responses
October 28, 2024Information
- ID
- 12268
- To Cite
- DCA Citation Guide
Transcript
- 00:00So, kicking us off in
- 00:02session two, it's,
- 00:03Steve, Kleinstein,
- 00:05to who's the Anthony Brady
- 00:06professor of pathology here at
- 00:08the Yale School of Medicine.
- 00:10He also co directs the
- 00:11grad program in computational biology
- 00:11and
- 00:13biomedical informatics,
- 00:15with secondary appointments in immunobiology
- 00:18and the new department of
- 00:19biomedical informatics and data science.
- 00:22And so Steve, came to
- 00:23immunology from computer science, and
- 00:25he's been a leader working
- 00:27on
- 00:29various aspects of the immune
- 00:30system for the last, two
- 00:31decades. So welcome, Steve.
- 00:36Thank you.
- 00:39It's, re really nice to
- 00:41see all all the energy
- 00:42and all the all the,
- 00:43collaborations that are starting up,
- 00:45here at Yale in this
- 00:46area of, computational systems, immunology,
- 00:49AI and engineering.
- 00:50So I am going to,
- 00:52hit on a lot of
- 00:53the themes that so a
- 00:54a bunch of the previous
- 00:55speakers already, already touched upon.
- 00:58And broadly speaking, we are,
- 00:59we are interested in how
- 01:01to read an individual's immunological
- 01:03state. So there's lots of
- 01:04new and emerging experimental methods
- 01:06that allow one to profile
- 01:07somebody's immune state,
- 01:09at at very high throughput
- 01:10ranging from things like transcriptomics,
- 01:13single cell transcriptomics, metabolomics,
- 01:16proteomics,
- 01:17and also profiling of the
- 01:18adaptive immune receptor repertoire, b
- 01:20and t cell receptor repertoires.
- 01:23My lab both, develops novel
- 01:25computational methods to, analyze,
- 01:28and, deal with a lot
- 01:29of these new, and emerging
- 01:30data types,
- 01:31as well as working with
- 01:32clinical and experimental groups to
- 01:34apply those,
- 01:35those techno those techniques,
- 01:37to real to real data
- 01:38to understand immunological
- 01:40state. And the kinds of
- 01:41questions we're broadly interested in
- 01:42are some of the kind
- 01:43some of the questions that
- 01:44John already already touched on.
- 01:46So how do we you
- 01:47know, can we take these
- 01:48immunological measurements,
- 01:49and say something about the
- 01:51exposure history,
- 01:52of the individual? Right? The
- 01:53immune system encodes to some
- 01:55degree our history of prior
- 01:56exposure. So can we say
- 01:57something about whether an individual
- 01:58has been exposed to, say,
- 01:59SARS COV two? Does the
- 02:01individual carry protective memory from
- 02:02this, from this infection?
- 02:05Talking about current immunological state,
- 02:06can we profile somebody's immune
- 02:08system, and make an inference
- 02:10about whether they have are
- 02:11are currently undergoing
- 02:13an acute infection? With what
- 02:14virus are they currently infected
- 02:15with? Can we say something
- 02:17about the future? Will the
- 02:18outcome of that, that infection
- 02:19be sort of a mild
- 02:20infection that resolves quickly, or
- 02:22will it be more more
- 02:23severe?
- 02:24And we can we also
- 02:25wanna be able to ask
- 02:26questions about, the the the
- 02:28outcome of different,
- 02:29interventions or things like vaccinations.
- 02:31We give someone a vaccine,
- 02:33how does their immunological state
- 02:34can we predict what the
- 02:35outcome of that vaccine will
- 02:37be? Will they generate good
- 02:38protective, immunological
- 02:39memory? So I'm gonna talk
- 02:41at, somewhat high level about
- 02:43a few of the different
- 02:44projects going on in my
- 02:45lab to touch on on
- 02:46these different aspects.
- 02:47So first, let's talk about
- 02:48identifying immunological exposures from somebody's
- 02:51host, host response profile.
- 02:53So the idea here is
- 02:55that we're going from a,
- 02:56a population of people. We
- 02:57take some kind of blood
- 02:58or tissue sample, and we
- 03:00profile that sample. So here,
- 03:01for example, we're looking at
- 03:03transcriptional profiles that we take,
- 03:05so we measure different gene
- 03:07expression levels, genes on the
- 03:08x axis, samples from those
- 03:09individuals on the y axis.
- 03:11So we get these different
- 03:11gene expression profiles, and we
- 03:13wanna make some inferences. So,
- 03:14for example, can we look
- 03:15at those gene expression profiles
- 03:17and discern a pattern that
- 03:18might tell us, hey. This
- 03:19group of individuals
- 03:21is currently undergoing an acute
- 03:22response to West Nile virus
- 03:23infection.
- 03:24So there are a lot
- 03:25of publications, a lot of
- 03:26studies that try to develop
- 03:28these types of signatures. Right?
- 03:29You recruiting a cohort of
- 03:30people with an infection,
- 03:32some kind of control populations,
- 03:34profile them, do some machine
- 03:35learning, some differential expression analysis,
- 03:37come up with some set
- 03:38of genes or metabolites or
- 03:39proteins that are characteristic of
- 03:41that, of that infection.
- 03:43So we set out a
- 03:44few years ago to evaluate
- 03:45some of these signatures that
- 03:46have been proposed in literature.
- 03:48CBB graduate student, Jan Chawla,
- 03:51wanted to ask a couple
- 03:52of questions.
- 03:53Particularly, he wanted to look
- 03:54at the the signatures that
- 03:56were in the literature and
- 03:57assessed how, robust they were.
- 03:59So to what extent you
- 04:00could take a signature from
- 04:01one study and, apply that
- 04:03signature to an independent,
- 04:04cohort and have it be
- 04:05predictive and also cross reactivity.
- 04:07So to what extent could
- 04:08you take a signature of,
- 04:09say, for some particular virus
- 04:11infection,
- 04:12like a like an influenza
- 04:13infection?
- 04:14And to what extent was
- 04:15that signature specific to that
- 04:17infection or would it what
- 04:18we call cross react? Would
- 04:20that signature also come up,
- 04:21in a lot of other
- 04:22viral infections or bacteria or
- 04:24or other types of infections
- 04:25like a bacterial?
- 04:26So we put together a
- 04:27cohort of about, at the
- 04:29time of the publication, it
- 04:29was a hundred and fifty
- 04:30datasets and a a a
- 04:32bit over seventeen thousand,
- 04:34transcriptional profiles
- 04:35encompassing a whole variety of
- 04:37viral, viral infections, bacterial infections,
- 04:40as well as some noninfections,
- 04:42conditions that we thought might,
- 04:43might be similar, might look
- 04:45similar transcriptionally to a,
- 04:47to an infection response.
- 04:48And he evaluated those signatures,
- 04:50for their robustness and cross
- 04:52reactivity
- 04:53and brought you know, high
- 04:54level what we found from,
- 04:55from the study is one,
- 04:56a lot of the signatures
- 04:57that were,
- 04:59proposed in literature, in fact,
- 05:00were quite robust. Generally speaking,
- 05:02if you developed a signature
- 05:04that was supposed to be
- 05:04predictive of some virus response,
- 05:07it was reproducible in in,
- 05:09other studies of the of
- 05:10that virus response. But there
- 05:11was also a large amount
- 05:12of cross reactivity, and maybe
- 05:14this is something,
- 05:15that's not too surprising.
- 05:17A lot of these a
- 05:18lot of these studies, you
- 05:19will get, you know, for
- 05:20example, like an interferon signature
- 05:22that will come up in
- 05:23almost every virus,
- 05:24virus response that you study.
- 05:26And you might expect if
- 05:27you don't account for that,
- 05:28that signature is also gonna
- 05:29be predictive of multiple other
- 05:30viruses.
- 05:31We also found those signatures
- 05:33cross react with a lot
- 05:34of noninfectious conditions. So here
- 05:35you see, for example, we
- 05:36we evaluated a bunch of
- 05:37different, signatures for virus infections
- 05:39along the x axis and
- 05:41evaluated their ability to predict,
- 05:43aging. So just comparing young
- 05:44versus older older individuals. And
- 05:46you can see a lot
- 05:47of these signatures,
- 05:48as measured by the area
- 05:50under the curve for predicting
- 05:51old versus young are highly
- 05:52predictive of age, and only
- 05:54a subset of signatures is
- 05:55not. So for so we
- 05:56are getting a lot of
- 05:57cross reactivity, and you have
- 05:58to account for this if
- 05:59you wanna use these signatures,
- 06:01for, predictive responses.
- 06:03And in general, we are
- 06:04interest often interested in what
- 06:05makes a response unique. Right?
- 06:07In general, we don't necessarily
- 06:08wanna pick out, oh, we're
- 06:09getting an interferon response,
- 06:11in in in coming up
- 06:12in response to a viral
- 06:13infection. We wanna know what's
- 06:14specific about a SARS COV
- 06:16two infection that makes it
- 06:17different from a typical antiviral
- 06:19response, or what's different about
- 06:20this,
- 06:21vaccine response, this influenza vaccine
- 06:23response,
- 06:24when it doesn't generate protective
- 06:25memory that makes it different
- 06:27from other types of, responses?
- 06:28So we've also worked on
- 06:29methods in collaboration with, Elena
- 06:31Daslavsky's group at Mount Sinai
- 06:33to as methods to generate
- 06:35robust signatures that are not
- 06:36cross reactive. And this takes
- 06:38advantage of the massive public
- 06:40data that's available,
- 06:41profiling different types of responses.
- 06:43So if we have our
- 06:44response, our our our dataset
- 06:46of interest, so for example,
- 06:47disease versus a control, we
- 06:49can also integrate all sorts
- 06:51of control samples from other
- 06:52diseases that are in the
- 06:53public domain to come up
- 06:55with our signature that's specific.
- 06:56So for example, if we're
- 06:57profiling the influenza,
- 06:59an influenza infection response, we
- 07:00might pull from the literature
- 07:02lots of other
- 07:12specificities, so gene expression,
- 07:14protein expression signatures are are
- 07:16nice, but the immune system,
- 07:17we can take advantage already
- 07:19has a a sort of
- 07:20component built in that's highly
- 07:22specific,
- 07:22for the for the response
- 07:24being generated, and that is
- 07:25the the the adaptive immune
- 07:26response, the b cells and
- 07:28the t cells, that compose
- 07:29the adaptive arms of the
- 07:30immune response. I'm gonna focus
- 07:32on b cells. My lab
- 07:33focuses works a lot on
- 07:34b cell receptor, responses.
- 07:36So b cells have antibody
- 07:38receptors on their surface that
- 07:39give them specificity,
- 07:41to recognize different pathogens.
- 07:43The naive repertoire coming out
- 07:44of the bone marrow, so
- 07:45our our immune system constantly
- 07:47produces naive b cells with,
- 07:49pretty much a unique receptor
- 07:50on every cell that's generated
- 07:52from the bone marrow, and
- 07:53that gives those cells their,
- 07:54a unique ability to recognize
- 07:56different pathogens. So some cells
- 07:58coming out of the bone
- 07:59marrow might, have a have
- 08:00a,
- 08:01innate specificity for the influenza
- 08:03virus. Others might have an
- 08:04innate specificity for,
- 08:06for SARS CoV two. Once
- 08:08you get once you encounter
- 08:10a a pathogen, you're confronted
- 08:11with pathogenic challenge, whether it
- 08:13be a natural infection, or
- 08:14a vaccine.
- 08:15Some of those naive b
- 08:16cells are stimulated, activated into
- 08:18the response, and they undergo
- 08:20a process of, rapid tonal
- 08:22expansion and somatic hypermutation.
- 08:25So these cells introduce point
- 08:26mutations into the DNA that
- 08:28codes for their receptor. This
- 08:29is a really scary process
- 08:30when I first learned about
- 08:31it,
- 08:32but it's actually quite,
- 08:34quite good because it allows
- 08:35our bodies to learn about
- 08:36the pathogenic environment.
- 08:38And through a process of
- 08:39mutation and selection, we can
- 08:41generate higher and higher affinity
- 08:42receptors for the pathogen that
- 08:44we're confronted with. So we
- 08:45can learn about the pathogenic
- 08:46environment through our, throughout our
- 08:48lifetime.
- 08:49And this occurs in specialized
- 08:51structures called, germinal centers in
- 08:53our secondary lymphoid organs.
- 08:55And here you see actually
- 08:56a a germinal center. The
- 08:58b cells are ingrained from
- 08:59a live anesthetized mouse, undergoing
- 09:01an immune response. This is
- 09:02work we did with Anne
- 09:03Haberman,
- 09:04with two photon microscopy over,
- 09:05over a decade ago. So
- 09:07this process results in affinity
- 09:09maturation of the b cell
- 09:11repertoire, and you can imagine
- 09:12that we can learn a
- 09:13lot about the history of
- 09:14immunological
- 09:15exposures by studying these, the,
- 09:17the set of b cell
- 09:18receptors carried by an individual.
- 09:20Since two thousand nine, we've
- 09:21had the ability to sequence
- 09:23the b cell receptors at
- 09:24a high throughput,
- 09:26both with,
- 09:27both with bulk sequencing technologies
- 09:29where we can take a
- 09:30tissue sample,
- 09:31sequence the b cell receptor
- 09:32using PCR amplification and high
- 09:34throughput sequencing. That gives us
- 09:35anywhere from hundreds of thousands
- 09:37to millions of b cell
- 09:39receptors in a sample,
- 09:40and more recently through single
- 09:42cell technologies where we can
- 09:43get individual from a single
- 09:45cell, the b cell receptor
- 09:46along with the pair transcriptome
- 09:48for tens of thousands of
- 09:49cells, at a time.
- 09:51And, of course, the challenge
- 09:52is now once we're once
- 09:53we have these receptors, how
- 09:55do we
- 09:56how do we interpret them?
- 09:57So the typical data we
- 09:58get kinda looks like this.
- 10:00Right? We get the b
- 10:01cell receptor sequence. Each row
- 10:02is a b cell receptor
- 10:03sequence,
- 10:05associated with the cell. And
- 10:06now we wanna we wanna
- 10:07know things like, well, which
- 10:08one of these b cells
- 10:09which one of these b
- 10:09cell receptors,
- 10:11is is specific or binds
- 10:13to the influenza hemagglutinin,
- 10:14molecule, or which one of
- 10:15these are specific for the
- 10:16SARS CoV two spike protein.
- 10:18And I wanna thank Deepa
- 10:20for for introducing these,
- 10:22representation learning approaches because that's
- 10:24one of the things we
- 10:25do with b cell receptors
- 10:26is we take advantage of
- 10:28these, language based language based
- 10:30models
- 10:30to, develop,
- 10:32embedding approaches where we can
- 10:34learn we can take those
- 10:35b cell receptors,
- 10:36map them into a lower
- 10:37dimensional space where we can
- 10:39then,
- 10:40develop classification models and do
- 10:41analysis and learn, learn all
- 10:43sorts of interesting things about
- 10:44the b cell receptor.
- 10:46So one of the things
- 10:47we wanted to learn about
- 10:48and one of the, sort
- 10:49of important key challenges for
- 10:50the community is learning specificity.
- 10:52So we would like to
- 10:53be able to take a
- 10:54b cell receptor sequence and
- 10:55predict that this b cell
- 10:57receptor is going to bind,
- 10:58for example, to SARS CoV
- 10:59two spike protein. A graduate
- 11:01student, Mimi Wang, from the
- 11:03CBB program,
- 11:04did a did a study
- 11:05a couple of years ago,
- 11:07looking actually, just just was
- 11:09published this, this past year
- 11:10early early this year, actually.
- 11:12Looking at multiple embedding methods.
- 11:14Some of these are general
- 11:15protein embedding methods that were
- 11:17generated not for antibody receptors,
- 11:18but to embed general protein
- 11:19sequences,
- 11:21and others, such as this
- 11:22AnteBERTy were developed specifically to
- 11:24embed antibody
- 11:25receptors.
- 11:26And she evaluated the ability
- 11:28of these, of these different
- 11:29embedding methods to predict SARS
- 11:31CoV two spike binding.
- 11:33There's a background control where
- 11:34we scrambled up the sequences
- 11:35shown in gray here, that
- 11:37has the expected
- 11:39f ones. And you can
- 11:40see on the real data
- 11:42shown in the colored, in
- 11:43the colored bar too, we
- 11:44actually get a pretty good
- 11:45ability,
- 11:46a very good ability to
- 11:47predict SARS CoV two spike
- 11:48binding,
- 11:49from these, from the, from
- 11:50these embedded b cell receptors.
- 11:52And in fact, some of
- 11:53the, you know, the the
- 11:54generic method, the general methods
- 11:56actually work pretty well. We
- 11:57do get a little bit
- 11:58of a performance boost, from
- 11:59the models that were developed
- 12:01specifically on antibody receptors.
- 12:03We have a newer study
- 12:04that's on bioRxiv now where
- 12:05we've shown that if you
- 12:06take these, general methods and
- 12:08you actually do a little
- 12:08bit of fine tuning on
- 12:09them specific to the problem,
- 12:11we can actually boost, that
- 12:12power to predict specificity,
- 12:14even further.
- 12:16So that's the power to
- 12:17predict specificity on a single
- 12:18sequence. What about whole repertoires?
- 12:20So when we take a
- 12:21blood sample or a tissue
- 12:22sample from a person, we
- 12:23don't get a single sequence.
- 12:25We actually get tens of
- 12:26thousands to to potentially millions
- 12:28of sequences.
- 12:29It looks something like this.
- 12:30This is actually a blood
- 12:31repertoire from somebody undergoing, an
- 12:32actual vaccine response seven days
- 12:34post the response,
- 12:36where each of the points
- 12:36in these trees a leaf
- 12:37and each of those,
- 12:39and each of those,
- 12:40sort of circle diagrams represent
- 12:42a clonal expansion starting from
- 12:44a starting from a naive
- 12:45b cell. So you can
- 12:46see there is a bunch
- 12:47of clonal expansions in the
- 12:48blood.
- 12:49So that's pretty indicative of
- 12:51of, an actual ongoing infection.
- 12:53Generally speaking, in a in
- 12:54a healthy human, at rest,
- 12:56you don't have large clonal
- 12:57expansions,
- 12:58in your blood. But now
- 12:59can we tell what infection
- 13:01or what vaccination this person
- 13:02is actually responding to? So
- 13:04one simple thing we can
- 13:05do is take all of
- 13:06the b cell receptor sequences
- 13:08in these clones, calculate the
- 13:09probability of it being a
- 13:10SARS CoV-two specific clone, and
- 13:12just average over the repertoire.
- 13:14And you can see that
- 13:15actually gives us a pretty
- 13:16good power to predict, individuals
- 13:17who are twenty eight days
- 13:18post SARS COV two vaccine,
- 13:20shown here on the right,
- 13:21versus a control population at
- 13:23rest pre vac pre vaccination.
- 13:26We do a lot of
- 13:27work like this with b
- 13:28cell receptors. As I mentioned,
- 13:29we we do all sorts
- 13:30of methods for preprocessing, population
- 13:32structure inference, and repertoire analysis.
- 13:34And BILab makes available a
- 13:36a computational framework called called
- 13:38incantation,
- 13:39to deal with all sorts
- 13:40of analysis of these kinds
- 13:41of data,
- 13:42which is pretty widely used
- 13:44by the by the community.
- 13:45And I'll just take a
- 13:46moment to plug. We are
- 13:47having our twenty twenty five
- 13:48users group meeting on January
- 13:50thirtieth. If you are interested
- 13:51in joining us for that,
- 13:52please do please do sign
- 13:53up. It's free. If you've
- 13:55ever used our tools and
- 13:56are interested in submitting an
- 13:57abstract, the abstract deadline is
- 13:59just about a week from
- 13:59now.
- 14:01So now let me move
- 14:02to an, beyond just sort
- 14:04of looking at the current
- 14:05state of an individual,
- 14:07into the problem of can
- 14:08we predict how an individual
- 14:09respond to a to a
- 14:10certain immunological exposure? John already
- 14:13introduced this idea of of
- 14:14variable you know, human variability
- 14:16to the same immunological exposure.
- 14:18So for example, in a
- 14:19vaccine response, if we take
- 14:21a bunch of young adults,
- 14:23shown along the x axis
- 14:24here, we give them the
- 14:25seasonal flu vaccine and we
- 14:26measure their antibody response on
- 14:28the y axis. You can
- 14:29see there's a whole,
- 14:31wide variety of different, responses.
- 14:33Some people barely generate any,
- 14:36or actually generate no boost
- 14:37in their antibody titers, while
- 14:39other individuals generate, you know,
- 14:41orders of magnitude increase in
- 14:42their, in their titers. So
- 14:43we wanna know, can we
- 14:44predict the response to this
- 14:46vax to to influenza vaccination?
- 14:48And more broadly, can we
- 14:50predict the response to other
- 14:51vaccinations? Is there a universal
- 14:52signature? Is this are the
- 14:53same biological mechanisms, the same
- 14:56signatures that are associated with
- 14:57good flu vaccine responses? Do
- 14:59those also hold for things
- 15:00like yellow fever, for smallpox,
- 15:02for SARS CoV two?
- 15:04And to do that, we
- 15:05leveraged the power of a
- 15:06consortium that we're involved in.
- 15:07John already mentioned, HIPSI, the
- 15:09human human immunologic human immunology
- 15:11project consortium.
- 15:13This is some a consortium
- 15:14that's been funded since two
- 15:16thousand and ten. We've had
- 15:17one of the centers here
- 15:18at Yale.
- 15:18It's currently co directed by
- 15:20Ruth Montgomery and and David
- 15:21Hafler, and we've been part
- 15:22of it since the beginning
- 15:24in twenty ten.
- 15:26HiPSI does a lot of
- 15:27high throughput profiling of of,
- 15:29humans in diverse immunological states.
- 15:32All of the data is
- 15:33deposited into a repository called
- 15:35import.
- 15:36There's over a hundred different
- 15:37studies,
- 15:38over, almost eight thousand different
- 15:40individuals and lots and lots
- 15:41of measurements.
- 15:43As part of this consortium,
- 15:44we put together a data
- 15:45resource of all the studies
- 15:47that were done, profiling human
- 15:49vaccine responses.
- 15:50In the end, we profiles
- 15:52of thirteen different vaccines where
- 15:53we could relate transcriptional profiles,
- 15:56to antibody tighter, tighter responses.
- 15:58From the analysis of those
- 16:00data, we did two different
- 16:01things. One is we looked
- 16:02for baseline immune states.
- 16:04So could you take somebody's
- 16:06immunological state prior to vaccination
- 16:08and predict how they were
- 16:09gonna respond to the to
- 16:10the to the vaccine?
- 16:12So we could, in fact,
- 16:14generate identify those kinds of
- 16:15profiles as well as temporal
- 16:17profiles. So could you look
- 16:18a day or a week
- 16:19post vaccination
- 16:21and predict the longer term
- 16:22antibody response to vaccine? It
- 16:24turns out you can do
- 16:24that, you can do that
- 16:26as well.
- 16:27And just to show you
- 16:28one,
- 16:29one aspect
- 16:30of that and the just
- 16:31to emphasize the importance of
- 16:33kinetics and and and sort
- 16:34of timing of the response.
- 16:35So here you see a
- 16:37a bunch of those different
- 16:38vaccines that we profiled. Each
- 16:39one is shown by a
- 16:40different line, days post vaccination
- 16:42on the x axis, and
- 16:43the y axis shows the
- 16:45ability of a plasma cell
- 16:46signature to predict the antibody
- 16:48titers of the response at
- 16:49each at different days post
- 16:50vaccination.
- 16:52And you can see for
- 16:52the flu vaccine, if you
- 16:54look seven days post vaccination,
- 16:56we know that there's a
- 16:57burst of plasma blasts that
- 16:58are generally observed in the
- 16:59blood, post vaccination in the
- 17:01flu response. And in fact,
- 17:02those that plasma blast is
- 17:04just highly predictive of the
- 17:05ultimate antibody response, to flu
- 17:08vaccine.
- 17:08A lot of other vaccines
- 17:10share those characteristics,
- 17:11but notably not the yellow
- 17:12fever vaccine. So in yellow
- 17:14fever, if you look at
- 17:15seven days and you look
- 17:15for that signature, you don't
- 17:17see it. But if you
- 17:18look a little bit later,
- 17:19say say,
- 17:20ten days or or a
- 17:21couple weeks after the vaccine
- 17:23response, then in fact, you
- 17:24actually do see that signature
- 17:26and coming out as predictive.
- 17:27So in fact, it's the
- 17:28same biology in both those
- 17:29cases that drives a successful
- 17:31response, but it's very important
- 17:33sort of when during the
- 17:34course of the response,
- 17:35that we, that we look.
- 17:37If you're interested in those
- 17:39kinds of of predictions,
- 17:40I'll make one other plug
- 17:41is that we are part
- 17:42of a center,
- 17:44center for modeling immunity, along
- 17:46with Bjorn Peters at the
- 17:47La Jolla Institute,
- 17:49and we run an annual
- 17:50challenge contest for predicting the,
- 17:52vaccine response. So the CMI
- 17:54PB challenge.
- 17:55And so the idea is
- 17:57that, we provide you data,
- 17:59multiomics,
- 18:00profiles,
- 18:01at baseline, so prior to
- 18:03getting a vaccine,
- 18:04and it's your job to
- 18:05try to come up with
- 18:06a model to predict the
- 18:07vaccine response. We provide a
- 18:08couple of cohorts of data
- 18:09where we have the,
- 18:11profiles and also the endpoints,
- 18:12and there's some data that's
- 18:14held out, and there are
- 18:15there are prizes for the
- 18:16teams that, that do the
- 18:18best. So if you're interested
- 18:19in that kind of work,
- 18:20please do, check it out.
- 18:21The submission deadline is, for
- 18:23on November twenty second. You've
- 18:24got just about a month
- 18:25to develop, to develop your
- 18:27models.
- 18:29Okay. And so now I
- 18:30wanna touch, for a minute
- 18:31on multiomics.
- 18:33So previously, I was just
- 18:35talking, for for HiPSI. We
- 18:36looked a lot of transcriptional
- 18:37signatures, and those are, those
- 18:39are very powerful. But it
- 18:40was mentioned a couple times
- 18:41already. Increasingly,
- 18:43a lot of our experimental
- 18:44data encompasses multiple different types
- 18:46of measurements, transcriptional profiles, met
- 18:49metabolomic profiles, proteomic profiles,
- 18:52all together. And one one
- 18:53of the challenges, how do
- 18:55we integrate those data together
- 18:57in order to come up
- 18:58with, predictive
- 18:59responses and understand the the
- 19:01immunological
- 19:02mechanisms that are driving those
- 19:03differential,
- 19:04differential outcomes.
- 19:06And here I wanna highlight
- 19:07a a method that, we've
- 19:08been developed in collaboration, with
- 19:10Liang who spoke spoke earlier,
- 19:12and a graduate student, Jeremy
- 19:14Geege, another CBB graduate student.
- 19:16And the idea here is
- 19:17a lot of times where
- 19:18we have these multi omics
- 19:19data, the common way to
- 19:20analyze those data is to
- 19:22take the data and do
- 19:23some dimensionality reduction. Right? So
- 19:25you have those, you know,
- 19:27potentially hundreds of thousands of
- 19:28measurements of genes and proteins
- 19:30and and and metabolites
- 19:31to reduce the dimensionality of
- 19:33those into some smaller, some
- 19:35smaller space and then do
- 19:36machine learning. And one of
- 19:37the things we found is
- 19:39that doing doing it that
- 19:40way actually could be suboptimal
- 19:41because you would actually you
- 19:42could actually miss variation
- 19:44that was important for the
- 19:45response,
- 19:46of interest. And so we
- 19:47developed a method called, SPEAR,
- 19:49which is a sparse supervised
- 19:50Bayesian factor model,
- 19:52where in generating the lower
- 19:53dimensional representation,
- 19:55we took into account both
- 19:56the variation in the multi
- 19:57omic data as well as
- 19:59the response of interest. And
- 20:00the the method learns to
- 20:02adaptively balance how much to
- 20:04try and explain the variability
- 20:05in the data and how
- 20:06much to try and explain
- 20:07the response of interest in
- 20:09coming up with those lower
- 20:10dimensional,
- 20:11lower dimensional factors.
- 20:13And finally, what I'll end
- 20:14with is the the importance
- 20:15of genetics.
- 20:16So so all of these,
- 20:18responses and all of these
- 20:19signatures are often modulated by
- 20:21the genetic background,
- 20:22of of the individual. And
- 20:24in particularly, it's an emerging
- 20:26area to understand to what
- 20:28extent the genetics of the
- 20:29b cell receptor,
- 20:31repertoire
- 20:32and the components of the
- 20:33b cell receptor
- 20:34influence the ability of people
- 20:36to respond to infection,
- 20:37and vaccination.
- 20:39And I'll show one example
- 20:40here,
- 20:41from, showing the influence of
- 20:43a b cell receptor variable
- 20:45gene, a gene called, IGH
- 20:46v one one sixty nine,
- 20:48on the flu vaccine response.
- 20:50So here you see,
- 20:52there's a position, in this,
- 20:54in this gene, which can
- 20:55either be an f or
- 20:56an l. And depending on
- 20:57the genotype of the individual,
- 21:00that individual will generate different
- 21:01titers of broadly neutralizing antibody
- 21:04in response to, in response
- 21:06to the flu vaccine. You
- 21:06see the micro
- 21:08titers, on the y on
- 21:09the y axis here.
- 21:11And so it's really an
- 21:12emerging area to try and
- 21:13understand how the genetics of
- 21:15the antibody repertoire
- 21:16influence the response to vaccination,
- 21:18and infection.
- 21:19And there's not been a
- 21:20lot of associations,
- 21:22that have been discovered so
- 21:23far. In part, I think
- 21:24this is because in all
- 21:25of the genome wide association
- 21:27studies that are done, the
- 21:28immunoglobulin locus is generally not
- 21:30included in any of those
- 21:32studies. So it's not there's
- 21:33very low representation
- 21:34on the, SNP arrays that
- 21:36are that are used.
- 21:37And in general, one can't
- 21:38get this from the short
- 21:39read sequencing
- 21:42data because the region is
- 21:43highly, highly repetitive.
- 21:46And so with the postdoc
- 21:47in the lab,
- 21:48Dylan, who's one of the,
- 21:50Gale BI data science fellows.
- 21:52He's actually developed a,
- 21:54an approach now where we
- 21:55can go in from short
- 21:57read DNA sequencing data and
- 21:59genotype the immunoglobulin
- 22:00locus. And the way he's
- 22:02able to do this is
- 22:03by leveraging,
- 22:04a pangeneome graph. So this
- 22:05is a graphical representation of
- 22:07a genome where instead of
- 22:08having a single linear reference
- 22:10as is commonly used today,
- 22:11we can actually represent and
- 22:13encode the diversity,
- 22:14of the population into a
- 22:15single graphical representation of the
- 22:17genome.
- 22:18And by and by leveraging
- 22:20that structure, we can take
- 22:21those short read, DNA sequencing
- 22:23data,
- 22:24map it to this pan
- 22:25genome graph, and discover variability
- 22:27in the, in the immunoglobulin
- 22:28locus. And what we're gearing
- 22:30up to do now is
- 22:31to actually use this method
- 22:32on biobank data because we
- 22:33now have public data sources
- 22:36of massive numbers of individuals
- 22:37with, short read DNA sequencing
- 22:39data, and a lot of
- 22:40interesting clinical and other, and
- 22:42other phenotypes.
- 22:43And so we're we're planning
- 22:45to use this approach to
- 22:46mine those biobanks and find
- 22:47interesting,
- 22:48connections between the immunoglobulin locus,
- 22:51and and those outcomes.
- 22:53And so I think I
- 22:53will, stop there because I'm,
- 22:56out of time. And,
- 22:57thank you and take any
- 22:58questions.