Immune Signatures of Vaccination and Infection Responses

Name: Immune Signatures of Vaccination and Infection Responses
Uploaded: 2024-10-28T14:13:29.3333333Z
Duration: 22 min 59 s

October 28, 2024

Information

ID: 12268
To Cite: DCA Citation Guide

Download Transcript

00:00So, kicking us off in
00:02session two, it's,
00:03Steve, Kleinstein,
00:05to who's the Anthony Brady
00:06professor of pathology here at
00:08the Yale School of Medicine.
00:10He also co directs the
00:11grad program in computational biology
00:11and
00:13biomedical informatics,
00:15with secondary appointments in immunobiology
00:18and the new department of
00:19biomedical informatics and data science.
00:22And so Steve, came to
00:23immunology from computer science, and
00:25he's been a leader working
00:27on
00:29various aspects of the immune
00:30system for the last, two
00:31decades. So welcome, Steve.
00:36Thank you.
00:39It's, re really nice to
00:41see all all the energy
00:42and all the all the,
00:43collaborations that are starting up,
00:45here at Yale in this
00:46area of, computational systems, immunology,
00:49AI and engineering.
00:50So I am going to,
00:52hit on a lot of
00:53the themes that so a
00:54a bunch of the previous
00:55speakers already, already touched upon.
00:58And broadly speaking, we are,
00:59we are interested in how
01:01to read an individual's immunological
01:03state. So there's lots of
01:04new and emerging experimental methods
01:06that allow one to profile
01:07somebody's immune state,
01:09at at very high throughput
01:10ranging from things like transcriptomics,
01:13single cell transcriptomics, metabolomics,
01:16proteomics,
01:17and also profiling of the
01:18adaptive immune receptor repertoire, b
01:20and t cell receptor repertoires.
01:23My lab both, develops novel
01:25computational methods to, analyze,
01:28and, deal with a lot
01:29of these new, and emerging
01:30data types,
01:31as well as working with
01:32clinical and experimental groups to
01:34apply those,
01:35those techno those techniques,
01:37to real to real data
01:38to understand immunological
01:40state. And the kinds of
01:41questions we're broadly interested in
01:42are some of the kind
01:43some of the questions that
01:44John already already touched on.
01:46So how do we you
01:47know, can we take these
01:48immunological measurements,
01:49and say something about the
01:51exposure history,
01:52of the individual? Right? The
01:53immune system encodes to some
01:55degree our history of prior
01:56exposure. So can we say
01:57something about whether an individual
01:58has been exposed to, say,
01:59SARS COV two? Does the
02:01individual carry protective memory from
02:02this, from this infection?
02:05Talking about current immunological state,
02:06can we profile somebody's immune
02:08system, and make an inference
02:10about whether they have are
02:11are currently undergoing
02:13an acute infection? With what
02:14virus are they currently infected
02:15with? Can we say something
02:17about the future? Will the
02:18outcome of that, that infection
02:19be sort of a mild
02:20infection that resolves quickly, or
02:22will it be more more
02:23severe?
02:24And we can we also
02:25wanna be able to ask
02:26questions about, the the the
02:28outcome of different,
02:29interventions or things like vaccinations.
02:31We give someone a vaccine,
02:33how does their immunological state
02:34can we predict what the
02:35outcome of that vaccine will
02:37be? Will they generate good
02:38protective, immunological
02:39memory? So I'm gonna talk
02:41at, somewhat high level about
02:43a few of the different
02:44projects going on in my
02:45lab to touch on on
02:46these different aspects.
02:47So first, let's talk about
02:48identifying immunological exposures from somebody's
02:51host, host response profile.
02:53So the idea here is
02:55that we're going from a,
02:56a population of people. We
02:57take some kind of blood
02:58or tissue sample, and we
03:00profile that sample. So here,
03:01for example, we're looking at
03:03transcriptional profiles that we take,
03:05so we measure different gene
03:07expression levels, genes on the
03:08x axis, samples from those
03:09individuals on the y axis.
03:11So we get these different
03:11gene expression profiles, and we
03:13wanna make some inferences. So,
03:14for example, can we look
03:15at those gene expression profiles
03:17and discern a pattern that
03:18might tell us, hey. This
03:19group of individuals
03:21is currently undergoing an acute
03:22response to West Nile virus
03:23infection.
03:24So there are a lot
03:25of publications, a lot of
03:26studies that try to develop
03:28these types of signatures. Right?
03:29You recruiting a cohort of
03:30people with an infection,
03:32some kind of control populations,
03:34profile them, do some machine
03:35learning, some differential expression analysis,
03:37come up with some set
03:38of genes or metabolites or
03:39proteins that are characteristic of
03:41that, of that infection.
03:43So we set out a
03:44few years ago to evaluate
03:45some of these signatures that
03:46have been proposed in literature.
03:48CBB graduate student, Jan Chawla,
03:51wanted to ask a couple
03:52of questions.
03:53Particularly, he wanted to look
03:54at the the signatures that
03:56were in the literature and
03:57assessed how, robust they were.
03:59So to what extent you
04:00could take a signature from
04:01one study and, apply that
04:03signature to an independent,
04:04cohort and have it be
04:05predictive and also cross reactivity.
04:07So to what extent could
04:08you take a signature of,
04:09say, for some particular virus
04:11infection,
04:12like a like an influenza
04:13infection?
04:14And to what extent was
04:15that signature specific to that
04:17infection or would it what
04:18we call cross react? Would
04:20that signature also come up,
04:21in a lot of other
04:22viral infections or bacteria or
04:24or other types of infections
04:25like a bacterial?
04:26So we put together a
04:27cohort of about, at the
04:29time of the publication, it
04:29was a hundred and fifty
04:30datasets and a a a
04:32bit over seventeen thousand,
04:34transcriptional profiles
04:35encompassing a whole variety of
04:37viral, viral infections, bacterial infections,
04:40as well as some noninfections,
04:42conditions that we thought might,
04:43might be similar, might look
04:45similar transcriptionally to a,
04:47to an infection response.
04:48And he evaluated those signatures,
04:50for their robustness and cross
04:52reactivity
04:53and brought you know, high
04:54level what we found from,
04:55from the study is one,
04:56a lot of the signatures
04:57that were,
04:59proposed in literature, in fact,
05:00were quite robust. Generally speaking,
05:02if you developed a signature
05:04that was supposed to be
05:04predictive of some virus response,
05:07it was reproducible in in,
05:09other studies of the of
05:10that virus response. But there
05:11was also a large amount
05:12of cross reactivity, and maybe
05:14this is something,
05:15that's not too surprising.
05:17A lot of these a
05:18lot of these studies, you
05:19will get, you know, for
05:20example, like an interferon signature
05:22that will come up in
05:23almost every virus,
05:24virus response that you study.
05:26And you might expect if
05:27you don't account for that,
05:28that signature is also gonna
05:29be predictive of multiple other
05:30viruses.
05:31We also found those signatures
05:33cross react with a lot
05:34of noninfectious conditions. So here
05:35you see, for example, we
05:36we evaluated a bunch of
05:37different, signatures for virus infections
05:39along the x axis and
05:41evaluated their ability to predict,
05:43aging. So just comparing young
05:44versus older older individuals. And
05:46you can see a lot
05:47of these signatures,
05:48as measured by the area
05:50under the curve for predicting
05:51old versus young are highly
05:52predictive of age, and only
05:54a subset of signatures is
05:55not. So for so we
05:56are getting a lot of
05:57cross reactivity, and you have
05:58to account for this if
05:59you wanna use these signatures,
06:01for, predictive responses.
06:03And in general, we are
06:04interest often interested in what
06:05makes a response unique. Right?
06:07In general, we don't necessarily
06:08wanna pick out, oh, we're
06:09getting an interferon response,
06:11in in in coming up
06:12in response to a viral
06:13infection. We wanna know what's
06:14specific about a SARS COV
06:16two infection that makes it
06:17different from a typical antiviral
06:19response, or what's different about
06:20this,
06:21vaccine response, this influenza vaccine
06:23response,
06:24when it doesn't generate protective
06:25memory that makes it different
06:27from other types of, responses?
06:28So we've also worked on
06:29methods in collaboration with, Elena
06:31Daslavsky's group at Mount Sinai
06:33to as methods to generate
06:35robust signatures that are not
06:36cross reactive. And this takes
06:38advantage of the massive public
06:40data that's available,
06:41profiling different types of responses.
06:43So if we have our
06:44response, our our our dataset
06:46of interest, so for example,
06:47disease versus a control, we
06:49can also integrate all sorts
06:51of control samples from other
06:52diseases that are in the
06:53public domain to come up
06:55with our signature that's specific.
06:56So for example, if we're
06:57profiling the influenza,
06:59an influenza infection response, we
07:00might pull from the literature
07:02lots of other
07:12specificities, so gene expression,
07:14protein expression signatures are are
07:16nice, but the immune system,
07:17we can take advantage already
07:19has a a sort of
07:20component built in that's highly
07:22specific,
07:22for the for the response
07:24being generated, and that is
07:25the the the adaptive immune
07:26response, the b cells and
07:28the t cells, that compose
07:29the adaptive arms of the
07:30immune response. I'm gonna focus
07:32on b cells. My lab
07:33focuses works a lot on
07:34b cell receptor, responses.
07:36So b cells have antibody
07:38receptors on their surface that
07:39give them specificity,
07:41to recognize different pathogens.
07:43The naive repertoire coming out
07:44of the bone marrow, so
07:45our our immune system constantly
07:47produces naive b cells with,
07:49pretty much a unique receptor
07:50on every cell that's generated
07:52from the bone marrow, and
07:53that gives those cells their,
07:54a unique ability to recognize
07:56different pathogens. So some cells
07:58coming out of the bone
07:59marrow might, have a have
08:00a,
08:01innate specificity for the influenza
08:03virus. Others might have an
08:04innate specificity for,
08:06for SARS CoV two. Once
08:08you get once you encounter
08:10a a pathogen, you're confronted
08:11with pathogenic challenge, whether it
08:13be a natural infection, or
08:14a vaccine.
08:15Some of those naive b
08:16cells are stimulated, activated into
08:18the response, and they undergo
08:20a process of, rapid tonal
08:22expansion and somatic hypermutation.
08:25So these cells introduce point
08:26mutations into the DNA that
08:28codes for their receptor. This
08:29is a really scary process
08:30when I first learned about
08:31it,
08:32but it's actually quite,
08:34quite good because it allows
08:35our bodies to learn about
08:36the pathogenic environment.
08:38And through a process of
08:39mutation and selection, we can
08:41generate higher and higher affinity
08:42receptors for the pathogen that
08:44we're confronted with. So we
08:45can learn about the pathogenic
08:46environment through our, throughout our
08:48lifetime.
08:49And this occurs in specialized
08:51structures called, germinal centers in
08:53our secondary lymphoid organs.
08:55And here you see actually
08:56a a germinal center. The
08:58b cells are ingrained from
08:59a live anesthetized mouse, undergoing
09:01an immune response. This is
09:02work we did with Anne
09:03Haberman,
09:04with two photon microscopy over,
09:05over a decade ago. So
09:07this process results in affinity
09:09maturation of the b cell
09:11repertoire, and you can imagine
09:12that we can learn a
09:13lot about the history of
09:14immunological
09:15exposures by studying these, the,
09:17the set of b cell
09:18receptors carried by an individual.
09:20Since two thousand nine, we've
09:21had the ability to sequence
09:23the b cell receptors at
09:24a high throughput,
09:26both with,
09:27both with bulk sequencing technologies
09:29where we can take a
09:30tissue sample,
09:31sequence the b cell receptor
09:32using PCR amplification and high
09:34throughput sequencing. That gives us
09:35anywhere from hundreds of thousands
09:37to millions of b cell
09:39receptors in a sample,
09:40and more recently through single
09:42cell technologies where we can
09:43get individual from a single
09:45cell, the b cell receptor
09:46along with the pair transcriptome
09:48for tens of thousands of
09:49cells, at a time.
09:51And, of course, the challenge
09:52is now once we're once
09:53we have these receptors, how
09:55do we
09:56how do we interpret them?
09:57So the typical data we
09:58get kinda looks like this.
10:00Right? We get the b
10:01cell receptor sequence. Each row
10:02is a b cell receptor
10:03sequence,
10:05associated with the cell. And
10:06now we wanna we wanna
10:07know things like, well, which
10:08one of these b cells
10:09which one of these b
10:09cell receptors,
10:11is is specific or binds
10:13to the influenza hemagglutinin,
10:14molecule, or which one of
10:15these are specific for the
10:16SARS CoV two spike protein.
10:18And I wanna thank Deepa
10:20for for introducing these,
10:22representation learning approaches because that's
10:24one of the things we
10:25do with b cell receptors
10:26is we take advantage of
10:28these, language based language based
10:30models
10:30to, develop,
10:32embedding approaches where we can
10:34learn we can take those
10:35b cell receptors,
10:36map them into a lower
10:37dimensional space where we can
10:39then,
10:40develop classification models and do
10:41analysis and learn, learn all
10:43sorts of interesting things about
10:44the b cell receptor.
10:46So one of the things
10:47we wanted to learn about
10:48and one of the, sort
10:49of important key challenges for
10:50the community is learning specificity.
10:52So we would like to
10:53be able to take a
10:54b cell receptor sequence and
10:55predict that this b cell
10:57receptor is going to bind,
10:58for example, to SARS CoV
10:59two spike protein. A graduate
11:01student, Mimi Wang, from the
11:03CBB program,
11:04did a did a study
11:05a couple of years ago,
11:07looking actually, just just was
11:09published this, this past year
11:10early early this year, actually.
11:12Looking at multiple embedding methods.
11:14Some of these are general
11:15protein embedding methods that were
11:17generated not for antibody receptors,
11:18but to embed general protein
11:19sequences,
11:21and others, such as this
11:22AnteBERTy were developed specifically to
11:24embed antibody
11:25receptors.
11:26And she evaluated the ability
11:28of these, of these different
11:29embedding methods to predict SARS
11:31CoV two spike binding.
11:33There's a background control where
11:34we scrambled up the sequences
11:35shown in gray here, that
11:37has the expected
11:39f ones. And you can
11:40see on the real data
11:42shown in the colored, in
11:43the colored bar too, we
11:44actually get a pretty good
11:45ability,
11:46a very good ability to
11:47predict SARS CoV two spike
11:48binding,
11:49from these, from the, from
11:50these embedded b cell receptors.
11:52And in fact, some of
11:53the, you know, the the
11:54generic method, the general methods
11:56actually work pretty well. We
11:57do get a little bit
11:58of a performance boost, from
11:59the models that were developed
12:01specifically on antibody receptors.
12:03We have a newer study
12:04that's on bioRxiv now where
12:05we've shown that if you
12:06take these, general methods and
12:08you actually do a little
12:08bit of fine tuning on
12:09them specific to the problem,
12:11we can actually boost, that
12:12power to predict specificity,
12:14even further.
12:16So that's the power to
12:17predict specificity on a single
12:18sequence. What about whole repertoires?
12:20So when we take a
12:21blood sample or a tissue
12:22sample from a person, we
12:23don't get a single sequence.
12:25We actually get tens of
12:26thousands to to potentially millions
12:28of sequences.
12:29It looks something like this.
12:30This is actually a blood
12:31repertoire from somebody undergoing, an
12:32actual vaccine response seven days
12:34post the response,
12:36where each of the points
12:36in these trees a leaf
12:37and each of those,
12:39and each of those,
12:40sort of circle diagrams represent
12:42a clonal expansion starting from
12:44a starting from a naive
12:45b cell. So you can
12:46see there is a bunch
12:47of clonal expansions in the
12:48blood.
12:49So that's pretty indicative of
12:51of, an actual ongoing infection.
12:53Generally speaking, in a in
12:54a healthy human, at rest,
12:56you don't have large clonal
12:57expansions,
12:58in your blood. But now
12:59can we tell what infection
13:01or what vaccination this person
13:02is actually responding to? So
13:04one simple thing we can
13:05do is take all of
13:06the b cell receptor sequences
13:08in these clones, calculate the
13:09probability of it being a
13:10SARS CoV-two specific clone, and
13:12just average over the repertoire.
13:14And you can see that
13:15actually gives us a pretty
13:16good power to predict, individuals
13:17who are twenty eight days
13:18post SARS COV two vaccine,
13:20shown here on the right,
13:21versus a control population at
13:23rest pre vac pre vaccination.
13:26We do a lot of
13:27work like this with b
13:28cell receptors. As I mentioned,
13:29we we do all sorts
13:30of methods for preprocessing, population
13:32structure inference, and repertoire analysis.
13:34And BILab makes available a
13:36a computational framework called called
13:38incantation,
13:39to deal with all sorts
13:40of analysis of these kinds
13:41of data,
13:42which is pretty widely used
13:44by the by the community.
13:45And I'll just take a
13:46moment to plug. We are
13:47having our twenty twenty five
13:48users group meeting on January
13:50thirtieth. If you are interested
13:51in joining us for that,
13:52please do please do sign
13:53up. It's free. If you've
13:55ever used our tools and
13:56are interested in submitting an
13:57abstract, the abstract deadline is
13:59just about a week from
13:59now.
14:01So now let me move
14:02to an, beyond just sort
14:04of looking at the current
14:05state of an individual,
14:07into the problem of can
14:08we predict how an individual
14:09respond to a to a
14:10certain immunological exposure? John already
14:13introduced this idea of of
14:14variable you know, human variability
14:16to the same immunological exposure.
14:18So for example, in a
14:19vaccine response, if we take
14:21a bunch of young adults,
14:23shown along the x axis
14:24here, we give them the
14:25seasonal flu vaccine and we
14:26measure their antibody response on
14:28the y axis. You can
14:29see there's a whole,
14:31wide variety of different, responses.
14:33Some people barely generate any,
14:36or actually generate no boost
14:37in their antibody titers, while
14:39other individuals generate, you know,
14:41orders of magnitude increase in
14:42their, in their titers. So
14:43we wanna know, can we
14:44predict the response to this
14:46vax to to influenza vaccination?
14:48And more broadly, can we
14:50predict the response to other
14:51vaccinations? Is there a universal
14:52signature? Is this are the
14:53same biological mechanisms, the same
14:56signatures that are associated with
14:57good flu vaccine responses? Do
14:59those also hold for things
15:00like yellow fever, for smallpox,
15:02for SARS CoV two?
15:04And to do that, we
15:05leveraged the power of a
15:06consortium that we're involved in.
15:07John already mentioned, HIPSI, the
15:09human human immunologic human immunology
15:11project consortium.
15:13This is some a consortium
15:14that's been funded since two
15:16thousand and ten. We've had
15:17one of the centers here
15:18at Yale.
15:18It's currently co directed by
15:20Ruth Montgomery and and David
15:21Hafler, and we've been part
15:22of it since the beginning
15:24in twenty ten.
15:26HiPSI does a lot of
15:27high throughput profiling of of,
15:29humans in diverse immunological states.
15:32All of the data is
15:33deposited into a repository called
15:35import.
15:36There's over a hundred different
15:37studies,
15:38over, almost eight thousand different
15:40individuals and lots and lots
15:41of measurements.
15:43As part of this consortium,
15:44we put together a data
15:45resource of all the studies
15:47that were done, profiling human
15:49vaccine responses.
15:50In the end, we profiles
15:52of thirteen different vaccines where
15:53we could relate transcriptional profiles,
15:56to antibody tighter, tighter responses.
15:58From the analysis of those
16:00data, we did two different
16:01things. One is we looked
16:02for baseline immune states.
16:04So could you take somebody's
16:06immunological state prior to vaccination
16:08and predict how they were
16:09gonna respond to the to
16:10the to the vaccine?
16:12So we could, in fact,
16:14generate identify those kinds of
16:15profiles as well as temporal
16:17profiles. So could you look
16:18a day or a week
16:19post vaccination
16:21and predict the longer term
16:22antibody response to vaccine? It
16:24turns out you can do
16:24that, you can do that
16:26as well.
16:27And just to show you
16:28one,
16:29one aspect
16:30of that and the just
16:31to emphasize the importance of
16:33kinetics and and and sort
16:34of timing of the response.
16:35So here you see a
16:37a bunch of those different
16:38vaccines that we profiled. Each
16:39one is shown by a
16:40different line, days post vaccination
16:42on the x axis, and
16:43the y axis shows the
16:45ability of a plasma cell
16:46signature to predict the antibody
16:48titers of the response at
16:49each at different days post
16:50vaccination.
16:52And you can see for
16:52the flu vaccine, if you
16:54look seven days post vaccination,
16:56we know that there's a
16:57burst of plasma blasts that
16:58are generally observed in the
16:59blood, post vaccination in the
17:01flu response. And in fact,
17:02those that plasma blast is
17:04just highly predictive of the
17:05ultimate antibody response, to flu
17:08vaccine.
17:08A lot of other vaccines
17:10share those characteristics,
17:11but notably not the yellow
17:12fever vaccine. So in yellow
17:14fever, if you look at
17:15seven days and you look
17:15for that signature, you don't
17:17see it. But if you
17:18look a little bit later,
17:19say say,
17:20ten days or or a
17:21couple weeks after the vaccine
17:23response, then in fact, you
17:24actually do see that signature
17:26and coming out as predictive.
17:27So in fact, it's the
17:28same biology in both those
17:29cases that drives a successful
17:31response, but it's very important
17:33sort of when during the
17:34course of the response,
17:35that we, that we look.
17:37If you're interested in those
17:39kinds of of predictions,
17:40I'll make one other plug
17:41is that we are part
17:42of a center,
17:44center for modeling immunity, along
17:46with Bjorn Peters at the
17:47La Jolla Institute,
17:49and we run an annual
17:50challenge contest for predicting the,
17:52vaccine response. So the CMI
17:54PB challenge.
17:55And so the idea is
17:57that, we provide you data,
17:59multiomics,
18:00profiles,
18:01at baseline, so prior to
18:03getting a vaccine,
18:04and it's your job to
18:05try to come up with
18:06a model to predict the
18:07vaccine response. We provide a
18:08couple of cohorts of data
18:09where we have the,
18:11profiles and also the endpoints,
18:12and there's some data that's
18:14held out, and there are
18:15there are prizes for the
18:16teams that, that do the
18:18best. So if you're interested
18:19in that kind of work,
18:20please do, check it out.
18:21The submission deadline is, for
18:23on November twenty second. You've
18:24got just about a month
18:25to develop, to develop your
18:27models.
18:29Okay. And so now I
18:30wanna touch, for a minute
18:31on multiomics.
18:33So previously, I was just
18:35talking, for for HiPSI. We
18:36looked a lot of transcriptional
18:37signatures, and those are, those
18:39are very powerful. But it
18:40was mentioned a couple times
18:41already. Increasingly,
18:43a lot of our experimental
18:44data encompasses multiple different types
18:46of measurements, transcriptional profiles, met
18:49metabolomic profiles, proteomic profiles,
18:52all together. And one one
18:53of the challenges, how do
18:55we integrate those data together
18:57in order to come up
18:58with, predictive
18:59responses and understand the the
19:01immunological
19:02mechanisms that are driving those
19:03differential,
19:04differential outcomes.
19:06And here I wanna highlight
19:07a a method that, we've
19:08been developed in collaboration, with
19:10Liang who spoke spoke earlier,
19:12and a graduate student, Jeremy
19:14Geege, another CBB graduate student.
19:16And the idea here is
19:17a lot of times where
19:18we have these multi omics
19:19data, the common way to
19:20analyze those data is to
19:22take the data and do
19:23some dimensionality reduction. Right? So
19:25you have those, you know,
19:27potentially hundreds of thousands of
19:28measurements of genes and proteins
19:30and and and metabolites
19:31to reduce the dimensionality of
19:33those into some smaller, some
19:35smaller space and then do
19:36machine learning. And one of
19:37the things we found is
19:39that doing doing it that
19:40way actually could be suboptimal
19:41because you would actually you
19:42could actually miss variation
19:44that was important for the
19:45response,
19:46of interest. And so we
19:47developed a method called, SPEAR,
19:49which is a sparse supervised
19:50Bayesian factor model,
19:52where in generating the lower
19:53dimensional representation,
19:55we took into account both
19:56the variation in the multi
19:57omic data as well as
19:59the response of interest. And
20:00the the method learns to
20:02adaptively balance how much to
20:04try and explain the variability
20:05in the data and how
20:06much to try and explain
20:07the response of interest in
20:09coming up with those lower
20:10dimensional,
20:11lower dimensional factors.
20:13And finally, what I'll end
20:14with is the the importance
20:15of genetics.
20:16So so all of these,
20:18responses and all of these
20:19signatures are often modulated by
20:21the genetic background,
20:22of of the individual. And
20:24in particularly, it's an emerging
20:26area to understand to what
20:28extent the genetics of the
20:29b cell receptor,
20:31repertoire
20:32and the components of the
20:33b cell receptor
20:34influence the ability of people
20:36to respond to infection,
20:37and vaccination.
20:39And I'll show one example
20:40here,
20:41from, showing the influence of
20:43a b cell receptor variable
20:45gene, a gene called, IGH
20:46v one one sixty nine,
20:48on the flu vaccine response.
20:50So here you see,
20:52there's a position, in this,
20:54in this gene, which can
20:55either be an f or
20:56an l. And depending on
20:57the genotype of the individual,
21:00that individual will generate different
21:01titers of broadly neutralizing antibody
21:04in response to, in response
21:06to the flu vaccine. You
21:06see the micro
21:08titers, on the y on
21:09the y axis here.
21:11And so it's really an
21:12emerging area to try and
21:13understand how the genetics of
21:15the antibody repertoire
21:16influence the response to vaccination,
21:18and infection.
21:19And there's not been a
21:20lot of associations,
21:22that have been discovered so
21:23far. In part, I think
21:24this is because in all
21:25of the genome wide association
21:27studies that are done, the
21:28immunoglobulin locus is generally not
21:30included in any of those
21:32studies. So it's not there's
21:33very low representation
21:34on the, SNP arrays that
21:36are that are used.
21:37And in general, one can't
21:38get this from the short
21:39read sequencing
21:42data because the region is
21:43highly, highly repetitive.
21:46And so with the postdoc
21:47in the lab,
21:48Dylan, who's one of the,
21:50Gale BI data science fellows.
21:52He's actually developed a,
21:54an approach now where we
21:55can go in from short
21:57read DNA sequencing data and
21:59genotype the immunoglobulin
22:00locus. And the way he's
22:02able to do this is
22:03by leveraging,
22:04a pangeneome graph. So this
22:05is a graphical representation of
22:07a genome where instead of
22:08having a single linear reference
22:10as is commonly used today,
22:11we can actually represent and
22:13encode the diversity,
22:14of the population into a
22:15single graphical representation of the
22:17genome.
22:18And by and by leveraging
22:20that structure, we can take
22:21those short read, DNA sequencing
22:23data,
22:24map it to this pan
22:25genome graph, and discover variability
22:27in the, in the immunoglobulin
22:28locus. And what we're gearing
22:30up to do now is
22:31to actually use this method
22:32on biobank data because we
22:33now have public data sources
22:36of massive numbers of individuals
22:37with, short read DNA sequencing
22:39data, and a lot of
22:40interesting clinical and other, and
22:42other phenotypes.
22:43And so we're we're planning
22:45to use this approach to
22:46mine those biobanks and find
22:47interesting,
22:48connections between the immunoglobulin locus,
22:51and and those outcomes.
22:53And so I think I
22:53will, stop there because I'm,
22:56out of time. And,
22:57thank you and take any
22:58questions.