Tcr-seq

TCR-Seq (T-cell Receptor Sequencing) is a method used to identify and track specific T cells and their clones... TCR-Seq utilizes the unique nature of a T-cell receptor (TCR) as a ready-made molecular barcode. This technology can apply to both single cell sequencing technologies and high throughput screens

T-cell Receptor (TCR)
T cells are a part of the adaptive immune system and play a critical role in protecting the body from foreign pathogens. T-cell receptors (TCRs) are a group of membrane proteins found on the surface of T cells which can bind to foreign antigens. TCRs interact with major histocompatibility complexes (MHC) on cell surfaces to recognize antigens. They are heterodimers made up of predominantly α and β chains (or more rarely δ and γ chains) and consist of a variable region and a constant region. Variable regions are produced through a process called VDJ recombination, which results in unique amino acid sequences for α, β, and γ chains. The result is that each TCR is unique and recognizes a specific antigen

Complementarity Determining Regions (CDRs)
Complementarity determining regions (CDRs) are a part of the TCR and play an essential role in TCR-MHC interactions. CDR1 and CDR2 are encoded by V genes, while CDR3 is made from the region between V and J genes or between D and J genes (termed "VDJ genes" when referred to together). CDR3 is the most variable of the CDRs, and is in direct contact with the antigen. As such, CDR3 is used as the “barcode region” to identify unique T cell populations, as it is highly unlikely for two T cells to have the same CDR3 sequence unless they came from the same parental T cell.

Clonality
VDJ recombination produces such a vast amount of unique TCRs that many receptors never encounter the antigen they are best suited for. When a foreign antigen is present in the body, the few T cells that recognize that antigen are positively selected for so that the body has an adequate number of T cells to mount an effective immune response. The selected T cells rapidly divide and differentiate into effector T-cells through a process called clonal expansion, which retains the TCR sequence (including the CDR3 sequence) that originally recognized the antigen

TCR-Seq uses the unique nature of the TCR - in particular CDR3 - as a molecular barcode to track T cells through a variety of processes like differentiation and proliferation, which can be used for a wide variety of purposes.

Bulk vs Single-Cell Sequencing
TCR sequencing can be performed in on pooled cell populations (“bulk sequencing”) or single cells (“single cell sequencing”). Bulk sequencing is useful to explore entire TCR repertoires - all the TCRs within an individual or a sample - and to generate comparisons between repertoires of different individuals. This method can sequence millions of cells in a single experiment. However, one major disadvantage is that bulk sequencing cannot determine which TCR chains pair together, only the frequency within the repertoire. The large amount of TCRs sampled also means that lower abundance TCRs may not be detected Single cell sequencing can determine TCR chain pairs, making them more useful for identifying specific TCRs. Some major disadvantages of this technique are its high costs, limited capacity of a few thousand cells, and the necessity of live cells which may be more challenging to obtain

Target Sequences
Any TCR chain can be sequenced, although the α and β chain are more commonly chosen due to their abundance in the T cell population. In particular, the β chain is of interest due to its higher diversity and specificity compared to other chains. The presence of a D gene component in the β chain which is not present in the α chain allows more diverse combinations. As well, β chains are unique to each T cell, which can be used to identify distinct T cell populations within a sample

To perform TCR-sequencing, polymerase chain reaction (PCR) amplification is performed on the CDR3 region as a measure of unique T cells within a population. The CDR3 region is chosen over CDR1 and CDR2 as it is directly responsible for antigen interactions and is generally unique to TCRs from the same lineage, which allows identification of distinct populations

Library Preparation
The goal of this step is to generate a library of transcripts to be sequenced. There are 3 major ways of generating a library for TCR sequencing.

Multiplex DNA
Multiplex PCR can be employed on both genomic DNA (gDNA) or RNA which has been converted to double-stranded complementary DNA (cDNA). Primer pools with primer pairs targeting J and V alleles are used to amplify the CDR3 region of the TCR transcript. The transcript goes through two or more rounds of PCR to amplify the region of interest, then adaptors are ligated onto either end of the resulting transcript. This method is among the most used in the generation of libraries for TCR-seq as it can capture a great deal of the diversity of the TCR through the primer pool. However, as it is near-impossible to optimize PCR conditions for all the primers in the pool, multiplex DNA can result in amplification bias where some CDR3 regions with primers that bind poorly may not be amplified. This means the abundance of amplified segments may not correspond with the actual abundance within the cell

Target Enrichment In-Solution
This method can use genomic gDNA or RNA converted to cDNA. The starting material is first processed to generate DNA or cDNA transcripts with indexed adaptors on the 5’ and 3’ ends. These transcripts are then incubated with RNA baits designed to bind to regions of interest, which is generally the CDR3 region. These baits, which are normally bound to magnetic beads, can be isolated using a magnet. This allows the isolation of transcripts of the CDR3 region which can then amplified using PCR. Target enrichment using RNA baits requires fewer PCR amplification steps, which may decrease amplification bias. However, the efficiency of the capture by magnets may affect the diversity of the amplified transcripts.

5’-RACE
Rapid Amplification of cDNA Ends (RACE) is a method that uses RNA transcripts for generation of the library. Although RACE can be applied with the 3' or the 5' end, the 5' end more commonly used for TCR-seq. This method revolves around the addition of a common 5' adaptor sequence to the transcript, which can be a done a few different ways. One method is to add on the adapter following reverse transcription. During the generation of the reverse DNA strand from the RNA template, a forward primer adds a sequence complementary to the 5'adapter, leading to template switching This allows a 5' adapter to be incorporated into the cDNA when the complementary sequence is generated. Primers can be designed to amplify the entire region from the adaptor to the constant region, then adaptor ligation can be performed in a second PCR reaction. As all the different transcripts now share an identical adapter, they can be amplified using a single primer pair. As such, this method decreases amplification bias and improves the ability to detect more uncommon TCR populations with greater certainty. However, as TCR transcription levels differ between cells, this method cannot provide an accurate measurement of the number of different T cell types in the sample based on the level of RNA transcripts alone

Sequencing
Following generation of the library, the products can be sequenced, generally via Next Generation Sequencing (NGS). Usage of machines capable of longer reads and maintains read quality at the 3’end is important, as the CDR3 region is at the 3’end of an approximately 500 base pair transcript

The error rate of NGS presents a challenge for analysis of TCR repertoires. Small variations in the TCR can change their specificity towards antigens, and as such may be interest to researchers. However, errors in sequencing can generate a minor change that may be interpreted as a low-frequency, distinct TCR population, which is a problem when analyzing changes in TCR repertoires. Efforts have been made to establish thresholds to remove low abundance reads from analysis, as well as to develop algorithms to correct these errors

Applications
Generally, the data collected from TCR-seq is used to compare TCR repertoires, either between the same patient at different timepoints, or between different patients. Recent studies examined the characteristics of a healthy repertoire, and found a high degree of variation in TCR β chain levels and types, though a subset is shared across different individuals. However, this diversity has yet to be shown to strongly correlate with any conditions of interest, such as rates of infection or chance of cancer relapse, suggesting further research is necessary.

Infectious Diseases
Clonal expansion of T cells allow the immune system to deal with a variety of infection disease with high specificity. Thus, understanding changes that occur to the T cell repertoire following disease infection can early diagnosis, disease monitoring, and therapeutic development

Acquired Immunodeficiency Syndrome (AIDS) is a devastating disease caused by Human Immunodeficiency Virus (HIV) infection, which results in the death of CD4+ T cells. and dysfunctional CD8+ T cells. Recent studies have suggested that increased TCR diversity may decrease HIV diversity and limit disease progression. Sequencing of the TCR would also increase understanding of the progression of AIDS and predict morbidity. Additionally, sequencing the TCR repertoire of individuals with natural defense against AIDs infection could help development of a vaccine to limit further spread of the disease

Cancer
Cancer is the uncontrolled proliferation of malignant cells which can spread throughout the body. This is caused by mutations within the cancer cell, which often leads to expression of mutant proteins termed neoantigens. Identification of these neoantigens has great therapeutic benefit, as they can be exploited to target cancer cells without harming normal cells. As CD8+ T cells can recognize some neoantigens in their TCR, sequencing of TCR repertoires can help identify potential cancer biomarkers. In addition to biomarker identification, sequencing of the TCR repertoire can also track changes in cancer progression, assess responses to immunotherapy, and evaluate the tumour microenvironmen t for conditions that may make it permissible to cancer growth