Calculate TMB Score

Calculate TMB Score takes a variant track and a matching target regions track, and calculates a TMB score, i.e. the number of somatic mutations per megabase (Mb).

The Calculate TMB Score tool considers only SNVs - and discards variants of any other type. First, it filters variants, keeping only variants that lie within exons, within target regions, and outside the masking regions. It then successively applies various quality, germline, and non-synonymous filters before calculating the TMB score as the number of somatic variants divided by the length of the assessed regions in Mb.

Calculate TMB Score is available from the Tools menu at:

        Tools | Biomedical Genomics Analysis (Image biomedical_folder_closed_16_n_p) | Oncology Score Estimation (Image oncology_tools_folder_closed_16_h_p) | Calculate TMB Score (Image calculate_tmb_16_h_p)

The tool takes a variant track as input.

In the next dialog, the following tracks are specified (figure 8.1):

Only variants inside target regions and exons, and not within regions annotated on the masking track, are considered when calculating the TMB score.

Image calculate_tmb_wizard_1
Figure 8.1: Specifying tracks and parameters for calculating a TMB score.

In addition, it is possible to enable the calculation of a TMB status based on a low and a high threshold. The TMB status will appear as an additional item on the TMB report. The default values of 10 and 15, respectively, have been chosen based on internal benchmark analyses of lung cancer cell lines and different tissue cancer samples. Given the lack of standardization of methods and the heterogeneity of tumor mutation burden across many tumor types, it is difficult to establish cutoff values. Thresholds should be set according to the samples analyzed.

In the next dialog, quality filters, germline filters, and a non-synonymous filter can be configured (figure 8.2):

Image calculate_tmb_wizard_2
Figure 8.2: Specifying filters for calculating a TMB score.

Note that the default TMB filtering parameters are set conservatively. This is because for panels of 1Mb size, a single false positive variant may increase the TMB score substantially.

Generate target regions with sufficient coverage

Since the TMB score is calculated as the number of mutations per Mb, the minimum coverage used to filter somatic variants must also be used for calculating the length of assessed regions. By default, the minimum coverage threshold is set to 100x.

We recommend running Calculate TMB Score from a workflow including the tools Create Mapping Graph and Identify Graph Threshold Areas to generate target regions with sufficient coverage (figure 8.3). The same coverage threshold should then be used for Identify Graph Threshold Area and for Calculate TMB Score.

Image tmb_small_workflow
Figure 8.3: Workflow for identifying target regions with sufficient coverage and calculating a TMB score.

Output from Calculate TMB Score

The Calculate TMB Score tool generates two outputs:

The TMB report includes the TMB status when enabled in the Specify settings step (figure 8.1). By default, the TMB status is considered low if the TMB score is lower than 10; intermediate if the TMB score is between 10 and 15; and high if the TMB score is larger than 15.

Image calculate_tmb_report
Figure 8.4: A TMB report where the option to detect TMB status was enabled with default threshold values.

The Length of assessed regions (bp) is the length of the provided target regions overlapping exons and not included in the masking regions. The TMB status is assessed with a confidence level based on the length of the assessed regions. This is illustrated by the color of the TMB status table cell in the report. If the length of the assessed regions is below 900,000bp the cell will be colored in red, if it is between 900,000bp and 1,000,000bp it will be colored in yellow and if it is above 1,000,000bp it will not be colored. Note that if low coverage regions are not excluded from the target regions before TMB score calculation (as shown in figure 8.3), the TMB status confidence level may wrongly be displayed as high.

Additionally, the report lists the number of various types of variants based on germline, non-coding, and non-synonymous filter.

The Quality filters statistics section recapitulates how many variants were removed by the various filters applied by the tool, along with the frequency distributions of input and somatic variants.