AutEx: Regions of autozygosity

Description   Prerequisites   Dialog   Practical tips   Algorithm

Description

Identification of autozygous regions in an individual with consanguineous parents.

Prerequisites

The AutEx algorithm can be applied to any variant file loadable in FILTUS, as long as the genotype format is correctly given in the input settings. However, the following points should be kept in mind to maximize the power of AutEx:

Dialog

Autex dialog

To open the AutEx dialog, first select a single sample in the Loaded samples window (at the top left of the main view of FILTUS) and then choose AutEx: Regions of autozygosity in the Analysis menu. Any filters (e.g. PASS) should be applied before opening the dialog.

Model parameters
Use the parental relation parameter to indicate the degree of inbreeding of the sample under analysis. The algorithm uses this as prior information on the expected number and length of autozygous regions.

In our experience, the results are not sensitive to this choice, in particular when the number of variants is large. This is not surprising: Any true autozygous segment covered by the sequencing is likely to contain many variants. The collective evidence of these variants quickly drowns the impact of the prior.

Allele frequencies
Indicate a column containing frequencies for the ALT alleles. The Missing entry value is substituted whenever the column does not contain a number. If your variant file does not have frequency data, the Missing entry value will be used for all variants.

Output
These parameters impose filters on the reported regions. If a Posterior threshold is given, only regions where the posterior probability exceeds this are included. One can also put lower limits on the region length and/or the number of variants in each region.

Summary
A summary of the findings is printed here. The identified autozygous regions are shown in the main FILTUS window; to inspect them you must close the AutEx dialog.

Plot
This functionality plots a single chromosome, showing all the variants, the posterior probability curve, and the identified autozygous regions.

Practical tips

Tip 1: Before running AutEx it is recommended to apply filters removing variants of low quality (base/genotype/mapping).

Tip 2: The algorithm only works on the autosomes. You don't have to filter out X and Y yourself, this is taken care of internally.

Tip 3: To save the regions to a file, first close the AutEx dialog and then select Save main window content in the File menu.

Tip 4: After saving the regions to a file (previous tip) this file can be directly used as a Region filter. This two-step approach is the common way to perform homozygosity/autozygosity mapping in FILTUS.

Algorithm

The AutEx algorithm implemented in FILTUS is designed for detecting autozygous regions directly from HTS variant files. It is based on the hidden Markov model introduced by Leutenegger et al. in Estimation of the inbreeding coefficient through use of genomic data which we refer to for mathematical details. In brief, the IBD process along an inbred chromosome is approximated as a Markov model with two states, IBD and not-IBD, with transition probabilities depending on the genetic map distances and also the relationship between the parents. The observed variables are the variant genotype calls, whose emission probabilities depend on allele frequencies and genotyping error rates. A standard forward-backward algorithm is applied to compute the posterior IBD probability at each variant locus. Regions where these probabilities exceed a given threshold are reported as autozygous. The implementation uses the Decode recombination map (Kong et al., 2010) to compute genetic distances.