Variants (VCF)

VCF, which stands for Variant Call Format, is a standardized text file format used for representing SNP, indel, and structural variation calls. The full specification of the format can be found at https://samtools.github.io/hts-specs.

Variant calls and genotypes#

Each variant call in the VCF file is displayed as a bar at the variant's locus.

In addition to variant calls, VCF files can optionally include genotypes. If they are included in the file, genotypes are displayed below the variant calls. Each row represents the genotypes of a single sample, and the sample names are displayed in the name panel to the left of the data.

Zooming in to basepair level gives a better view of individual variants. In the example below, we see the sites of several SNPs and below them some of the samples and their genotypes found in the VCF file. In this view, most of the samples are scrolled off the bottom of the data panel. The scrollbar is not visible in this screenshot.

Sorting and grouping samples#

Right-clicking on a genotype column will enable more entries in the pop-menu, allowing you to sort the sample rows:

Selecting the same sort option again will reverse the sort order.

Loading a sample attribute file alongside the VCF file, provides further options to sort and group the sample rows:

Display mode#

VCF tracks have three display mode options: .

To change the display mode, select a new option from the right-click pop-up menu.

To hide the genotypes in Expanded or Squished mode, unselect Show Genotypes in the track's right-click pop-up menu. To show them again, reselect Show Genotypes in the menu.

Color schemes#

The variant calls can be colored by Allele Frequency which is an annotation provided in the VCF file, or by Allele Fraction which is similar to allele frequency but is based on the specific samples in the VCF file as opposed to an annotation.

The variant call bars are rendered using two colors, one for the reference allele and one for the alternate allele, and the height of each section represents the frequency or fraction. By default, the variant call colors are red and blue:

By default, variant calls are colored by Allele Frequency. To change the default, uncheck the box labeled Color by allele frequency in View > Preferences > Variants. To change the color mode in selected tracks only, use the right-click pop-up menu.

For the genotypes, the default colors are:

To change any of the colors, select View > Preferences > Variants and click on a color swatch to bring up a color picker.

Visibility window#

If the VCF file is indexed, the data is only loaded when the viewing region is smaller than the current value for visibility window. If the region in view is larger than the window, the track will display: Zoom in to see features. The default value for the visibility window is proportional to the number of samples. To change the value, select Set Feature Visibility Window... from the right-click pop-up menu and in the dialog window that pops up enter the new window size in kilobases.