Skip to main content
Version: 3.25 (unreleased)

Illumina Connected Annotations provides translational research-grade annotation of genomic variants (SNVs, MNVs, insertions, deletions, indels, STRs, gene fusions, and SVs (including CNVs). It can be run as a stand-alone package, or integrated into larger software tools that require variant annotation.

The input to Illumina Connected Annotations are VCFs and the output is a structured JSON representation of all annotation and sample information (as extracted from the VCF). Illumina Connected Annotations handles multiple alternate alleles and multiple samples with ease.

The software is being developed under a rigorous SDLC and testing process to ensure accuracy of the results and enable embedding in other software. Illumina Connected Annotations uses a continuous integration pipeline where millions of variant annotations are monitored against baseline values daily.

What does Illumina Connected Annotations annotate?

We use Sequence Ontology consequences to describe how each variant impacts a given transcript:

Reference genome

For GRCh37, For GRCh38, our reference genome is using version GRCh38.p14. The list of chromosome and contigs in this version can be checked from this file

caution

There was a bug in our previous genome reference. Previously, we generate genome reference based on GRCh38.p13 version 109.20190607 for chromosomes and contigs name. Despite that, we use GRCh38.p12 version 109 for the FASTA sequence. This resulted in mismatch between some alt contigs name and its FASTA sequence. This mismatch can cause some minor issue on the variant ID and HGVS g. notation if the variant is on the alt contigs that were affected. This issue is not present for variant that occurs on main chromosome (chromosome 1-22, X, and Y).

With release GRCh38.p14, we have fix this issue and the GRCh38.p14 release will also backward compatible with previous release. Old transcript and gene model files and all of supplementary annotation data will not have any issue with the new genome reference version. It is advisable to update to the current genome reference version.

Transcript and Gene Models

The transcript and gene models are obtained from RefSeq and Ensembl. The current officially supported versions for GRCh38 are:

Data SourceVersionRelease Date
RefSeqGCF_000001405.40-RS_2023_102023-10-07
Ensembl1122024-05-14

For GRCh37:

Data SourceVersionRelease Date
RefSeq105.202203072022-03-10
Ensembl1102023-02-08
note

For GRCh37 Ensembl release, despite the version is 110 (and they also have newer release version 111 and 112), the annotation data is effectively the same as version 87 which was release back in 2017.

For GRCh37 Refseq release, NCBI has stopped releasing new gene annotation for and version 105.20220307 is the last release version for GRCh37.

For gene symbols that we supported for the transcript model above, we download the data from HGNC website as of 2024-06-03.

Supplementary Annotation

In addition, it uses external data sources to provide additional context for each variant. Illumina Connected Annotations provides annotations from the following sources divided into 2 tiers: Professional and basic. The basic tier can be accessed free of charge. The professional tier requires a license. Please see Licensed Content for details. For access, please contact annotation_support@illumina.com.

Data SourceAvailabilityLatest Supported Version
COSMICProfessional99
OMIMProfessional20240807
Primate AI-3DProfessional1.0
Promoter AIProfessional1.0
Splice AIProfessional1.3
1000 Genomes ProjectBasicPhase 3 v3plus
Cancer HotspotsBasic2017
ClinGenBasic20240807
ClinVarBasic20240730
DANNBasic20200205
dbSNPBasic156
DECIPHERBasic201509
FusionCatcherBasic1.33
GERPBasic20110522
GME VariomeBasic20160618
gnomAD (GRCh37)Basic2.1
gnomAD (GRCh38)Basic4.1
MITOMAPBasic20200819
MultiZ 100 wayBasic20171006
REVELBasic20200205
TOPMedBasicfreeze 5

Download manager

To effectively download all of data, we have provided download manager. Please go to download manager page to read more.

Download

Please visit Illumina Connected Annotations.