The VarSome API is an incredibly simple and powerful tool allowing a developer to instantly access the 140+ databases integrated in VarSome along with automated ACMG or AMP classifications.
It is possible to annotate whole exomes or even whole genomes extremely efficiently and at low cost using the VarSome API, with options to add VarSome’s automated ACMG & AMP classifications, or pull in additional data from specific databases of interest.
The most cost effective results will be achieved by only annotating a subset of variants: coding variants or those close to a canonical splice-point (flanking within 10bp for example). You can use tools such as bedtools combined with a GFF3 file for your genome of interest (https://www.gencodegenes.org/human/).
The VarSome API includes a filter to automatically remove high frequency variants.
In our tests we annotated a sample exome, then restricted the annotation to only coding/splicing variants and then finally filtered the remaining variants using the ACMG BA1 benign frequency threshold of 5%.
Variants | ACMG | ACMG +AMP | All Data | |
Coding or flanking | 24 954 | 13.6 KB/variant | 21.1 KB/variant | 131 KB/variant |
Frequency < 5% | 4 205 | 11.5 KB/Variant | 17.9 KB/variant | 161 KB/Variant |
Non-coding variants usually have substantially less data each, and intergenic variants even less.
Results will vary depending on your pipeline and specific use-case, which additional databases you require, annotating tumour samples or multiple family members. You can of course cache previous results to avoid re-annotating common variants in a family group or cohort.
Please refer to your Saphetor sales consultant for exact pricing details which may vary over time. As of March 2021, annotating a whole exome using ACMG would incur costs of approximately $15 (€12, CHF 13). Furthermore API pricing is degressive and reduces the more you use it.
Currently, the API is priced based on the amount of data exchanged. All data is returned in the JSON format. This document aims to explain some of the intricacies and how to keep the costs extremely low.
The following gives some recommendations on how to use the API efficiently, the full reference guide is available at https://api.varsome.com/.
The API gives full control to the user: in order to keep costs reasonable or very low, you need to decide which data you actually need. This is controlled by the following options:
Only use “expand-pubmed-articles” on a very small set of variants, the large amount of text in abstracts will rapidly make this option unviable. We recommend using the API call “pubmed_info” instead on a case-by-case basis and then caching the results in your own database - for example: https://api.varsome.com/pubmed_info/12345:45678.
This is a very useful tool, set it at 0.05 for example to remove all BA1 variants. We find that on average this reduces the amount of data by a factor of 4x to 6x.
We annotated a whole exome to measure the current data-sizes returned by the API:
Variants | ACMG | ACMG +AMP | All Data | |
Coding or flanking | 24 954 | 13.6 KB/variant | 21.1 KB/variant | 131 KB/variant |
Frequency < 5% | 4 205 | 11.5 KB/Variant | 17.9 KB/variant | 161 KB/Variant |
We recommend that you cache frequently used data:
We hope these guidelines are helpful to extract maximum value from the hugely powerful VarSome API. Do let us know if there’s any further information we can provide, please address any questions to support@varsome.com.
Kind regards,
The VarSome Team.