Updates:
- vSampler now supports sampling on X chromosome.
- vSampler now supports input file to be separated by commas.
vSampler is both a web-based tool and a local program to support matched control variant sampling. Given input variants, vSampler could randomly sample control variants with matched minor allele frequency (MAF), distance to closet transcription start site (DTCT), number of nearby genes, number of variants in LD, GC content and many other properties. These matched random controls could be used to construct null distribution for GWAS enrichment analysis to estimate significance of enrichment empirically or serve as negative training/test data for regulatory variant prediction methods.
vSampler runs significantly faster than existing tools, supports both SNPs and indels and provides comprehensive state of art variant annotations as matching properties. A novel data structure and algorithm were developed to achieve this performance improvement.
Please cite vSampler as follows:
Huang D#, Wang Z#, Zhou Y#, Liang Q, Sham PC, Yao H*, Li MJ*. vSampler: fast and annotation-based matched variant sampling tool. Bioinformatics. 2021 Jul 27;37(13):1915-1917.