Sample SNPs
Fast ordered sampling of rows from large text or binary files. Special cases for DNA variant files (.bed, VCF, HapMap, etc).
|
M Matsumoto and T Nishimura. Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM TOMACS, 8(1):3–30, 1998.
Jeffrey Scott Vitter. Faster methods for random sampling. Commun. ACM, 27(7):703–718, July 1984.
Jeffrey S. Vitter. An efficient algorithm for sequential random sampling. ACM Trans. Math. Softw., 13(1):58–67, March 1987.