Hot processor speeds up U.K. genome analysis
The Genome Analysis Centre (TGAC) is the first institute in the U.K. to deploy a new bioinformatics processor called DRAGEN™, which dramatically reduces genomic pipeline runtimes from hours to minutes. This collaboration between Edico Genome and TGAC resulted in the first adaptation of the DRAGEN technology for the analysis of non-human genomes as part of the Institute’s endeavours to sequence the DNA of plant, animal and microbial species to promote a sustainable bioeconomy.
NORWICH, England and SAN DIEGO, Oct. 28, 2015 — TGAC’s high performance computing (HPC) infrastructure will benefit from the addition of Edico Genome’s DRAGEN™, the world’s first processor designed to analyse specific sequencing data tasks. DRAGEN will be used to accelerate TGAC’s next-generation sequencing workflows.
Initial evaluations of DRAGEN showed that mapping against the ash tree genome was 177 times faster per processing core than TGAC’s local HPC systems, requiring only 7 minutes instead of 3 hours on one of the larger datasets. Alignment runs on the rice genome that take approximately two hours on TGAC’s HPC servers took just three minutes using DRAGEN.
Project Lead Dr. Tim Stitt, Head of Scientific Computing at TGAC, said: “We are really excited to be Edico Genome’s first DRAGEN customer in the U.K., and we hotly anticipate utilising this ground-breaking technology to advance our mission to promote a sustainable bioeconomy and maintain the U.K.’s food security.
“In particular, we are really interested to see how DRAGEN handles the wheat genome, which is five times bigger than the human genome and much more complex. Wheat is the staple diet for over 35 percent of the world’s population, which is predicted to increase to 9 billion people by 2050.
“By understanding the genomic building blocks of wheat, and its diversity, we can better inform breeders on how to improve their yields, particularly in areas where wheat is prone to disease and drought. Obviously the sooner we do this the better and DRAGEN can greatly help us in this mission.
“Alignment against reference genomes is a fundamental task undertaken daily by TGAC researchers. Thanks to our partnership with Edico Genome, our DRAGEN system will contain both genome and transcriptome highly optimised analysis pipelines.
“TGAC is proud to be a leader in bringing new and disruptive technologies into the hands of the bioscience community and our collaboration with Edico Genome continues to illustrate our leadership in this area.”
The DRAGEN Bio-IT Processor is integrated on a PCIe card and available in a pre-configured server, enabling seamless integration into bioinformatics workflows. DRAGEN is highly reconfigurable, using a field-programmable gate array (FPGA) to provide hardware-accelerated implementations of BCL conversion, compression, mapping, alignment, sorting, duplicate marking, haplotype variant calling and joint genotyping.
The DRAGEN system therefore is much faster than traditional approaches that execute algorithmic implementations in software. In a recent study published in Genome Medicine, DRAGEN sped up analysis of a whole genome from 22.5 hours to 41 minutes, while also achieving sensitivity and specificity of 99.5 percent. Similar efficiency gains could make an enormous impact due to the high throughput of genomic data processed at TGAC, where sequence alignment is critical to many sequencing projects.
“Our collaboration with TGAC, a powerhouse in genomics that is home to one of the largest computing hardware facilities in Europe, is a great example of the benefits DRAGEN holds for sequencing centres,” said Pieter van Rooyen, Ph.D., chief executive officer of Edico Genome. “We look forward to continuing to work with researchers and clinicians around the world with a need to analyse next-generation sequencing data rapidly and cost effectively without compromising accuracy.”
Note to Editors
- The hardware modifications were carried out by Edico Genome engineers based on in-house testing using the datasets provided by TGAC. This testing allowed adaption of the pipelines to handle these non-human datasets.
- The DRAGEN technology has been shown to accurately analyse over 50 whole human genomes (from FASTQ to VCF) in less than a day. TGAC plans to incorporate the system into the existing HPC platform as a resource within the batch submission system.
The Genome Analysis Centre (TGAC) is a research institute focused on the application of state of the art genomics and bioinformatics to advance plant, animal and microbial research to promote a sustainable bioeconomy. TGAC is a hub for innovative bioinformatics founded on research, analysis and interpretation of multiple, complex data sets. TGAC hosts one of the largest computing hardware facilities dedicated to life science research in Europe.
About Edico Genome
Edico Genome has created the world’s first bioinformatics processor designed to analyze next-generation sequencing data, DRAGEN™. The use of next-generation sequencing is growing at an unprecedented pace, creating a need for a technology that can process this big data rapidly and accurately. Edico Genome’s computing platform has been shown to speed whole genome data analysis from hours to minutes, while maintaining high accuracy and reducing costs, enabling clinicians and researchers to reveal answers more quickly. For more information, visit www.EdicoGenome.com or follow @EdicoGenome.