By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR
News

Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR

News Room
Last updated: 2025/11/24 at 2:53 AM
News Room Published 24 November 2025
Share
Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR
SHARE






FOTOGRIN/Shutterstock

DNA sequencing is one of today’s most critical scientific fields, powering leaps in humanity’s understanding of genetic causes of cancer, neurodegenerative diseases, and diabetes. One issue facing the industry is an overabundance of information. With scientists sharing their sequencing results in previously unrealized droves, massive datasets numbering in the petabytes have begun to be stored in repositories like the American Sequence Read Archive and European Nucleotide Archive. Containing almost as much information as all the text on the internet, harnessing these massive datasets has proven as difficult as analyzing them. Researchers at ETH Zurich have begun to tackle this problem by creating a DNA search engine that will allow scientists to look up and isolate genetic sequences. In a paper published in the scientific journal Nature, the team describes how its search engine, dubbed MetaGraph,  transforms these massive, disparate databases into a single searchable database housing nearly 600 million distinct sequences and 21 million gigabytes of sequence data.  

Such advancements build off the chain termination methods of Nobel laureate Fred Sanger, who pioneered the field with his 1977 breakthrough in genome sequencing. Since then, scientists have pursued next-generation sequencing technologies to develop tests to identify almost any infection, catalog the SARS-CoV-2 genome behind the COVID-19 pandemic, and even revive the dire wolf species. Described as a “Google for DNA” by Professor Gunnar Rätsch, a data scientist at the Department of Computer Science at ETH Zurich, researchers hope that MetaGraph’s search functionalities will vastly accelerate this form of genetic research.

A searchable genonome database


A digital rendering of a helixed DNA structure represents sequences within a database.
Bymuratdeniz/Getty Images

The research team at ETH Zurich has been building MetaGraph since 2020. Its strength is in its ability to streamline searching through DNA and RNA sequencing data by compressing it into full-text searchable indexes, reducing the average data size by a factor of 300. To do so, all data within the system undergoes a refining process, taking raw data and transforming it into error-corrected, refined graphs that are subsequently merged into the group’s unified index. This has allowed researchers to compress 100 TB datasets like GTEx and TCGA into just 10 GB each. 

The datasets feature virus, microbe, fungi, plant, bacteria, and human DNA sequences, including human gut metagenome and metazoan samples. The scientists also added raw metagenomic data and other critical datasets. The team used advanced mathematical graphs to efficiently organize the datasets, similar to how values are ordered in a spreadsheet. The connections between raw data and metadata have allowed the team to remove several redundancies, vastly compressing the dataset.   

One benefit of MetaGraph is that it allows researchers to search through the dataset without downloading large reams of information. Previously, researchers needed to download individual datasets before searching through the raw data sequences, making the research process slow and expensive. Another benefit is that this form of search is much more cost-efficient than previous data collation methods. For instance, the entire scope of publicly available biological sequencing data can now fit on a few hard drives, with each search costing a matter of cents, making the total cost roughly $2,500.

The future of DNA sequencing


A DNA sequencing database shows various processes for analyzing genetic structures.
Black_kira/Getty Images

As it stands, roughly half of the world’s sequencing datasets are currently available through MetaGraph’s search functions. The team at ETH expects the rest of the publicly available dataset to be online by the end of 2025. Critically, MetaGraph’s approach is scalable, ensuring that users continue to experience high search speeds even as its dataset multiplies. An open source resource, MetaGraph believes that it will attract various users, ranging from pharmaceutical companies, educators, scientists, researchers, and, possibly, private individuals. As Dr. André Kahles, a member of the Biomedical Informatics Group at ETH Zurich, said in a university press release, “In the early days, even Google didn’t know exactly what a search engine was good for. If the rapid development in DNA sequencing continues, it may become commonplace to identify your balcony plants more precisely.”

MetaGraph’s team of developers hopes their new program will facilitate genetic research. For instance, scientists used genomic sequencers to map out the SARS-CoV-2 virus, a key step in developing the COVID vaccine. Others have analyzed the DNA sequences of earthworms to study evolution. MetaGraph’s database could facilitate this research by making it easier to search, structure, and test genome sequences more quickly and cheaply. Such developments will make the next generation of genome sequencing technologies better, cheaper, and ultimately, healthier.

If you want to play with it, you can visit MetaGraph’s Open Data repository to execute searches within the group’s cloud database. For amateurs and prospective users looking to visualize the databases’ results, several examples are available on their website, including visualizations of famous proteins and antimicrobial resistance genes.



Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Survey reveals why Google’s Wicked Pixel Drop was a total letdown Survey reveals why Google’s Wicked Pixel Drop was a total letdown
Next Article How to transfer files from Android to iPhone: 5 methods that work How to transfer files from Android to iPhone: 5 methods that work
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

the science behind a geological risk that repeats itself every 1,200 years
the science behind a geological risk that repeats itself every 1,200 years
Mobile
Security Bite: Why I stopped using camera covers and you should too – 9to5Mac
Security Bite: Why I stopped using camera covers and you should too – 9to5Mac
News
Early Black Friday Deal: Microsoft Office for Mac Is Just
Early Black Friday Deal: Microsoft Office for Mac Is Just $30
News
Apple Pencil returns at its best price for Black Friday – just £59
Apple Pencil returns at its best price for Black Friday – just £59
Gadget

You Might also Like

Security Bite: Why I stopped using camera covers and you should too – 9to5Mac
News

Security Bite: Why I stopped using camera covers and you should too – 9to5Mac

4 Min Read
Early Black Friday Deal: Microsoft Office for Mac Is Just
News

Early Black Friday Deal: Microsoft Office for Mac Is Just $30

3 Min Read
Wrap up your 2025 workload with this Microsoft Office license, now 0 off
News

Wrap up your 2025 workload with this Microsoft Office license, now $100 off

5 Min Read
Tim Cook May Not Be Leaving Apple As Soon As You Think – BGR
News

Tim Cook May Not Be Leaving Apple As Soon As You Think – BGR

5 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?