By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR
News

Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR

News Room
Last updated: 2025/11/24 at 2:53 AM
News Room Published 24 November 2025
Share
Scientists Created A Genetic Code Search Engine Like ‘Google For DNA’ – BGR
SHARE






FOTOGRIN/Shutterstock

DNA sequencing is one of today’s most critical scientific fields, powering leaps in humanity’s understanding of genetic causes of cancer, neurodegenerative diseases, and diabetes. One issue facing the industry is an overabundance of information. With scientists sharing their sequencing results in previously unrealized droves, massive datasets numbering in the petabytes have begun to be stored in repositories like the American Sequence Read Archive and European Nucleotide Archive. Containing almost as much information as all the text on the internet, harnessing these massive datasets has proven as difficult as analyzing them. Researchers at ETH Zurich have begun to tackle this problem by creating a DNA search engine that will allow scientists to look up and isolate genetic sequences. In a paper published in the scientific journal Nature, the team describes how its search engine, dubbed MetaGraph,  transforms these massive, disparate databases into a single searchable database housing nearly 600 million distinct sequences and 21 million gigabytes of sequence data.  

Such advancements build off the chain termination methods of Nobel laureate Fred Sanger, who pioneered the field with his 1977 breakthrough in genome sequencing. Since then, scientists have pursued next-generation sequencing technologies to develop tests to identify almost any infection, catalog the SARS-CoV-2 genome behind the COVID-19 pandemic, and even revive the dire wolf species. Described as a “Google for DNA” by Professor Gunnar Rätsch, a data scientist at the Department of Computer Science at ETH Zurich, researchers hope that MetaGraph’s search functionalities will vastly accelerate this form of genetic research.

A searchable genonome database


A digital rendering of a helixed DNA structure represents sequences within a database.
Bymuratdeniz/Getty Images

The research team at ETH Zurich has been building MetaGraph since 2020. Its strength is in its ability to streamline searching through DNA and RNA sequencing data by compressing it into full-text searchable indexes, reducing the average data size by a factor of 300. To do so, all data within the system undergoes a refining process, taking raw data and transforming it into error-corrected, refined graphs that are subsequently merged into the group’s unified index. This has allowed researchers to compress 100 TB datasets like GTEx and TCGA into just 10 GB each. 

The datasets feature virus, microbe, fungi, plant, bacteria, and human DNA sequences, including human gut metagenome and metazoan samples. The scientists also added raw metagenomic data and other critical datasets. The team used advanced mathematical graphs to efficiently organize the datasets, similar to how values are ordered in a spreadsheet. The connections between raw data and metadata have allowed the team to remove several redundancies, vastly compressing the dataset.   

One benefit of MetaGraph is that it allows researchers to search through the dataset without downloading large reams of information. Previously, researchers needed to download individual datasets before searching through the raw data sequences, making the research process slow and expensive. Another benefit is that this form of search is much more cost-efficient than previous data collation methods. For instance, the entire scope of publicly available biological sequencing data can now fit on a few hard drives, with each search costing a matter of cents, making the total cost roughly $2,500.

The future of DNA sequencing


A DNA sequencing database shows various processes for analyzing genetic structures.
Black_kira/Getty Images

As it stands, roughly half of the world’s sequencing datasets are currently available through MetaGraph’s search functions. The team at ETH expects the rest of the publicly available dataset to be online by the end of 2025. Critically, MetaGraph’s approach is scalable, ensuring that users continue to experience high search speeds even as its dataset multiplies. An open source resource, MetaGraph believes that it will attract various users, ranging from pharmaceutical companies, educators, scientists, researchers, and, possibly, private individuals. As Dr. André Kahles, a member of the Biomedical Informatics Group at ETH Zurich, said in a university press release, “In the early days, even Google didn’t know exactly what a search engine was good for. If the rapid development in DNA sequencing continues, it may become commonplace to identify your balcony plants more precisely.”

MetaGraph’s team of developers hopes their new program will facilitate genetic research. For instance, scientists used genomic sequencers to map out the SARS-CoV-2 virus, a key step in developing the COVID vaccine. Others have analyzed the DNA sequences of earthworms to study evolution. MetaGraph’s database could facilitate this research by making it easier to search, structure, and test genome sequences more quickly and cheaply. Such developments will make the next generation of genome sequencing technologies better, cheaper, and ultimately, healthier.

If you want to play with it, you can visit MetaGraph’s Open Data repository to execute searches within the group’s cloud database. For amateurs and prospective users looking to visualize the databases’ results, several examples are available on their website, including visualizations of famous proteins and antimicrobial resistance genes.



Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Survey reveals why Google’s Wicked Pixel Drop was a total letdown Survey reveals why Google’s Wicked Pixel Drop was a total letdown
Next Article How to transfer files from Android to iPhone: 5 methods that work How to transfer files from Android to iPhone: 5 methods that work
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

The 5 best noise-cancelling earbuds to shop on Black Friday, tested and reviewed
The 5 best noise-cancelling earbuds to shop on Black Friday, tested and reviewed
News
Hotel Chocolat All-New Velvetiser Review
Hotel Chocolat All-New Velvetiser Review
Gadget
Beehiiv’s CEO isn’t worried about newsletter saturation |  News
Beehiiv’s CEO isn’t worried about newsletter saturation | News
News
Is The Universe Actually Infinite? Here’s What Astrophysicists Say – BGR
Is The Universe Actually Infinite? Here’s What Astrophysicists Say – BGR
News

You Might also Like

The 5 best noise-cancelling earbuds to shop on Black Friday, tested and reviewed
News

The 5 best noise-cancelling earbuds to shop on Black Friday, tested and reviewed

3 Min Read
Beehiiv’s CEO isn’t worried about newsletter saturation |  News
News

Beehiiv’s CEO isn’t worried about newsletter saturation | News

26 Min Read
Is The Universe Actually Infinite? Here’s What Astrophysicists Say – BGR
News

Is The Universe Actually Infinite? Here’s What Astrophysicists Say – BGR

5 Min Read
OnePlus 15 proves it’s the toughest flagship you can actually buy
News

OnePlus 15 proves it’s the toughest flagship you can actually buy

3 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?