In the COVID era, computational biology is having a heyday – and machine learning is playing a massive role. With billions upon billions of compounds to search through for any given therapeutic application, strictly brute-force simulations are wildly unfeasible, necessitating more artificially intelligent methods of whittling down the options. Now, researchers from IRB Barcelona’s Structural Bioinformatics and Network Biology lab have developed a deep learning method that predicts the biological activity of any given molecule – even in the absence of experimental data.
The researchers, led by Patrick Aloy, are applying deep machine learning to a massive dataset: the Chemical Checker, which provides processed, harmonized, and integrated bioactivity data on 800,000 small molecules and is also produced by the Structural Bioinformatics and Network Biology lab. In total, any given molecule has 25 bioactivity “spaces,” but for most molecules, data on only a few are known – if that.
Using the new deep learning tool, that’s changing. The Chemical Checker database contains data on all 25 bioactivity spaces from each of those 800,000 molecules, and the tool, having been trained on that data, can predict all the bioactivity spaces of any molecules with incomplete bioactivity data. “The new tool … allows us to forecast the bioactivity spaces of new molecules, and this is crucial in the drug discovery process as we can select the most suitable candidates and discard those that, for one reason or another, would not work,” explained Aloy.
Of course, the prediction isn’t perfect, and assessing molecules with more available data will allow the tool to produce higher-confidence predictions. Some molecules, as well, prove simply more or less difficult for the tool to assess. “All models are wrong, but some are useful,” said Martino Bertoni, first author on the paper describing the research. “A measure of confidence allows us to better interpret the results and highlight which spaces of bioactivity of a molecule are accurate and in which ones an error rate can be contemplated.”
The researchers chose a challenging case for validation: a cancer-related transcription factor that was broadly considered an “undruggable” target. The tool identified 131 compounds that fit the target by predicting their bioactivity spaces, and their ability to degrade the target was experimentally confirmed.
The research described in this article was published as “Bioactivity descriptors for uncharacterized chemical compounds” in the June 2021 issue of Nature Communications. The article was written by Martino Bertoni, Miquel Duran-Frigola, Pau Badia-i-Mompel, Eduardo Pauls, Modesto Orozco-Ruiz, Oriol Guitart-Pla, Víctor Alcalde, Víctor M. Diaz, Antoni Berenguer-Llergo, Isabelle Brun-Heath, Núria Villegas, Antonio García de Herreros and Patrick Aloy. To read it, click here.
[wpsr_share_icons icons="twitter,facebook,linkedin,reddit,email" icon_size="40px" icon_bg_color="" icon_shape="circle" hover_effect="opacity" sm_screen_width="768" lg_screen_action="show" sm_screen_action="show" page_url="https://www.bigdatawire.com/2021/07/16/new-deep-learning-tool-predicts-bioactivity-identifies-therapeutics/" page_title="New Deep Learning Tool Predicts Bioactivity, Identifies Therapeutics" page_excerpt="In the COVID era, computational biology is having a heyday – and machine learning is playing a massive role. With billions upon billions of compounds to search through for any given therapeutic application, strictly brute-force simulations are wildly unfeasible, necessitating more artificially intelligent methods of whittling down the options. Read more…
" share_counter=""]January 30, 2025
- DeepSeek-R1 models now available on AWS
- DeepSeek-R1 Now Live With NVIDIA NIM
- Komprise Unveils Sensitive Data Management Capabilities for AI Data Governance and Cybersecurity
- ServiceNow Expands Workflow Data Fabric with Oracle Integration for AI-Driven Insights
- Kurrent Introduces Public Internet Access for Event-Native Data Platform
- VAST Data Supports Canada’s Sovereign AI Strategy with Hypertec Cloud Collaboration
- Lovelytics Acquires Datalytics to Expand Databricks Consulting and Global Reach
- Astronomer Announces Winners of the Inaugural Astronomer Data Excellence Awards
- LexisNexis Introduces Conversational Search in Nexis+ AI for Faster Insights
- Cerebras Launches Record-Breaking DeepSeek R1 Distill Llama 70B Inference
- YugabyteDB Levels Up its PostgreSQL Compatibility with PG15 Features and Seamless Upgrades
- SiMa.ai Expands MLSoC Lineup with Modalix for GenAI and Computer Vision
- FlutterFlow Announces AI-Powered Solution for CPG, Partners with Google Cloud and Accenture
January 29, 2025
- Scality Launches Cloud & Service Provider Program with Pay-as-You-Go Licensing for Veeam Partner Ecosystem
- KIOXIA Open Sources AiSAQ to Enable High-Performance RAG on SSDs
- Rocket Software Advances Hybrid Cloud Data Integration with Rocket DataEdge
- Genpact Launches AI Gigafactory to Accelerate Enterprise Value
- SAS Report Highlights AI’s Role in Boosting Government Efficiency
- Marvell to Showcase Accelerated Infrastructure Silicon at DesignCon 2025
- Perforce Introduces AI Validation: Adaptive, Intelligent AI Testing for Enterprise Teams