PCI 7 November 2023, 15:22
Thermofisher: Thu 29 February 2024, 11:15
BMG Labtech: Wed 18 September 2024, 11:55
Owen Mumford 12 January 2022, 16:46

Current Edition

Cell and Gene Therapy

Upcoming Events

PEGS Boston – 17/02/2025
NextGen BioMed – 04/02/2025
BioTrinity 2025 – January 30th 2025
Elrig R&I 2025 – 27th January 2025
Biotechnology Show 2025: 20th January 2025
Anglonordic: 16th January 2025
AI in Drug Discovery – SAE media – January 14th 2025

Advertisement

Fujifilm rectangle: Fri 22 November 2024, 14:23
Roald Dahl Charity: Fri 15 November 2024, 12:57
A&M STABTEST: Fri 21 June 2024, 11:43
CDD Vault: Wed 17 July 2024, 11:46
Aurisco – 04/02/2025

Extremely fast chemical search in vast chemical space

ChemAxon explores the complex nature of chemical graphs, that offers an immense source of variability for drug designers to tackle optimization challenges along with the project pathway towards candidates.

The difficulty lies within the exploration of the chemical space either by chemical intuition of medicinal chemists or by using enabling technologies, like cheminformatics tools. Real and virtual chemical spaces encompass broad scale of compound numbers and a vast potential to be exploited. An especially valuable sub-group is where measured data exists and stored, most commonly in relational databases. In our study both types, a very large compound collection and a medium sized database with extensive assay data were evaluated.

As a read-out we used the cost associated with finding an answer for chemical questions: the search time. In the first use-case, the aim was to suggest novel analogues of known drugs using the largest publicly available enumerated compound collection, the GDB-13 counting 977 million unique entries. This collection was screened with ultra-fast similarity search technique, using a subset of marketed drugs, where ~4 seconds elapsed search time was measured constantly on a commercially available server (EC2, r3.8xlarge) using standard 1k fingerprint. Top 100 most similar compounds were cross filtered with the database of exemplified structures from patents (SureChEMBL DB) to fetch novel moieties with higher tendency to be in freedom to operate space. (Fig. 1)

In the second part search performance on the entire data from ChEMBL DB was measured with three search types (duplicate, similarity and substructure) and joined queries. These joined queries represent complex questions asked from data warehouses in pharmaceutical industry, where performance is a key indicator due to massive load. The aim is to provide realistic speed statistics measured with chemical cartridge extending Oracle and the new generation engine running on PostgreSQL. Significant speed up was measured using the new search engine, especially on combined queries, where 100x speed up was achieved and median search time was in a range of ~100 milliseconds falling below the recognition time limit.

Read through the poster by ChemAxon over here: https://chemaxon.com/poster/finding-answers-from-chemical-space-extremely-fast

 

Newcells 3 June 2024, 15:12
Novonordisk: Wed 17 July 2024, 11:22
FujiFilm 30 October 2023, 16:23
Autoscribe Mon 26 June 2023, 15:15
Aldevron: 16th January 2025
Richter: Wed 23 October 2024, 09:03
GenXPro: Mon 16 September 2024, 10:40