Accéder au contenu
Merck

Cheminformatics analysis of assertions mined from literature that describe drug-induced liver injury in different species.

Chemical research in toxicology (2009-12-18)
Denis Fourches, Julie C Barnes, Nicola C Day, Paul Bradley, Jane Z Reed, Alexander Tropsha
RÉSUMÉ

Drug-induced liver injury is one of the main causes of drug attrition. The ability to predict the liver effects of drug candidates from their chemical structures is critical to help guide experimental drug discovery projects toward safer medicines. In this study, we have compiled a data set of 951 compounds reported to produce a wide range of effects in the liver in different species, comprising humans, rodents, and nonrodents. The liver effects for this data set were obtained as assertional metadata, generated from MEDLINE abstracts using a unique combination of lexical and linguistic methods and ontological rules. We have analyzed this data set using conventional cheminformatics approaches and addressed several questions pertaining to cross-species concordance of liver effects, chemical determinants of liver effects in humans, and the prediction of whether a given compound is likely to cause a liver effect in humans. We found that the concordance of liver effects was relatively low (ca. 39-44%) between different species, raising the possibility that species specificity could depend on specific features of chemical structure. Compounds were clustered by their chemical similarity, and similar compounds were examined for the expected similarity of their species-dependent liver effect profiles. In most cases, similar profiles were observed for members of the same cluster, but some compounds appeared as outliers. The outliers were the subject of focused assertion regeneration from MEDLINE as well as other data sources. In some cases, additional biological assertions were identified, which were in line with expectations based on compounds' chemical similarities. The assertions were further converted to binary annotations of underlying chemicals (i.e., liver effect vs no liver effect), and binary quantitative structure-activity relationship (QSAR) models were generated to predict whether a compound would be expected to produce liver effects in humans. Despite the apparent heterogeneity of data, models have shown good predictive power assessed by external 5-fold cross-validation procedures. The external predictive power of binary QSAR models was further confirmed by their application to compounds that were retrieved or studied after the model was developed. To the best of our knowledge, this is the first study for chemical toxicity prediction that applied QSAR modeling and other cheminformatics techniques to observational data generated by the means of automated text mining with limited manual curation, opening up new opportunities for generating and modeling chemical toxicology data.

MATÉRIAUX
Référence du produit
Marque
Description du produit

Sigma-Aldrich
Ethyl alcohol, Pure, 200 proof, for molecular biology
Sigma-Aldrich
2-Propanol, suitable for HPLC, 99.9%
Sigma-Aldrich
2-Propanol, ACS reagent, ≥99.5%
Sigma-Aldrich
Ethyl alcohol, Pure, 200 proof, ACS reagent, ≥99.5%
Sigma-Aldrich
Acide acétique, glacial, ACS reagent, ≥99.7%
Sigma-Aldrich
Glycérol, ACS reagent, ≥99.5%
Sigma-Aldrich
Acide acétique, glacial, ReagentPlus®, ≥99%
Sigma-Aldrich
Glycérol, for molecular biology, ≥99.0%
Sigma-Aldrich
Tamoxifène, ≥99%
Sigma-Aldrich
Bicarbonate de sodium, ACS reagent, ≥99.7%
Sigma-Aldrich
Glycérol, ReagentPlus®, ≥99.0% (GC)
Sigma-Aldrich
Saccharose, for molecular biology, ≥99.5% (GC)
Sigma-Aldrich
Glycine, ReagentPlus®, ≥99% (HPLC)
Sigma-Aldrich
Acetate de sodium, anhydrous, ReagentPlus®, ≥99.0%
Sigma-Aldrich
Bicarbonate de sodium, powder, BioReagent, for molecular biology, suitable for cell culture, suitable for insect cell culture
Sigma-Aldrich
Solution de formol, tamponnée à pH neutre (10 %), histological tissue fixative
Sigma-Aldrich
Ethyl alcohol, Pure, 200 proof, meets USP testing specifications
Sigma-Aldrich
Saccharose, ≥99.5% (GC)
Sigma-Aldrich
L-glutamine solution, 200 mM, solution, sterile-filtered, BioXtra, suitable for cell culture
Sigma-Aldrich
Dexaméthasone, powder, BioReagent, suitable for cell culture, ≥97%
Sigma-Aldrich
Glycine, suitable for electrophoresis, ≥99%
Sigma-Aldrich
Acide rétinoïque, ≥98% (HPLC), powder
Sigma-Aldrich
Ethyl alcohol, Pure, 190 proof, for molecular biology
Sigma-Aldrich
Guanidine hydrochloride, for molecular biology, ≥99%
Sigma-Aldrich
Guanidine hydrochloride, ≥98%
Sigma-Aldrich
Bromure d'hexadécyltriméthylammonium, ≥98%
Sigma-Aldrich
2-Propanol, HPLC Plus, for HPLC, GC, and residue analysis, 99.9%
Sigma-Aldrich
Isopropanol, 70% in H2O
Sigma-Aldrich
Formaldéhyde solution, for molecular biology, 36.5-38% in H2O
Sigma-Aldrich
2-Propanol, for molecular biology, BioReagent, ≥99.5%