A QSAR model was developed from a dataset consisting in 546 organic molecules to predict acute aquatic toxicity towards Daphnia Magna. LC50 data, which is the concentration that causes death in 50% of test Daphnia Magna over a test duration of 48 hours, was used as model response. The 546 molecules of the dataset were randomly divided into training (436) and external test sets (110). The dataset included 8 molecular descriptors: TPSA(Tot) (Molecular properties), SAacc (Molecular properties), H-050 (Atom-centred fragments), MLOGP (Molecular properties), RDCHI (Connectivity indices), GATS1p (2D autocorrelations), nN (Constitutional indices), C-040 (Atom-centred fragments).
Here the dataset (togheter with the molecular SMILES) is provided for free as excel file. For further details on the dataset, molecular descriptors and the consequent published QSAR model, please have a look to the referenced scientific paper. The dataset is also uploaded on the OCHEM web site.
Conditions and warranty
The dataset is freeware and may be used if proper reference is given to the authors. Please, refer to the following paper:
M. Cassotti, D. Ballabio, V. Consonni, A. Mauri, I. Tetko, R. Todeschini (2014). Prediction of acute aquatic toxicity toward daphnia magna using the GA-kNN method, Alternatives to Laboratory Animals (ATLA), 42,31:41
Download
Fill in the following form. Your personal data will be used only for notification via email of new releases of the dataset and will not be communicated to external third parties. Once the form has been submitted, open the rar file and extract the excel data file. If you experience any problem to downlaod the toolbox, write to davide.ballabio@unimib.it.