We compare a range of computational methods for the prediction of sublimation thermodynamics (enthalpy, entropy and free energy of sublimation). These include a model from theoretical chemistry that utilizes crystal lattice energy minimization (with the DMACRYS program) and QSPR models generated by both machine learning (Random Forest and Support Vector Machines) and regression (Partial Least Squares) methods. Using these methods we investigate the predictability of the enthalpy, entropy and free energy of sublimation, with consideration of whether such a method may be able to improve solubility prediction schemes. Previous work has suggested that the major source of error in solubility prediction schemes involving a thermodynamic cycle via the solid state is in the modeling of the free energy change away from the solid state. Yet contrary to this conclusion other work has found that the inclusion of terms such as the enthalpy of sublimation in QSPR methods does not improve the predictions of solubility. We suggest the use of theoretical chemistry terms, detailed explicitly in the methods section, as descriptors for the prediction of the enthalpy and free energy of sublimation. A dataset of 158 molecules with experimental sublimation thermodynamics values and some CSD refcodes has been collected from the literature and is provided with their original source references.
- drug discovery
McDonagh, J., Palmer, D. S., van Mourik, T., & Mitchell, J. B. O. (2016). Are the sublimation thermodynamics of organic molecules predictable? Journal of Chemical Information and Modeling . https://doi.org/10.1021/acs.jcim.6b00033