Universal dynamical properties preclude standard clustering in a large class of biochemical data

Gomez, Florian ; Stoop, Ralph L. ; Stoop, Ruedi

In: Bioinformatics, 2014, vol. 30, no. 17, p. 2486-2493

Ajouter à la liste personnelle
    Summary
    Motivation: Clustering of chemical and biochemical data based on observed features is a central cognitive step in the analysis of chemical substances, in particular in combinatorial chemistry, or of complex biochemical reaction networks. Often, for reasons unknown to the researcher, this step produces disappointing results. Once the sources of the problem are known, improved clustering methods might revitalize the statistical approach of compound and reaction search and analysis. Here, we present a generic mechanism that may be at the origin of many clustering difficulties. Results: The variety of dynamical behaviors that can be exhibited by complex biochemical reactions on variation of the system parameters are fundamental system fingerprints. In parameter space, shrimp-like or swallow-tail structures separate parameter sets that lead to stable periodic dynamical behavior from those leading to irregular behavior. We work out the genericity of this phenomenon and demonstrate novel examples for their occurrence in realistic models of biophysics. Although we elucidate the phenomenon by considering the emergence of periodicity in dependence on system parameters in a low-dimensional parameter space, the conclusions from our simple setting are shown to continue to be valid for features in a higher-dimensional feature space, as long as the feature-generating mechanism is not too extreme and the dimension of this space is not too high compared with the amount of available data. Availability and implementation: For online versions of super-paramagnetic clustering see http://stoop.ini.uzh.ch/research/clustering. Contact: ruedi@ini.phys.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online