DATA-DRIVEN SUPERVISED LEARNING FOR LIFE SCIENCE DATA

Data-Driven Supervised Learning for Life Science Data

Data-Driven Supervised Learning for Life Science Data

Blog Article

Life science data are often encoded in a non-standard way by means of alpha-numeric sequences, graph representations, numerical vectors of variable length, or other formats.Domain-specific or data-driven similarity measures like alignment functions have been VTX employed with great success.The vast majority of more complex data analysis algorithms require fixed-length vectorial input data, asking for substantial preprocessing of life science data.Data-driven measures are widely ignored in favor of simple encodings.These preprocessing steps are not always easy to perform nor particularly effective, with a potential loss of information and interpretability.

We present some strategies and concepts of how to employ data-driven similarity measures in the life science context and other complex biological systems.In particular, we show how to use data-driven Arm Chair (set of 4) similarity measures effectively in standard learning algorithms.

Report this page