Schema Inference and Machine Learning

Schema Inference and Machine Learning

(Own Funds)

Overall project:
Project leader:
Project members:
Start date: 01/08/2018
End date:
Acronym: SIML
Funding source:


Within the framework of the project SIML (Schema Inference and Machine Learning), unstructured and semi-structured data are to be used to generate information from which a partial conceptual schema can be derived. Methods of topological data analysis (TDA) are used in combination with machine learning techniques to automate this as far as possible. In particular, we are interested in a stable, persistent form of natural data when using unsupervised learning methods. As a core concept, functional dependencies after data processing are to be investigated, with the help of which a suitable schema can then be defined. There are parallels and differences for time series and persistent data, which are also to be worked out.

The motivation of the work is to prove that schemata have a natural geometric structure in the form of a simplicial complex which can be investigated or made visible by topological methods.