• Skip navigation
  • Skip to navigation
  • Skip to the bottom
Simulate organization breadcrumb open Simulate organization breadcrumb close
Friedrich-Alexander-Universität Chair of Computer Science 6 CS6
  • FAUTo the central FAU website
  1. Friedrich-Alexander-Universität
  2. Technische Fakultät
  3. Department Informatik
Suche öffnen
    • Imprint
    • Privacy
    • Accessibility
    1. Friedrich-Alexander-Universität
    2. Technische Fakultät
    3. Department Informatik
    Friedrich-Alexander-Universität Chair of Computer Science 6 CS6
    Navigation Navigation close
    • Chair
      • About us
      • Staff
      • Contact
      Chair main page
    • Research
      • Evolutionary Information Systems
        • Sprechaktbasiertes Fallmanagement
        • Open and Collaborative Query-Driven Analytics
        • Processing Heterogeneous Assets and Resources to discover Ontologies and Semantics
        • Schema Inference and Machine Learning
      • Data Quality
        • DQ-Step – Verbesserung der Datenqualität bei AREVA NP / Abteilung NEM-G
        • Data quality and innovative capability of medical products
      • Data Integration
        • DQ-Step – Verbesserung der Datenqualität bei AREVA NP / Abteilung NEM-G
        • Open and Collaborative Query-Driven Analytics
        • Processing Heterogeneous Assets and Resources to discover Ontologies and Semantics
        • Schema Inference and Machine Learning
      • Process Management
        • Sprechaktbasiertes Fallmanagement
      • Database Systems
        • REAPER: A Framework for Materializing and Reusing Deep-Learning Models
        • Data Stream Application Manager
        • Know Your Queries!
        • Assessment of Data Management Systems
        • Query Optimisation and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis
        • Query Optimisation and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis (Phase II)
        • Schema Inference and Machine Learning
        • Architecture of Non-Multiple Autoencoders for Non-Lossy Information Agglomeration (working title, preliminary)
      • Datastream Systems
        • Data Stream Application Manager
        • Cross-system Optimization of Data-stream Queries
      • Data Management in the Digital Humanities
        • Franken in historischen Reiseberichten
        • Campusnetzwerk Digitale Geistes- und Sozialwissenschaften
      • Modern Database Systems
      • Finished Research Projects
      • Publications
      Research main page
    • Teaching
      • Study recommendations
      • Courses
      • Curriculum
      • Supervised theses
      • Exam information
      Teaching main page

    Chair of Computer Science 6

    Data Management

    In page navigation: Research
    • Finished Research Projects
    • Publications
    • Evolutionary Information Systems
    • Data Quality
    • Data Integration
    • Process Management
    • Database Systems
      • REAPER: A Framework for Materializing and Reusing Deep-Learning Models
      • Data Stream Application Manager
      • Know Your Queries!
      • Assessment of Data Management Systems
      • Query Optimisation and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis
      • Query Optimisation and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis (Phase II)
      • Schema Inference and Machine Learning
      • Architecture of Non-Multiple Autoencoders for Non-Lossy Information Agglomeration (working title, preliminary)
    • Datastream Systems
    • Data Management in the Digital Humanities
    • Modern Database Systems

    Query Optimisation and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis (Phase II)

    Maximilian Langohr

    Maximilian Langohr, M. Sc.

    Department of Computer Science
    Chair of Computer Science 6 (Data Management)

    Room: Room 08.151
    Martensstr. 3
    91058 Erlangen
    Germany
    • Phone number: +49 9131 85-27800
    • Email: maximilian.langohr@fau.de
    • Website: https://www.cs6.tf.fau.de/person/maximilian-langohr/
    • Xing: Page of Maximilian Langohr
    • Google Scholar: Page of Maximilian Langohr
    • ORCID: Page of Maximilian Langohr

    Query Optimisation and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis (Phase II)

    (Third Party Funds Single)

    Overall project:
    Project leader: Klaus Meyer-Wegener
    Project members: Maximilian Langohr
    Start date: 01/08/2021
    End date: 31/07/2024
    Acronym: ReProVide II INF6
    Funding source: Deutsche Forschungsgemeinschaft (DFG)
    URL: https://www.dfg-spp2037.de/me943-9/

    Abstract

    Analysing petabytes of data in an affordable amount of time and energy requires massively parallel processing of data at their source. Active research is therefore directed towards emerging hardware architectures to reduce the volume of data close to their source and towards sound query analysis and optimisation techniques to exploit such novel architectures. The goal of the ReProVide project is to investigate FPGA-based solutions for smart storage and near-data processing together with novel query-optimisation techniques that exploit the speed and reconfigurability of FPGA hardware for a scalable and powerful (pre-)filtering of Big Data.
    In the first funding phase, we have fostered the fundamentals for this endeavour. In particular, we have designed an FPGA-based PSoC architecture of so-called Reconfigurable Data Provider Units (RPUs). For data processing and filtering, an RPU exploits the capabilities of dynamic (run-time) hardware reconfiguration of modern FPGAs to load pre-designed hardware accelerators on-the-fly. An RPU is able to process SQL queries or parts of them in hardware in combination with CPU cores also available on the PSoC. For the integration of RPUs into a DBMS, new cost models had to be developed, taking the capabilities and characteristics of an RPU into account. Here, we have elaborated a novel hierarchical (multi-level) query optimisation to determine which operations are worthwhile to be assigned to a RPU (query partitioning) and how to deploy and execute the assigned (sub-)queries or database operators on the RPU (query placement). The implemented query optimiser shares the work between the global optimiser of the DBMS (in our case Apache Calcite) and an architecture-specific local optimiser running on the RPU. 
    In the second funding phase, our major research goals will be: 
    1.) Stream processing: RPUs could equally be beneficial for the filtering of streams. Here, a plethora of fundamentally new module functionality will have to be investigated to support non-standard operators, leading to RPUs applicable to a much more diverse class of tasks including window operations and data-preparation functionality. 
    2.) Scalability: User interaction with modern databases usually involves not only one, but a sequence of queries. At the same time, multiple applications are running concurrently. Here, we will design an eight-node RPU cluster attached to storage and network to enable the distributed and parallel data processing of large databases and data streams. Also required are concepts for data partitioning and novel query optimisation techniques, making use of query-sequence information. 
    3.) Demonstrator & Evaluation: As a testbed and a proof of the benefits of the ReProVide approach in general and an FPGA-based RPU cluster in particular, we want to analytically as well as experimentally evaluate the margins of energy reductions that become possible through near-data processing.

    Publications

    • Langohr M., Vogler T., Meyer-Wegener K.:
      SKYSHARK: A Benchmark with Real-world Data for Line-rate Stream Processing with FPGAs
      Lernen, Wissen, Daten, Analysen (LWDA) (Marburg, 09/10/2023 - 11/10/2023)
      In: Leyer M, Wichmann J (ed.): Lernen, Wissen, Daten, Analysen (LWDA) Conference Proceedings, Marburg, Germany, October 9-11, 2023 2023
      Open Access: https://ceur-ws.org/Vol-3630/LWDA2023-paper9.pdf
      URL: https://ceur-ws.org/Vol-3630/LWDA2023-paper9.pdf
      BibTeX: Download
    • Hahn T., Langohr M., Meißner S., Döring B., Wildermann S., Meyer-Wegener K., Teich J.:
      ReProVide: Query Optimisation and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis
      21st Conference on Database Systems for Business, Technology and Web (Bamberg, 03/03/2025 - 07/03/2025)
      In: Proceedings of the 21st Conference on Database Systems for Business, Technology and Web 2025
      DOI: 10.18420/BTW2025-57
      BibTeX: Download
    • Hahn T., Langohr M., Becher A., Beena Gopalakrishnan Nair L., Meyer-Wegener K., Teich J., Wildermann S.:
      ReProVide: Query Optimization and Near-Data Processing on Reconfigurable SoCs for Big Data Analysis
      In: Scalable Data Management for Future Hardware, 2025, p. 171-197
      ISBN: 9783031740961

      DOI: 10.1007/978-3-031-74097-8_7
      BibTeX: Download

    Staff

    Project Heads

    Maximilian Langohr

    Maximilian Langohr, M. Sc.

    Martensstr. 3
    91058 Erlangen
    Germany
    • Phone number: +49 9131 85-27800
    • Email: maximilian.langohr@fau.de
    • Website: https://www.cs6.tf.fau.de/person/maximilian-langohr/
    • Xing: Page of Maximilian Langohr
    • Google Scholar: Page of Maximilian Langohr
    • ORCID: Page of Maximilian Langohr
    More › Details for Maximilian Langohr

    Klaus Meyer-Wegener

    Prof. i. R. Dr. Klaus Meyer-Wegener

    Martensstraße 3
    91058 Erlangen
    • Phone number: +49 9131 85-27892
    • Email: klaus.meyer-wegener@fau.de
    • Website: https://www.cs6.tf.fau.de/person/klaus-meyer-wegener/
    More › Details for Klaus Meyer-Wegener

    Further Participants

    • Stefan Meißner
    Chair of Computer Science 6 (Data Management)
    Friedrich-Alexander-Universität Erlangen-Nürnberg

    Martensstraße 3
    91058 Erlangen
    Germany
    • Contact
    • Imprint
    • Privacy
    • Accessibility
    • Facebook
    • RSS Feed
    • Twitter
    • Xing
    Up