Fabian Wenz

PROJECTS

BEAVER

BEAVER - a enterprise dataset for text-to-SQL - is sourced from enterprise data warehouses with natural language queries and accurate SQL statements. Unlike public datasets, it highlights LLM limitations in complex environments. Future research can leverage this dataset to build advanced text-to-SQL systems.

GOBY

GOBY is a benchmark dataset designed for evaluating data integration techniques specifically for enterprise data. It was derived from a real-world production workload in the event promotion and marketing domain, compiled around 2017. Unlike public benchmarks, GOBY focuses on private datasets, making it more representative of enterprise challenges.

BENCHPRESS

BENCHPRESS is an interactive annotation system for rapidly creating text-to-SQL benchmarks tailored to enterprise data tasks. It uses a human-in-the-loop approach, where annotators refine or repair LLM-generated SQL-NL pairs. BENCHPRESS was instrumental in creating the BEAVER benchmark and includes plans for scalable query log annotation, semantic context enrichment, and robustness evaluation, addressing the unique challenges of enterprise data.

RUBICON

RUBICON is an agent-centric information system designed to answer complex, cross-source queries in domains with heterogeneous, multimodal, and partially incompatible data. Each information source—such as regulatory documents, technical guidelines, databases, or visual artifacts—is wrapped by a dedicated agent that exposes its capabilities through a unified Agentic Query Language (AQL). Rather than relying on a fully autonomous coordinating agent, Rubicon places the human in the loop as the explicit coordinator, allowing users to decompose queries, orchestrate agents, and iteratively refine intermediate results. This design avoids brittle global planning, increases transparency and controllability, and enables robust reasoning across sources that differ in structure, modality, and interpretability.

coming soon ...

WORK

Celonis

Developed and implemented new features and algorithms for Celonis’ query engine in C++, Java, and Python, improving performance and reliability.

Sep 2022 – Mar 2024

Amundi

Created data visualizations for fund mandates and automated monthly reports using Python and MySQL.

Jul 2021 – Mar 2022

Ernst & Young

Developed a web crawler for pharmaceutical regulations, an automated email newsletter, and tax-related applications using Python and Power BI.

Nov 2019 – Jul 2020

BMW Group

Implemented optimization algorithms for generating realistic load cases and automated scripts for strain case calculations in Lua and Python.

Mar 2018 – Sep 2018

Technical University of Munich

Served as a mathematics tutor and exam corrector, and scripted LaTeX documents.

Feb 2019 – Jul 2019

Hi, I am Fabian Wenz.

A Data Scientist and Developer.

PROJECTS

BEAVER

GOBY

BENCHPRESS

RUBICON

WORK

Celonis

Amundi

Ernst & Young

BMW Group

Technical University of Munich

SKILLS

C++

Python

Java

LaTeX

Lua

Matlab

Git

Haskell

Algorithms

AI

HTML

CONTACT