Selected projects from our research program.
Automated Database Tuning
This project focuses on automatically optimizing database performance, reducing reliance on costly expert intervention. We investigate both physical design tuning (e.g., indexes and materialized views) and configuration tuning, leveraging techniques such as multi-armed bandits and large language models.
[
ICDE'21
,
ICDM'21
,
VLDB'22
,
TKDE'23
,
KAIS'23
,
ICDM'24
,
SIGMOD'26
,
MLSys'26
]
(Machine) Learned Indexes
In this line of work, we exploit data distributions to construct compact, ML-based indexes for faster and more efficient data retrieval. We study the efficiency and practicality of existing learned index models and propose new algorithms suitable for on-disk deployment, a critical requirement for industrial adoption. This line of work also includes learned spatial indexes and spatial query processing, including semantic search.
[
ADC'20
,
SIGMOD'23
,
ICDE'24
,
VLDB'25
,
ADC'25 x 2
,
ICDE'26
]
Semantics-Driven Prefetching
This project investigates how to anticipate and preload relevant data to reduce query latency and improve exploration efficiency. We leverage data and query semantics to predict complex access patterns that arise during exploratory data analysis, particularly in scientific domains.
[
VLDB'24
]
Cardinality Estimation
This project aims to predict query result sizes in order to improve query planning and execution performance. Our approach leverages copulas from statistical machine learning theory to model complex data correlations more accurately.
[
VLDB'25
,
ICDE'26
]
Predicting Next Actions in Data Exploration
This project focuses on learning and recommending relevant next steps during interactive data exploration, guiding users through complex analysis workflows and reducing the cognitive and technical burden of exploratory analysis.
[
CIKM'25
]
Learning-Based Traffic Optimisation
This line of work develops learning-based methods for traffic modelling, prediction, and optimisation in intelligent transportation systems. We leverage data-driven and machine learning techniques to support traffic management, routing, and decision making under dynamic and uncertain conditions.
[
SIGSPATIAL'20
,
ECML/PKDD'20
,
SIGSPATIAL'22 x 3
,
IV'22
,
TIST'23
,
PAKDD'24
,
SIGSPATIAL'25
]