Managing semantic evolution in databases: from theory to implementation
le 3 juillet 2025
13h15
Manufacture des Tabacs Salle MH003
Kelly Rosa Braghetto
Abstract: Semantic heterogeneity often arises in long-term datasets due to changes in how data is categorized, named, or measured. This complicates querying, as users must manually account for inconsistencies over time—a time-consuming and error-prone task. The seminar introduces theoretical foundations and tools to tackle this issue through two strategies: query rewriting and data preprocessing. Both rely on tailored storage models and algorithms to ensure correct and semantically consistent retrieval. A prototype, MellowDB, enables querying heterogeneous data as if it were homogeneous, given a complete history of semantic changes. The approach models changes as discrete, time-stamped operations—translation, grouping, and ungrouping—with the possibility of extension. The system was tested on a real dataset: causes of death in Brazil (1979–2021).
En appuyant sur le bouton "j'accepte" vous nous autorisez à déposer des cookies afin de mesurer l'audience de notre site. Ces données sont à notre seul usage et ne sont pas communiquées. Consultez notre politique relative aux cookies