SmartCat About Research Software Data Users Lessons Learned

AI-Driven Data Catalog Management Systems

Modern organizations—companies, scientific domains, and government agencies—are increasingly relying on data catalogs for two key reasons.

Much academic research has addressed individual components of data catalog systems, but little has focused on integrating these advances into end-to-end solutions. SmartCat is a new project (started in 2025) at UW–Madison that aims to bridge this gap. SmartCat is distinguished by:

Overall, we believe that building data catalog management systems is a compelling direction for data management research. It brings together and advances multiple previously disparate research areas, while also producing software that serves an increasingly critical real-world need.