Magellan About Research Software Data Users Lessons Learned

Fast and Accurate Entity Matching with AI

From research to real-world scale

Started in 2015, Magellan is a major R&D project at UW–Madison focused on entity matching (EM)—a foundational challenge in data science and AI that affects data integration, analytics, and downstream modeling.

Our mission is to advance the science and practice of entity matching by building software, collaborating with real users, transferring technology to industry, and publishing high-impact research.

Over the years, Magellan has produced three major EM platforms and two startups:

Publications from the Magellan project have been cited thousands of times and have received Research Highlight Awards from both SIGMOD and ACM.

In 2025, the project inspired a new startup, MadMatcher, founded by Dev Ahluwalia, a CS graduate student at UW–Madison. MadMatcher builds on and extends SparkMatcher with Generative AI–based entity matching capabilities.

Looking for Entity Matching Software?