Citeseer
Abstract
This data set lists highly cited authors and their homepages.
The data set is used for the purpose of object matching.
Contribution
Original Owner and Donor
AnHai Doan
Department of Computer Science
University of Illinois, Champaign-Urbana
anhai@cs.uiuc.edu
Date Donated: February 6, 2004
Description
- This data was obtained from a Web page that lists highly cites authors and their homepages in 2003.
- This data was collected as a designed experiment for the purpose of Object Matching.
- As of now the publications that have used this data are:
- Object Matching for Data Integration: A Profile-Based Approach, A. Doan, Y. Lu, Y. Lee, and J. Han. Proc. of the IJCAI-03 Workshop on
Information Integration on the Web, 2003.
- Object Matching for Data Integration: A Profile-Based Approach, A. Doan, Y. Lu, Y. Lee, and J. Han. IEEE Intelligent Systems, Special
Issue on Information Integration on the Web, 2003.
- Semi-automated discovery of matches between schemas, ontologies and data fragments of disparate data sources, R. Dhamankar. Masters
Thesis, Department of Computer Science, University of Illinois at Urbana Champaign, 2004.
Data Format
This data set consists of two data sources and a mapping file between sources.
- The first data source lists highly cites authors with rank and name.
- The second data source contains suggested homepages for authors. We manually converted each homepage into a
XML file by extracting homepage information such as name, name and rank of current university, position, and year that the person
obtained his or her PhD.
Data Files
Illini Semantic Integration Archive
Department of Computer Science
University of Illinois, Champaign-Urbana
Urbana, IL 61801
Last modified: February 6, 2004