Publications by Year
2014
2013
- Social Media Analytics: the Kosmix Story, with many authors.
IEEE Data Engineering Bulletin, Sept 2013.
- Entity Extraction,
Linking, Classification, and Tagging for Social Media: A
Wikipedia-Based Approach, A. Gattani, D. Lamba, N. Garera,
M. Tiwari, X. Chai, S. Das, S. Subramaniam, A. Rajaraman,
V. Harinarayan, and A. Doan. VLDB-13, industrial paper. slides
- Building, Maintaining, and Using
Knowledge Bases: A Report from the Trenches, O. Deshpande,
D. Lamba, M. Tourn, S. Das, S. Subramaniam, A. Rajaraman,
V. Harinarayan, A. Doan. SIGMOD-13, industrial paper. slides
2012
- Muppet: MapReduce-Style
Processing of Fast Data, W. Lam, L. Liu, S. Prasad,
A. Rajaraman, Z. Vacheri, A. Doan. VLDB-12, industrial
paper.
- Principles of Data Integration, A. Doan, A. Halevy, Z. Ives. Morgan Kaufmann, textbook. 2012.
2011
2010
2009
- Optimizing Complex
Extraction Programs over Evolving Text Data, F. Chen, B. Gao,
A. Doan, J. Yang, R. Ramakrishnan. SIGMOD-09. 63/397=15.9%.
- Efficiently Incorporating User Feedback into Information
Extraction and Integration Programs, X. Chai, B. Vuong, A. Doan, J. Naughton.
SIGMOD-09. 63/397=15.9%.
- Combining Keyword Search and Forms for Ad Hoc Querying of Databases,
E. Chu, A. Baid, X. Chai, A. Doan, J. Naughton. SIGMOD-09. 63/397=15.9%. slides
-
The Case for a Structured Approach to Managing Unstructured Data,
A. Doan, J. F. Naughton, A. Baid, X. Chai, F. Chen, T. Chen, E. Chu, P. DeRose, B. Gao,
C. Gokhale, J. Huang, W. Shen, B. Vuong. CIDR-09.
- Join Optimization of Information Extraction
Output: Quality Matters!, A. Jain, P. G. Ipeirotis, A. Doan, L. Gravano.
ICDE-09. 93/554 = 16.8%.
- Weighted Proximity Best-Joins for Information Retrieval,
R. Thonangi, H. He, A. Doan, H. Wang, J. Yang. ICDE-09. 93/554 = 16.8%.
2008
-
The Claremont Report on Database Research, with many others. SIGMOD Record, Fall 08.
- Introduction to the Special Issue on
Managing Information Extraction, A. Doan, L. Gravano, S. Vaithyanathan, R. Ramakrishnan.
SIGMOD Record, Winter 08.
- Information Extraction Challenges
in Managing Unstructured Data, with many others. SIGMOD Record, Winter 08,
Special Issue on Managing Information Extraction.
-
Analyzing and Revising Data Integration Schemas to Improve Their Matchability,
X. Chai, M. Sayyadian, A. Doan, A. Rosenthal, L. Seligman. VLDB-08. 49/296 = 16.5%.
-
On the Provenance of Non-Answers to Queries over Extracted Data,
J. Huang, T. Chen, A. Doan, J. Naughton. VLDB-08. 49/296 = 16.5%.
-
Toward Best-Effort Information Extraction, W. Shen, P. DeRose,
R. McCann, A. Doan, R. Ramakrishnan. SIGMOD-08. 78/435 = 17.9%.
- Building Community Wikipedias: A Human-Machine Approach,
P. DeRose, X. Chai, B. Gao, W. Shen, A. Doan, P. Bohannon, J. Zhu. ICDE-08. 75/617 = 12.1%.
- Efficient Information Extraction over Evolving Text Data,
F. Chen, A. Doan, J. Yang, R. Ramakrishnan. ICDE-08. 75/617 = 12.1%.
- Optimizing SQL Queries over Text Databases, A. Jain,
A. Doan, L. Gravano. ICDE-08. 75/617 = 12.1%.
- Matching Schemas in Online Communities: A Web 2.0
Approach, R. McCann, W. Shen, A. Doan. ICDE-08. 119/617 = 19.2%.
-
Databases and Web 2.0 Panel at VLDB 2007, S. Amer-Yahia, A. Halevy, G. Alonso, D. Kossmann, V. Markl,
A. Doan, G. Weikum.
2007
- A Relational Approach to Incrementally
Extracting and Querying Structure in Unstructured Data, E. Chu, A. Baid, T. Chen,
A. Doan, J. Naughton. VLDB-07. 45/276 = 16.3%.
- Building Structured Web Community Portals:
A Top-Down, Compositional, and Incremental Approach, P. DeRose, W. Shen, F. Chen,
A. Doan, R. Ramakrishnan. VLDB-07. 45/276 = 16.3%.
- Declarative Information Extraction Using
Datalog with Embedded Extraction Predicates, W. Shen,
A. Doan, J. Naughton, R. Ramakrishnan. VLDB-07. 45/276 = 16.3%.
- OLAP over Imprecise Data with Domain Constraints, D. Burdick, A. Doan,
R. Ramakrishnan, S. Vaithyanathan. VLDB-07. 45/276 = 16.3%.
-
Efficient Keyword Search across Heterogeneous Relational Databases,
M. Sayyadian, H. LeKhac, A. Doan, L. Gravano. ICDE-07. 122/659 = 18%.
- Souce-aware Entity Matching: A Compositional Approach,
W. Shen, P. DeRose, L. Vu, A. Doan, R. Ramakrishnan. ICDE-07. 122/659 = 18%.
- SQL Queries over Unstructured Text Databases,
A. Jain, A. Doan, L. Gravano. ICDE-07 (poster). 182/659 = 28%.
- User-Centric Research Challenges in Community
Information Management Systems, A. Doan, P. Bohannon, R. Ramakrishnan, X. Chai, P. DeRose,
B. Gao, W. Shen. IEEE Data Engineering Bulletin, special issue on data management in social
networks. 2007, invited.
- DBLife: A Community Information Management Platform for
the Database Research Community, P. DeRose, W. Shen, F. Chen, Y. Lee, D. Burdick,
A. Doan, R. Ramakrishnan. CIDR-07 (demo).
2006
- Learning from the Web to
Match Deep-Web Query Interfaces, W. Wu,
A. Doan, C. Yu. ICDE-06. PPT slides. 89/456 = 20%.
- Managing Information Extraction (PPT slides), A. Doan,
R. Ramakrishnan, S. Vaithyanathan. SIGMOD-06 Tutorial (see the 2-page description
here)
-
eTuner: Tuning Schema Matching Software Using Synthetic Scenarios,
Y. Lee, M. Sayyadian, A. Doan, A. Rosenthal. VLDB Journal Special Issue, Best Papers
of VLDB-05. 2006. Invited.
- Community Information
Management, A. Doan, R. Ramakrishnan, F. Chen, P. DeRose,
Y. Lee, R. McCann, M. Sayyadian, and W. Shen. IEEE Data
Engineering Bulletin, Special Issue on Probabilistic Databases,
29(1), 2006. Invited.
- Best-Effort Data Integration,
A. Doan. The NSF/EPA/ONR/NARA/AHRQ/NCO
Workshop on Data Integration (position statement). 2006.
2005
- Maveric: Mapping
Maintenance for Data Integration Systems, R. McCann, B.
AlShelbi, Q. Le, H. Nguyen, L. Vu, A. Doan. VLDB-05. 85/517 = 16%.
PPT slides.
- eTuner: Tuning Schema
Matching Software Using Synthetic Scenarios, M. Sayyadian,
Y. Lee, A. Doan, A. Rosenthal. VLDB-05. 85/517 = 16%.
PPT slides.
- Constraint-Based Entity
Matching, W. Shen, X. Li, A. Doan. AAAI-05 (Nat. Conf. on
AI). 148/803 = 18%. PPT slides.
- Integrating Data from
Disparate Sources: A Mass Collaboration Approach, R. McCann,
A. Kramnik, W. Shen, V. Varadarajan, O. Sobulo,
A. Doan. ICDE-05. Poster. 100/521 = 19%.
- Corpus-based Schema
Matching, J. Madhavan, P. Bernstein, A. Doan, A. Halevy.
ICDE-05. 67/521 = 13%.
- Merging Interface Schemas on the
Deep Web via Clustering Aggregation,
W. Wu, A. Doan, and C. Yu. IEEE Int. Conf. on Data Mining (ICDM-05).
141/630 = 22%.
- Semantic
Integration Research in the Database Community: A Brief
Survey, A. Doan and A. Halevy. AI Magazine, Special Issue
on Semantic Integration, Spring 2005.
-
Special Issue on Semantic Integration, N. Noy, A. Doan,
A. Halevy (editors).
AI Magazine, Spring 2005.
- Proceedings of the
Eighth Int. Workshop on the Web and Databases (WebDB-05),
A. Doan, F. Neven, R. McCann, G. J. Bex (editors).
- Bootstrapping Domain
Ontology for Semantic Web Services from Source Web Sites, W. Wu,
A. Doan, C. Yu, and W. Meng. In Proc. of the VLDB-05 Workshop on
Technologies for E-Services.
2004
- An
Interactive Clustering-based Approach to Integrating Source Query
interfaces on the Deep Web, W. Wu, C. Yu, A. Doan, and
W. Meng. SIGMOD-04. 69/431 = 16%.
- iMAP: Discovering Complex Semantic
Matches between Database Schemas, R. Dhamanka, Y. Lee,
A. Doan, A. Halevy, and P. Domingos. SIGMOD-2004. 69/431 = 16%.
- Privacy Preserving Data Integration and
Sharing, C. Clifton, A. Doan, A. Elmagarmid,
M. Kantarcioglu, G. Schadow, D. Suciu, and J. Vaidya.
Proc. of the 9th Int. Workshop on Data Mining and Knowledge Discovery (DMKD-04).
8/34 = 24%.
- Special Issue on Semantic Integration, A. Doan,
N. Noy, A. Halevy (editors).
ACM SIGMOD Record, 33(4), 2004.
- Report on the
Semantic Integration Workshop at the 2nd Int. Semantic Web
Conf. (ISWC-03), A. Doan, A. Halevy, and N. Noy.
SIGMOD Record, 33(1):138-140, 2004.
A related version appeared in AI Magazine, Spring 2004.
- Ontology Matching: A
Machine Learning Approach, A. Doan, J. Madhavan, P. Domingos,
and A. Halevy.
Handbook on Ontologies in Information Systems, S. Staab and R. Studer (eds.), Springer-Velag,
2004. Invited paper. Pages 397-416.
2003
- Building Data Integration
Systems via Mass Collaboration, R. McCann, A. Doan,
A. Kramnik, and V. Varadarajan. Proc. of the Int. Workshop on
Web and Databases (WebDB-03). 17/74 = 23%.
- Crossing the Structure Chasm,
O. Etzioni, A. Halevy, A. Doan, Z. Ives, J.
Madhavan, L. McDowell, I. Tatarinov. Conf. on Innovative Database Research (CIDR-2003). 26/87 = 30%.
- Learning to Match Ontologies
on the Semantic Web, A. Doan, J. Madhavan, R. Dhamankar,
P. Domingos, and A. Halevy. VLDB Journal, Special Issue on the
Semantic Web, 2003. PDF
version
- Learning to Match the Schemas of Databases: A Multistrategy
Approach, A. Doan, P. Domingos, and A. Halevy. Machine Learning Journal,
50, Pages 279-301, 2003.
- The Proceedings of the Semantic Integration Workshop at ISWC-03,
edited by A. Doan, A. Halevy, and N. Noy.
- Building Data
Integration Systems: A Mass Collaboration Approach, A. Doan
and R. McCann. Proc. of the IJCAI-03 Workshop on Information
Integration on the Web.
- Profile-based Object Matching, A. Doan, Y. Lu,
Y. Lee, and J. Han. IEEE Intelligent Systems, Special Issue
on Information Integration on the Web, 2003. Invited
paper.
-
Object Matching for Data Integration: A Profile-Based Approach, A. Doan, Y. Lu, Y. Lee,
and J. Han. Proc. of the IJCAI-03 Workshop on Information Integration on the Web.
- Mining for Information Discovery on the Web: Overview and Illustrative Research, H. Yu, A.
Doan, and J. Han. Intelligent Technologies - Advances in Agents, Data Mining, and Learning,
N. Zhong (ed.), Springer-Velag, 2003. Invited paper.
- Research on
Statistical Relational Learning at the University of Washington, with various coauthors.
Proc. of the IJCAI-03 Workshop on Learning Statistical Models from Relational Data, 2003.
2002
- Learning to Map between Structured
Representations of Data,
A. Doan. Ph.D. Dissertation, Univ. of Washington-Seattle,
2002. Received the ACM Doctoral
Dissertation Award in 2003.
- Learning to Map between Ontologies
on the Semantic Web, A. Doan, J. Madhavan, P. Domingos, and
A. Halevy. Proc. of the World-Wide Web Conf. (WWW-2002). 72/454 = 16%.
- Efficiently Ordering Query Plans for
Data Integration, A. Doan
and A. Halevy. Proc. of the 18th IEEE Int. Conf. on Data Engineering
(ICDE-2002) . 54/287 = 19%.
- Database Research at
UIUC. M. Winslett, K. Chang, A. Doan, J. Han, C. Zhai, and
Y. Zhou. SIDMOD Record, 31(3):97-102, 2002.
2001
2000
- Learning Source Descriptions for
Data Integration, A. Doan, P. Domingos, and
A. Levy. Proceedings of the 3rd International Workshop on
the Web and Databases (WebDB-2000). 20/69 = 29%.
- Data Integration: A "Killer App" for
Multi-Strategy Learning, A. Doan, P. Domingos, and A. Levy. Proceedings of
the Workshop on Multi-Strategy Learning (MSL-00), 2000, Guimaraes, Portugal.
Invited Paper.
- Learning Mappings between Data Schemas
, A. Doan, P. Domingos, and
A. Levy. Proceedings of the AAAI-2000 Workshop on
Learning Statistical Models from Relational Data, 2000, Austin, TX.
1994-1999
- Efficiently Ordering
Query Plans for Data Integration, A. Doan and A. Levy.
The IJCAI-99 Workshop on Intelligent Information Integration,
Stockholm, Sweden, 1999.
-
Geometric Foundations for Interval-Based Probabilities,
V. Ha, A. Doan, V. Vu, and P. Haddawy. Annals of Mathematics and Artificial Intelligence,
24 (1-4), 1998.
-
Decision-Theoretic Refinement Planning in Medical
Decision Making: Management of Acute Deep Venous Thrombosis, P. Haddawy, A. Doan, and
C. Kahn. Journal of Medical Decision Making, 1996.
- Sound Abstraction of
Probabilistic Actions in the Constraint Mass Assignment
Framework, A. Doan and P. Haddawy, Proceedings of the 12th
National Conference on Uncertainty in AI (UAI-96), Portland,
Oregon, 1996, pages 228-235.
- Modeling Probabilistic
Actions for Practical Decision-Theoretic Planning, A. Doan.
Proceedings of the 3rd International Conference on AI
Planning Systems (AIPS-96), Edinburgh, Scotland, May 1996.
- Decision-Theoretic Planning for Clinical Decision
Analysis, A. Doan, P. Haddawy, and C. Kahn. The Working
Papers of AI in Medicine Spring Symposium, Stanford, 1996.
- Management of Acute Deep Venous Thrombosis of the Lower
Extremities (abstract), C. Kahn, A. Doan. and P. Haddawy.
American Roentgen Ray Society Meeting, San Diego, May
1996.
- Efficient
Decision-Theoretic Planning: Techniques and Empirical
Analysis, P. Haddawy, A. Doan, and
R. Goodwin. Proceedings of the 11th National Conference on
Uncertainty in AI (UAI-95), Montreal, Canada, August 1995,
pages 229-236.
- An Abstraction-Based
Approach to Decision-Theoretic Planning for Partially
Observable Metric Domains, A. Doan. Masters
Thesis. Technical Report TR-95-12-01, Dept. of Electrical
Engineering and Computer Science, University of
Wisconsin-Milwaukee.
- Decision-Theoretic
Refinement Planning: A New Method for Clinical Decision
Analysis, A. Doan, P. Haddawy, and C. Kahn. Proceedings
of the 19th Annual Symposium on Computer Applications in
Medical Care (SCAMC-95), New Orleans, 1995, pages 299-303.
- Generating Macro
Operators, A. Doan and P. Haddawy. AAAI Spring Symposium
on Extended Theories of Action Representation, Stanford
1995.
- Abstracting
Probabilistic Actions, P. Haddawy and
A. Doan. Proceedings of the 10th National Conference on
Uncertainty in AI (UAI-94), Seattle, July 1994.