Work Done at Kosmix (2010-2011)
While at Kosmix I was involved in a number of projects that built and
used Web-scale knowledge bases, especially for information extraction and integration,
entity disambiguation, and social media analytics. Parts of these
projects are described in the following papers:
- Social Media Analytics: the Kosmix Story, with many authors.
IEEE Data Engineering Bulletin, Sept 2013.
- Entity Extraction,
Linking, Classification, and Tagging for Social Media: A
Wikipedia-Based Approach, A. Gattani, D. Lamba, N. Garera,
M. Tiwari, X. Chai, S. Das, S. Subramaniam, A. Rajaraman,
V. Harinarayan, and A. Doan. VLDB-13, industrial paper. slides
- Building, Maintaining, and Using
Knowledge Bases: A Report from the Trenches, O. Deshpande,
D. Lamba, M. Tourn, S. Das, S. Subramaniam, A. Rajaraman,
V. Harinarayan, A. Doan. SIGMOD-13, industrial paper. slides
- Muppet: MapReduce-Style
Processing of Fast Data, W. Lam, L. Liu, S. Prasad,
A. Rajaraman, Z. Vacheri, A. Doan. VLDB-12, industrial
paper. slides
I also worked on event detection and monitoring for social media (with
papers describing these under preparation).
A talk that describes work at Kosmix at a high level:
Social Media, Data Integration, and
Human Computation.