I am a fifth year Ph.D. student in the Dept. of Computer Sciences at the University of Wisconsin - Madison. My advisor is Professor AnHai Doan. My research involves building a system we call CloudMatcher. CloudMatcher provides a hands-off cloud/crowd service for Entity Matching(EM) using machine learning techniques. It provides a robust and scalable self-service framework to build macro/micro services and do end-to-end entity matching and other steps in the EM space. We envision CloudMatcher to be fast, easy-to-use, scalable and highly available service on the web. Our CloudMatcher code is being deployed at American Family Insurance, a Fortune 500 company.
Before graduate school, I worked for 7 years in the insurance sector as a software engineer where my last stint was at Humana Inc. Green Bay - WI. In 2007, I graduated with a Bachelors degree in Computer Sciences from Pt. Ravi Shankar Shukla University.
Data Integration, Entity Matching, Data Cleaning, HILDA
Towards building a cloud/crowd-based self-service framework to do Entity Matching(EM). A platform to support macro and micro services to perform different steps in the EM space.
Worked on VidyaMap project by integrating digital text in design-based science classes using D3, Java and MySQL.
Backend developer for Macademia application at UW Carbone Cancer Center. Developed WCF services to extract publication data from PubMed.
Worked on understanding the CoW (Copy on Write) behaviour of B-tree file system (Btrfs) and how isolation of data and metadata is done in Btrfs.
Working to deploy/build the CloudMatcher solution at AmFam to match customers across multiple databases and solve other matching usecases in the insurance domain.
Worked on extending the IceFS solution to isolate metadata in Ext3 file system dynamically based on the size of file system. Added space isolation: a cube(abstraction) will be allocated a specific number of block groups and changes can be done only by an administrator using an online tool. Enhanced user level tools (mke2fs, dumpe2fs, e2fsck, etc.).
Worked on developing and maintaining web-services and solutions for agent reporting, commissions and bonuses as a backend developer. Developed ETL SSIS packages and did performance enhancement of SQL queries and packages.
Worked as a Mainframe developer/production support analyst for CIGNA.