My main project is Quickstep, which aims to build a high-performance converged analytics data platform that leverages the full-capabilities of modern hardware, including large main memory, NVRAM storage and multi-core/multi-socket processors. Quickstep targets application surfaces including SQL (for data warehousing applications), graph analytics, document store, and machine learning. While the dual goals of high-performance and broad application surface sounds ambitious, we have shown that there are big benefits to starting with an extended relational kernel and builing graph analytics (see Grail), document stores (see Argo), and various machine learning methods (see QuickFOIL, GLMs over normalized data) on that core kernel. Thus, there are strong reasons that support the feasibility and advantages for this overall vision. Bringing these aspects together in a single-system is the current focus of the Quickstep project.

Quickstep is an Apache (incubating) open-source project.

My CV is here.

