764 Project ideas
These are the list of 764 project ideas. If you like to discuss about the projects, talk to the respective persons assigned. Refer to the list of
references before coming and talking to the assigned persons.Incorporating alternatives for evaluating Star Joins
Generating a query plan that considers the existing bitmap and projection indexes. Consider different kinds of query plans such as left-deep or right-deep query trees. This involves coming up with cost formulas for the different alternatives and incorporating these into the optimizer so that a good plan is chosen.
Contact Person: Amit Shukla (
samit@cs.wisc.edu)
Execution of Star Join
This set of projects involves implementing various operators for executing star joins. This could be using bitmap join indexes with fact file scan or right deep query trees. There are some other combinations possible such as combining bitmap indexes with projection indexes. More thought is needed on this.
Contact Person: Yihong Zhao (
zhao@cs.wisc.edu)
Bulkloading bitmap indexes
Given a table create a bitmap index for that. Explore efficient ways of doing this. Some possibilities are: building few bitmaps (as many can be held in memory) at a time and using multiple scans, successive refinement of equiwidth, creating bitmaps in segments using single scan (might lead to unclustered indexes).
Contact Person: Karthikeyan Ramasamy (
karthik@cs.wisc.edu)
Implementing bitmap operations
Bitmaps can be stored in compressed or uncompressed format depending on selectivity. The operations should take care of these different formats for the input bitmaps. There should be a way for dynamically selecting the representation for the output bitmap depending on the operation and the expected selectivity of the output.
Contact Person: Prasad Deshpande (
pmd@cs.wisc.edu)
Incorporating Materialized Views
This involves rewriting the queries to make use of materialized views. The optimizer should consider alternative forms for the query and chooses a form with the least expected cost.
Contact Person: Prasad Deshpande (
pmd@cs.wisc.edu)
DBA Tool
This project involves developing a frontend for the DBA in order to manage the creation of indexes, estimation of sizes, selection of views, precomputing them and identifying query profiles.
Contact Person: Amit Shukla (
samit@cs.wisc.edu)
SQL Generator
Study the type of queries an OLAP user might require. Consider some multidimensional data model and see if it is useful. Write to tool to finally convert the requests into SQL queries.
Contact Person: Prasad Deshpande (
pmd@cs.wisc.edu)
Multidimensional API specification in Paradise
This project involves studying the OLAP council specification for multidimensional API and implementing it on top of the Paradise cursor.
Contact Person: Yihong Zhao (
zhao@cs.wisc.edu)
Concurrency Control / Recovery for Fact File
Fact File is an efficient file organization supporting skipped sequential access. It uses a extent based tree with leaf extents chained. This project requires to come up with a CC / Recovery protocol and implement it.
Contact Person: Karthikeyan Ramasamy (
karthik@cs.wisc.edu)
Supporting updates in Fact File
Currently, fact file provides a single dimensional array like interface. Using this interface, we can read, update and delete nth record. Upon deletion, we allow the existence of holes. This project requires to implement an efficient mechanism for managing and reusing the holes. A related component associated with this project is appending records to fact file.
Contact Person: Karthikeyan Ramasamy (
karthik@cs.wisc.edu)
Implementing JDBC for Paradise
Studying JDBC specification and alternative methods of implementing them either map the calls to Java native methods on top of Paradise cursor class or make changes on the server side so that it directly returns data in the format that Java uses. Implement the best approach.
Contact Person: Karthikeyan Ramasamy (
karthik@cs.wisc.edu)Implementing Histograms for Paradise
Histograms are used to estimate the result size of a query. They aid the optimizer to chose the cheapest plan. This project aims to incorporate histograms into Paradise optimzer. You have to implement both the statistics collection and incorporate it into the optimizer.
Contact Person: Navin Kabra (
navin@cs.wisc.edu)