Class Project
To do
- Mar 6: send me a short email that lists the members
of your team (1-3 persons) and briefly describes what the
team wants to do. If I deem a project not well-suited for
764, I will let you know.
- Around March 27: I will meet with each team to discuss
its progress, and make suggestions.
- Each team will present their project
in the May 8-10 period. The presentation will be either in
the class (if we still have slots), or to me.
- Each team submits a project report (and optionally
the code). Date to be decided.
Topics
The default project is to extend a community
management system such as DBLife, drawing from the lessons we
learn in the class on relational data management. Please feel free to
propose other ideas.
On DBLife, you can
- browse and study the system, and find something that the current
system is not doing well, or think of some feature that it would be
great to add to the current system.
- think about (a) a plausible line of attack, (b) logistics: how much
code you need from DBLife, how much data, etc., how much time you have.
- if all appear okay, then you have a possible project topic. Pls let
me know.
If your team does a good job, we would love to have your project result being
incorporated into the live DBLife system.
Possible DBLife topics (not an exclusive list), will be discussed in class:
- Data provenance: if DBLife infers that X is related to Y, we want to know
why.
- Keyword search: it currently is searching only over pages, people,
and papers, and it doesn't do a very good job. Can you improve and expand
its coverage?
- Citations: add citation information to each paper, each person, and also
provide aggregate citation counts, rankings.
- Pub recognizer: build a little "engine" that recognizes that there is a publication
in a given page, and extracts names, paper titles, etc. from it.
- Picture retrieval: improve the way current pictures are obtained and voted upon.
- Research summary: build a summary of research for each author, by examining
his or her publication page.
- Wiki: add wiki capabilities, so that users can correct data.
- Add data sources: learn how to recognize important new data sources, and add
them to the system.
- more soon ...
Logistics: you can have access to anything in DBLife: code, data, etc. However, note
that you can do a project on DBLife without touching its code and data. For example,
if you work on research summary, you can try to just experiment over DBLP pages first.