CS766 Computer Vision, Fall 2009

Computer Vision

CS766, Fall 2010
Time: Tu Th 2:30pm - 3:45pm
Room: COMP S&ST 1325

Professor: Li Zhang
Email: lizhang@cs.wisc.edu
Telephone: (608)262-5083
Office: COMP S&ST 6387
Office Hours: 3:50PM-4:50PM Thursday (or by appointment)

TA: Mark Chapman
Email: chapman@cs.wisc.edu
Telephone: (608)262-6600
Office: COMP S&ST 1301
Office Hours: 1:00-2:00PM Tuesday and Wednesday

The goal of computer vision is to compute properties of the three-dimensional world from digital images. Specific problems in this field include reconstructing the 3D shape of an environment, determining how things are moving, and recognizing familiar people and objects and their activities, all through analysis of images and videos.

This course will provide an introduction to computer vision, including such topics as image formation, feature detection, motion estimation, image mosaics, 3D shape reconstruction, and object recognition. Applications of these techniques include building 3D maps, creating virtual characters, organizing photo and video databases, human computer interaction, video surveillance, automatic vehicle navigation, and others.

Prerequisites
This course will be self-contained; students do not need to have computer vision background. Prerequisites are linear algebra, calculus, and C/C++ programming.
If you are not sure whether you can take it, please send me email or talk to me!

Textbooks
Forsyth & Ponce, Computer Vision: A Modern Approach, Pearson, 2002, ISBN 0130851981

Recommended readings
Richard Szeliski, Computer Vision: Algorithms and Applications (Incomplete draft), 2008.
L. Shapiro and G. Stockman, Computer Vision, Prentice-Hall, Upper Saddle River, N.J., 2000.
R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, Cambridge, UK, 2000.
M. Sonka, V. Hlavac and R. Boyle, Image Processing, Analysis, and Machine Vision, Brooks/Cole Publishing, 1999.
E. Trucco and A. Verri, Introductory Techniques for 3-D Computer Vision, Prentice-Hall, Upper Saddle River, N.J., 1998.
A. Watt and F. Policarpo, The Computer Image, Addison-Wesley, Harlow, UK, 1998.
R. Jain, R. Kasturi and B. G. Schunck, Machine Vision, McGraw-Hill, New York, 1995.
V. S. Nalwa, A Guided Tour of Computer Vision, Addison-Wesley, Reading, Mass., 1993.
O. Faugeras, Three-Dimensional Computer Vision, MIT Press, Cambridge, Mass., 1993.
B. K. P. Horn, Robot Vision, McGraw-Hill, New York, 1986.

Grading
The grade is based on one written assignment (5%), three programming projects (15% each), paper presentation (15%), and a final project (35%). There will be no exams. The final project is research-oriented: students propose their own project topics, subject to the instructor's approval. The final project can be a pure vision project or an application of vision technique in the student's own research area.

Administrative
Email list: compsci766-1-f09@lists.wisc.edu
Computer account: Everyone registered in this class will get a Computer Systems Lab account to do project assignments.