This course gives an overview of fundamental methods in computer vision from a computational perspective. Methods include computation of 3-D geometric constraints from binocular stereo, motion, texture, shape-from-shading, and photometric stereo. Edge detection and color perception are studied as well. Elements of machine vision and biological vision are also included.
Announcements
- 05/09/2008: TA Office hours to collect/check final exams (All in Clark 317):
- Friday, 05/09/2008: 2 p.m. to 5 p.m.
- Monday, 05/12/2008: 2 p.m. to 4 p.m.
- 05/09/2008: A solution to the final can be found HERE. Please have a look at it before coming to contest your marks for the final.
- 05/05/2008: Review session for the final on Tuesday 05/06/2008 in Shaffer 202 at 4 p.m.
- 05/02/2008: Homework 4 solution is up!
- 05/01/2008: Homework 3 solution is up!
- 04/23/2008: You can find all the individual frames of the project video in PNG format HERE
- 04/22/2008: A few important announcements regarding the project presentations:
- Every student in every group has to present for 5 minutes each. Hence there would be 10 minutes per group of two students and 15 minutes for the group of three students. After the presentation, 5 minutes will be left for questions for every group. Students are advised to STRICTLY adhere to the 5 minute limit per student otherwise marks will be deducted for every minute over the limit.
- The presentations will be held in class Monday April 28 and Wednesday April 30. Every group is required to be ready to present their work on Monday. The groups that will eventually present on Monday will be chosen by a draw in class. This makes it fair for all the students to have the same amount of time to prepare the presentations.
- A member from every group is advised to come half an hour before class on both days with their laptops so that their laptops can be tested to work with the presentation equipment in the class rooms. If your laptops do not work at the time of your presentation, any time that you take to reconfigure/trouble-shoot/reboot etc will be taken out of your presentation time.
- The detailed 6 page report (title, abstract, intro, problem description, proposed solution, experimental evaluation, conclusions, references), would be due Wednesday April 30, 6 p.m. in class. Your report must conform to the same document style as in this document. i.e. it must be 6 pages, double column, single line spacing and Times New Roman font size 10 document. These specifications MUST be adhered to and failure will result in loss of marks. Microsoft Word and LaTeX templates can be found HERE
- 04/16/2008: Newly added to Useful Computer Vision resources at the bottom of the page:
- Dictionary of Computer Vision and Image Processing
- HIPR-2: Image Processing Learning Resources
- 04/10/2008: The project description can be found under Project in the Grading section.
- 04/08/2008: Here is the final list of the groups for the final project.
Group |
Students |
1 |
Gagan Bansal, Sandeep Mullur |
2 |
Christopher McFarland, Tiffany Chen |
3 |
Giancarlo Troni, Raphael Sznitman |
4 |
Bogdan Vigaru, Michael Kutzer |
5 |
Samantha Mercer, Aru Sahni, Chuan Huang |
6 |
Steve Swedish, Steve Barolak |
7 |
Pin Wu, Sneha Verma |
8 |
Ehsan Elhamifar, Roberto Tron |
9 |
Jeffrey Cheng, Kapil Dalwani |
10 |
Yue Xing, Yin Chen |
11 |
Eric Lin, Chris Ecker |
- 04/02/2008: Check the Handouts section for a very good reference for Lucas Kanade based Tracking.
- 04/01/2008: IMPORTANT: The full Indian traffic videos for the Project can be found HERE (compressed) and HERE (uncompressed - WARNING 500 MB file)
- 04/01/2008: An updated solution of Homework 1 with complete figures is available now. Please check the updated link.
- 03/26/2008: For the final project, if you have already decided the person you are going to work with (at most 2 persons per group), then email the TA your name and your group partner's name. For people who have not found a group partner yet, here's a current list of people looking for group partners. Once you have decided who you want to work with, email the TA your names so that your name can be removed from the list of people looking for partners to the list of decided groups. If you do not see your groups then email the TA ASAP.
- 03/26/2008: A solution to Homework 1 can be found HERE. Some diagrams for Q 2(a) are missing and will be updated ASAP.
- 03/24/2008: A solution to homework 2 can be found HERE.
- 03/24/2008: Class timings and locations have changed and there are several changes. Please check the individual class timings and locations in the table.
- 03/14/2008: Assignment 4 has been posted!
- 03/08/2008: For people who could not finish their filter implementation in Assignment 2, here is the code for the S, LM and RFS filters and a function that gives the output of the filters when applied to an image. It is not necessary that you use this code if your own code works fine.
- 03/05/2008: Assignment 3 has been updated again. Please download the latest version.
- 03/04/2008: Assignment 3 has been updated. Please download the new version.
- 03/02/2008: Assignment 3 has been posted!
- 02/18/2008: The updated Assignment 2 is now AVAILABLE.
- 02/17/2008: Assignment 2 is temporarily unavailable and will be updated soon.
- 02/16/2008: Assignment 2 has been posted!
- 02/06/2008: Assignment 1 has been updated and question 1(a) has been replaced. Please download the updated version of the assignment from the Homeworks section below.
- 02/01/2008: Assignment 1 has been posted!
Class notes
Most of the slideds used in class are based on slides available on the web at the following websites:
- Prof. Gregory Hager: Computer Vision, Johns Hopkins University, Fall 2006
- Prof. Ko Nishino: Introduction to Computer Vision, Drexel University, Winter 2008
- Profs. David Forsyth and Jean Ponce - Computer Vision: A Modern Approach book slides
- Prof. Pietro Perona, Visual Recognition, Caltech, Spring 2007
- Prof. Steve Seitz, Computer Vision, Washington University, Winter 2008
The copyright of the slides belongs to the respective authors.
Textbooks
- Forsyth and Ponce (FP): Computer Vision a Modern Approach
- Trucco and Verry (TV): Introductory Techniques for 3-D Computer Vision
- Shapiro and Stockman (SS): Computer Vision
Syllabus
- Cameras
- FP1, TV 2.2.2, SS 2.2
- Photometry
- Radiometry: FP 4, TV 2.2.3, SS 2.1
- Shading: FP 5, TV 9, SS 6.6
- Color: SS6, FP6
- Image Enhancement and Filtering
- Enhancement: SS 5.1-5.2
- Filtering: FP 7, 8.1-8.2, TV 3, SS 5.3-5.4
- Edge detection: SS 5.6-5.8, FP 8.3, TV 4.2
- Corner detection: TV 4.3
- Texture
- FP9, SS7, TV 9.5.1
- Image Segmentation
- K-means and EM: FP 14.4
- Spectral clustering and NCut: FP 14.5
- Color Segmentation: SS 6.5
- Texture Segmentation: SS 7.4
- Motion, Optical Flow, Image Registration and Matching
- TV 8
- Structure from Motion
- Camera Calibration: FP 2-3, TV 2.4, 6
- Stereo: FP 11
- Two-View Geometry: FP 10.1
- Affine SFM: FP12
- Object Recognition
Grading
- Homeworks (40%)
- HW1: Shape from Shading (DUE DATE: February 15)
Face Data, Sphere Data
SOLUTION
- HW2: Image Filtering, Edge Detection, and Texture Classification (DUE DATE: February 29)
CURET dataset
SOLUTION
- HW3: Image Segmentation (DUE DATE: March 14)
Images for Intensity segmentation
Images for Color segmentation
Images for Texture segmentation
Datasets for clustering
SOLUTION
- HW4: Image Mosaicing (DUE DATE: April 4)
Images for Lucas Kanade Tracker
Russian glass plates
Mosaicing problem data sets: Set 1 Set 2 (Right click on these links and choose "Save As" to save the MAT files)
SOLUTION
- Project (30%)
- Description: title and problem description (April 9)
- Progress report: 3 pages (title, abstract, intro, problem description, proposed solution) (April 18)
- Presentations: 10 min + 3 min questions (April 28-30)
- Final report: 6 pages (progress report + proposed solution, experimental evaluation, conclusions, references) (April 30)
- Exam (30%)
- Thursday May 8th (2-5PM)
Administrative
- Late policy:
- Homeworks and projects are due on the specified dates.
- No late homeworks or projects will be accepted.
- Honor policy:
The strength of the university depends on academic and personal
integrity. In this course, you must be honest and truthful. Ethical
violations include cheating on exams, plagiarism, reuse of
assignments, improper use of the Internet and electronic devices,
unauthorized collaboration, alteration of graded assignments, forgery
and falsification, lying, facilitating academic dishonesty, and unfair
competition.
- Homeworks and exams are strictly individual
- Projects can be done in teams of two students
Useful Computer Vision Resources