Computer Science & Engineering Department

CS773C Machine Intelligence Advanced Applications

Spring 2008: Object Recognition

Meets: TR 1:00pm - 2:15pm (SEM 257)

Instructor: Dr. George Bebis

Email: bebis@cse.unr.edu
Phone: 784-6463
Office: 235 SEM
Office Hours: MW 1:00pm - 2:30pm or by appointment.

Prerequisites

Good background in image processing (CS674), computer vision (CS685), pattern recognition (CS679), linear algebra, probabilities, and statistics.

Texts

We will not use any text in this course; all of the material will be drawn from lecture notes and research papers.

Useful Texts

Emanuele Trucco, Alessandro Verri, Introductory Techniques for 3-D Computer Vision, Prentice Hall, 1998.
Forsyth and Ponce, Computer Vision - A modern approach, Prentice Hall, 2002.
Shapiro and Stockman, Computer Vision, Prentice Hall, 2001.

Computer Vision Resources

Object Recognition Resources

Object Recognition Challenges and Datasets

Segmentation Datasets and Benchmarks

The Berkeley Segmentation Dataset and Benchmark

Useful Software

Intel Computer Vision Library (OpenCV)
Image processing and computer vision algorithms optimized to run on Intel microprocessors.
Matlab
Matlab(1) is a numeric computation and visualization environment. The image processing and signal processing toolboxes are especially useful. See also: Matlib Tutorial (Univ Utah), Matlab Basics (RPI), Matlab Primer (200K postscript; 25 pages).
Interest Point Detectors - Local Descriptors (by Gyuri Dorko).
Lear's Software
Daniel Huttenlocher's page (has code for various algorithms based on his research)

Description and Objectives

Recognizing objects from images has been a challenging task in computer vision. This is because objects may look very different from different viewing positions. The most successful approach is in the context of "model-based" object recognition, where the environment is rather constrained and recognition relies upon the existence of a set of predefined model objects. Given an unknown scene, recognition implies: (i) the identification of a set of features from the unknown scene which approximately match a set of features from a known view of a model object, (ii) the recovery of the geometric transformation that the model object has undergone (i.e., pose recovering) and, (iii) verification that other features coincide with predictions. Since usually there is no a-priori knowledge of which model points correspond to which scene points, recognition can be computationally too expensive, even for a moderate number of models. Our goal in this course would be to study several well known techniques in object recognition.

This course is primarily intended for highly motivated students interested in doing research in object recognition and computer vision in general. It will be essential for students to have a solid understanding of basic topics in math, such as linear algebra, probability and statistics, and calculus. It will also be useful to have some knowledge of computer vision, image processing, and geometry. In general, the more math a student knows, the easier the course will be.

Topics

Image Formation and Perspective Projection
Approximations to Perspective Projection
Segmentation and Feature Extraction
2D Object Recognition Using Geometric Models
3D Object Recognition Using Geometric Models
Object Recognition Using Appearance Models
Grouping
Error Analysis

Course Requirements

This course is primarily intended for highly motivated students interested in doing research in object recognition and computer vision in general. It will be essential for students to have a solid understanding of basic topics in math, such as linear algebra, probability and statistics, and calculus. It will also be useful to have some knowledge of computer vision, image processing, and geometry. There would be no exams in this course. Grading will be based on paper presentations, reports, class participation, and a project. Details are provided in the course syllabus.

Syllabus

Spring 2008 Syllabus

Schedule of Presentations

Course Objectives and Requirements

Perspective projection

Introduction to Object Recognition

Geometric Hashing

Object Recognition Using Algebraic Functions of Views

Object Recognition Using Genetic Algorithms

Color-based object recognition

Building local part models for category-level recognition

reading material:

Normalized Cuts and Image Segmentation

Trainable visual models for object classification

reading material:

Learning to Detect Natural Image Boundaries Using Local Brightness, Color and Texture Cues

Trainable visual models for object classification

reading material:

Distinctive Image Features from Scale-Invariant Keypoints

Generative Models for Visual Objects and Object Recognition via Bayesian Inference

reading material:

Learning and Recognizing Visual Object Categories

(Daniel Huttenlocher, Cornell University) (reading material: ORP [8])

3/13

A Component-based Framework for Face Detection and Identification

Wide base-line stereo matching based on local, affinely invariant regions

Object Recognition Using Local Affine Frames on Maximally Stable Extremal Regions

Visual Categorization with Bags of Keypoints

reading material:

Video Google: Efficient Visual Search of Videos

Learning shared representations for Object Recognition

(Antonio Torralba, MIT) (reading material: ORLD [4])

4/15 Video Lecture:

Learning Visual Distance Function for Object Identification from one Example

reading material:

Discriminative Training for Object Recognition Using Image Patches

reading material:

Pascal Challenge 101 Objects

(Chris Williams, University of Edinburg) (reading material: DIOR [1])
4/22 Video Lecture:

Overview of the Challenge and Results

(Mark Everingham, University of Oxford) (reading material: DIOR [1])

4/24

Scale and Affine Invariant Interest Point Detectors

Learning issues in image segmentation

(Joachim M. Buhmann, Institute of Computational Science) (reading material: )

5/1

Perceptual Grouping of Natural Shapes in Cluttered Backgrounds

Learning issues in image segmentation

(Joachim M. Buhmann, Institute of Computational Science) (reading material: )

5/9 Final Reports, Presentations and Demos

Video Lectures (VL)

Energy-based models and Learning for Invariant Image Recognition

Graph Based Shapes Representation and Recognition

Object Identification by Statistical Methods

Machine Learning in Vision

Machine Learning, Probability and Graphical Models

Papers

10.

"Comparison of Generative and Discriminative Techniques for Object Detection and Classification"

11.

"Shape Matching and Object Recognition"

12.

"Multi-view matching for unordered image sets, or How do I organize my holiday snaps?",

13.

"Visual Categorization with Bags of Keypoints",

14.

"Learning Visual Similarity Measures for Comparing Never Seen Objects",

15.

"Local Greyvalue Invariants for Image Retrieval",

16.

"Object Class Recognition by Unsupervised Scale-Invariant Learning",

17.

"A visual category filter for google images",

18.

"An Affine Invariant Interest Point Detector ",

19.

"Indexing Based on Scale Invariant Interest Points",

20.

"Semi-Local Affine Parts for Object Recognition",

21.

"Toward True 3D Object Recognition",

22.

"Unsupervised Learning of Models for Recognition",

23.

"Scale and Affine Invariant Interest Point Detectors",

24.

"A Comprison of Affine Region Detectors",

25.

"Discriminative training for object recognition using image patches",

Object Recognition using Parts (ORP)

"Components for Object Detection and Identification

"A Component-based Framework for Face Detection and Identification",

"Learning to Detect Objects in Images via a Sparse, Part-Based Representations",

"Shape Matching and Object Recognition Using Shape Contexts,

"Object Recognition Using Locality-Sensitive Hashing of Shape Contexts",

"3D Object Modeling and Recognition Using Affine-Invariant Patches and Multi-View Spatial Constraints",

"Visual Classification by a Hierarchy of Extended Fragments",

"Pictorial Structures for Object Recognition",

Object Recognition by Combining Geometric and Appearance Models (ORGAP)

"Object Recognition by Combining Appearance and Geometry",

"Fusing shape and appearance information for object category detection",

Object Categorization (GOR)

"Generic Object Recognition with Boosting",

"One-Shot Learning of Object Categories",

Dataset Issues in Object Recognition (DIOR)

"Dataset Issues in Object Recognition",

Object Recognition Applications (ORA)

"Recognizing Groceries in situ Using in vitro Training Data",

here

"Industry and Object Recognition: Applications, Applied Research and Challenges",

Project Topics

Department of Computer Science & Engineering, University of Nevada, Reno, NV 89557
Page created and maintained by: Dr. George Bebis (bebis@cse.unr.edu)

Computer Science & Engineering Department

CS773C Machine Intelligence Advanced Applications

Spring 2008: Object Recognition

Meets: TR 1:00pm - 2:15pm (SEM 257) Instructor: Dr. George Bebis Email: bebis@cse.unr.edu Phone: 784-6463 Office: 235 SEM Office Hours: MW 1:00pm - 2:30pm or by appointment.

Prerequisites

Texts

Useful Texts

Computer Vision Resources

Object Recognition Resources

Object Recognition Challenges and Datasets

Segmentation Datasets and Benchmarks

Useful Software

Description and Objectives

Topics

Course Requirements

Syllabus

Schedule of Presentations

Video Lectures (VL)

Papers

Review Papers (REV)

2D Object Recognition using Geometric Models (2DORGM)

3D Object Recognition using Geometric Models (3DORGM)

Pose Clustering (PC)

Segmentation (S)

Grouping (G)

Object Recognition using Local Descriptors (ORLD)

Object Recognition using Parts (ORP)

Object Recognition by Combining Geometric and Appearance Models (ORGAP)

Object Categorization (GOR)

Dataset Issues in Object Recognition (DIOR)

Object Recognition Applications (ORA)

Project Topics

Meets: TR 1:00pm - 2:15pm (SEM 257)

Instructor: Dr. George Bebis

Email: bebis@cse.unr.edu
Phone: 784-6463
Office: 235 SEM
Office Hours: MW 1:00pm - 2:30pm or by appointment.