CS280: Computer Vision

Description: [SCS dragon logo]

CS280: Computer Vision
Computer Science Division
University of California Berkeley

Location: 306 SODA Hall

Time: Wednesday & Friday, 10:30AM - 12:00PM

Previous sites: http://inst.eecs.berkeley.edu/~cs280/archives.html

INSTRUCTOR: Prof. Alyosha Efros
INSTRUCTOR: Prof. Jitendra Malik
INSTRUCTOR: Prof. Stella Yu, OH: Wed. 12-1p, Soda 329
GSI: Pulkit Agrawal, OH: Fri. 9:30a-10:30a, Soda 651
GSI: Yi Wu, OH: Fri. 2-3p, Cory 367 (Start from Feb 2.)
UNITS: 3
SEMESTER: Spring 2018

COURSE OVERVIEW
Syllabus

Computer vision seeks to develop algorithms that replicate one of the most amazing capabilities ofthe human brain – inferring properties of the external world purely by means of the light reflectedfrom various objects to the eyes. We can determine how far away these objects are, how they areoriented with respect to us, and in relationship to various other objects. We reliably guess theircolors and textures, and we can recognize them - this is a chair, this is my dog Fido, this is a pictureof Bill Clinton smiling. We can segment out regions of space corresponding to particular objectsand track them over time, such as a basketball player weaving through the court.

In this course, we will study the concepts and algorithms behind some of the remarkable suc-cesses of computer vision – capabilities such as face detection, handwritten digit recognition, re-constructing three-dimensional models of cities, automated monitoring of activities, segmentingout organs or tissues in biological images, and sensing for control of robots. We will build thisup from fundamentals – an understanding of the geometry and radiometry of image formation,core image processing operations, as well as tools from statistical machine learning. On completingthis course a student would understand the key ideas behind the leading techniques for the mainproblems of computer vision - reconstruction, recognition and segmentation – and have a sense ofwhat computers today can or can not do.

TOPICS TO BE COVERED

Introduction - The Three R's - Recognition, Reconstruction, Reorganization
Static Perspective - the pinhole camera model
Transformations - rotation, translation, affine and projective
Dynamic perspective and optical flow
Radiometry of image formation
Basic image processing operations - filters, features and flow
Biological visual processing - retina, V1 and beyond
The feedforward model of visual processing - convolutional networks
Object recognition case study - Identifying digits with multiple approaches
Recognizing objects in scene - sliding windows and object proposals.
Feature Histograms
Convolutional Neural Network (ConvNet) based approaches to visual recognition of objects and scenes
Attributes, pose and actions
Controur detection and bottom-up segmentation, Gestalt grouping heuristics
Semantic Segmentations - instance segmentation and pixel classification
3D reconstruction from multiple views
3D reconstruction from pictorial cues
Scene understanding from RGBD images
Face Recognition
Video Analysis

Homeworks

Transformation, package, due on Feb 4, 2018
Filter and Edget Dectection, package, due on Mar 6, 2018
Deep Learning, TBA (tentatively release on Mar 7)
3D Reconstruction, TBA
TBD