VIS | Computer Vision |
If you’ve ever wondered how Google earth builds 3D models of entire cities, how computers can automatically diagnose medical conditions from images, how robot vacuum cleaners can find their way around a house or how augmented reality headsets work, then this course is for you. Computer vision, through what is known as the ImageNet moment, sparked the current resurgence in AI and led to the mass adoption of Deep Learning in both industry and academia. It is also one of the fastest growing areas in computer science with exponentially increasing impact on companies across many industries. Computer vision systems are already used to solve problems in a wide range of domains such as:
- Augmented reality
- Smart agriculture
- Fintech and Insurtech
- Image and video indexing
- Automated inspection of equipment
- Smart retail management
- Home robots
- Automated quality control
- Virtual and automated training and skill development
This course covers the essentials for getting started with developing computer vision systems from a practitioner's perspective and focuses on how computer vision can be applied to solve real-world problems. We cover the basics of how images are formed, how to deal with multiple views of the same scene, how useful features and representations can be extracted from images. We will look at tasks such as 3D reconstruction, detecting objects, estimating the pose of humans and animals, and captioning images.
Frequency
This course will run twice a year.
Course dates
3rd February 2025 | Oxford University Department of Computer Science - Held in the Department | 0 places remaining. |
16th June 2025 | Oxford University Department of Computer Science - Held in the Department | 0 places remaining. |
10th November 2025 | Oxford University Department of Computer Science - Held in the Department | 0 places remaining. |
18th May 2026 | Oxford University Department of Computer Science - Held in the Department | 08 places remaining. |
Objectives
At the end of the course, students will:
- Understand the fundamentals and have working knowledge of computer vision
- Design a computer vision system to solve real-world problems
- Be aware of the broad range of applications of computer vision
Assessment criteria
The course assignment will determine the following:
- Have you understood the principles of computer vision: feature extraction, matching, correspondence estimation,epipolar geometry
- Have you demonstrated the ability to apply the fundamentals to solve common tasks such as: object detection, segmentation, and image captioning
- Can you reason about an unseen problem and present a solution, clearly discussing its strengths and weaknesses?
- Can you demonstrate fluency in Python, OpenCV and scikit-image?
Contents
- What computer vision is: introduction and overview
- How images are formed
- Projections and transformations
- Pixels and photometry
- Representation learning and feature extraction
- SIFT, CNNs, ResNet
- Matching and correspondence estimation
- Semantics and high-level vision:
- Object detection and tracking
- Semantic segmentation and human pose estimation
- Image captioning and tagging
- Multi-view computer vision:
- 3D reconstruction and stereo
- Basics of epipolar geometry
- Applications and case study
Requirements
This course is related to DNN and CML, but neither is a formal prerequisite. Useful concepts to know are: convolutional neural networks and the training loop, from DNN; and loss and reward functions, and different training methods (unsupervised, supervised etc), from CML. So if taking either of DNN or CML as well as VIS, it would be better to take VIS last.