Brief Introduction
Learn how to design robot vision systems that avoid collisions, safely work with humans and understand their environment.
Description
How do robots “see”, respond to and learn from their interactions with the world around them? This is the fascinating field of visual intelligence and machine learning. Visual intelligence allows a robot to “sense” and “recognize” the surrounding environment. It also enables a robot to “learn” from the memory of past experiences by extracting patterns in visual signals.
You will understand how Machine Learning extracts statistically meaningful patterns in data that support classification, regression and clustering. Then by studying Computer Vision and Machine Learning together you will be able to build recognition algorithms that can learn from data and adapt to new environments.
By the end of this course, part of the Robotics MicroMasters program, you will be able to program vision capabilities for a robot such as robot localization as well as object recognition using machine learning.
Projects in this course will utilize MATLAB and OpenCV and will include real examples of video stabilization, recognition of 3D objects, coding a classifier for objects, building a perceptron, and designing a convolutional neural network (CNN) using one of the standard CNN frameworks.
Knowledge
- The fundamentals of image filtering and tracking, and how to apply those principles to face detection, mosaicking and stabilization
- How to use geometric transformations to determine 3D poses from 2D images for augmented reality tasks and visual odometry for robot localization
- How to recognize objects and the basics of visual learning and neural networks for the purpose of classification