course-details-portlet

TDT4265

Computer Vision and Deep Learning

Choose study year

Credits 7.5

Level Second degree level

Course start Spring 2022

Duration 1 semester

Language of instruction English

Location Trondheim

Examination arrangement Portfolio assessment

About the course

Course content

Computer vision techniques build the basis for an automatic understanding and interpretation of digital images that today can be recorded with a multitude of different imaging devices (e.g. mobile phones, webcams, 3D depth-images, MRI, Ultrasound... ).

The content of the course ranges from the classical feature extraction and classification approach of vision to the more modern machine / deep learning based way of making sense of images and video. The course also contains a short summary of the programming skills and mathematical background needed as well as a recap of basic image processing & analysis methods in order to make sure that everybody is on the same page.

Topics covered are the human eye and the image formation process, filtering in the spatial and frequency domain, segmentation and mathematical morphology, Hierarchical Structures and Scale space, Principal Components Analysis (PCA), feature extraction (e.g. Harris, SIFT) and matching (e.g. RANSAC), unsupervised and supervised classification & recognition as well as various machine learning methods, stereo vision and essential & fundamental matrixes, optical flow, tracking (Kalman/Particle) and last but not least deep learning for vision, i.e. fully connected feedforward NNs (Shallow ANNs, forward pass, matrix-based notation and activation functions etc.), learning (cost functions, gradient descent and backpropagation etc.), generalization (overfitting, regularization, initialization, hyper-parameters, vanishing gradients / unstable gradients and deep NNs etc.), CNNs and Image Classification (different layers, especially the conv layer, learning features and sharing parameters, AlexNet and later versions etc.), object detection and semantic segmentation (R-CNN, Fast R-CNN, Faster R-CNN, R-FCN, YOLO / YOLO v2, SSD and Mask R-CNN etc.).

Examples will be taken from key application domains like medicine, autonomy (drones, cars and ships) & robotics, industrial inspection etc.

Learning outcome

The course provides an overview and understanding of several fundamental techniques in Computer Vision. Advanced knowledge in this field is becoming increasingly more and more important. This is in particular true in view of the ever increasing availability of cameras and other imaging devices in nearly all areas of our society. The course helps to build the skills to design and construct advanced computer vision modules that function within a system to achieve the vision system's goals. Application fields include industrial areas, autonomy (drones, cars and ships), robotics and medical image analysis. The learned subjects can be the basis of employment in industry or the public sector, or could be followed for doctoral research in Norway or overseas.

Learning methods and activities

Lectures and exercises. Lectures will be given in English. An important part of the exercises is a project that address a real-world problem.

Further on evaluation

Portfolio assessment is the basis for the grade in the course. The portfolio includes a final written exam 60% and exercises 40%. The results for the parts are given in %-scores, while the entire portfolio is assigned a letter grade. Achieved points for exercises may be used for a possible later examination. If there is a re-sit examination, the examination form may change from written to oral. The examination papers will be given in English only.

In the case that the student receives an F/Fail as a final grade after both ordinary and re-sit exam, then the student must retake the course in its entirety. Submitted work that counts towards the final grade will also have to be retaken.

Recommended previous knowledge

TDT4195 Visual Computing fundamentals or equivalent.

Course materials

Book: Digital Image Processing, Rafael C. Gonzalez, Richard E. Woods (Publisher: Pearson)

Book: Neural Networks and Deep Learning, Michael Nielsen (online)

Book: Deep Learning, Ian Goodfellow et. al. (online)

Credit reductions

Course code	Reduction	From
SIF8066	7.5 sp

This course has academic overlap with the course in the table above. If you take overlapping courses, you will receive a credit reduction in the course where you have the lowest grade. If the grades are the same, the reduction will be applied to the course completed most recently.

Subject areas

Informatics
Technological subjects

Contact information

Course coordinator

Frank Lindseth

Department with academic responsibility

Department of Computer Science

Examination

Examination arrangement: Portfolio assessment

Grade: Letter grades

Ordinary examination - Spring 2022

Work

Weighting 40/100 Date Submission 2022-05-06 Time Submission 14:00

Home exam (1)

Weighting 60/100 Date Release 2022-05-21
Submission 2022-05-21 Time Release 09:00
Submission 13:00 Duration 4 hours Exam system Inspera Assessment

Other comments
1) Merk at eksamensform er endret som et smittevernstiltak i den pågående koronasituasjonen.

Re-sit examination - Summer 2022

Work

Weighting 40/100

Home exam

Weighting 60/100 Duration 4 hours Exam system Inspera Assessment

All about examinations at NTNU

Språkvelger

Course - Computer Vision and Deep Learning - TDT4265

course-details-portlet

Computer Vision and Deep Learning

About

About the course

Course content

Learning outcome

Learning methods and activities

Further on evaluation

Recommended previous knowledge

Course materials

Credit reductions

Subject areas

Contact information

Course coordinator

Department with academic responsibility

Examination

Examination

Ordinary examination - Spring 2022

Work

Home exam (1)

Re-sit examination - Summer 2022

Work

Home exam