course-details-portlet

TKP4198

Data Science and Machine Learning in Natural Sciencesogy

New from the academic year 2025/2026

Credits 7.5
Level Second degree level
Course start Autumn 2025
Duration 1 semester
Language of instruction English
Location Trondheim
Examination arrangement Project

About

About the course

Course content

This course offers a concise yet comprehensive dive into ML’s integration and utility in the natural science domain. Participants will learn about clustering and dimensionality reduction to the group and visualize high-dimensional data, machine learning methods such as Quantitative Structure-Activity Relationship (QSAR) for molecular property prediction and classification, physics-informed methods for combining physics principles with machine learning, and surrogate models to accelerate model run time. The course equips attendees with the skills and knowledge to apply machine learning effectively in their respective fields by combining theory with practical examples. Additionally, students will develop an understanding of the relationship between ML principles and traditional fields such as mathematics, chemometrics, and automatic control, thereby demystifying machine learning and providing a clear path to its practical application in natural sciences.

Learning outcome

Learning Outcomes:

  1. Understand the relationship between data science principles and traditional fields such as chemometrics and automatic control and effectively translate this understanding into developing machine learning models to address practical challenges in the natural sciences.
  2. Apply machine learning methodologies, incorporating domain-specific knowledge, to solve natural science problems, ensuring the models are reliable and applicable in world contexts.

Learning Objectives - After completing this course, you will be able to:

  • Program in Python or Julia within a Jupyter Notebook environment.
  • Load CSV files into Python Data Frames, visualize variables, select columns, summarize data, and fill in missing data.
  • Solve supervised learning tasks like regression and classification.
  • Solve unsupervised learning tasks, such as clustering, dimensionality reduction, and density estimation.
  • Understand and use the mathematical structure of deep neural networks and their applications to regression and classification.
  • Understand basic automatic control theory and use this in data science and machine learning contexts. Be able to set up simple control structures.
  • Develop and apply models that combine physical knowledge with machine learning to enhance predictive power.
  • Formulate and solve the molecular property prediction problem as a regression/classification task.
  • Formulate and solve time series forecast problems as a regression task.

Learning methods and activities

Problem-driven learning takes place weekly (4 hours) with associated exercises and guidance (2 hours)

Compulsory assignments

  • Compulsory exercises

Further on evaluation

Report and presentation from the project work counts 100% of the grade. 6 excersices must be approved in order to get the grade.

Subject areas

  • Technological subjects

Contact information

Course coordinator

Lecturers

Department with academic responsibility

Department of Chemical Engineering

Examination

Examination

Examination arrangement: Project
Grade: Letter grades

Ordinary examination - Autumn 2025

Project
Weighting 100/100 Exam system Inspera Assessment