Navigation

  • Skip to Content
NTNU Home NTNU Home

ntnu.edu

  • Studies
    • Master's programmes in English
    • For exchange students
    • PhD opportunities
    • All programmes of study
    • Courses
    • Financing
    • Language requirements
    • Application process
    • Academic calendar
    • FAQ
  • Research and innovation
    • NTNU research
    • Research excellence
    • Strategic research areas
    • Innovation resources
    • PhD opportunities
  • Life and housing
    • Student in Trondheim
    • Student in Gjøvik
    • Student in Ålesund
    • For researchers
    • Life and housing
  • About NTNU
    • Contact us
    • Faculties and departments
    • Libraries
    • International researcher support
    • Vacancies
    • About NTNU
    • Maps
  1. Employees

Språkvelger

Norsk

Giampiero Salvi

Download press photo
Download press photo
Foto:

Giampiero Salvi

Professor
Department of Electronic Systems

giampiero.salvi@ntnu.no
Elektro B, B323, Gløshaugen
Google Scholar My profile at KTH
About Publications Teaching Outreach

About

Giampiero Salvi (Senior Member IEEE) is a Full Professor at the Department of Electronic Systems. He is a member of the Signal Processing Group. He is also an Associate Professor with KTH Royal Institute of Technology in Sweden. Professor Salvi has an M.Sc. in Electronic Engineering from La Sapienza University in Rome, Italy, and a PhD in Computer Science from KTH Royal Institue of Technology in Sweden.

Research Interests

  • Machine Learning
  • Speech Technology
  • Cognitive Systems

Current Projects

  • SCRIBE: Machine transcription of Norwegian conversational speech [link]
  • Teflon: Technology-enhanced foreign and second-language learning of Nordic languages [link]
  • NordTrans: Technology for automatic speech transcription in selected Nordic languages [link]
  • Center for Geophysical Forecasting [link]

Past Projects

  • Gesture learning and language acquisition in humanoid robots, Fundação para a Ciência e a Tecnologia, IST, Lisbon, Portugal
  • Biologically inspired statistical methods for flexible automatic speech understanding, Swedish Research Council, KTH, Stockholm, Sweden
  • Interactive Grounded Language Understanding, CHIST-ERA (EU) and Swedish Research Council, KTH, Stockholm Sweden. [link]

Competencies

  • Artificial Intelligence
  • Digital signal processing
  • Human machine interaction
  • Language Technology
  • Machine learning
  • Pattern Recognition
  • Signal processing
  • Speech recognition
  • artificial intelligence
  • big data
  • kunstig intelligens
  • probabilistic ai
  • probabilistisk ki
  • stordata

Publications

  • Chronological
  • By category
  • All publications registered in NVA

2025

  • Salvi, Giampiero. (2025) TeflonNorL2 NOCASA Challenge Dataset.
    Other
  • Yaroslav, Getman,; Tamás, Grósz,; Mikko, Kurimo,; Salvi, Giampiero. (2025) [2504.20678] Non-native Children's Automatic Speech Assessment Challenge (NOCASA).
    Academic chapter/article/Conference paper
  • Parsons, Phoebe Luree Turner; Bremnes, Heming Strømholt; Kvale, Knut; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Effects of Prosodic Information on Dialect Classification Using Whisper Features.
    Academic chapter/article/Conference paper
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn. (2025) Improving Phone Recognition through Informed Initialization and Path-Aligned CTC Loss.
    Academic chapter/article/Conference paper
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Child speech assessment through large language model speech synthesis: Preliminary results.
    Academic chapter/article/Conference paper
  • Dymbe, Simen; Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Using Cross-Attention for Conversational ASR over the Telephone.
    Academic chapter/article/Conference paper
  • Rugayan, Janine Lizbeth Cabrera; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2025) Optimizing ASR Models with Semantic Information.
    Academic chapter/article/Conference paper
  • Adiban, Mohammad; Stefanov, Kalin; Siniscalchi, Sabato Marco; Salvi, Giampiero. (2025) S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction. IEEE transactions on multimedia
    Academic article
  • Parsons, Phoebe Luree Turner; Solberg, Per Erik; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2025) Adding Metadata to Existing Parliamentary Speech Corpus.
    Academic chapter/article/Conference paper
  • Parsons, Phoebe Luree Turner; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2025) Match ‘em: Multi-Tiered Alignment for Error Analysis in ASR.
    Academic chapter/article/Conference paper

2024

  • Salvi, Giampiero. (2024) Teflon at Fonetik 2024.
    Other
  • Salvi, Giampiero. (2024) Teflon at LREC-Coling 2024, Torino, Italy.
    Other
  • Salvi, Giampiero. (2024) Challenges collecting and sharing speech data from children.
    Other
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) A Framework for Phoneme-Level Pronunciation Assessment Using CTC. Interspeech
    Academic article
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2024) Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. Machine Learning for Signal Processing
    Academic article
  • Quatra, Moreno La; Turco, Maria Francesca; Svendsen, Torbjørn Karl; Salvi, Giampiero; Orozco-Arroyave, Juan Rafael; Siniscalchi, Sabato Marco. (2024) Exploiting Foundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions. Interspeech
    Academic article
  • Kynych, Frantisek; Cerva, Petr; Zdansky, Jindrich; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams. EURASIP Journal on Audio, Speech, and Music Processing
    Academic article
  • Olstad, Anne Marte Haug; Smolander, Anna; Strömbergsson, Sofia; Ylinen, Sari; Lehtonen, Minna; Kurimo, Mikko. (2024) Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages. Proceedings of LREC
    Academic article

2023

  • Salvi, Giampiero. (2023) NTNU (Teflon) at NNL2P.
    Other
  • Salvi, Giampiero. (2023) TEFLON at SLaTE 2023.
    Other
  • Salvi, Giampiero. (2023) Second face-to-face Teflon meeting in Trondheim, June 2023.
    Other
  • Stenwig, Eline; Salvi, Giampiero; Rossi, Pierluigi Salvo; Skjaervold, Nils Kristian. (2023) Comparison of correctly and incorrectly classified patients for in-hospital mortality prediction in the intensive care unit. BMC Medical Research Methodology
    Academic article
  • Solberg, Per Erik; Cabello, Pablo Ortiz; Parsons, Phoebe; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) Improving Generalization of Norwegian ASR with Limited Linguistic Resources.
    Academic chapter/article/Conference paper
  • Parsons, Phoebe; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) A character-based analysis of impacts of dialects on end-to-end Norwegian ASR.
    Academic chapter/article/Conference paper
  • Getman, Yaroslav; Phan, Nhan; Al-Ghezi, Ragheb; Voskoboinik, Ekaterina; Singh, Mittul; Grosz, Tamas. (2023) Developing an AI-Assisted Low-Resource Spoken Language Learning App for Children. IEEE Access
    Academic article
  • Adiban, Mohammad; Siniscalchi, Sabato Marco; Salvi, Giampiero. (2023) A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity. Neurocomputing
    Academic article
  • Rugayan, Janine Lizbeth Cabrera; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation. Interspeech (USB)
    Academic article
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
    Academic article
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) An Analysis of Goodness of Pronunciation for Child Speech. Interspeech
    Academic article

2022

  • Rugayan, Janine Lizbeth Cabrera; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2022) Semantically Meaningful Metrics for Norwegian ASR Systems. Interspeech (USB)
    Academic article
  • Abdelnour, Jerome; Rouat, Jean; Salvi, Giampiero. (2022) NAAQA: A Neural Architecture for Acoustic Question Answering. IEEE Transactions on Pattern Analysis and Machine Intelligence
    Academic article
  • Getman, Yaroslav; Al-Ghezi, Ragheb; Voskoboinik, Ekaterina; Grósz, Tamás; Kurimo, Mikko; Salvi, Giampiero. (2022) wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. Interspeech (USB)
    Academic article
  • Stenwig, Eline; Salvi, Giampiero; Rossi, Pierluigi Salvo; Skjaervold, Nils Kristian. (2022) Comparative analysis of explainable machine learning prediction models for hospital mortality. BMC Medical Research Methodology
    Academic article

2021

  • Shahrebabaki, Abdolreza Sabzi; Salvi, Giampiero; Svendsen, Torbjørn Karl; Siniscalchi, Sabato Marco. (2021) Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)
    Academic article
  • Stefanov, Kalin; Adiban, Mohammad; Salvi, Giampiero. (2021) Spatial Bias in Vision-Based Voice Activity Detection. International Conference on Pattern Recognition
    Academic article
  • Adiban, Mohammad; Safari, Arash; Salvi, Giampiero. (2021) STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
    Academic article
  • Shahrebabaki, Abdolreza Sabzi; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2021) A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
    Academic chapter/article/Conference paper

2020

  • Shahrebabaki, Abdolreza Sabzi; Olfati, Negar; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn. (2020) Transfer learning of articulatory information through phone information. Interspeech (USB)
    Academic article
  • Shahrebabaki, Abdolreza Sabzi; Siniscalchi, Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2020) Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals. Interspeech (USB)
    Academic article

2019

  • Stefanov, Kalin; Salvi, Giampiero; Kontogiorgos, Dimosthenis; Kjellström, Hedvig; Beskow, Jonas. (2019) Modeling of Human Visual Attention in Multiparty Open-World Dialogues. ACM Transactions on Human-Robot Interaction
    Academic article
  • Selamtzis, Andreas; Castellana, Antonella; Salvi, Giampiero; Carullo, Alessio. (2019) Effect of vowel context in cepstral and entropy analysis of pathological voices. Biomedical Signal Processing and Control
    Academic article
  • Saponaro, Giovanni; Jamone, Lorenzo; Alexandre, Bernardino; Salvi, Giampiero. (2019) Beyond the Self: Using Grounded Affordances to Interpret and Describe Others' Actions. IEEE Transactions on Cognitive and Developmental Systems
    Academic article
  • Stefanov, Kalin; Beskow, Jonas; Salvi, Giampiero. (2019) Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition. IEEE Transactions on Cognitive and Developmental Systems
    Academic article

2015

  • Strömbergsson, Sofia; Salvi, Giampiero; House, David. (2015) Acoustic and perceptual evaluation of category goodness of /t/ and /k/ in typical and misarticulated children's speech. Journal of the Acoustical Society of America
    Academic article

2013

  • Koniaris, Christos; Salvi, Giampiero; Engwall, Olov. (2013) On mispronunciation analysis of individual foreign speakers using auditory periphery models. Speech Communication
    Academic article
  • Neiberg, Daniel; Salvi, Giampiero; Gustafson, Joakim. (2013) Semi-supervised methods for exploring the acoustics of simple productive feedback. Speech Communication
    Academic article

2012

  • Salvi, Giampiero; Montesano, Luis; Bernardino, Alexandre; Santos-Victor, José. (2012) Language bootstrapping: Learning Word Meanings From Perception-Action Association. IEEE Transactions on Cybernetics
    Academic article

2009

  • Salvi, Giampiero; Beskow, Jonas; Moubayed, Samer Al; Grandström, Björn. (2009) SynFace-Speech-Driven Facial Animation for Virtual Speech-Reading Support. EURASIP Journal on Audio, Speech, and Music Processing
    Academic article

2006

  • Salvi, Giampiero. (2006) Dynamic behaviour of connectionist speech recognition with strong latency constraints. Speech Communication
    Academic article
  • Salvi, Giampiero. (2006) Segment boundary detection via class entropy measurements in connectionist phoneme recognition. Speech Communication
    Academic article

2004

  • Siciliano, Catherine; Williams, Geoff; Faulkner, Andrew J.; Salvi, Giampiero. (2004) Intelligibility of an ASR-controlled synthetic talking face. Journal of the Acoustical Society of America
    Academic article

Journal publications

  • Adiban, Mohammad; Stefanov, Kalin; Siniscalchi, Sabato Marco; Salvi, Giampiero. (2025) S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction. IEEE transactions on multimedia
    Academic article
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) A Framework for Phoneme-Level Pronunciation Assessment Using CTC. Interspeech
    Academic article
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2024) Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. Machine Learning for Signal Processing
    Academic article
  • Quatra, Moreno La; Turco, Maria Francesca; Svendsen, Torbjørn Karl; Salvi, Giampiero; Orozco-Arroyave, Juan Rafael; Siniscalchi, Sabato Marco. (2024) Exploiting Foundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions. Interspeech
    Academic article
  • Kynych, Frantisek; Cerva, Petr; Zdansky, Jindrich; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams. EURASIP Journal on Audio, Speech, and Music Processing
    Academic article
  • Olstad, Anne Marte Haug; Smolander, Anna; Strömbergsson, Sofia; Ylinen, Sari; Lehtonen, Minna; Kurimo, Mikko. (2024) Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages. Proceedings of LREC
    Academic article
  • Stenwig, Eline; Salvi, Giampiero; Rossi, Pierluigi Salvo; Skjaervold, Nils Kristian. (2023) Comparison of correctly and incorrectly classified patients for in-hospital mortality prediction in the intensive care unit. BMC Medical Research Methodology
    Academic article
  • Getman, Yaroslav; Phan, Nhan; Al-Ghezi, Ragheb; Voskoboinik, Ekaterina; Singh, Mittul; Grosz, Tamas. (2023) Developing an AI-Assisted Low-Resource Spoken Language Learning App for Children. IEEE Access
    Academic article
  • Adiban, Mohammad; Siniscalchi, Sabato Marco; Salvi, Giampiero. (2023) A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity. Neurocomputing
    Academic article
  • Rugayan, Janine Lizbeth Cabrera; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation. Interspeech (USB)
    Academic article
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
    Academic article
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) An Analysis of Goodness of Pronunciation for Child Speech. Interspeech
    Academic article
  • Rugayan, Janine Lizbeth Cabrera; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2022) Semantically Meaningful Metrics for Norwegian ASR Systems. Interspeech (USB)
    Academic article
  • Abdelnour, Jerome; Rouat, Jean; Salvi, Giampiero. (2022) NAAQA: A Neural Architecture for Acoustic Question Answering. IEEE Transactions on Pattern Analysis and Machine Intelligence
    Academic article
  • Getman, Yaroslav; Al-Ghezi, Ragheb; Voskoboinik, Ekaterina; Grósz, Tamás; Kurimo, Mikko; Salvi, Giampiero. (2022) wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. Interspeech (USB)
    Academic article
  • Stenwig, Eline; Salvi, Giampiero; Rossi, Pierluigi Salvo; Skjaervold, Nils Kristian. (2022) Comparative analysis of explainable machine learning prediction models for hospital mortality. BMC Medical Research Methodology
    Academic article
  • Shahrebabaki, Abdolreza Sabzi; Salvi, Giampiero; Svendsen, Torbjørn Karl; Siniscalchi, Sabato Marco. (2021) Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)
    Academic article
  • Stefanov, Kalin; Adiban, Mohammad; Salvi, Giampiero. (2021) Spatial Bias in Vision-Based Voice Activity Detection. International Conference on Pattern Recognition
    Academic article
  • Adiban, Mohammad; Safari, Arash; Salvi, Giampiero. (2021) STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
    Academic article
  • Shahrebabaki, Abdolreza Sabzi; Olfati, Negar; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn. (2020) Transfer learning of articulatory information through phone information. Interspeech (USB)
    Academic article
  • Shahrebabaki, Abdolreza Sabzi; Siniscalchi, Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2020) Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals. Interspeech (USB)
    Academic article
  • Stefanov, Kalin; Salvi, Giampiero; Kontogiorgos, Dimosthenis; Kjellström, Hedvig; Beskow, Jonas. (2019) Modeling of Human Visual Attention in Multiparty Open-World Dialogues. ACM Transactions on Human-Robot Interaction
    Academic article
  • Selamtzis, Andreas; Castellana, Antonella; Salvi, Giampiero; Carullo, Alessio. (2019) Effect of vowel context in cepstral and entropy analysis of pathological voices. Biomedical Signal Processing and Control
    Academic article
  • Saponaro, Giovanni; Jamone, Lorenzo; Alexandre, Bernardino; Salvi, Giampiero. (2019) Beyond the Self: Using Grounded Affordances to Interpret and Describe Others' Actions. IEEE Transactions on Cognitive and Developmental Systems
    Academic article
  • Stefanov, Kalin; Beskow, Jonas; Salvi, Giampiero. (2019) Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition. IEEE Transactions on Cognitive and Developmental Systems
    Academic article
  • Strömbergsson, Sofia; Salvi, Giampiero; House, David. (2015) Acoustic and perceptual evaluation of category goodness of /t/ and /k/ in typical and misarticulated children's speech. Journal of the Acoustical Society of America
    Academic article
  • Koniaris, Christos; Salvi, Giampiero; Engwall, Olov. (2013) On mispronunciation analysis of individual foreign speakers using auditory periphery models. Speech Communication
    Academic article
  • Neiberg, Daniel; Salvi, Giampiero; Gustafson, Joakim. (2013) Semi-supervised methods for exploring the acoustics of simple productive feedback. Speech Communication
    Academic article
  • Salvi, Giampiero; Montesano, Luis; Bernardino, Alexandre; Santos-Victor, José. (2012) Language bootstrapping: Learning Word Meanings From Perception-Action Association. IEEE Transactions on Cybernetics
    Academic article
  • Salvi, Giampiero; Beskow, Jonas; Moubayed, Samer Al; Grandström, Björn. (2009) SynFace-Speech-Driven Facial Animation for Virtual Speech-Reading Support. EURASIP Journal on Audio, Speech, and Music Processing
    Academic article
  • Salvi, Giampiero. (2006) Dynamic behaviour of connectionist speech recognition with strong latency constraints. Speech Communication
    Academic article
  • Salvi, Giampiero. (2006) Segment boundary detection via class entropy measurements in connectionist phoneme recognition. Speech Communication
    Academic article
  • Siciliano, Catherine; Williams, Geoff; Faulkner, Andrew J.; Salvi, Giampiero. (2004) Intelligibility of an ASR-controlled synthetic talking face. Journal of the Acoustical Society of America
    Academic article

Part of book/report

  • Salvi, Giampiero. (2025) TeflonNorL2 NOCASA Challenge Dataset.
    Other
  • Yaroslav, Getman,; Tamás, Grósz,; Mikko, Kurimo,; Salvi, Giampiero. (2025) [2504.20678] Non-native Children's Automatic Speech Assessment Challenge (NOCASA).
    Academic chapter/article/Conference paper
  • Parsons, Phoebe Luree Turner; Bremnes, Heming Strømholt; Kvale, Knut; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Effects of Prosodic Information on Dialect Classification Using Whisper Features.
    Academic chapter/article/Conference paper
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn. (2025) Improving Phone Recognition through Informed Initialization and Path-Aligned CTC Loss.
    Academic chapter/article/Conference paper
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Child speech assessment through large language model speech synthesis: Preliminary results.
    Academic chapter/article/Conference paper
  • Dymbe, Simen; Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Using Cross-Attention for Conversational ASR over the Telephone.
    Academic chapter/article/Conference paper
  • Rugayan, Janine Lizbeth Cabrera; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2025) Optimizing ASR Models with Semantic Information.
    Academic chapter/article/Conference paper
  • Parsons, Phoebe Luree Turner; Solberg, Per Erik; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2025) Adding Metadata to Existing Parliamentary Speech Corpus.
    Academic chapter/article/Conference paper
  • Parsons, Phoebe Luree Turner; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2025) Match ‘em: Multi-Tiered Alignment for Error Analysis in ASR.
    Academic chapter/article/Conference paper
  • Salvi, Giampiero. (2024) Teflon at Fonetik 2024.
    Other
  • Salvi, Giampiero. (2024) Teflon at LREC-Coling 2024, Torino, Italy.
    Other
  • Salvi, Giampiero. (2024) Challenges collecting and sharing speech data from children.
    Other
  • Salvi, Giampiero. (2023) NTNU (Teflon) at NNL2P.
    Other
  • Salvi, Giampiero. (2023) TEFLON at SLaTE 2023.
    Other
  • Salvi, Giampiero. (2023) Second face-to-face Teflon meeting in Trondheim, June 2023.
    Other
  • Solberg, Per Erik; Cabello, Pablo Ortiz; Parsons, Phoebe; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) Improving Generalization of Norwegian ASR with Limited Linguistic Resources.
    Academic chapter/article/Conference paper
  • Parsons, Phoebe; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) A character-based analysis of impacts of dialects on end-to-end Norwegian ASR.
    Academic chapter/article/Conference paper
  • Shahrebabaki, Abdolreza Sabzi; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2021) A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
    Academic chapter/article/Conference paper

Teaching

Courses

  • TT8108 - PhD Seminar in Signal Processing
  • TFE4595 - Electronic Systems Design, Specialization Course
  • TTT4185 - Machine Learning for Signal Processing
  • TFE4590 - Electronic Systems Design, Specialization Project
  • TT8111 - Signal and Estimation Theory

Outreach

2025

  • Academic lecture
    Parsons, Phoebe Luree Turner; Solberg, Per Erik; Kvale, Knut; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Adding Metadata to Existing Parliamentary Speech Corpus. Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025) 2025-03-01 - 2025-03-03
  • Academic lecture
    Parsons, Phoebe Luree Turner; Bremnes, Heming Strømholt; Kvale, Knut; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Effects of Prosodic Information on Dialect Classification Using Whisper Features. Interspeech 2025 2025-08-16 - 2025-08-20
  • Academic lecture
    Rugayan, Janine Lizbeth Cabrera; Salvi, Giampiero; Svendsen, Torbjørn. (2025) Optimizing ASR Models with Semantic Information. Text, Speech and Dialogue 2025-08-24 - 2025-08-27
  • Academic lecture
    Dymbe, Simen; Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Using Cross-Attention for Conversational ASR over the Telephone. Text, Speech and Dialogue 2025-08-24 - 2025-08-27
  • Academic lecture
    Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn. (2025) Improving Phone Recognition through Informed Initialization and Path-Aligned CTC Loss. 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP) 2025-08-30 - 2025-09-02
  • Academic lecture
    Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Child speech assessment through large language model speech synthesis: Preliminary results. 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP) 2025-08-30 - 2025-09-02

2024

  • Academic lecture
    Olstad, Anne Marte Haug; Smolander, Anna; Strömbergsson, Sofia; Ylinen, Sari; Lehtonen, Minna; Kurimo, Mikko. (2024) Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages. LREC-COLING , Turin, Italy 2024-05-20 - 2024-05-24
  • Academic lecture
    Quarta, Moreno La; Turco, Maria Francesca; Svendsen, Torbjørn; Salvi, Giampiero; Orozco-Arroyave, Juan Rafael; Siniscalchi, Sabato Marco. (2024) oundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions. Interspeech , Kos, Greece 2024-09-01 - 2024-09-05
  • Academic lecture
    Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2024) Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. chine Learning for Signal Processing , London, UK 2024-09-22 - 2024-09-25
  • Academic lecture
    Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) Framework for Phoneme-Level Pronunciation Assessment Using CTC. Interspeech , Kos, Greece 2024-09-01 - 2024-09-05
  • Academic lecture
    Salvi, Giampiero. (2024) Speech Research at NTNU. Visit at Politecnico di Torino , Turin, Italy 2024-05-27 - 2024-05-27
  • Academic lecture
    Salvi, Giampiero. (2024) Speech Research at NTNU. Visit at Electical Engineering, Sapienza University , Rome, Italy 2024-05-14 - 2024-05-14
  • Academic lecture
    Parsons, Phoebe Luree Turner; Bremnes, Heming Strømholt; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) Norwegian dialect identification: is prosody enough?. Fonetik , Stockholm 2024-06-03 - 2024-06-05

2023

  • Lecture
    Salvi, Giampiero. (2023) Speech Research at NTNU. Visit at Italian Institute of Technology , Genova, Italy 2023-08-01 - 2023-08-01
  • Academic lecture
    Salvi, Giampiero. (2023) Speech Research at NTNU. Visit at Computer Engineering, Sapienza University , Rome, Italy 2023-12-13 - 2023-12-13
  • Academic lecture
    Parsons, Phoebe Luree Turner; Kvale, Knut; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) A character-based analysis of impacts of dialects on end-to-end Norwegian ASR. 24th Nordic Conference on Computational Linguistics (NoDaLiDa) , Tórshavn, Faroe Islands 2023-05-14 - 2023-05-18
  • Academic lecture
    Rugayan, Janine Lizbeth Cabrera; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation. Interspeech , Dublin, Irland 2023-08-20 - 2023-08-24
  • Academic lecture
    Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. ICASSP , Rhodes, Greece 2023-06-04 - 2023-06-10
  • Academic lecture
    Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) An Analysis of Goodness of Pronunciation for Child Speech. Interspeech , Dublin, Irland 2023-08-20 - 2023-08-24
  • Academic lecture
    Solberg, Per Erik; Cabello, Pablo Ortiz; Parsons, Phoebe Luree Turner; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) Improving Generalization of Norwegian ASR with Limited Linguistic Resources. 24th Nordic Conference on Computational Linguistics (NoDaLiDa) , Tórshavn, Faroe Islands 2023-05-15 - 2023-05-18

2022

  • Academic lecture
    Getman, Yaroslav; Al-Ghezi, Ragheb; Voskoboinik, Ekaterina; Grósz, Tamás; Kurimo, Mikko; Salvi, Giampiero. (2022) wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. Interspeech , Incheon, Korea 2022-09-18 - 2022-09-22
  • Academic lecture
    Rugayan, Janine Lizbeth Cabrera; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2022) Semantically Meaningful Metrics for Norwegian ASR Systems. Interspeech , Incheon, Korea 2022-09-18 - 2022-09-22

NTNU – Norwegian University of Science and Technology

  • For employees
  • |
  • For students
  • |
  • Intranet
  • |
  • Blackboard

Studies

  • Master's programmes in English
  • For exchange students
  • PhD opportunities
  • Courses
  • Career development
  • Continuing education
  • Application process

News

  • NTNU News
  • Vacancies

About NTNU

  • About the university
  • Libraries
  • NTNU's strategy
  • Research excellence
  • Strategic research areas
  • Organizational chart

Contact

  • Contact NTNU
  • Employees
  • Find experts
  • Press contacts
  • Researcher support
  • Maps

NTNU in three cities

  • NTNU in Gjøvik
  • NTNU in Trondheim
  • NTNU in Ålesund

About this website

  • Use of cookies
  • Accessibility statement
  • Privacy policy
  • Editorial responsibility
Facebook Instagram Linkedin Snapchat Tiktok Youtube
Sign In
NTNU logo