Navigation

  • Skip to Content
NTNU Home

ntnu.edu

  • Studies
    • Master's programmes in English
    • For exchange students
    • PhD opportunities
    • All programmes of study
    • Courses
    • Financing
    • Language requirements
    • Application process
    • Academic calendar
    • FAQ
  • Research and innovation
    • NTNU research
    • Research excellence
    • Strategic research areas
    • Innovation resources
    • PhD opportunities
  • Life and housing
    • Student in Trondheim
    • Student in Gjøvik
    • Student in Ålesund
    • For researchers
    • Life and housing
  • About NTNU
    • Contact us
    • Faculties and departments
    • Libraries
    • International researcher support
    • Vacancies
    • About NTNU
    • Maps
  1. Employees

Språkvelger

Norsk

Torbjørn Karl Svendsen

Torbjørn Karl Svendsen

Professor
Department of Electronic Systems

torbjorn.svendsen@ntnu.no
+4773591481 +4793080477 Elektro C, C335, Gløshaugen, O. S. Bragstads plass 2
About Publications Teaching Media

About

Torbjørn Svendsen (1955) is a Professor at the Department of Electronic Systems. Professor Svendsen holds a MScEE, and a PhD both from the NTNU.

Fields of interest and present research activities

My research interests have from the outset in 1979 been speech signal processing. The first period was focused on source coding, i.e. speech compression, which was also the subject of my doctoral thesis. From the mid 80’s the research interests have been mainly on automatic speech recognition, but also areas like spoken dialogue systems and speech synthesis have been included in my research.  Speech analysis methods and lexical modelling, e.g. pronunciation modelling have been two central areas. Realizing that current approaches to speech recognition seem to be nearing a saturation point in terms of performance, a major activity in the last 5-year period has been to investigate new paradigms for speech recognition, aiming to integrate phonetic and linguistic knowledge in a statistical framework based on detection of (language universal) phonetic features.

Work experience

  • NTNU (1979-1981 Research assistant, 1983-1984 doctoral fellowship, 1988-1995 Associate professor, 1995-present Professor), Director NTNU Digital (2015-2021)
  • SINTEF (1981-1987, Research scientist)
  • Research visits at AT&T Bell Labs, Murray Hill, NJ (1986-1987, 1990); Griffith University, Brisbane, Australia (1996-97); AT&T Labs, Florham Park, NJ (2000); Queensland University of Technology, Brisbane, Australia (2002-03); Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, MA (2013)

Professional merits

Peer review and professional evaluation work:

  • Reviewer for international journals like IEEE Transactions (Communications; Signal Processing; Audio, Speech and Language Processing; Multimedia); EURASIP Journal on Applied Signal Processing, Signal, Image and Video Processing; and Speech Communication, and various conferences and workshops on speech and signal processing.
  • Member of Speech Communication journal Editorial Board
  • Reviewer for EU's Language Engineering program and the Information Society Research Programme of the Academy of Finland. Project reviews for the Norwegian, Australian, Swiss, Dutch, Belgian and South African Research Councils
  • Opponent/member of examination boards for 26 doctoral theses

Membership in academic and professional committees

  • Various appointments at the national level, e.g. in the Research Council of Norway, incl. grant committee member for the IKTPLUSS program, program board chair for the VERDIKT program, and in the Norwegian Language Council.
  • Member of advisory board, Norwegian Language Bank (“Språkbanken”)
  • Member of Technical committees, Eurospeech2001 and Interspeech2012, and organizing committee of Eurospeech2001.
  • Senior Member, IEEE Signal Processing Society Speech Technical Committee (1998-2001)
  • Elected member, Norwegian Academy of Technological Sciences
  • Vice president, International Speech Communication Association (ISCA)

Other professional merits

  • Project manager, "Atomic Units for Language Universal Speech" (current), "Spoken dialog systems for telephony"; "Speech interfaces and reasoning systems"; "Norwegian corpus for language technology"; “Voice centric user interfaces for location based services”; “Tools for realistic speech synthesis in”; “Spoken Information Retrieval by Knowledge Utilization in Statistical Speech Processing”;  “Rundkast – A transcribed broadcast news for applications in language technology”(past projects).
  • Vice chair, COST action 278; WG chair COST actions 232 and 249; Advisory Scientific Board member, EU project ACORNS; Board member, Nordic Graduate School of Language Technology (former actions and activities)
  • Previous NTNU appointments: Department Head, Department of Telecommunications; Vice Dean, Faculty of Electrical Engineering and Telecommunications; member of several NTNU committees
  • 16 PhD students graduated (2 as co-supervisor). Currently supervising 2 PhD students.
  • ~80 Master degree students graduated
  • ~85 papers in international journals and conferences

Competencies

  • Artificial intelligence
  • Biometry
  • Digital signal processing
  • Human-machine system
  • Information and communication technology
  • Language Technology
  • Language resources
  • Machine learning
  • Pattern Recognition
  • Signal processing
  • Speech recognition

Publications

  • Chronological
  • By category
  • See all publications in Cristin

2022

  • Getman, Yaroslav; Al-Ghezi, Ragheb; Voskoboinik, Ekaterina; Grósz, Tamás; Kurimo, Mikko; Salvi, Giampiero; Svendsen, Torbjørn Karl; Strömbergsson, Sofia. (2022) wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. Interspeech (USB).
    Academic article
  • Oudijk, Esmée; Hasler, Oliver Kevin; Øveraas, Henning; Marty, Sabine; Williamson, David Roddan; Svendsen, Torbjørn Karl; Berg, Simen; Birkeland, Roger; Halvorsen, Daniel Ørnes; Bakken, Sivert; Henriksen, Marie Bøe; Alver, Morten; Johnsen, Geir; Johansen, Tor Arne; Stahl, Annette; Kvaløy, Pål; Dallolio, Alberto; Majaneva, Sanna; Fragoso, Glaucia Moreira. (2022) Campaign For Hyperspectral Data Validation In North Atlantic Coastal Waters. Workshop on Hyperspectral Image and Signal Processing, Evolution in Remote Sensing.
    Academic article
  • Rugayan, Janine Lizbeth Cabrera; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2022) Semantically Meaningful Metrics for Norwegian ASR Systems. Interspeech (USB).
    Academic article

2021

  • Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Imran, Ali Shariq; Johnsen, Magne Hallstein; Siniscalchi, Sabato Marco; Svendsen, Torbjørn Karl. (2021) A Two-Stage Deep Modeling Approach to Articulatory Inversion. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
    Academic chapter/article/Conference paper
  • Sabzi Shahrebabaki, Abdolreza; Salvi, Giampiero; Svendsen, Torbjørn Karl; Siniscalchi, Sabato Marco. (2021) Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP). volum 30.
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2021) A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion. Proceedings 2021 IEEE International Symposium on Circuits and Systems.
    Academic chapter/article/Conference paper
  • Sabzi Shahrebabaki, Abdolreza; Siniscalchi, Sabato Marco; Svendsen, Torbjørn Karl. (2021) Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation. Interspeech.
    Academic article

2020

  • Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn. (2020) Transfer learning of articulatory information through phone information. Interspeech (USB).
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Siniscalchi, Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2020) Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals. Interspeech (USB).
    Academic article

2019

  • Imran, Ali Shariq; Haflan, Vetle; Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Svendsen, Torbjørn Karl. (2019) Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification. ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
    Academic chapter/article/Conference paper
  • Imran, Ali Shariq; Kastrati, Zenun; Svendsen, Torbjørn Karl; Kurti, Arianit. (2019) Text-Independent Speaker ID for Automatic Video Lecture Classification Using Deep Learning. ICCAI '19 Proceedings of the 2019 5th International Conference on Computing and Artificial Intelligence.
    Academic chapter/article/Conference paper
  • Imran, Ali Shariq; Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Svendsen, Torbjørn Karl. (2019) A Study on the Performance Evaluation of Machine Learning Models for Phoneme Classification. ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
    Academic chapter/article/Conference paper
  • Sabzi Shahrebabaki, Abdolreza; Imran, Ali Shariq; Olfati, Negar; Svendsen, Torbjørn Karl. (2019) A Comparative Study of Deep Learning Techniques on Frame-Level Speech Data Classification. Circuits, systems, and signal processing. volum 38.
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Imran, Ali Shariq; Sabato Marco, Siniscalchi; Svendsen, Torbjørn Karl. (2019) A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion. Interspeech (USB).
    Academic article

2018

  • Sabzi Shahrebabaki, Abdolreza; Imran, Ali Shariq; Olfati, Negar; Svendsen, Torbjørn Karl. (2018) Acoustic Feature Comparison for Different Speaking Rates. Human-Computer Interaction. Interaction Technologies.
    Academic chapter/article/Conference paper

2015

  • Svendsen, Torbjørn Karl; Hamar, Jarle Bauck. (2015) Combining NdHMM and Phonetic Feature Detection for Speech Recognition. Proceedings of European Signal Processing Conference.
    Academic chapter/article/Conference paper

2014

  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2014) An artificial neural network approach to automatic speech processing. Neurocomputing. volum 140.
    Academic article
  • Soufifar, Mehdi; Svendsen, Torbjørn; Burget, Lukas. (2014) Subspace Modeling of Discrete features for Language Recognition. 2014. ISBN 978-82-326-0496-8.
    Doctoral dissertation

2013

  • Doddipatla, Rama Sanand; Svendsen, Torbjørn. (2013) Synthetic Speaker Models Using VTLN to Improve the Performance of Children in Mismatched Speaker Conditions for ASR. Interspeech (USB).
    Academic article
  • Hamar, Jarle Bauck; Doddipatla, Rama Sanand; Svendsen, Torbjørn; Sreenivas, Thippur. (2013) Non-Negative Durational HMM. Proceedings of IEEE International Workshop on Machine Learning for Signal Processing 2013.
    Academic chapter/article/Conference paper

2012

  • Siniscalchi, Sabato Marco; Lyu, DC; Svendsen, Torbjørn; Lee, CH. (2012) Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data. IEEE Transactions on Audio, Speech, and Language Processing. volum 20 (3).
    Academic article
  • Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2012) Universal attribute characterization of spoken languages for automatic spoken language recognition. Computer Speech and Language. volum 27 (1).
    Academic article
  • Svendsen, Torbjørn. (2012) Data med barnestemme. Forskning.no.
    publications.INTERVJUSKRIFTL

2011

  • Adde, Line; Svendsen, Torbjørn. (2011) Pronunciation Variation Modeling of Non-Natie Proper Names by Discriminative Tree Search. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
    Academic article
  • Kvale, Knut; Nordgård, Torbjørn; Svendsen, Torbjørn; Lyse, Gunn Inger; Gjesdal, Anje Müller. (2011) Datamaskinen må skjønne norsk. Bergens Tidende.
    Feature article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2011) A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines. Interspeech.
    Academic article
  • Skogstad, Trond; Svendsen, Torbjørn. (2011) Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients. Interspeech.
    Academic article
  • Soufifar, Mehdi; Kockmann, Marcel; Burget, Lukas; Plchot, Oldrich; Glembek, Ondrej; Svendsen, Torbjørn. (2011) iVector Approach to Phonotactic Language Recognition. Interspeech.
    Academic article

2010

  • Adde, Line; Reveil, Bert; Martens, Jean-Pierre; Svendsen, Torbjørn. (2010) A Minimum Classification Error Approach to Pronunciation Variation Modeling of Non-Native Proper Names. Interspeech.
    Academic article
  • Adde, Line; Svendsen, Torbjørn. (2010) A Comparative Analysis of Discriminative and Non-Discriminative Pronunciation Priors in Pronunciation Variation Modeling. Proceedings of 2010 IEEE Workshop on Spoken Language Technology.
    Other
  • Adde, Line; Svendsen, Torbjørn. (2010) NameDat: A Database of English Proper Names Spoken by Native Norwegians. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
    Academic chapter/article/Conference paper
  • Saeidi, Rahim; Soufifar, Mehdi; Kinnunen, Tomi; Svendsen, Torbjørn; Fränti, Pasi. (2010) UEF-NTNU System Description for Albayzin 2010 Language Recognition Evaluation. Proceedings FALA 2010.
    Other
  • Sikveland, Rein Ove; Öttl, Anton; Amdal, Ingunn; Ernestus, Mirjam; Svendsen, Torbjørn; Edlund, Jens. (2010) Spontal-N: A Corpus of Interactional Spoken Norwegian. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
    Other
  • Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition. Interspeech.
    Academic article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) A Survey on Recent Progress in the ASAT/SIRKUS Paradigm. Proceedings of 7th International Symposium on Chinese Spoken Language.
    Other
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Sorbello, Filippo; Lee, Chin-Hui. (2010) Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
    Academic article
  • Skogstad, Trond; Svendsen, Torbjørn. (2010) Intra-Frame Variability As a Predictor of Frame Classifiability. Interspeech.
    Academic article

2009

  • Mertens, Timo Pascal; Schneider, Daniel; Næss, Arild Brandrud; Svendsen, Torbjørn. (2009) Lexicon Adaptation for Subword Speech Recognition. Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
    Academic chapter/article/Conference paper
  • Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition. Interspeech.
    Academic article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) A Phonetic Feature Based Lattice Rescoring Approach to LVCSR. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
    Academic article

2008

  • Amdal, Ingunn; Strand, Ole Morten; Almberg, Jørn; Svendsen, Torbjørn. (2008) RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus. Proceedings of the 6th International Language Resources and Evaluation (LREC 2008).
    Academic chapter/article/Conference paper
  • Siniscalchi, Sabato Marco; Birkenes, Øystein; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2008) Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition. Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
    Other
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) A Penalized Logistic Regression Approach to Detection Based Phone Classification. Interspeech.
    Academic article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) Toward a Detector-Based Universal Phone Recognizer. Proc. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
    Other
  • Skogstad, Trond; Svendsen, Torbjørn. (2008) Time-Varying Cepstral Coefficients. Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
    Other

2007

  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2007) Towards Bottom-Up Continuous Phone Recognition. Proceedings 2007 IEEE Workshop on Automatic Speech Recognition and Understanding.
    Academic chapter/article/Conference paper

2006

  • Amdal, Ingunn; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2006) Log Likelihood Ratio Based Annotation Verification of a Norwegian Speech Synthesis Database. Proceedings of the 7th Nordic Signal Processing Symposium (NORSIG 2006).
    Academic chapter/article/Conference paper
  • Amdal, Ingunn; Svendsen, Torbjørn. (2006) FonDat1: A Speech Synthesis Corpus for Norwegian. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006).
    Academic chapter/article/Conference paper

2005

  • Amdal, Ingunn; Svendsen, Torbjørn. (2005) Unit Selection Synthesis Database Development Using Utterance Verification. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology. volum 9.
    Academic article
  • Bjørkan, Ingmund; Svendsen, Torbjørn. (2005) Comparing Spectral Distance Measures for Join Cost Optmization. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
    Academic article
  • Bjørkan, Ingmund; Svendsen, Torbjørn; Farner, Snorre. (2005) Comparing Spectral Distance Measures for Join Cost Optimization in Concatenative Speech Synthesis. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology. volum 9.
    Academic article
  • Meen, Dyre; Svendsen, Torbjørn; Natvig, Jon-Emil. (2005) Improving Phone Label Alignment Accuracy by Utilizing Voicing Information. SPECOM 2005 Proceedings.
    Academic chapter/article/Conference paper
  • Skogstad, Trond; Svendsen, Torbjørn. (2005) Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology. volum 9.
    Academic article
  • Svendsen, Torbjørn; Amdal, Ingunn; Bjørkan, Ingmund; Meen, Dyre; Heggtveit, Per Olav; Natvig, Jon Emil. (2005) FONEMA - Tools for realistic speech synthesis in Norwegian. Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
    Academic chapter/article/Conference paper
  • Svendsen, Torbjørn; Egeberg, Andreas; Holter, Trym; Skogstad, Trond. (2005) VOCALS - Voice centric user interfaces for location based services. Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
    Academic chapter/article/Conference paper

2004

  • Nordgård, Torbjørn; Svendsen, Torbjørn; Harborg, Erik; Kvale, Knut. (2004) Language Technology Towards 2020. Infosam2020, Information Society of 2020.
    Academic chapter/article/Conference paper

2003

  • Svendsen, Torbjørn. (2003) Speech Technology: Past, Present and Future. Telektronikk. volum 99 (2).
    Academic article

2002

  • Nordgård, Torbjørn; Svendsen, Torbjørn; Breivik, Torbjørg. (2002) Samling og tilgjengeleggjering av norske språkteknologiressursar. 2002.
    Report
  • Nordgård, Torbjørn; Svendsen, Torbjørn; Natvig, Jon Emil. (2002) Talsmann talesyntese som hjelpemiddel for dyslektikere. 2002.
    Report
  • Svendsen, Torbjørn. (2002) Roles for Speech And Language Technology in The Information Society. Perspectives on the age of the information society.
    Academic chapter/article/Conference paper

2001

  • Braverman, Marc; Svendsen, Torbjørn; Lund, Karl Erik; Aarø, Leif Edvard. (2001) Tobacco use by early adolescents in Norway. European Journal of Public Health. volum 11.
    Academic article
  • Svendsen, Torbjørn. (2001) Nordisk forskningssamarbeid innen språkteknologi. Språknytt. volum 3/2001.
    Popular scientific article

2000

  • Amdal, Ingunn; Holter, Trym; Svendsen, Torbjørn. (2000) Modellering av uttalevariasjon for automatisk talegjenkjenning. Nordlyd. volum 28-2000.
    Academic article
  • Foldvik, Arne Kjell; Nordgård, Torbjørn; Svendsen, Torbjørn; Thygesen, Ragnar. (2000) Dysleksi og språkteknologi. Adresseavisen.
    Feature article

1999

  • Holter, Trym; Svendsen, Torbjørn. (1999) Maximum likelihood modelling of pronunciation variation. Speech Communication. volum 29 (2-4).
    Academic article
  • Svendsen, Torbjørn. (1999) Taleteknologi. Språk i Norden.
    Academic article
  • Svendsen, Torbjørn; Johnsen, Magne Hallstein; Nordgård, Torbjørn; Hofland, Knut; Hofland, Knut; Ore, Christian Emil; Ore, Christian Emil. (1999) Nasjonalt korpus for språkteknologi - forprosjekt. 1999.
    Report

1998

  • Svendsen, Torbjørn. (1998) Blir norsk gresk for språkteknologien?. Språknytt. volum 26 (4-98).
    Academic article

1995

  • Harborg, Erik; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1995) Talegjenkjenning II. 1995.
    Report
  • Harborg, Erik; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1995) Talegjenkjenning for teksting av direktesendte programmer - en studie. 1995.
    Report

1994

  • Svendsen, Torbjørn. (1994) Talebaserte brukergrensesnitt. NORSIGnalet : organ for NORSIG, Norsk forening for signalbehandling.
    Popular scientific article

Journal publications

  • Getman, Yaroslav; Al-Ghezi, Ragheb; Voskoboinik, Ekaterina; Grósz, Tamás; Kurimo, Mikko; Salvi, Giampiero; Svendsen, Torbjørn Karl; Strömbergsson, Sofia. (2022) wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. Interspeech (USB).
    Academic article
  • Oudijk, Esmée; Hasler, Oliver Kevin; Øveraas, Henning; Marty, Sabine; Williamson, David Roddan; Svendsen, Torbjørn Karl; Berg, Simen; Birkeland, Roger; Halvorsen, Daniel Ørnes; Bakken, Sivert; Henriksen, Marie Bøe; Alver, Morten; Johnsen, Geir; Johansen, Tor Arne; Stahl, Annette; Kvaløy, Pål; Dallolio, Alberto; Majaneva, Sanna; Fragoso, Glaucia Moreira. (2022) Campaign For Hyperspectral Data Validation In North Atlantic Coastal Waters. Workshop on Hyperspectral Image and Signal Processing, Evolution in Remote Sensing.
    Academic article
  • Rugayan, Janine Lizbeth Cabrera; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2022) Semantically Meaningful Metrics for Norwegian ASR Systems. Interspeech (USB).
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Salvi, Giampiero; Svendsen, Torbjørn Karl; Siniscalchi, Sabato Marco. (2021) Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP). volum 30.
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Siniscalchi, Sabato Marco; Svendsen, Torbjørn Karl. (2021) Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation. Interspeech.
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn. (2020) Transfer learning of articulatory information through phone information. Interspeech (USB).
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Siniscalchi, Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2020) Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals. Interspeech (USB).
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Imran, Ali Shariq; Olfati, Negar; Svendsen, Torbjørn Karl. (2019) A Comparative Study of Deep Learning Techniques on Frame-Level Speech Data Classification. Circuits, systems, and signal processing. volum 38.
    Academic article
  • Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Imran, Ali Shariq; Sabato Marco, Siniscalchi; Svendsen, Torbjørn Karl. (2019) A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion. Interspeech (USB).
    Academic article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2014) An artificial neural network approach to automatic speech processing. Neurocomputing. volum 140.
    Academic article
  • Doddipatla, Rama Sanand; Svendsen, Torbjørn. (2013) Synthetic Speaker Models Using VTLN to Improve the Performance of Children in Mismatched Speaker Conditions for ASR. Interspeech (USB).
    Academic article
  • Siniscalchi, Sabato Marco; Lyu, DC; Svendsen, Torbjørn; Lee, CH. (2012) Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data. IEEE Transactions on Audio, Speech, and Language Processing. volum 20 (3).
    Academic article
  • Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2012) Universal attribute characterization of spoken languages for automatic spoken language recognition. Computer Speech and Language. volum 27 (1).
    Academic article
  • Svendsen, Torbjørn. (2012) Data med barnestemme. Forskning.no.
    publications.INTERVJUSKRIFTL
  • Adde, Line; Svendsen, Torbjørn. (2011) Pronunciation Variation Modeling of Non-Natie Proper Names by Discriminative Tree Search. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
    Academic article
  • Kvale, Knut; Nordgård, Torbjørn; Svendsen, Torbjørn; Lyse, Gunn Inger; Gjesdal, Anje Müller. (2011) Datamaskinen må skjønne norsk. Bergens Tidende.
    Feature article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2011) A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines. Interspeech.
    Academic article
  • Skogstad, Trond; Svendsen, Torbjørn. (2011) Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients. Interspeech.
    Academic article
  • Soufifar, Mehdi; Kockmann, Marcel; Burget, Lukas; Plchot, Oldrich; Glembek, Ondrej; Svendsen, Torbjørn. (2011) iVector Approach to Phonotactic Language Recognition. Interspeech.
    Academic article
  • Adde, Line; Reveil, Bert; Martens, Jean-Pierre; Svendsen, Torbjørn. (2010) A Minimum Classification Error Approach to Pronunciation Variation Modeling of Non-Native Proper Names. Interspeech.
    Academic article
  • Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition. Interspeech.
    Academic article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Sorbello, Filippo; Lee, Chin-Hui. (2010) Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
    Academic article
  • Skogstad, Trond; Svendsen, Torbjørn. (2010) Intra-Frame Variability As a Predictor of Frame Classifiability. Interspeech.
    Academic article
  • Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition. Interspeech.
    Academic article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) A Phonetic Feature Based Lattice Rescoring Approach to LVCSR. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
    Academic article
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) A Penalized Logistic Regression Approach to Detection Based Phone Classification. Interspeech.
    Academic article
  • Amdal, Ingunn; Svendsen, Torbjørn. (2005) Unit Selection Synthesis Database Development Using Utterance Verification. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology. volum 9.
    Academic article
  • Bjørkan, Ingmund; Svendsen, Torbjørn. (2005) Comparing Spectral Distance Measures for Join Cost Optmization. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
    Academic article
  • Bjørkan, Ingmund; Svendsen, Torbjørn; Farner, Snorre. (2005) Comparing Spectral Distance Measures for Join Cost Optimization in Concatenative Speech Synthesis. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology. volum 9.
    Academic article
  • Skogstad, Trond; Svendsen, Torbjørn. (2005) Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation. Eurospeech : Proceedings of the European Conference on Speech Communication and Technology. volum 9.
    Academic article
  • Svendsen, Torbjørn. (2003) Speech Technology: Past, Present and Future. Telektronikk. volum 99 (2).
    Academic article
  • Braverman, Marc; Svendsen, Torbjørn; Lund, Karl Erik; Aarø, Leif Edvard. (2001) Tobacco use by early adolescents in Norway. European Journal of Public Health. volum 11.
    Academic article
  • Svendsen, Torbjørn. (2001) Nordisk forskningssamarbeid innen språkteknologi. Språknytt. volum 3/2001.
    Popular scientific article
  • Amdal, Ingunn; Holter, Trym; Svendsen, Torbjørn. (2000) Modellering av uttalevariasjon for automatisk talegjenkjenning. Nordlyd. volum 28-2000.
    Academic article
  • Foldvik, Arne Kjell; Nordgård, Torbjørn; Svendsen, Torbjørn; Thygesen, Ragnar. (2000) Dysleksi og språkteknologi. Adresseavisen.
    Feature article
  • Holter, Trym; Svendsen, Torbjørn. (1999) Maximum likelihood modelling of pronunciation variation. Speech Communication. volum 29 (2-4).
    Academic article
  • Svendsen, Torbjørn. (1999) Taleteknologi. Språk i Norden.
    Academic article
  • Svendsen, Torbjørn. (1998) Blir norsk gresk for språkteknologien?. Språknytt. volum 26 (4-98).
    Academic article
  • Svendsen, Torbjørn. (1994) Talebaserte brukergrensesnitt. NORSIGnalet : organ for NORSIG, Norsk forening for signalbehandling.
    Popular scientific article

Part of book/report

  • Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Imran, Ali Shariq; Johnsen, Magne Hallstein; Siniscalchi, Sabato Marco; Svendsen, Torbjørn Karl. (2021) A Two-Stage Deep Modeling Approach to Articulatory Inversion. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
    Academic chapter/article/Conference paper
  • Sabzi Shahrebabaki, Abdolreza; Siniscalchi, Sabato Marco; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2021) A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion. Proceedings 2021 IEEE International Symposium on Circuits and Systems.
    Academic chapter/article/Conference paper
  • Imran, Ali Shariq; Haflan, Vetle; Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Svendsen, Torbjørn Karl. (2019) Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification. ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
    Academic chapter/article/Conference paper
  • Imran, Ali Shariq; Kastrati, Zenun; Svendsen, Torbjørn Karl; Kurti, Arianit. (2019) Text-Independent Speaker ID for Automatic Video Lecture Classification Using Deep Learning. ICCAI '19 Proceedings of the 2019 5th International Conference on Computing and Artificial Intelligence.
    Academic chapter/article/Conference paper
  • Imran, Ali Shariq; Sabzi Shahrebabaki, Abdolreza; Olfati, Negar; Svendsen, Torbjørn Karl. (2019) A Study on the Performance Evaluation of Machine Learning Models for Phoneme Classification. ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
    Academic chapter/article/Conference paper
  • Sabzi Shahrebabaki, Abdolreza; Imran, Ali Shariq; Olfati, Negar; Svendsen, Torbjørn Karl. (2018) Acoustic Feature Comparison for Different Speaking Rates. Human-Computer Interaction. Interaction Technologies.
    Academic chapter/article/Conference paper
  • Svendsen, Torbjørn Karl; Hamar, Jarle Bauck. (2015) Combining NdHMM and Phonetic Feature Detection for Speech Recognition. Proceedings of European Signal Processing Conference.
    Academic chapter/article/Conference paper
  • Hamar, Jarle Bauck; Doddipatla, Rama Sanand; Svendsen, Torbjørn; Sreenivas, Thippur. (2013) Non-Negative Durational HMM. Proceedings of IEEE International Workshop on Machine Learning for Signal Processing 2013.
    Academic chapter/article/Conference paper
  • Adde, Line; Svendsen, Torbjørn. (2010) A Comparative Analysis of Discriminative and Non-Discriminative Pronunciation Priors in Pronunciation Variation Modeling. Proceedings of 2010 IEEE Workshop on Spoken Language Technology.
    Other
  • Adde, Line; Svendsen, Torbjørn. (2010) NameDat: A Database of English Proper Names Spoken by Native Norwegians. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
    Academic chapter/article/Conference paper
  • Saeidi, Rahim; Soufifar, Mehdi; Kinnunen, Tomi; Svendsen, Torbjørn; Fränti, Pasi. (2010) UEF-NTNU System Description for Albayzin 2010 Language Recognition Evaluation. Proceedings FALA 2010.
    Other
  • Sikveland, Rein Ove; Öttl, Anton; Amdal, Ingunn; Ernestus, Mirjam; Svendsen, Torbjørn; Edlund, Jens. (2010) Spontal-N: A Corpus of Interactional Spoken Norwegian. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
    Other
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) A Survey on Recent Progress in the ASAT/SIRKUS Paradigm. Proceedings of 7th International Symposium on Chinese Spoken Language.
    Other
  • Mertens, Timo Pascal; Schneider, Daniel; Næss, Arild Brandrud; Svendsen, Torbjørn. (2009) Lexicon Adaptation for Subword Speech Recognition. Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
    Academic chapter/article/Conference paper
  • Amdal, Ingunn; Strand, Ole Morten; Almberg, Jørn; Svendsen, Torbjørn. (2008) RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus. Proceedings of the 6th International Language Resources and Evaluation (LREC 2008).
    Academic chapter/article/Conference paper
  • Siniscalchi, Sabato Marco; Birkenes, Øystein; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2008) Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition. Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
    Other
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) Toward a Detector-Based Universal Phone Recognizer. Proc. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
    Other
  • Skogstad, Trond; Svendsen, Torbjørn. (2008) Time-Varying Cepstral Coefficients. Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
    Other
  • Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2007) Towards Bottom-Up Continuous Phone Recognition. Proceedings 2007 IEEE Workshop on Automatic Speech Recognition and Understanding.
    Academic chapter/article/Conference paper
  • Amdal, Ingunn; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2006) Log Likelihood Ratio Based Annotation Verification of a Norwegian Speech Synthesis Database. Proceedings of the 7th Nordic Signal Processing Symposium (NORSIG 2006).
    Academic chapter/article/Conference paper
  • Amdal, Ingunn; Svendsen, Torbjørn. (2006) FonDat1: A Speech Synthesis Corpus for Norwegian. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006).
    Academic chapter/article/Conference paper
  • Meen, Dyre; Svendsen, Torbjørn; Natvig, Jon-Emil. (2005) Improving Phone Label Alignment Accuracy by Utilizing Voicing Information. SPECOM 2005 Proceedings.
    Academic chapter/article/Conference paper
  • Svendsen, Torbjørn; Amdal, Ingunn; Bjørkan, Ingmund; Meen, Dyre; Heggtveit, Per Olav; Natvig, Jon Emil. (2005) FONEMA - Tools for realistic speech synthesis in Norwegian. Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
    Academic chapter/article/Conference paper
  • Svendsen, Torbjørn; Egeberg, Andreas; Holter, Trym; Skogstad, Trond. (2005) VOCALS - Voice centric user interfaces for location based services. Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
    Academic chapter/article/Conference paper
  • Nordgård, Torbjørn; Svendsen, Torbjørn; Harborg, Erik; Kvale, Knut. (2004) Language Technology Towards 2020. Infosam2020, Information Society of 2020.
    Academic chapter/article/Conference paper
  • Svendsen, Torbjørn. (2002) Roles for Speech And Language Technology in The Information Society. Perspectives on the age of the information society.
    Academic chapter/article/Conference paper

Report

  • Soufifar, Mehdi; Svendsen, Torbjørn; Burget, Lukas. (2014) Subspace Modeling of Discrete features for Language Recognition. 2014. ISBN 978-82-326-0496-8.
    Doctoral dissertation
  • Nordgård, Torbjørn; Svendsen, Torbjørn; Breivik, Torbjørg. (2002) Samling og tilgjengeleggjering av norske språkteknologiressursar. 2002.
    Report
  • Nordgård, Torbjørn; Svendsen, Torbjørn; Natvig, Jon Emil. (2002) Talsmann talesyntese som hjelpemiddel for dyslektikere. 2002.
    Report
  • Svendsen, Torbjørn; Johnsen, Magne Hallstein; Nordgård, Torbjørn; Hofland, Knut; Hofland, Knut; Ore, Christian Emil; Ore, Christian Emil. (1999) Nasjonalt korpus for språkteknologi - forprosjekt. 1999.
    Report
  • Harborg, Erik; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1995) Talegjenkjenning II. 1995.
    Report
  • Harborg, Erik; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1995) Talegjenkjenning for teksting av direktesendte programmer - en studie. 1995.
    Report

Teaching

Courses

  • TT8108 - PhD Seminar in Signal Processing

Media

2018

  • Popular scientific lecture
    Øien, Geir Egil Dahle; Mengshoel, Ole Jakob; Ramampiaro, Heri; Svendsen, Torbjørn Karl. (2018) NTNUs strategiske satsing på kunstig intelligens (AI) – bakgrunn, aktiviteter og fremtidsvyer. Medlemsmøte, Det Kongelige Norske Vitenskapers Selskap . Det Kongelige Norske Vitenskapers Selskap; Trondheim. 2018-11-12 - 2018-11-12.

2011

  • Academic lecture
    Javier Rodriguez-Fuentes, Luis; Penagarikano, Mikel; Varona, Amparo; Diez, Mireia; Bordel, German; Martinez, David; Villalba, Jesus; Miguel, Antonio; Ortega, Alfonso; Lleida, Eduardo; Abad, Alberto; Koller, Oscar; Trancoso, Isabel; Lopez-Otero, Paula; Docio-Fernandez, Laura; Garcia-Mateo, Carmen; Saeidi, Rahim; Soufifar, Mehdi; Kinnunen, Tomi. (2011) MULTI-SITE HETEROGENEOUS SYSTEM FUSIONS FOR THE ALBAYZIN 2010 LANGUAGE RECOGNITION EVALUATION. Automatic Speech Recognition and Understanding . IEEE; Big Island, Hawaii. 2011-12-11 - 2011-12-15.
  • Popular scientific lecture
    Svendsen, Torbjørn. (2011) Hva er det med tale? Forskningsutfordringer og aktiviteter innen taleteknologi. På snakkis med teknologien . MediaLT; Oslo. 2011-11-09 - 2011-11-09.
  • Academic lecture
    Svendsen, Torbjørn. (2011) Universal Speech Attribute Characterization for Automatic Speech Recognition and Spoken Language Recognition. CSAIL Seminar . MIT CSAIL; Boston. 2011-12-05 - 2011-12-05.

2010

  • Academic lecture
    Sikveland, Rein Ove; Öttl, Anton; Amdal, Ingunn; Ernestus, Mirjam; Svendsen, Torbjørn; Edlund, Jens. (2010) Spontal-N: A Corpus of Interactional Spoken Norwegian. LREC . ELDA; Valetta. 2010-05-17 - 2010-05-23.
  • Academic lecture
    Adde, Line; Reveil, Bert; Martens, Jean-Pierre; Svendsen, Torbjørn. (2010) A Minimum Classification Error Approach to Pronunciation Variation Modeling of Non-Native Proper Names. Interspeech 2010 . ISCA; Makuhari. 2010-09-27 - 2010-09-30.
  • Academic lecture
    Adde, Line; Svendsen, Torbjørn. (2010) A Comparative Analysis of Discriminative and Non-Discriminative Pronunciation Priors in Pronunciation Variation Modeling. IEEE Workshop on Spoken Language Technology 2010 . IEEE; Berkeley, California. 2010-12-12 - 2010-12-15.
  • Academic lecture
    Adde, Line; Svendsen, Torbjørn. (2010) NameDat: A Database of English Proper Names Spoken by Native Norwegians. LREC . ELDA; Valetta. 2010-05-17.
  • Academic lecture
    Meen, Dyre; Svendsen, Torbjørn. (2010) The NTNU Concatenative Speech Synthesizer. Blizzard Challenge Workshop . ISCA; Kyoto. 2010-09-25 - 2010-09-25.
  • Academic lecture
    Saeidi, Rahim; Soufifar, Mehdi; Kinnunen, Tomi; Svendsen, Torbjørn; Fränti, Pasi. (2010) UEF-NTNU System Description for Albayzin 2010 Language Recognition Evaluation. FALA 2010 . University of Vigo; Vigo. 2010-10-10 - 2010-10-12.
  • Academic lecture
    Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition. Interspeech 2010 . ISCA; Makuhari. 2010-09-27 - 2010-09-30.
  • Academic lecture
    Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) A Survey on Recent Progress in the ASAT/SIRKUS Paradigm. ISCSLP 2010 . IEEE; Tainan. 2010-11-21 - 2010-12-03.
  • Academic lecture
    Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Sorbello, Filippo; Lee, Chin-Hui. (2010) Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions. ICASSP 2010 . IEEE; Dallas, Texas. 2010-03-14 - 2010-03-19.
  • Academic lecture
    Skogstad, Trond; Svendsen, Torbjørn. (2010) Intra-Frame Variability As a Predictor of Frame Classifiability. Interspeech 2010 . ISCA; Makuhari. 2010-09-27 - 2010-09-30.

2009

  • Academic lecture
    Siniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition. Interspeech . ISCA; Brighton. 2009-09-06 - 2009-09-10.
  • Academic lecture
    Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) A Phonetic Feature Based Lattice Rescoring Approach to LVCSR. IEEE International Conference on Acoustics, Speech and Signal Processing . IEEE; Taipei. 2009-04-19 - 2009-04-24.
  • Interview
    Svendsen, Torbjørn. (2009) Språkteknologien gjør fremskritt igjen. forskning.no [Internett]. 2009-04-09.
  • Interview
    Svendsen, Torbjørn. (2009) VERDIKT på Forskningsdagene. Nytt fra VERDIKT [Avis]. 2009-11-03.

2008

  • Academic lecture
    Amdal, Ingunn; Strand, Ole Morten; Almberg, Jørn; Svendsen, Torbjørn. (2008) RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus. LREC 2008 . European Language Resources Association; Marrakech. 2008-05-26 - 2008-05-31.
  • Academic lecture
    Amdal, Ingunn; Svendsen, Torbjørn; Johnsen, Magne Hallstein; Siniscalchi, Sabato Marco; Hamar, Jarle Bauck; Martinez, Del Hoyo Canterla A.. (2008) SIRKUS - A new paradigm for speech recognition. VERDIKT Conference 2008 . Norges forskningsråd; Bergen. 2008-10-29 - 2008-10-30.
  • Academic lecture
    Siniscalchi, Sabato Marco; Birkenes, Øystein; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2008) Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition. ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery . ISCA; Aalborg. 2008-06-04 - 2008-06-06.
  • Academic lecture
    Siniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) A Penalized Logistic Regression Approach to Detection Based Phone Classification. Interspeech 2008 . ISCA; Brisbane. 2008-09-22 - 2008-09-26.
  • Academic lecture
    Siniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) Toward a Detector-Based Universal Phone Recognizer. International Conference on Acoustics, Speech and Signal Processing . IEEE; Las Vegas. 2008-03-30 - 2008-04-04.
  • Academic lecture
    Skogstad, Trond; Svendsen, Torbjørn. (2008) Time-Varying Cepstral Coefficients. ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery . ISCA; Aalborg. 2008-06-04 - 2008-06-06.
  • Interview
    Svendsen, Torbjørn. (2008) Norsk språkbank. Språkteigen, NRK P2 [Radio]. 2008-08-24.
  • Interview
    Svendsen, Torbjørn. (2008) Norsk talesyntese. P4 [Radio]. 2008-02-08.
  • Interview
    Svendsen, Torbjørn. (2008) Taleteknologi. God morgen Norge [TV]. 2008-02-08.

2007

  • Academic lecture
    Siniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2007) Towards Bottom-Up Continuous Phone Recognition. 2007 IEEE Workshop on Automatic Speech Recognition and Understanding . IEEE; Kyoto. 2007-12-09 - 2007-12-13.
  • Academic lecture
    Svendsen, Torbjørn. (2007) Articulatory Features and Segmental Information for Automatic Speech Recognition. ESF Exploratory Workshop on Models of Language Evolution, Acquisition and Processing . European Science Foundation; Leuven. 2007-11-25 - 2008-11-28.
  • Interview
    Svendsen, Torbjørn; Abelsen, Atle. (2007) IKE i hver puslebit. Bladet Forskning [Avis]. 2007-12-01.

2006

  • Academic lecture
    Svendsen, Torbjørn. (2006) Task and speaker adaptation. WISSAP'06 . IEEE og ISCA; 2006-01-04 - 2006-01-07.
  • Academic lecture
    Amdal, Ingunn; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2006) Log Likelihood Ratio Based Annotation Verification of a Norwegian Speech Synthesis Database. NORSIG 2006 . NORSIG; Reykjavik. 2006-06-07 - 2006-06-09.
  • Poster
    Amdal, Ingunn; Svendsen, Torbjørn. (2006) FonDat1: A Speech Synthesis Corpus for Norwegian. LREC 2006 . European Language Resources Association; Genova. 2006-05-22 - 2006-05-28.
  • Academic lecture
    Nordgård, Torbjørn; Svendsen, Torbjørn. (2006) Et norsk uttaleleksikon møter en spontan virkelighet. Oslomålet - et seminar med forskning fra NoTa-korpuset . Universitetet i Oslo; Oslo. 2006-11-23 - 2006-11-24.

2005

  • Poster
    Amdal, Ingunn; Svendsen, Torbjørn. (2005) Unit Selection Synthesis Database Development Using Utterance Verification. Interspeech 2005 . ISCA; Lisboa. 2005-09-04 - 2005-09-08.
  • Poster
    Bjørkan, Ingmund; Svendsen, Torbjørn; Farner, Snorre. (2005) Comparing Spectral Distance Measures for Join Cost Optimization in Concatenative Speech Synthesis. Interspeech 2005 . ISCA; Lisboa. 2005-09-04 - 2005-09-08.
  • Poster
    Meen, Dyre; Svendsen, Torbjørn; Natvig, Jon-Emil. (2005) Improving Phone Label Aligment Accuracy by Utilizing Voicing Information. SPECOM 2005 . University of Patras, Wire Communications Laboratory; Patras. 2005-10-17 - 2005-10-19.
  • Poster
    Skogstad, Trond; Svendsen, Torbjørn. (2005) Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation. Eurospeech 2005 . ISCA; Lisboa. 2005-09-04 - 2005-09-08.
  • Academic lecture
    Svendsen, Torbjørn; Amdal, Ingunn; Bjørkan, Ingmund; Meen, Dyre; Heggtveit, Per Olav; Natvig, Jon Emil. (2005) FONEMA - Tools for realistic speech synthesis in Norwegian. NORSIG 05 . NORSIG; Stavanger. 2005-09-22 - 2005-09-24.
  • Academic lecture
    Svendsen, Torbjørn; Egeberg, Andreas; Holter, Trym. (2005) VOCALS - Voice centric user interfaces for location based services. NORSIG 05 . NORSIG; Stavanger. 2005-09-22 - 2005-09-24.

2004

  • Academic lecture
    Svendsen, Torbjørn. (2004) Pronunciation Modeling for Speech Technology. 2004 International Conference on Signal Processing and Communications . IEEE Signal Processing Society and Indian Institute of Scien; Bangalore. 2004-12-11 - 2004-12-14.
  • Academic lecture
    Øien, Geir Egil; Holte, Nils; Andresen, Steinar; Svendsen, Torbjørn; Hammer, Mikael. (2004) Communication technology towards 2020. INFOSAM-2020 conference . IME-fakultetet, NTNU/Teknologirådet; Trondheim. 2004-04-19 - 2004-04-20.

2003

  • Poster
    Martin, Terrence; Svendsen, Torbjørn; Sridharan, Sridha. (2003) Cross-Lingual Pronunciation Modelling for Indonesian Speech Recognition. Eurospeech 2003 . [Mangler data]; Geneve. 2003-09-04.
  • Popular scientific lecture
    Svendsen, Torbjørn. (2003) FONEMA - Metodeutvikling for naturtro norsk talesyntese. KUNSTI-seminar 2003 . [Mangler data]; Bergen. 2003-11-18.
  • Academic lecture
    Svendsen, Torbjørn. (2003) Pronunciation Modelling for Speech Technology. [Mangler data] . Queenslad University of Technology; Brisbane, Australia. 2003-05-30.
  • Popular scientific lecture
    Svendsen, Torbjørn. (2003) Snakke dialekt med mobilen? Om dialektbruk i ny språkteknologi. [Mangler data] . Noregs mållag; Oslo. 2003-09-28.
  • Popular scientific lecture
    Svendsen, Torbjørn. (2003) Speech Processing Activities at NTNU: An Overview. Nordic Speech Technology Seminar . [Mangler data]; Stockholm. 2003-11-14.
  • Poster
    Wong, Eddie; Martin, Terrence; Svendsen, Torbjørn; Sridharan, Sridha. (2003) Multilingual Phone Clustering for Recognition of Spontaneous Indonesian Speech Utilising Pronunciation Modelling Techniques. Eurospeech 2003 . [Mangler data]; Geneve. 2003-09-04.

2002

  • Academic lecture
    Amdal, Ingunn; Svendsen, Torbjørn. (2002) Evaluation of pronunciation variants in the ASR lexicon for different speaking styles. Third International Conference on Language Resources and Evaluation . [Mangler data]; Las Palmas de Gran Canaria, Spain. 2002-05-31.

2001

  • Academic lecture
    Johnsen, Magne Hallstein; Harborg, Erik; Svendsen, Torbjørn; Amble, Tore; Holter, Trym; Myrvoll, Tor Andre; Nordgård, Torbjørn. (2001) SPODIS - Spoken Dialog Systems for Telephony. NORSIG-2001, Norwegian Signal Processing Symposium . [Mangler data]; Trondheim, Norway, October 18-20 2001.
  • Poster
    Myrvoll, Tor Andre; Paliwal, Kuldip K.; Svendsen, Torbjørn. (2001) Fast Adaptation using Constrained Affine Transformations with Hierarchical Priors. Eurospeech 2001 . [Mangler data]; Aalborg, Sept 3-7, 2001.

2000

  • Academic lecture
    Holter, Trym; Harborg, Erik; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2000) ASR-Based Subtitiling of Live TV-Programs for the Hearing Impaired. 6th International Conference on Spoken Language Processing . [Mangler data]; Beijing, Oct. 16-20, 2000.
  • Academic lecture
    Johnsen, Magne Hallstein; Holter, Trym; Svendsen, Torbjørn; Harborg, Erik. (2000) Stochastic Modelling of Semantic Content for Use in a Spoken Dialogue System. 6th International Conference on Spoken Language Processing . [Mangler data]; Beijing, Oct. 16-20, 2000.
  • Academic lecture
    Johnsen, Magne Hallstein; Svendsen, Torbjørn; Amble, Tore; Holter, Trym; Harborg, Erik. (2000) TABOR - A Norwegian Spoken Dialogue System for Bus Travel Information. 6th International Conference on Spoken Language Processing . [Mangler data]; Beijing, Oct. 16-20, 2000.
  • Popular scientific lecture
    Svendsen, Torbjørn. (2000) Norsk språkbank, et nasjonalt korpus for språkteknologi. [Mangler data] . [Mangler data]; Statssekretærutvalget for IT, Oslo, 12. januar, 2000.
  • Popular scientific lecture
    Svendsen, Torbjørn. (2000) Ordets makt � om taleteknologi som hjelpemiddel for funksjonshemmede. [Mangler data] . [Mangler data]; "Selvstendig liv", Sjølyst, 12. april, 2000.
  • Academic lecture
    Svendsen, Torbjørn. (2000) Pronunciation modeling for improved recognition of names. [Mangler data] . [Mangler data]; AT&T Labs, Florham Park, New Jersey, 15. september 2000.
  • Popular scientific lecture
    Svendsen, Torbjørn. (2000) Taleteknologi- teknologi med potensiale for kvalitetsheving og effektivisering ved håndtering av informasjon i sykehus. [Mangler data] . [Mangler data]; Norges tekniske vitenskapsakademi, Trondheim, 22. februar, 2000.
  • Popular scientific lecture
    Svendsen, Torbjørn; Johnsen, Magne Hallstein. (2000) �Sesam sesam!� - Kan taleteknologi bli en døråpner for funksjonshemmede?. [Mangler data] . [Mangler data]; Rehabiliteringskonferansen, Trondheim, 20. juni, 2000.

1999

  • Academic lecture
    Amdal, Ingunn; Holter, Trym; Svendsen, Torbjørn. (1999) Maximum likelihood pronunciation modelling of Norwegian natural numbers for automatic speech recognition. NORSIG'99 . [Mangler data]; Asker, september 1999.
  • Academic lecture
    Amdal, Ingunn; Holter, Trym; Svendsen, Torbjørn. (1999) Modellering av uttalevariasjon for automatisk talegjenkjenning. Møte om norsk språk (MONS 8) . [Mangler data]; Tromsø, 18.-20. november 1999.
  • Academic lecture
    Harborg, Erik; Holter, Trym; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) Generation of closed captions for live TV-programs using speech recognition. Norsig'99 . [Mangler data]; Asker, September 1999.
  • Academic lecture
    Harborg, Erik; Holter, Trym; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) On-line captioning of TV-programs for the hearing impaired. EuroSpeech'99 . [Mangler data]; Budapest, Ungarn.
  • Poster
    Harborg, Erik; Holter, Trym; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) Subtitling of live broadcast TV-programs for the hearing impaired. AAATE'99 . [Mangler data]; Dusseldorf, November 1999.
  • Academic lecture
    Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) Menneske/maskin-kommunikasjon basert på tale. MONS-8 (8nde Møte Om Norsk Språk) . [Mangler data]; Tromsø, Norway, Nov. 1999.
  • Popular scientific lecture
    Yang, Qian; Cremelie, Nick; Holter, Trym; Martens, Jean-Pierre; Svendsen, Torbjørn; Ringland, Simon. (1999) Lexicon building and word accuracy in continuous speech recognition. COST 249 meeting, Prague . [Mangler data]; Prague, Czech Republic, February 1999.

1998

  • Academic lecture
    Holter, Trym; Svendsen, Torbjørn. (1998) Maximum likelihood modelling of pronunciation variation. ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for ASR . [Mangler data]; Rolduc.
  • Academic lecture
    Svendsen, Torbjørn. (1998) SPODIS - Spoken dialog systems for telephony services. Studiemøtet i elektronikk og data . [Mangler data]; Kristiansand.
  • Popular scientific lecture
    Svendsen, Torbjørn. (1998) Speech processing activities at NTNU. [Mangler data] . [Mangler data]; KTH, Stockholm.
  • Popular scientific lecture
    Svendsen, Torbjørn. (1998) Taleteknolog. Nordisk språkmøte . [Mangler data]; Trondheim.
  • Popular scientific lecture
    Svendsen, Torbjørn. (1998) Taleteknologi ved NTNU. Aalborg workshop in speech communication . [Mangler data]; Aalborg.

1997

  • Academic lecture
    Holter, Trym; Svendsen, Torbjørn. (1997) A joint segmentation and labelling scheme for use in acoustic subword based speech recognition. Norwegian Signal Processing Symposium . [Mangler data]; Tromsø.
  • Academic lecture
    Holter, Trym; Svendsen, Torbjørn. (1997) Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units. IEEE Speech recognition Workshop . [Mangler data]; Santa Barbara, Calif..
  • Popular scientific lecture
    Holter, Trym; Svendsen, Torbjørn. (1997) Combined optimisation of baseforms and model parameters in speech recognition based on acoustic sub-word units. [Mangler data] . [Mangler data]; AT&T Labs, Florham Park, NJ, USA.
  • Academic lecture
    Holter, Trym; Svendsen, Torbjørn. (1997) Incorporating linguistic knowledge and automatic baseform generation in acoustic subword unit based speech recognition. Eurospeech '97 . [Mangler data]; Rhodos.
  • Popular scientific lecture
    Svendsen, Torbjørn. (1997) Acoustic subwords - some applications in speech processing. [Mangler data] . [Mangler data]; Griffith University, Brisbane, Australia.
  • Popular scientific lecture
    Svendsen, Torbjørn. (1997) Some topics from recent work in speech processing. [Mangler data] . [Mangler data]; Motorola Research Labs, Sydney og University of Wollongong.
  • Popular scientific lecture
    Svendsen, Torbjørn. (1997) Speech recognition based on acoustic subword units. [Mangler data] . [Mangler data]; Telenor FoU, Kjeller.

1996

  • Academic lecture
    Pihl, Johnny; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1996) A VLSI implementation of pdf computations in HMM based speech recognition. TENCON-96 . IEEE; Perth. 1996-11-27 - 1996-11-29.

1995

  • Academic lecture
    Johnsen, Magne Hallstein; Svendsen, Torbjørn; Harborg, Erik. (1995) Experiments on cepstral mean subtraction and Rasta-filtering applied to SAMPA phoneme recognition. COST249 . COST; Nancy. 1995-05-06 - 1995-05-07.

1994

  • Popular scientific lecture
    Svendsen, Torbjørn. (1994) Acoustic segmentation of speech : applications in speech processing. [Mangler data] . [Mangler data]; [Mangler data].
  • Popular scientific lecture
    Svendsen, Torbjørn. (1994) Acoustic segmentation of speech : applications in speech processing. [Mangler data] . [Mangler data]; [Mangler data].
  • Academic lecture
    Svendsen, Torbjørn. (1994) Segmental quantization of speech spectral information. IEEE International Conference on Acoustics, Speech and Signal Processing . [Mangler data]; [Mangler data].

1993

  • Academic lecture
    Svendsen, Torbjørn. (1993) Efficient quantization of speech spectral information. EUROSPEECH '93 (1993 : Berlin) . [Mangler data]; [Mangler data].

1989

  • Academic lecture
    Svendsen, Torbjørn Karl; Paliwal, Kuldip K.; Harborg, Erik; Husøy, Per Ove. (1989) An Improved Sub-Word Based Speech Recognizer. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ; Glasgow. 1989-05-01.

1988

  • Academic lecture
    Svendsen, Torbjørn Karl; Paliwal, K.K.; Harborg, Erik; Husøy, P.O.. (1988) Experiments with a Sub-Word Based Speech Recognizer. International Conference on Speech Science and Technology (ICSST) ; Sydney. 1988-12-01.
NTNU
Studies
  • Master's programmes in English
  • For exchange students
  • PhD opportunities
  • Courses
  • Career development
  • Continuing education
  • Application process
Contact
  • Contact NTNU
  • Employees
  • For alumni
  • Press contacts
  • Researcher support
Discover NTNU
  • Experts
  • Vacancies
  • Pictures from NTNU
  • Innovation resources
  • NTNU in Gjøvik
  • NTNU in Trondheim
  • NTNU in Ålesund
  • Maps
About NTNU
  • NTNU's strategy
  • Research excellence
  • Strategic research areas
  • Organizational chart
  • Libraries
  • About the university
Services
  • For employees
  • For students
  • Blackboard
  • Intranet

Norwegian University of Science and Technology

Use of cookies
Accessibility statement (in Norwegian)
Privacy policy
Editoral responsibility
Sign In