Torbjørn Karl Svendsen
About
Torbjørn Svendsen (1955) is a Professor at the Department of Electronic Systems. Professor Svendsen holds a MScEE, and a PhD both from the NTNU.
Fields of interest and present research activities
My research interests have from the outset in 1979 been speech signal processing. The first period was focused on source coding, i.e. speech compression, which was also the subject of my doctoral thesis. From the mid 80’s the research interests have been mainly on automatic speech recognition, but also areas like spoken dialogue systems and speech synthesis have been included in my research. Speech analysis methods and lexical modelling, e.g. pronunciation modelling have been two central areas. Realizing that current approaches to speech recognition seem to be nearing a saturation point in terms of performance, a major activity in the last 5-year period has been to investigate new paradigms for speech recognition, aiming to integrate phonetic and linguistic knowledge in a statistical framework based on detection of (language universal) phonetic features.
Work experience
- NTNU (1979-1981 Research assistant, 1983-1984 doctoral fellowship, 1988-1995 Associate professor, 1995-present Professor), Director NTNU Digital (2015-2021)
- SINTEF (1981-1987, Research scientist)
- Research visits at AT&T Bell Labs, Murray Hill, NJ (1986-1987, 1990); Griffith University, Brisbane, Australia (1996-97); AT&T Labs, Florham Park, NJ (2000); Queensland University of Technology, Brisbane, Australia (2002-03); Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, MA (2013)
Professional merits
Peer review and professional evaluation work:
- Reviewer for international journals like IEEE Transactions (Communications; Signal Processing; Audio, Speech and Language Processing; Multimedia); EURASIP Journal on Applied Signal Processing, Signal, Image and Video Processing; and Speech Communication, and various conferences and workshops on speech and signal processing.
- Member of Speech Communication journal Editorial Board
- Reviewer for EU's Language Engineering program and the Information Society Research Programme of the Academy of Finland. Project reviews for the Norwegian, Australian, Swiss, Dutch, Belgian and South African Research Councils
- Opponent/member of examination boards for 26 doctoral theses
Membership in academic and professional committees
- Various appointments at the national level, e.g. in the Research Council of Norway, incl. grant committee member for the IKTPLUSS program, program board chair for the VERDIKT program, and in the Norwegian Language Council.
- Member of advisory board, Norwegian Language Bank (“Språkbanken”)
- Member of Technical committees, Eurospeech2001 and Interspeech2012, and organizing committee of Eurospeech2001.
- Senior Member, IEEE Signal Processing Society Speech Technical Committee (1998-2001)
- Elected member, Norwegian Academy of Technological Sciences
- Vice president, International Speech Communication Association (ISCA)
Other professional merits
- Project manager, "Atomic Units for Language Universal Speech" (current), "Spoken dialog systems for telephony"; "Speech interfaces and reasoning systems"; "Norwegian corpus for language technology"; “Voice centric user interfaces for location based services”; “Tools for realistic speech synthesis in”; “Spoken Information Retrieval by Knowledge Utilization in Statistical Speech Processing”; “Rundkast – A transcribed broadcast news for applications in language technology”(past projects).
- Vice chair, COST action 278; WG chair COST actions 232 and 249; Advisory Scientific Board member, EU project ACORNS; Board member, Nordic Graduate School of Language Technology (former actions and activities)
- Previous NTNU appointments: Department Head, Department of Telecommunications; Vice Dean, Faculty of Electrical Engineering and Telecommunications; member of several NTNU committees
- 16 PhD students graduated (2 as co-supervisor). Currently supervising 2 PhD students.
- ~80 Master degree students graduated
- ~85 papers in international journals and conferences
Publications
2022
-
Getman, Yaroslav;
Al-Ghezi, Ragheb;
Voskoboinik, Ekaterina;
Grósz, Tamás;
Kurimo, Mikko;
Salvi, Giampiero;
Svendsen, Torbjørn Karl;
Strömbergsson, Sofia.
(2022)
wav2vec2-based Speech Rating System for Children with Speech Sound Disorder.
Interspeech (USB).
Academic article
-
Oudijk, Esmée;
Hasler, Oliver Kevin;
Øveraas, Henning;
Marty, Sabine;
Williamson, David Roddan;
Svendsen, Torbjørn Karl;
Berg, Simen;
Birkeland, Roger;
Halvorsen, Daniel Ørnes;
Bakken, Sivert;
Henriksen, Marie Bøe;
Alver, Morten;
Johnsen, Geir;
Johansen, Tor Arne;
Stahl, Annette;
Kvaløy, Pål;
Dallolio, Alberto;
Majaneva, Sanna;
Fragoso, Glaucia Moreira.
(2022)
Campaign For Hyperspectral Data Validation In North Atlantic Coastal Waters.
Workshop on Hyperspectral Image and Signal Processing, Evolution in Remote Sensing.
Academic article
-
Rugayan, Janine Lizbeth Cabrera;
Svendsen, Torbjørn Karl;
Salvi, Giampiero.
(2022)
Semantically Meaningful Metrics for Norwegian ASR Systems.
Interspeech (USB).
Academic article
2021
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Imran, Ali Shariq;
Johnsen, Magne Hallstein;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
A Two-Stage Deep Modeling Approach to Articulatory Inversion.
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Academic chapter/article/Conference paper
-
Sabzi Shahrebabaki, Abdolreza;
Salvi, Giampiero;
Svendsen, Torbjørn Karl;
Siniscalchi, Sabato Marco.
(2021)
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models.
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP).
volum 30.
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2021)
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
Proceedings 2021 IEEE International Symposium on Circuits and Systems.
Academic chapter/article/Conference paper
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation.
Interspeech.
Academic article
2020
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn.
(2020)
Transfer learning of articulatory information through phone information.
Interspeech (USB).
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2020)
Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals.
Interspeech (USB).
Academic article
2019
-
Imran, Ali Shariq;
Haflan, Vetle;
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2019)
Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification.
ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
Academic chapter/article/Conference paper
-
Imran, Ali Shariq;
Kastrati, Zenun;
Svendsen, Torbjørn Karl;
Kurti, Arianit.
(2019)
Text-Independent Speaker ID for Automatic Video Lecture Classification Using Deep Learning.
ICCAI '19 Proceedings of the 2019 5th International Conference on Computing and Artificial Intelligence.
Academic chapter/article/Conference paper
-
Imran, Ali Shariq;
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2019)
A Study on the Performance Evaluation of Machine Learning Models for Phoneme Classification.
ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
Academic chapter/article/Conference paper
-
Sabzi Shahrebabaki, Abdolreza;
Imran, Ali Shariq;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2019)
A Comparative Study of Deep Learning Techniques on Frame-Level Speech Data Classification.
Circuits, systems, and signal processing.
volum 38.
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Imran, Ali Shariq;
Sabato Marco, Siniscalchi;
Svendsen, Torbjørn Karl.
(2019)
A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion.
Interspeech (USB).
Academic article
2018
-
Sabzi Shahrebabaki, Abdolreza;
Imran, Ali Shariq;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2018)
Acoustic Feature Comparison for Different Speaking Rates.
Human-Computer Interaction. Interaction Technologies.
Academic chapter/article/Conference paper
2015
-
Svendsen, Torbjørn Karl;
Hamar, Jarle Bauck.
(2015)
Combining NdHMM and Phonetic Feature Detection for Speech Recognition.
Proceedings of European Signal Processing Conference.
Academic chapter/article/Conference paper
2014
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2014)
An artificial neural network approach to automatic speech processing.
Neurocomputing.
volum 140.
Academic article
-
Soufifar, Mehdi;
Svendsen, Torbjørn;
Burget, Lukas.
(2014)
Subspace Modeling of Discrete features for Language Recognition.
2014. ISBN 978-82-326-0496-8.
Doctoral dissertation
2013
-
Doddipatla, Rama Sanand;
Svendsen, Torbjørn.
(2013)
Synthetic Speaker Models Using VTLN to Improve the Performance of Children in Mismatched Speaker Conditions for ASR.
Interspeech (USB).
Academic article
-
Hamar, Jarle Bauck;
Doddipatla, Rama Sanand;
Svendsen, Torbjørn;
Sreenivas, Thippur.
(2013)
Non-Negative Durational HMM.
Proceedings of IEEE International Workshop on Machine Learning for Signal Processing 2013.
Academic chapter/article/Conference paper
2012
-
Siniscalchi, Sabato Marco;
Lyu, DC;
Svendsen, Torbjørn;
Lee, CH.
(2012)
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data.
IEEE Transactions on Audio, Speech, and Language Processing.
volum 20 (3).
Academic article
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2012)
Universal attribute characterization of spoken languages for automatic spoken language recognition.
Computer Speech and Language.
volum 27 (1).
Academic article
-
Svendsen, Torbjørn.
(2012)
Data med barnestemme.
Forskning.no.
publications.INTERVJUSKRIFTL
2011
-
Adde, Line;
Svendsen, Torbjørn.
(2011)
Pronunciation Variation Modeling of Non-Natie Proper Names by Discriminative Tree Search.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Academic article
-
Kvale, Knut;
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Lyse, Gunn Inger;
Gjesdal, Anje Müller.
(2011)
Datamaskinen må skjønne norsk.
Bergens Tidende.
Feature article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2011)
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines.
Interspeech.
Academic article
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2011)
Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients.
Interspeech.
Academic article
-
Soufifar, Mehdi;
Kockmann, Marcel;
Burget, Lukas;
Plchot, Oldrich;
Glembek, Ondrej;
Svendsen, Torbjørn.
(2011)
iVector Approach to Phonotactic Language Recognition.
Interspeech.
Academic article
2010
-
Adde, Line;
Reveil, Bert;
Martens, Jean-Pierre;
Svendsen, Torbjørn.
(2010)
A Minimum Classification Error Approach to Pronunciation Variation Modeling of Non-Native Proper Names.
Interspeech.
Academic article
-
Adde, Line;
Svendsen, Torbjørn.
(2010)
A Comparative Analysis of Discriminative and Non-Discriminative Pronunciation Priors in Pronunciation Variation Modeling.
Proceedings of 2010 IEEE Workshop on Spoken Language Technology.
Other
-
Adde, Line;
Svendsen, Torbjørn.
(2010)
NameDat: A Database of English Proper Names Spoken by Native Norwegians.
Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
Academic chapter/article/Conference paper
-
Saeidi, Rahim;
Soufifar, Mehdi;
Kinnunen, Tomi;
Svendsen, Torbjørn;
Fränti, Pasi.
(2010)
UEF-NTNU System Description for Albayzin 2010 Language Recognition Evaluation.
Proceedings FALA 2010.
Other
-
Sikveland, Rein Ove;
Öttl, Anton;
Amdal, Ingunn;
Ernestus, Mirjam;
Svendsen, Torbjørn;
Edlund, Jens.
(2010)
Spontal-N: A Corpus of Interactional Spoken Norwegian.
Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
Other
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition.
Interspeech.
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
A Survey on Recent Progress in the ASAT/SIRKUS Paradigm.
Proceedings of 7th International Symposium on Chinese Spoken Language.
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Sorbello, Filippo;
Lee, Chin-Hui.
(2010)
Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Academic article
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2010)
Intra-Frame Variability As a Predictor of Frame Classifiability.
Interspeech.
Academic article
2009
-
Mertens, Timo Pascal;
Schneider, Daniel;
Næss, Arild Brandrud;
Svendsen, Torbjørn.
(2009)
Lexicon Adaptation for Subword Speech Recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
Academic chapter/article/Conference paper
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition.
Interspeech.
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
A Phonetic Feature Based Lattice Rescoring Approach to LVCSR.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Academic article
2008
-
Amdal, Ingunn;
Strand, Ole Morten;
Almberg, Jørn;
Svendsen, Torbjørn.
(2008)
RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus.
Proceedings of the 6th International Language Resources and Evaluation (LREC 2008).
Academic chapter/article/Conference paper
-
Siniscalchi, Sabato Marco;
Birkenes, Øystein;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(2008)
Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition.
Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
A Penalized Logistic Regression Approach to Detection Based Phone Classification.
Interspeech.
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
Toward a Detector-Based Universal Phone Recognizer.
Proc. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
Other
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2008)
Time-Varying Cepstral Coefficients.
Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
Other
2007
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2007)
Towards Bottom-Up Continuous Phone Recognition.
Proceedings 2007 IEEE Workshop on Automatic Speech Recognition and Understanding.
Academic chapter/article/Conference paper
2006
-
Amdal, Ingunn;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(2006)
Log Likelihood Ratio Based Annotation Verification of a Norwegian Speech Synthesis Database.
Proceedings of the 7th Nordic Signal Processing Symposium (NORSIG 2006).
Academic chapter/article/Conference paper
-
Amdal, Ingunn;
Svendsen, Torbjørn.
(2006)
FonDat1: A Speech Synthesis Corpus for Norwegian.
Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006).
Academic chapter/article/Conference paper
2005
-
Amdal, Ingunn;
Svendsen, Torbjørn.
(2005)
Unit Selection Synthesis Database Development Using Utterance Verification.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
volum 9.
Academic article
-
Bjørkan, Ingmund;
Svendsen, Torbjørn.
(2005)
Comparing Spectral Distance Measures for Join Cost Optmization.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
Academic article
-
Bjørkan, Ingmund;
Svendsen, Torbjørn;
Farner, Snorre.
(2005)
Comparing Spectral Distance Measures for Join Cost Optimization in Concatenative Speech Synthesis.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
volum 9.
Academic article
-
Meen, Dyre;
Svendsen, Torbjørn;
Natvig, Jon-Emil.
(2005)
Improving Phone Label Alignment Accuracy by Utilizing Voicing Information.
SPECOM 2005 Proceedings.
Academic chapter/article/Conference paper
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2005)
Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
volum 9.
Academic article
-
Svendsen, Torbjørn;
Amdal, Ingunn;
Bjørkan, Ingmund;
Meen, Dyre;
Heggtveit, Per Olav;
Natvig, Jon Emil.
(2005)
FONEMA - Tools for realistic speech synthesis in Norwegian.
Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
Academic chapter/article/Conference paper
-
Svendsen, Torbjørn;
Egeberg, Andreas;
Holter, Trym;
Skogstad, Trond.
(2005)
VOCALS - Voice centric user interfaces for location based services.
Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
Academic chapter/article/Conference paper
2004
-
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Harborg, Erik;
Kvale, Knut.
(2004)
Language Technology Towards 2020.
Infosam2020, Information Society of 2020.
Academic chapter/article/Conference paper
2003
-
Svendsen, Torbjørn.
(2003)
Speech Technology: Past, Present and Future.
Telektronikk.
volum 99 (2).
Academic article
2002
-
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Breivik, Torbjørg.
(2002)
Samling og tilgjengeleggjering av norske språkteknologiressursar.
2002.
Report
-
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Natvig, Jon Emil.
(2002)
Talsmann talesyntese som hjelpemiddel for dyslektikere.
2002.
Report
-
Svendsen, Torbjørn.
(2002)
Roles for Speech And Language Technology in The Information Society.
Perspectives on the age of the information society.
Academic chapter/article/Conference paper
2001
-
Braverman, Marc;
Svendsen, Torbjørn;
Lund, Karl Erik;
Aarø, Leif Edvard.
(2001)
Tobacco use by early adolescents in Norway.
European Journal of Public Health.
volum 11.
Academic article
-
Svendsen, Torbjørn.
(2001)
Nordisk forskningssamarbeid innen språkteknologi.
Språknytt.
volum 3/2001.
Popular scientific article
2000
-
Amdal, Ingunn;
Holter, Trym;
Svendsen, Torbjørn.
(2000)
Modellering av uttalevariasjon for automatisk talegjenkjenning.
Nordlyd.
volum 28-2000.
Academic article
-
Foldvik, Arne Kjell;
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Thygesen, Ragnar.
(2000)
Dysleksi og språkteknologi.
Adresseavisen.
Feature article
1999
-
Holter, Trym;
Svendsen, Torbjørn.
(1999)
Maximum likelihood modelling of pronunciation variation.
Speech Communication.
volum 29 (2-4).
Academic article
-
Svendsen, Torbjørn.
(1999)
Taleteknologi.
Språk i Norden.
Academic article
-
Svendsen, Torbjørn;
Johnsen, Magne Hallstein;
Nordgård, Torbjørn;
Hofland, Knut;
Hofland, Knut;
Ore, Christian Emil;
Ore, Christian Emil.
(1999)
Nasjonalt korpus for språkteknologi - forprosjekt.
1999.
Report
1998
-
Svendsen, Torbjørn.
(1998)
Blir norsk gresk for språkteknologien?.
Språknytt.
volum 26 (4-98).
Academic article
1995
-
Harborg, Erik;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(1995)
Talegjenkjenning II.
1995.
Report
-
Harborg, Erik;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(1995)
Talegjenkjenning for teksting av direktesendte programmer - en studie.
1995.
Report
1994
-
Svendsen, Torbjørn.
(1994)
Talebaserte brukergrensesnitt.
NORSIGnalet : organ for NORSIG, Norsk forening for signalbehandling.
Popular scientific article
Journal publications
-
Getman, Yaroslav;
Al-Ghezi, Ragheb;
Voskoboinik, Ekaterina;
Grósz, Tamás;
Kurimo, Mikko;
Salvi, Giampiero;
Svendsen, Torbjørn Karl;
Strömbergsson, Sofia.
(2022)
wav2vec2-based Speech Rating System for Children with Speech Sound Disorder.
Interspeech (USB).
Academic article
-
Oudijk, Esmée;
Hasler, Oliver Kevin;
Øveraas, Henning;
Marty, Sabine;
Williamson, David Roddan;
Svendsen, Torbjørn Karl;
Berg, Simen;
Birkeland, Roger;
Halvorsen, Daniel Ørnes;
Bakken, Sivert;
Henriksen, Marie Bøe;
Alver, Morten;
Johnsen, Geir;
Johansen, Tor Arne;
Stahl, Annette;
Kvaløy, Pål;
Dallolio, Alberto;
Majaneva, Sanna;
Fragoso, Glaucia Moreira.
(2022)
Campaign For Hyperspectral Data Validation In North Atlantic Coastal Waters.
Workshop on Hyperspectral Image and Signal Processing, Evolution in Remote Sensing.
Academic article
-
Rugayan, Janine Lizbeth Cabrera;
Svendsen, Torbjørn Karl;
Salvi, Giampiero.
(2022)
Semantically Meaningful Metrics for Norwegian ASR Systems.
Interspeech (USB).
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Salvi, Giampiero;
Svendsen, Torbjørn Karl;
Siniscalchi, Sabato Marco.
(2021)
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models.
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP).
volum 30.
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation.
Interspeech.
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn.
(2020)
Transfer learning of articulatory information through phone information.
Interspeech (USB).
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2020)
Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals.
Interspeech (USB).
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Imran, Ali Shariq;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2019)
A Comparative Study of Deep Learning Techniques on Frame-Level Speech Data Classification.
Circuits, systems, and signal processing.
volum 38.
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Imran, Ali Shariq;
Sabato Marco, Siniscalchi;
Svendsen, Torbjørn Karl.
(2019)
A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion.
Interspeech (USB).
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2014)
An artificial neural network approach to automatic speech processing.
Neurocomputing.
volum 140.
Academic article
-
Doddipatla, Rama Sanand;
Svendsen, Torbjørn.
(2013)
Synthetic Speaker Models Using VTLN to Improve the Performance of Children in Mismatched Speaker Conditions for ASR.
Interspeech (USB).
Academic article
-
Siniscalchi, Sabato Marco;
Lyu, DC;
Svendsen, Torbjørn;
Lee, CH.
(2012)
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data.
IEEE Transactions on Audio, Speech, and Language Processing.
volum 20 (3).
Academic article
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2012)
Universal attribute characterization of spoken languages for automatic spoken language recognition.
Computer Speech and Language.
volum 27 (1).
Academic article
-
Svendsen, Torbjørn.
(2012)
Data med barnestemme.
Forskning.no.
publications.INTERVJUSKRIFTL
-
Adde, Line;
Svendsen, Torbjørn.
(2011)
Pronunciation Variation Modeling of Non-Natie Proper Names by Discriminative Tree Search.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Academic article
-
Kvale, Knut;
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Lyse, Gunn Inger;
Gjesdal, Anje Müller.
(2011)
Datamaskinen må skjønne norsk.
Bergens Tidende.
Feature article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2011)
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines.
Interspeech.
Academic article
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2011)
Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients.
Interspeech.
Academic article
-
Soufifar, Mehdi;
Kockmann, Marcel;
Burget, Lukas;
Plchot, Oldrich;
Glembek, Ondrej;
Svendsen, Torbjørn.
(2011)
iVector Approach to Phonotactic Language Recognition.
Interspeech.
Academic article
-
Adde, Line;
Reveil, Bert;
Martens, Jean-Pierre;
Svendsen, Torbjørn.
(2010)
A Minimum Classification Error Approach to Pronunciation Variation Modeling of Non-Native Proper Names.
Interspeech.
Academic article
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition.
Interspeech.
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Sorbello, Filippo;
Lee, Chin-Hui.
(2010)
Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Academic article
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2010)
Intra-Frame Variability As a Predictor of Frame Classifiability.
Interspeech.
Academic article
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition.
Interspeech.
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
A Phonetic Feature Based Lattice Rescoring Approach to LVCSR.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
A Penalized Logistic Regression Approach to Detection Based Phone Classification.
Interspeech.
Academic article
-
Amdal, Ingunn;
Svendsen, Torbjørn.
(2005)
Unit Selection Synthesis Database Development Using Utterance Verification.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
volum 9.
Academic article
-
Bjørkan, Ingmund;
Svendsen, Torbjørn.
(2005)
Comparing Spectral Distance Measures for Join Cost Optmization.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
Academic article
-
Bjørkan, Ingmund;
Svendsen, Torbjørn;
Farner, Snorre.
(2005)
Comparing Spectral Distance Measures for Join Cost Optimization in Concatenative Speech Synthesis.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
volum 9.
Academic article
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2005)
Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation.
Eurospeech : Proceedings of the European Conference on Speech Communication and Technology.
volum 9.
Academic article
-
Svendsen, Torbjørn.
(2003)
Speech Technology: Past, Present and Future.
Telektronikk.
volum 99 (2).
Academic article
-
Braverman, Marc;
Svendsen, Torbjørn;
Lund, Karl Erik;
Aarø, Leif Edvard.
(2001)
Tobacco use by early adolescents in Norway.
European Journal of Public Health.
volum 11.
Academic article
-
Svendsen, Torbjørn.
(2001)
Nordisk forskningssamarbeid innen språkteknologi.
Språknytt.
volum 3/2001.
Popular scientific article
-
Amdal, Ingunn;
Holter, Trym;
Svendsen, Torbjørn.
(2000)
Modellering av uttalevariasjon for automatisk talegjenkjenning.
Nordlyd.
volum 28-2000.
Academic article
-
Foldvik, Arne Kjell;
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Thygesen, Ragnar.
(2000)
Dysleksi og språkteknologi.
Adresseavisen.
Feature article
-
Holter, Trym;
Svendsen, Torbjørn.
(1999)
Maximum likelihood modelling of pronunciation variation.
Speech Communication.
volum 29 (2-4).
Academic article
-
Svendsen, Torbjørn.
(1999)
Taleteknologi.
Språk i Norden.
Academic article
-
Svendsen, Torbjørn.
(1998)
Blir norsk gresk for språkteknologien?.
Språknytt.
volum 26 (4-98).
Academic article
-
Svendsen, Torbjørn.
(1994)
Talebaserte brukergrensesnitt.
NORSIGnalet : organ for NORSIG, Norsk forening for signalbehandling.
Popular scientific article
Part of book/report
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Imran, Ali Shariq;
Johnsen, Magne Hallstein;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
A Two-Stage Deep Modeling Approach to Articulatory Inversion.
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Academic chapter/article/Conference paper
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2021)
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
Proceedings 2021 IEEE International Symposium on Circuits and Systems.
Academic chapter/article/Conference paper
-
Imran, Ali Shariq;
Haflan, Vetle;
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2019)
Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification.
ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
Academic chapter/article/Conference paper
-
Imran, Ali Shariq;
Kastrati, Zenun;
Svendsen, Torbjørn Karl;
Kurti, Arianit.
(2019)
Text-Independent Speaker ID for Automatic Video Lecture Classification Using Deep Learning.
ICCAI '19 Proceedings of the 2019 5th International Conference on Computing and Artificial Intelligence.
Academic chapter/article/Conference paper
-
Imran, Ali Shariq;
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2019)
A Study on the Performance Evaluation of Machine Learning Models for Phoneme Classification.
ICMLC '19 Proceedings of the 2019 11th International Conference on Machine Learning and Computing.
Academic chapter/article/Conference paper
-
Sabzi Shahrebabaki, Abdolreza;
Imran, Ali Shariq;
Olfati, Negar;
Svendsen, Torbjørn Karl.
(2018)
Acoustic Feature Comparison for Different Speaking Rates.
Human-Computer Interaction. Interaction Technologies.
Academic chapter/article/Conference paper
-
Svendsen, Torbjørn Karl;
Hamar, Jarle Bauck.
(2015)
Combining NdHMM and Phonetic Feature Detection for Speech Recognition.
Proceedings of European Signal Processing Conference.
Academic chapter/article/Conference paper
-
Hamar, Jarle Bauck;
Doddipatla, Rama Sanand;
Svendsen, Torbjørn;
Sreenivas, Thippur.
(2013)
Non-Negative Durational HMM.
Proceedings of IEEE International Workshop on Machine Learning for Signal Processing 2013.
Academic chapter/article/Conference paper
-
Adde, Line;
Svendsen, Torbjørn.
(2010)
A Comparative Analysis of Discriminative and Non-Discriminative Pronunciation Priors in Pronunciation Variation Modeling.
Proceedings of 2010 IEEE Workshop on Spoken Language Technology.
Other
-
Adde, Line;
Svendsen, Torbjørn.
(2010)
NameDat: A Database of English Proper Names Spoken by Native Norwegians.
Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
Academic chapter/article/Conference paper
-
Saeidi, Rahim;
Soufifar, Mehdi;
Kinnunen, Tomi;
Svendsen, Torbjørn;
Fränti, Pasi.
(2010)
UEF-NTNU System Description for Albayzin 2010 Language Recognition Evaluation.
Proceedings FALA 2010.
Other
-
Sikveland, Rein Ove;
Öttl, Anton;
Amdal, Ingunn;
Ernestus, Mirjam;
Svendsen, Torbjørn;
Edlund, Jens.
(2010)
Spontal-N: A Corpus of Interactional Spoken Norwegian.
Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10).
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
A Survey on Recent Progress in the ASAT/SIRKUS Paradigm.
Proceedings of 7th International Symposium on Chinese Spoken Language.
Other
-
Mertens, Timo Pascal;
Schneider, Daniel;
Næss, Arild Brandrud;
Svendsen, Torbjørn.
(2009)
Lexicon Adaptation for Subword Speech Recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
Academic chapter/article/Conference paper
-
Amdal, Ingunn;
Strand, Ole Morten;
Almberg, Jørn;
Svendsen, Torbjørn.
(2008)
RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus.
Proceedings of the 6th International Language Resources and Evaluation (LREC 2008).
Academic chapter/article/Conference paper
-
Siniscalchi, Sabato Marco;
Birkenes, Øystein;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(2008)
Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition.
Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
Toward a Detector-Based Universal Phone Recognizer.
Proc. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
Other
-
Skogstad, Trond;
Svendsen, Torbjørn.
(2008)
Time-Varying Cepstral Coefficients.
Proceedings ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery.
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2007)
Towards Bottom-Up Continuous Phone Recognition.
Proceedings 2007 IEEE Workshop on Automatic Speech Recognition and Understanding.
Academic chapter/article/Conference paper
-
Amdal, Ingunn;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(2006)
Log Likelihood Ratio Based Annotation Verification of a Norwegian Speech Synthesis Database.
Proceedings of the 7th Nordic Signal Processing Symposium (NORSIG 2006).
Academic chapter/article/Conference paper
-
Amdal, Ingunn;
Svendsen, Torbjørn.
(2006)
FonDat1: A Speech Synthesis Corpus for Norwegian.
Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006).
Academic chapter/article/Conference paper
-
Meen, Dyre;
Svendsen, Torbjørn;
Natvig, Jon-Emil.
(2005)
Improving Phone Label Alignment Accuracy by Utilizing Voicing Information.
SPECOM 2005 Proceedings.
Academic chapter/article/Conference paper
-
Svendsen, Torbjørn;
Amdal, Ingunn;
Bjørkan, Ingmund;
Meen, Dyre;
Heggtveit, Per Olav;
Natvig, Jon Emil.
(2005)
FONEMA - Tools for realistic speech synthesis in Norwegian.
Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
Academic chapter/article/Conference paper
-
Svendsen, Torbjørn;
Egeberg, Andreas;
Holter, Trym;
Skogstad, Trond.
(2005)
VOCALS - Voice centric user interfaces for location based services.
Proceedings of Norwegian Signal Processing Symposium 2005 (NORSIG-05).
Academic chapter/article/Conference paper
-
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Harborg, Erik;
Kvale, Knut.
(2004)
Language Technology Towards 2020.
Infosam2020, Information Society of 2020.
Academic chapter/article/Conference paper
-
Svendsen, Torbjørn.
(2002)
Roles for Speech And Language Technology in The Information Society.
Perspectives on the age of the information society.
Academic chapter/article/Conference paper
Report
-
Soufifar, Mehdi;
Svendsen, Torbjørn;
Burget, Lukas.
(2014)
Subspace Modeling of Discrete features for Language Recognition.
2014. ISBN 978-82-326-0496-8.
Doctoral dissertation
-
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Breivik, Torbjørg.
(2002)
Samling og tilgjengeleggjering av norske språkteknologiressursar.
2002.
Report
-
Nordgård, Torbjørn;
Svendsen, Torbjørn;
Natvig, Jon Emil.
(2002)
Talsmann talesyntese som hjelpemiddel for dyslektikere.
2002.
Report
-
Svendsen, Torbjørn;
Johnsen, Magne Hallstein;
Nordgård, Torbjørn;
Hofland, Knut;
Hofland, Knut;
Ore, Christian Emil;
Ore, Christian Emil.
(1999)
Nasjonalt korpus for språkteknologi - forprosjekt.
1999.
Report
-
Harborg, Erik;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(1995)
Talegjenkjenning II.
1995.
Report
-
Harborg, Erik;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(1995)
Talegjenkjenning for teksting av direktesendte programmer - en studie.
1995.
Report
Teaching
Courses
Media
2018
-
Popular scientific lectureØien, Geir Egil Dahle; Mengshoel, Ole Jakob; Ramampiaro, Heri; Svendsen, Torbjørn Karl. (2018) NTNUs strategiske satsing på kunstig intelligens (AI) – bakgrunn, aktiviteter og fremtidsvyer. Medlemsmøte, Det Kongelige Norske Vitenskapers Selskap . Det Kongelige Norske Vitenskapers Selskap; Trondheim. 2018-11-12 - 2018-11-12.
2011
-
Academic lectureJavier Rodriguez-Fuentes, Luis; Penagarikano, Mikel; Varona, Amparo; Diez, Mireia; Bordel, German; Martinez, David; Villalba, Jesus; Miguel, Antonio; Ortega, Alfonso; Lleida, Eduardo; Abad, Alberto; Koller, Oscar; Trancoso, Isabel; Lopez-Otero, Paula; Docio-Fernandez, Laura; Garcia-Mateo, Carmen; Saeidi, Rahim; Soufifar, Mehdi; Kinnunen, Tomi. (2011) MULTI-SITE HETEROGENEOUS SYSTEM FUSIONS FOR THE ALBAYZIN 2010 LANGUAGE RECOGNITION EVALUATION. Automatic Speech Recognition and Understanding . IEEE; Big Island, Hawaii. 2011-12-11 - 2011-12-15.
-
Popular scientific lectureSvendsen, Torbjørn. (2011) Hva er det med tale? Forskningsutfordringer og aktiviteter innen taleteknologi. På snakkis med teknologien . MediaLT; Oslo. 2011-11-09 - 2011-11-09.
-
Academic lectureSvendsen, Torbjørn. (2011) Universal Speech Attribute Characterization for Automatic Speech Recognition and Spoken Language Recognition. CSAIL Seminar . MIT CSAIL; Boston. 2011-12-05 - 2011-12-05.
2010
-
Academic lectureSikveland, Rein Ove; Öttl, Anton; Amdal, Ingunn; Ernestus, Mirjam; Svendsen, Torbjørn; Edlund, Jens. (2010) Spontal-N: A Corpus of Interactional Spoken Norwegian. LREC . ELDA; Valetta. 2010-05-17 - 2010-05-23.
-
Academic lectureAdde, Line; Reveil, Bert; Martens, Jean-Pierre; Svendsen, Torbjørn. (2010) A Minimum Classification Error Approach to Pronunciation Variation Modeling of Non-Native Proper Names. Interspeech 2010 . ISCA; Makuhari. 2010-09-27 - 2010-09-30.
-
Academic lectureAdde, Line; Svendsen, Torbjørn. (2010) A Comparative Analysis of Discriminative and Non-Discriminative Pronunciation Priors in Pronunciation Variation Modeling. IEEE Workshop on Spoken Language Technology 2010 . IEEE; Berkeley, California. 2010-12-12 - 2010-12-15.
-
Academic lectureAdde, Line; Svendsen, Torbjørn. (2010) NameDat: A Database of English Proper Names Spoken by Native Norwegians. LREC . ELDA; Valetta. 2010-05-17.
-
Academic lectureMeen, Dyre; Svendsen, Torbjørn. (2010) The NTNU Concatenative Speech Synthesizer. Blizzard Challenge Workshop . ISCA; Kyoto. 2010-09-25 - 2010-09-25.
-
Academic lectureSaeidi, Rahim; Soufifar, Mehdi; Kinnunen, Tomi; Svendsen, Torbjørn; Fränti, Pasi. (2010) UEF-NTNU System Description for Albayzin 2010 Language Recognition Evaluation. FALA 2010 . University of Vigo; Vigo. 2010-10-10 - 2010-10-12.
-
Academic lectureSiniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition. Interspeech 2010 . ISCA; Makuhari. 2010-09-27 - 2010-09-30.
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) A Survey on Recent Progress in the ASAT/SIRKUS Paradigm. ISCSLP 2010 . IEEE; Tainan. 2010-11-21 - 2010-12-03.
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Sorbello, Filippo; Lee, Chin-Hui. (2010) Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions. ICASSP 2010 . IEEE; Dallas, Texas. 2010-03-14 - 2010-03-19.
-
Academic lectureSkogstad, Trond; Svendsen, Torbjørn. (2010) Intra-Frame Variability As a Predictor of Frame Classifiability. Interspeech 2010 . ISCA; Makuhari. 2010-09-27 - 2010-09-30.
2009
-
Academic lectureSiniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition. Interspeech . ISCA; Brighton. 2009-09-06 - 2009-09-10.
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) A Phonetic Feature Based Lattice Rescoring Approach to LVCSR. IEEE International Conference on Acoustics, Speech and Signal Processing . IEEE; Taipei. 2009-04-19 - 2009-04-24.
-
InterviewSvendsen, Torbjørn. (2009) Språkteknologien gjør fremskritt igjen. forskning.no [Internett]. 2009-04-09.
-
InterviewSvendsen, Torbjørn. (2009) VERDIKT på Forskningsdagene. Nytt fra VERDIKT [Avis]. 2009-11-03.
2008
-
Academic lectureAmdal, Ingunn; Strand, Ole Morten; Almberg, Jørn; Svendsen, Torbjørn. (2008) RUNDKAST: An Annotated Norwegian Broadcast News Speech Corpus. LREC 2008 . European Language Resources Association; Marrakech. 2008-05-26 - 2008-05-31.
-
Academic lectureAmdal, Ingunn; Svendsen, Torbjørn; Johnsen, Magne Hallstein; Siniscalchi, Sabato Marco; Hamar, Jarle Bauck; Martinez, Del Hoyo Canterla A.. (2008) SIRKUS - A new paradigm for speech recognition. VERDIKT Conference 2008 . Norges forskningsråd; Bergen. 2008-10-29 - 2008-10-30.
-
Academic lectureSiniscalchi, Sabato Marco; Birkenes, Øystein; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2008) Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition. ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery . ISCA; Aalborg. 2008-06-04 - 2008-06-06.
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) A Penalized Logistic Regression Approach to Detection Based Phone Classification. Interspeech 2008 . ISCA; Brisbane. 2008-09-22 - 2008-09-26.
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) Toward a Detector-Based Universal Phone Recognizer. International Conference on Acoustics, Speech and Signal Processing . IEEE; Las Vegas. 2008-03-30 - 2008-04-04.
-
Academic lectureSkogstad, Trond; Svendsen, Torbjørn. (2008) Time-Varying Cepstral Coefficients. ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery . ISCA; Aalborg. 2008-06-04 - 2008-06-06.
-
InterviewSvendsen, Torbjørn. (2008) Norsk språkbank. Språkteigen, NRK P2 [Radio]. 2008-08-24.
-
InterviewSvendsen, Torbjørn. (2008) Norsk talesyntese. P4 [Radio]. 2008-02-08.
-
InterviewSvendsen, Torbjørn. (2008) Taleteknologi. God morgen Norge [TV]. 2008-02-08.
2007
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2007) Towards Bottom-Up Continuous Phone Recognition. 2007 IEEE Workshop on Automatic Speech Recognition and Understanding . IEEE; Kyoto. 2007-12-09 - 2007-12-13.
-
Academic lectureSvendsen, Torbjørn. (2007) Articulatory Features and Segmental Information for Automatic Speech Recognition. ESF Exploratory Workshop on Models of Language Evolution, Acquisition and Processing . European Science Foundation; Leuven. 2007-11-25 - 2008-11-28.
-
InterviewSvendsen, Torbjørn; Abelsen, Atle. (2007) IKE i hver puslebit. Bladet Forskning [Avis]. 2007-12-01.
2006
-
Academic lectureSvendsen, Torbjørn. (2006) Task and speaker adaptation. WISSAP'06 . IEEE og ISCA; 2006-01-04 - 2006-01-07.
-
Academic lectureAmdal, Ingunn; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2006) Log Likelihood Ratio Based Annotation Verification of a Norwegian Speech Synthesis Database. NORSIG 2006 . NORSIG; Reykjavik. 2006-06-07 - 2006-06-09.
-
PosterAmdal, Ingunn; Svendsen, Torbjørn. (2006) FonDat1: A Speech Synthesis Corpus for Norwegian. LREC 2006 . European Language Resources Association; Genova. 2006-05-22 - 2006-05-28.
-
Academic lectureNordgård, Torbjørn; Svendsen, Torbjørn. (2006) Et norsk uttaleleksikon møter en spontan virkelighet. Oslomålet - et seminar med forskning fra NoTa-korpuset . Universitetet i Oslo; Oslo. 2006-11-23 - 2006-11-24.
2005
-
PosterAmdal, Ingunn; Svendsen, Torbjørn. (2005) Unit Selection Synthesis Database Development Using Utterance Verification. Interspeech 2005 . ISCA; Lisboa. 2005-09-04 - 2005-09-08.
-
PosterBjørkan, Ingmund; Svendsen, Torbjørn; Farner, Snorre. (2005) Comparing Spectral Distance Measures for Join Cost Optimization in Concatenative Speech Synthesis. Interspeech 2005 . ISCA; Lisboa. 2005-09-04 - 2005-09-08.
-
PosterMeen, Dyre; Svendsen, Torbjørn; Natvig, Jon-Emil. (2005) Improving Phone Label Aligment Accuracy by Utilizing Voicing Information. SPECOM 2005 . University of Patras, Wire Communications Laboratory; Patras. 2005-10-17 - 2005-10-19.
-
PosterSkogstad, Trond; Svendsen, Torbjørn. (2005) Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation. Eurospeech 2005 . ISCA; Lisboa. 2005-09-04 - 2005-09-08.
-
Academic lectureSvendsen, Torbjørn; Amdal, Ingunn; Bjørkan, Ingmund; Meen, Dyre; Heggtveit, Per Olav; Natvig, Jon Emil. (2005) FONEMA - Tools for realistic speech synthesis in Norwegian. NORSIG 05 . NORSIG; Stavanger. 2005-09-22 - 2005-09-24.
-
Academic lectureSvendsen, Torbjørn; Egeberg, Andreas; Holter, Trym. (2005) VOCALS - Voice centric user interfaces for location based services. NORSIG 05 . NORSIG; Stavanger. 2005-09-22 - 2005-09-24.
2004
-
Academic lectureSvendsen, Torbjørn. (2004) Pronunciation Modeling for Speech Technology. 2004 International Conference on Signal Processing and Communications . IEEE Signal Processing Society and Indian Institute of Scien; Bangalore. 2004-12-11 - 2004-12-14.
-
Academic lectureØien, Geir Egil; Holte, Nils; Andresen, Steinar; Svendsen, Torbjørn; Hammer, Mikael. (2004) Communication technology towards 2020. INFOSAM-2020 conference . IME-fakultetet, NTNU/Teknologirådet; Trondheim. 2004-04-19 - 2004-04-20.
2003
-
PosterMartin, Terrence; Svendsen, Torbjørn; Sridharan, Sridha. (2003) Cross-Lingual Pronunciation Modelling for Indonesian Speech Recognition. Eurospeech 2003 . [Mangler data]; Geneve. 2003-09-04.
-
Popular scientific lectureSvendsen, Torbjørn. (2003) FONEMA - Metodeutvikling for naturtro norsk talesyntese. KUNSTI-seminar 2003 . [Mangler data]; Bergen. 2003-11-18.
-
Academic lectureSvendsen, Torbjørn. (2003) Pronunciation Modelling for Speech Technology. [Mangler data] . Queenslad University of Technology; Brisbane, Australia. 2003-05-30.
-
Popular scientific lectureSvendsen, Torbjørn. (2003) Snakke dialekt med mobilen? Om dialektbruk i ny språkteknologi. [Mangler data] . Noregs mållag; Oslo. 2003-09-28.
-
Popular scientific lectureSvendsen, Torbjørn. (2003) Speech Processing Activities at NTNU: An Overview. Nordic Speech Technology Seminar . [Mangler data]; Stockholm. 2003-11-14.
-
PosterWong, Eddie; Martin, Terrence; Svendsen, Torbjørn; Sridharan, Sridha. (2003) Multilingual Phone Clustering for Recognition of Spontaneous Indonesian Speech Utilising Pronunciation Modelling Techniques. Eurospeech 2003 . [Mangler data]; Geneve. 2003-09-04.
2002
-
Academic lectureAmdal, Ingunn; Svendsen, Torbjørn. (2002) Evaluation of pronunciation variants in the ASR lexicon for different speaking styles. Third International Conference on Language Resources and Evaluation . [Mangler data]; Las Palmas de Gran Canaria, Spain. 2002-05-31.
2001
-
Academic lectureJohnsen, Magne Hallstein; Harborg, Erik; Svendsen, Torbjørn; Amble, Tore; Holter, Trym; Myrvoll, Tor Andre; Nordgård, Torbjørn. (2001) SPODIS - Spoken Dialog Systems for Telephony. NORSIG-2001, Norwegian Signal Processing Symposium . [Mangler data]; Trondheim, Norway, October 18-20 2001.
-
PosterMyrvoll, Tor Andre; Paliwal, Kuldip K.; Svendsen, Torbjørn. (2001) Fast Adaptation using Constrained Affine Transformations with Hierarchical Priors. Eurospeech 2001 . [Mangler data]; Aalborg, Sept 3-7, 2001.
2000
-
Academic lectureHolter, Trym; Harborg, Erik; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2000) ASR-Based Subtitiling of Live TV-Programs for the Hearing Impaired. 6th International Conference on Spoken Language Processing . [Mangler data]; Beijing, Oct. 16-20, 2000.
-
Academic lectureJohnsen, Magne Hallstein; Holter, Trym; Svendsen, Torbjørn; Harborg, Erik. (2000) Stochastic Modelling of Semantic Content for Use in a Spoken Dialogue System. 6th International Conference on Spoken Language Processing . [Mangler data]; Beijing, Oct. 16-20, 2000.
-
Academic lectureJohnsen, Magne Hallstein; Svendsen, Torbjørn; Amble, Tore; Holter, Trym; Harborg, Erik. (2000) TABOR - A Norwegian Spoken Dialogue System for Bus Travel Information. 6th International Conference on Spoken Language Processing . [Mangler data]; Beijing, Oct. 16-20, 2000.
-
Popular scientific lectureSvendsen, Torbjørn. (2000) Norsk språkbank, et nasjonalt korpus for språkteknologi. [Mangler data] . [Mangler data]; Statssekretærutvalget for IT, Oslo, 12. januar, 2000.
-
Popular scientific lectureSvendsen, Torbjørn. (2000) Ordets makt � om taleteknologi som hjelpemiddel for funksjonshemmede. [Mangler data] . [Mangler data]; "Selvstendig liv", Sjølyst, 12. april, 2000.
-
Academic lectureSvendsen, Torbjørn. (2000) Pronunciation modeling for improved recognition of names. [Mangler data] . [Mangler data]; AT&T Labs, Florham Park, New Jersey, 15. september 2000.
-
Popular scientific lectureSvendsen, Torbjørn. (2000) Taleteknologi- teknologi med potensiale for kvalitetsheving og effektivisering ved håndtering av informasjon i sykehus. [Mangler data] . [Mangler data]; Norges tekniske vitenskapsakademi, Trondheim, 22. februar, 2000.
-
Popular scientific lectureSvendsen, Torbjørn; Johnsen, Magne Hallstein. (2000) �Sesam sesam!� - Kan taleteknologi bli en døråpner for funksjonshemmede?. [Mangler data] . [Mangler data]; Rehabiliteringskonferansen, Trondheim, 20. juni, 2000.
1999
-
Academic lectureAmdal, Ingunn; Holter, Trym; Svendsen, Torbjørn. (1999) Maximum likelihood pronunciation modelling of Norwegian natural numbers for automatic speech recognition. NORSIG'99 . [Mangler data]; Asker, september 1999.
-
Academic lectureAmdal, Ingunn; Holter, Trym; Svendsen, Torbjørn. (1999) Modellering av uttalevariasjon for automatisk talegjenkjenning. Møte om norsk språk (MONS 8) . [Mangler data]; Tromsø, 18.-20. november 1999.
-
Academic lectureHarborg, Erik; Holter, Trym; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) Generation of closed captions for live TV-programs using speech recognition. Norsig'99 . [Mangler data]; Asker, September 1999.
-
Academic lectureHarborg, Erik; Holter, Trym; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) On-line captioning of TV-programs for the hearing impaired. EuroSpeech'99 . [Mangler data]; Budapest, Ungarn.
-
PosterHarborg, Erik; Holter, Trym; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) Subtitling of live broadcast TV-programs for the hearing impaired. AAATE'99 . [Mangler data]; Dusseldorf, November 1999.
-
Academic lectureJohnsen, Magne Hallstein; Svendsen, Torbjørn. (1999) Menneske/maskin-kommunikasjon basert på tale. MONS-8 (8nde Møte Om Norsk Språk) . [Mangler data]; Tromsø, Norway, Nov. 1999.
-
Popular scientific lectureYang, Qian; Cremelie, Nick; Holter, Trym; Martens, Jean-Pierre; Svendsen, Torbjørn; Ringland, Simon. (1999) Lexicon building and word accuracy in continuous speech recognition. COST 249 meeting, Prague . [Mangler data]; Prague, Czech Republic, February 1999.
1998
-
Academic lectureHolter, Trym; Svendsen, Torbjørn. (1998) Maximum likelihood modelling of pronunciation variation. ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for ASR . [Mangler data]; Rolduc.
-
Academic lectureSvendsen, Torbjørn. (1998) SPODIS - Spoken dialog systems for telephony services. Studiemøtet i elektronikk og data . [Mangler data]; Kristiansand.
-
Popular scientific lectureSvendsen, Torbjørn. (1998) Speech processing activities at NTNU. [Mangler data] . [Mangler data]; KTH, Stockholm.
-
Popular scientific lectureSvendsen, Torbjørn. (1998) Taleteknolog. Nordisk språkmøte . [Mangler data]; Trondheim.
-
Popular scientific lectureSvendsen, Torbjørn. (1998) Taleteknologi ved NTNU. Aalborg workshop in speech communication . [Mangler data]; Aalborg.
1997
-
Academic lectureHolter, Trym; Svendsen, Torbjørn. (1997) A joint segmentation and labelling scheme for use in acoustic subword based speech recognition. Norwegian Signal Processing Symposium . [Mangler data]; Tromsø.
-
Academic lectureHolter, Trym; Svendsen, Torbjørn. (1997) Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units. IEEE Speech recognition Workshop . [Mangler data]; Santa Barbara, Calif..
-
Popular scientific lectureHolter, Trym; Svendsen, Torbjørn. (1997) Combined optimisation of baseforms and model parameters in speech recognition based on acoustic sub-word units. [Mangler data] . [Mangler data]; AT&T Labs, Florham Park, NJ, USA.
-
Academic lectureHolter, Trym; Svendsen, Torbjørn. (1997) Incorporating linguistic knowledge and automatic baseform generation in acoustic subword unit based speech recognition. Eurospeech '97 . [Mangler data]; Rhodos.
-
Popular scientific lectureSvendsen, Torbjørn. (1997) Acoustic subwords - some applications in speech processing. [Mangler data] . [Mangler data]; Griffith University, Brisbane, Australia.
-
Popular scientific lectureSvendsen, Torbjørn. (1997) Some topics from recent work in speech processing. [Mangler data] . [Mangler data]; Motorola Research Labs, Sydney og University of Wollongong.
-
Popular scientific lectureSvendsen, Torbjørn. (1997) Speech recognition based on acoustic subword units. [Mangler data] . [Mangler data]; Telenor FoU, Kjeller.
1996
-
Academic lecturePihl, Johnny; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (1996) A VLSI implementation of pdf computations in HMM based speech recognition. TENCON-96 . IEEE; Perth. 1996-11-27 - 1996-11-29.
1995
-
Academic lectureJohnsen, Magne Hallstein; Svendsen, Torbjørn; Harborg, Erik. (1995) Experiments on cepstral mean subtraction and Rasta-filtering applied to SAMPA phoneme recognition. COST249 . COST; Nancy. 1995-05-06 - 1995-05-07.
1994
-
Popular scientific lectureSvendsen, Torbjørn. (1994) Acoustic segmentation of speech : applications in speech processing. [Mangler data] . [Mangler data]; [Mangler data].
-
Popular scientific lectureSvendsen, Torbjørn. (1994) Acoustic segmentation of speech : applications in speech processing. [Mangler data] . [Mangler data]; [Mangler data].
-
Academic lectureSvendsen, Torbjørn. (1994) Segmental quantization of speech spectral information. IEEE International Conference on Acoustics, Speech and Signal Processing . [Mangler data]; [Mangler data].
1993
-
Academic lectureSvendsen, Torbjørn. (1993) Efficient quantization of speech spectral information. EUROSPEECH '93 (1993 : Berlin) . [Mangler data]; [Mangler data].
1989
-
Academic lectureSvendsen, Torbjørn Karl; Paliwal, Kuldip K.; Harborg, Erik; Husøy, Per Ove. (1989) An Improved Sub-Word Based Speech Recognizer. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ; Glasgow. 1989-05-01.
1988
-
Academic lectureSvendsen, Torbjørn Karl; Paliwal, K.K.; Harborg, Erik; Husøy, P.O.. (1988) Experiments with a Sub-Word Based Speech Recognizer. International Conference on Speech Science and Technology (ICSST) ; Sydney. 1988-12-01.