Navigation

  • Skip to Content
NTNU Home NTNU Home

ntnu.edu

  • Studies
    • Master's programmes in English
    • For exchange students
    • PhD opportunities
    • All programmes of study
    • Courses
    • Financing
    • Language requirements
    • Application process
    • Academic calendar
    • FAQ
  • Research and innovation
    • NTNU research
    • Research excellence
    • Strategic research areas
    • Innovation resources
    • PhD opportunities
  • Life and housing
    • Student in Trondheim
    • Student in Gjøvik
    • Student in Ålesund
    • For researchers
    • Life and housing
  • About NTNU
    • Contact us
    • Faculties and departments
    • Libraries
    • International researcher support
    • Vacancies
    • About NTNU
    • Maps
  1. Employees

Språkvelger

Norsk

Xinwei Cao

Download press photo
Download press photo
Foto:

Xinwei Cao

PhD Candidate
Department of Electronic Systems

xinwei.cao@ntnu.no
+49 152 26795174 Elektro C, Gløshaugen
Publications Outreach

Publications

  • Chronological
  • By category
  • All publications registered in NVA

2025

  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn. (2025) Improving Phone Recognition through Informed Initialization and Path-Aligned CTC Loss.
    Academic chapter/article/Conference paper
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Child speech assessment through large language model speech synthesis: Preliminary results.
    Academic chapter/article/Conference paper

2024

  • Cao, Xinwei. (2024) Kos-Interspeech2024.
    Other
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) A Framework for Phoneme-Level Pronunciation Assessment Using CTC. Interspeech
    Academic article
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2024) Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. Machine Learning for Signal Processing
    Academic article
  • Olstad, Anne Marte Haug; Smolander, Anna; Strömbergsson, Sofia; Ylinen, Sari; Lehtonen, Minna; Kurimo, Mikko. (2024) Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages. Proceedings of LREC
    Academic article

2023

  • Cao, Xinwei. (2023) Interspeech 2023.
    Other
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
    Academic article
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) An Analysis of Goodness of Pronunciation for Child Speech. Interspeech
    Academic article

Journal publications

  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) A Framework for Phoneme-Level Pronunciation Assessment Using CTC. Interspeech
    Academic article
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2024) Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. Machine Learning for Signal Processing
    Academic article
  • Olstad, Anne Marte Haug; Smolander, Anna; Strömbergsson, Sofia; Ylinen, Sari; Lehtonen, Minna; Kurimo, Mikko. (2024) Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages. Proceedings of LREC
    Academic article
  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
    Academic article
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) An Analysis of Goodness of Pronunciation for Child Speech. Interspeech
    Academic article

Part of book/report

  • Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn. (2025) Improving Phone Recognition through Informed Initialization and Path-Aligned CTC Loss.
    Academic chapter/article/Conference paper
  • Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Child speech assessment through large language model speech synthesis: Preliminary results.
    Academic chapter/article/Conference paper
  • Cao, Xinwei. (2024) Kos-Interspeech2024.
    Other
  • Cao, Xinwei. (2023) Interspeech 2023.
    Other

Outreach

2025

  • Academic lecture
    Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn. (2025) Improving Phone Recognition through Informed Initialization and Path-Aligned CTC Loss. 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP) 2025-08-30 - 2025-09-02
  • Academic lecture
    Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn; Salvi, Giampiero. (2025) Child speech assessment through large language model speech synthesis: Preliminary results. 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP) 2025-08-30 - 2025-09-02

2024

  • Academic lecture
    Olstad, Anne Marte Haug; Smolander, Anna; Strömbergsson, Sofia; Ylinen, Sari; Lehtonen, Minna; Kurimo, Mikko. (2024) Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages. LREC-COLING , Turin, Italy 2024-05-20 - 2024-05-24
  • Academic lecture
    Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2024) Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. chine Learning for Signal Processing , London, UK 2024-09-22 - 2024-09-25
  • Academic lecture
    Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2024) Framework for Phoneme-Level Pronunciation Assessment Using CTC. Interspeech , Kos, Greece 2024-09-01 - 2024-09-05

2023

  • Academic lecture
    Fan, Zijian; Cao, Xinwei; Salvi, Giampiero; Svendsen, Torbjørn Karl. (2023) Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. ICASSP , Rhodes, Greece 2023-06-04 - 2023-06-10
  • Academic lecture
    Cao, Xinwei; Fan, Zijian; Svendsen, Torbjørn Karl; Salvi, Giampiero. (2023) An Analysis of Goodness of Pronunciation for Child Speech. Interspeech , Dublin, Irland 2023-08-20 - 2023-08-24

NTNU – Norwegian University of Science and Technology

  • For employees
  • |
  • For students
  • |
  • Intranet
  • |
  • Blackboard

Studies

  • Master's programmes in English
  • For exchange students
  • PhD opportunities
  • Courses
  • Career development
  • Continuing education
  • Application process

News

  • NTNU News
  • Vacancies

About NTNU

  • About the university
  • Libraries
  • NTNU's strategy
  • Research excellence
  • Strategic research areas
  • Organizational chart

Contact

  • Contact NTNU
  • Employees
  • Find experts
  • Press contacts
  • Researcher support
  • Maps

NTNU in three cities

  • NTNU in Gjøvik
  • NTNU in Trondheim
  • NTNU in Ålesund

About this website

  • Use of cookies
  • Accessibility statement
  • Privacy policy
  • Editorial responsibility
Facebook Instagram Linkedin Snapchat Tiktok Youtube
Sign In
NTNU logo