Below is a list of scientific publications resulting from work conducted in the context of Meetween:
- Danni Liu and Jan Niehues. “Recent Highlights in Multilingual and Multimodal Speech Translation ”. In Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024) (pp. 235-253).
- Tu Anh Dinh, Tobias Palzer, Jan Niehues. “Quality Estimation with k-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation ”. arXiv preprint arXiv:2404.18031.
- Sai Koneru, Thai-Binh Nguyen, Ngoc-Quan Pham, Danni Liu, Zhaolin Li, Alexander Waibel, Jan Niehues. “Blending LLMs into Cascaded Speech Translation: KIT’s Offline Speech Translation System for IWSLT 2024 ”. arXiv preprint arXiv:2406.16777.
- Carlos Mullov, Ngoc-Quan Pham, Alexander Waibel. “Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages ”. arXiv preprint arXiv:2408.02290.
- Francesco Verdini, Pierfrancesco Melucci, Stefano Perna, Francesco Cariaggi, et al. “How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not ”. arXiv preprint arXiv:2409.17044.
- Dogucan Yaman, Fevziye Irem Eyiokur, et al. “Audio-driven Talking Face Generation with Stabilized Synchronization Loss ”. ECCV 2024
- Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof Arenas, Luisa Bentivogli. “What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study ”. EMNLP 2024 Main.
- Marco Gaido*, Sara Papi*, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri. “MOSEL: 950,000 Hours of Open-Source Compliant Speech Data for EU Languages ”. EMNLP 2024 Main.
- Sara Papi, Marco Gaido, Matteo Negri and Luisa Bentivogli. “StreamAtt: Direct Streaming Speech-to-Text Speech Translation with Attention-based History Selection ”. ACL 2024.
- Marco Gaido, Sara Papi, Matteo Negri and Luisa Bentivogli. “Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? ”. ACL 2024.
- Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti and Bhiksha Raj. “Continual Contrastive Spoken Language Understanding ”. ACL-findings 2024.
- Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli. “SimulSeamless: FBK’s submission for Simultaneous Speech Translation at IWSLT 2024 ”. IWSLT 2024.
- Beatrice Savoldi, Marco Gaido, Matteo Negri, Luisa Bentivogli. “FBK@IWSLT Test Suites Task: Gender Bias Evaluation with MuST-SHE ”. IWSLT 2024.
- Dogucan Yaman, Fevziye Irem Eyiokur, et al. “Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation. ” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024.
- Erdi Sarıtaş and Hazım Kemal Ekenel. “Analyzing the Feature Extractor Networks for Face Image Synthesis ”. 2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG).
- Maike Züfle and Jan Niehues. 2025. “Contrastive Learning for Task-Independent SpeechLLM-Pretraining ”. ACL-findings 2025.
- Ibrahim Said Ahmad, Antonios Anastasopoulos, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, William Chen, Qianqian Dong, Marcello Federico, Barry Haddow, Dávid Javorský, Mateusz Krubiński, Tsz Kin Lam, Xutai Ma, Prashant Mathur, Evgeny Matusov, Chandresh Maurya, John McCrae, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, Atul Kr. Ojha, John Ortega, Sara Papi, Peter Polák, Adam Pospíšil, Pavel Pecina, Elizabeth Salesky, Nivedita Sethiya, Balaram Sarkar, Jiatong Shi, Claytone Sikasote, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Brian Thompson, Alex Waibel, Shinji Watanabe, Patrick Wilken, Petr Zemánek, and Rodolfo Zevallos. “Findings of the IWSLT 2024 Evaluation Campaign ”. IWSLT 2024.
- Siqi Li*, Danni Liu*, Jan Niehues. “Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach ”. EMNLP 2024.
- Chrysoula Zerva, Frédéric Blain, José GC de Souza, Diptesh Kanojia, Sourabh Deoghare, Nuno M Guerreiro, Giuseppe Attanasio, Ricardo Rei, Constantin Orasan, Matteo Negri, Marco Turchi, Rajen Chatterjee, Pushpak Bhattacharyya, Markus Freitag, André FT Martins. “Findings of the Quality Estimation Shared Task at WMT 2024: Are LLMs Closing the Gap in QE? ”. WMT 2024.
- Mauro Cettolo, Andrea Piergentili, Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli. “MAGNET - MAchines GeNErating Translations: A CALAMITA Challenge ”. CLiC-it 2024.
- Simona Frenda, Andrea Piergentili, Beatrice Savoldi,Marco Madeddu, Martina Rosola, Silvia Casola, Chiara Ferrando, Viviana Patti, Matteo Negri and Luisa Bentivogli. “GFG - Gender-Fair Generation: A CALAMITA Challenge ”. CLiC-it 2024.
- Dennis Fucci, Beatrice Savoldi, Marco Gaido, Matteo Negri, Mauro Cettolo and Luisa Bentivogli. “Explainability for Speech Models: On the Challenges of Acoustic Feature Selection ”. CLiC-it 2024.
- Pierfrancesco Melucci, Stefano Perna, Francesco Verdini, Francesco Cariaggi. “Talking Heads: Bootstrapping Pre-trained LLMs to Build an end-to-end Speech Foundation Model”. GenAI Autumn School 2024, Université Paris-Saclay.
- Beatrice Savoldi, Jasmnj Bastings, Luisa Bentivogli, Eva Vanmmassenhove. “A decade of gender bias in Machine Translation ”. Patterns, Volume 6, Issue 6, 101257 (2025).
- Hyunji Lee, Danni Liu, Supriti Sinhamahapatra, Jan Niehues. “How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations ”. NAACL 2025.
- Shamil Chollampatt, Minh Quang Pham, Sathish Reddy Indurthi, Marco Turchi. “Cross-lingual Evaluation of Multilingual Text Generation ”. COLING 2025.
- Beomseok Lee, Marco Gaido, Ioan Calapodescu, Laurent Besacier and Matteo Negri. “Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection. ”. COLING 2025.
- Christian Huber, Alexander Waibel. “Continuously Learning New Words in Automatic Speech Recognition ”. ICASSP 2025.
- Thai-Binh Nguyen, Alexander Waibel. “MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models ”. ICASSP 2025.
- Enes Yavuz Ugan, Ngoc-Quan Pham, Leonard Bärmann, and Alex Waibel. “PIER: A Novel Metric for Evaluating What Matters in Code-Switching ”. ICASSP 2025.
- Julius Cheng, Maike Züfle, Vilém Zouhar, Andreas Vlachos. “A Bayesian Optimization Approach to Machine Translation Reranking ”. NAACL 2025.
- Tsz Kin Lam*, Marco Gaido*, Sara Papi, Luisa Bentivogli, Barry Haddow “Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison ”. NAACL 2025.
- Yining Liu, Alexander Waibel “Factorized-VITS: Decoupling Prosody and Text in End-to-End Speech Synthesis without External or Secondary Aligner ”. ICASSP 2025.
- Sara Papi, Peter Polak, Dominik Macheck, Ondrej Bojar. “How “Real” is Your Real-Time Simultaneous Speech-to-Text Translation System? ”. TACL 2025.
- Maike Züfle, Sara Papi, Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Jan Niehues. “NUTSHELL: A Dataset for Abstract Generation from Scientific Talks ”. IWSLT 2025.
- Vilém Zouhar, Maike Züfle, Beni Egressy, Julius Cheng, Jan Niehues. “Early-Exit and Instant Confidence Translation Quality Estimation ”. arXiv preprint (under review) (2025).
- Sai Koneru*, Maike Züfle*, Thai-Binh Nguyen, Seymanur Akti, Jan Niehues, Alexander Waibel. “KIT’s Offline Speech Translation and Instruction Following Submission for IWSLT 2025 ”. IWSLT 2025.
- Andrea Piergentili, Beatrice Savoldi, Matteo Negri, Luisa Bentivogli. “An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation ”. GITT Workshop @ MT-Summit 2025.
- Dennis Fucci, Marco Gaido, Matteo Negri, Luisa Bentivogli, André Martins, Giuseppe Attanasio. “Different Speech Translation Models Encode and Translate Speaker Gender Differently ”. ACL 2025.
- Beatrice Savoldi, Alan Ramponi, Matteo Negri, Luisa Bentivogli. “Translation in the Hands of Many:Centering Lay Users in Machine Translation Interactions ”. EMNLP 2025.
- Fabian Retkowski, Maike Züfle, Andreas Sudmann, Dinah Pfau, Jan Niehues, Alexander Waibel. “From Speech to Summary: A Comprehensive Survey of Speech Summarization ”. EMNLP 2025.
- Felix Schneider, Marco Turchi, Alex Waibel. “Policies and Evaluation for Online Meeting Summarization ”. arXiv preprint (2025).
- Marco Gaido*, Sara Papi*, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri. “The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence ”. IWSLT 2025.
- Sara Papi*, Marco Gaido*, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri. “FAMA: The First Large-Scale Open-Science Speech Foundation Model for Italian and English ”. CLiC-it 2025.
- Thai-Binh Nguyen, Ngoc-Quan Pham, Alexander Waibel. “Cocktail-Party Audio-Visual Speech Recognition ”. Interspeech 2025.
- Sara Papi, Maike Züfle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, Jan Niehues. “MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks” ". arXiv preprint (2025).
- Idris Abdulmumin, Victor Agostinelli, Tanel Alumäe, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Fethi Bougares, Roldano Cattoni, Mauro Cettolo, Lizhong Chen, William Chen, Raj Dabre, Yannick Estève, Marcello Federico, Mark Fishel, Marco Gaido, Dávid Javorský, Marek Kasztelnik, Fortuné Kponou, Mateusz Krubiński, Tsz Kin Lam, Danni Liu, Evgeny Matusov, Chandresh Kumar Maurya, John P. McCrae, Salima Mdhaffar, Yasmin Moslem, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Atul Kr. Ojha, John E. Ortega, Sara Papi, Pavel Pecina, Peter Polák, Piotr Połeć, Ashwin Sankar, Beatrice Savoldi, Nivedita Sethiya, Claytone Sikasote, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Brian Thompson, Marco Turchi, Alex Waibel, Patrick Wilken, Rodolfo Zevallos, Vilém Zouhar, Maike Züfle. “Findings of the IWSLT 2025 Evaluation Campaign ”. IWSLT 2025.
- Eren Onaran, Erdi Sarıtaş, Hazım Kemal Ekene. “Impact of Face Alignment on Face Image Quality ”. EAI ROSENET 2024.
- Mustafa İzzet Muştu, Hazım Kemal Ekenel “Facial Attribute Based Text Guided Face Anonymization ”. arXiv preprint (2025).
- Mustafa İzzet Muştu, Hazım Kemal Ekenel “Assessing the Use of Face Swapping Methods as Face Anonymizers in Videos ”. DSP 2025.