Yassine El Kheir - Researcher in Speech Processing and Deepfake Detection
Hi π, Iβm Yassine El Kheir
Iβm a PhD student at the German Research Center for Artificial Intelligence (DFKI) in Berlin, working on robust speech representations in self-supervised learning (SSL) models for audio deepfake detection under the supervision of Prof. Sebastian MΓΆller and Dr. Tim Polzehl.
Research Interests π
- π Self-Supervised Learning (SSL) for Speech
- π Audio Deepfake Detection, Anti-Spoofing
- π Multilingual and Non-Native Speech Processing/Recognition
- π£ Automatic Speech Recognition (ASR) and NLP
News β¨
- 2025-02-05: π Excited to announce MorphBPE Tokenizer used in Fanar Qatar LLM is published!
- 2025-01-25: π Excited to announce a new paper on Layer-wise Analysis of SSL Models for Audio Deepfake Detection model interpretability accepted to Findings of NAACL 2025!
- 2024-12-12: Invited as a researcher for a two-week project at the SDAIA Winter School, organized by SDAIA.
Education π
- π PhD in Computer Science (2024 - Exp. 2027) - DFKI, Berlin, Germany
- π MSc in Machine Learning (2021 - 2022) - KTH Royal Institute of Technology, Sweden
- π Master in Data Science (2020 - 2021) - EURECOM & TΓ©lΓ©com Paris, France
- π Master in Digital Engineering (2019 - 2022) - TΓ©lΓ©com Paris, France
- π Preparatory Classes (CPGE) (2017 - 2019) - LycΓ©e Mohammed VI, Morocco
Jobs π§βπ»
- 2024.07 - ongoing: PhD Student - Researcher @ DFKI, Berlin, Germany
- 2022.07 - 2024.07: Research Associate @ Qatar Computing Research Institute (QCRI), Qatar
- 2022.02 - 2022.07: Machine Learning Intern @ Snappet, Netherlands
Selected Publications π
- Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection - NAACL Findings 2025
- Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic - ACL 2024
- Speech Representation Analysis Based on Inter- and Intra-Model Similarities - IEEE WICASSP 2024
- L1-aware Multilingual Mispronunciation Detection Framework - IEEE ICASSP 2024
For a full list of my publications, visit my Google Scholar.
Projects π
- News-Polygraph: π€ News-polygraph is a collaborative research project working on a comprehensive, multimodal technology platform for analyzing and detecting disinformation (speech part β deepfake detection).
- Fanar LLM: π€ An Arabic-centric large language model supporting multiple dialects.
- QVoice: π£οΈ The first Arabic speech mispronunciation detection system.
- AraVoiceL2 Dataset: π€ A dataset of non-native Arabic speech for phoneme-level mispronunciation detection.
Awards & Scholarships π
- π Telecom Paris Scholarship (2022-2023)
- π Excellence Scholarship FIRSI (2019-2022)
- π Prepa FIRSI Scholarship (2018-2019)
Contact π¬
- π§ Email: elkheiryassine0@gmail.com
- π Website: yassine.el_kheir.github.io
- π LinkedIn: yassine-elkheir