Zaid Alyafeai

Fostering Research in Arabic NLP via Open Source Collaboration

Open source is all you need.

Arabic is a challenging language with rich morphology. The complexity comes from the flexibility of Arabic writing which gave rise to many dialects across the Arabic region. Unfortunately, the amount of open-source research to tackle Arabic is lacking and it is mostly contained in academic papers. In this talk I focus on my journey of democratizing Arabic NLP to reach a wide range of audience. This led up to creating many open source research projects and tools that focused on revising the NLP pipeline for Arabic. It also resulted in cool results in other related modalities like speech and GANs.


Zaid is a third-year PhD student in KFUPM in Saudi Arabia. He works on Arabic NLP with a focus on how to teach language models to learn morphology effectively.

Presentation Materials

Talk Video