AL Mus’haf corpus is a Quranic corpus that was built using a semi-automatic technique that involves using the morphosyntactic analyzer of standard Arabic words “AlKhalil Morpho Sys 2” followed by a manual treatment. The obtained result is an annotated Quranic corpus in which we combine each word segment with additional morphosyntactical information such as stem, part-of-speech tags, lemma, root, and the vowelled pattern for each of the stem and lemma.

For further details, please check the following paper :

  • Zeroual, I., Lakhouaja, A. A new Quranic Corpus rich in morphosyntactical information. Int J Speech Technol 19, 339–346 (2016).
    https://doi.org/10.1007/s10772-016-9335-7

You have the opportunity to download Al Mus'haf Corpus.

Download