AL Mus’haf corpus is a Quranic corpus that was built using a semi-automatic technique that involves using the morphosyntactic analyzer of standard Arabic words “AlKhalil Morpho Sys 2” followed by a manual treatment. The obtained result is an annotated Quranic corpus in which we combine each word segment with additional morphosyntactical information such as stem, part-of-speech tags, lemma, root, and the vowelled pattern for each of the stem and lemma.
For further details, please check the following paper :
- Zeroual, I., Lakhouaja, A. A new Quranic Corpus rich in morphosyntactical information. Int J Speech Technol 19, 339–346 (2016).
https://doi.org/10.1007/s10772-016-9335-7

