ArPC a corpus for paraphrase identification in Arabic text

Alaa Altheneyan & Mohamed Menai
ArPC is an Arabic paraphrase identification corpus. It consists of 1331 sentence pairs along with their binary score that indicates weather the pairs are paraphrase or not. The corpus has been manually annotated by three Arabic native speakers.
This data repository is not currently reporting usage information. For information on how your repository can submit usage information, please see our documentation.