We are happy to announce that our first Urdu Corpus Package has been published!
Package Name: Wikipedia Urdu 20160407
Release Date: 2016-05-10
Content Type Description: Unprocessed Plain Urdu Text
License: Open Source Creative Commons Attribution-ShareAlike 4.0 International License.
Source Name: Wikipedia Urdu
Source URL: https://ur.wikipedia.org
Un-compressed File Size: 108 MB
Un-compressed File Type: Text (.txt)
Compressed File Size: 21.4 MB
Compressed File Type: RAR (.rar)
Submitted By: Syed Muhammad Humayun - firstname.lastname@example.org
Submission Date: 2016-05-10
Download URL: wikipedia-urdu-20160407.rar (21.4 MB) | (.md5) | (.sha1)
Info/Comments: For details on how we created this package and the complete process, technology and tools involved, read a detailed post here.
View complete list of Published Urdu Corpus Packages.
Please drop us your feedback, comments, suggestions or questions below. We would love to hear from you and will respond ASAP inshaAllah.