Urdu Corpus Packages

Following is a list of Published Urdu Corpus Packages from Urdu Corpus Community. Are you looking for a specific Data Set that is not available here or you are unable to find? Leave your Questions, Comments, Feedback below or email us at smhumayun@gmail.com and we will respond you with relevant information.


Unprocessed Plain Urdu Text Packages:


  1. Package Name: Wikipedia Urdu 20160407
    Release Date: 2016-05-10
    Language: Urdu
    Content Type Description: Unprocessed Plain Urdu Text
    License: Open Source Creative Commons Attribution-ShareAlike 4.0 International License.
    Source Name: Wikipedia Urdu
    Source URL:  https://ur.wikipedia.org
    Un-compressed File Size: 108 MB
    Un-compressed File Type: Text (.txt)
    Compressed File Size: 21.4 MB
    Compressed File Type: RAR (.rar)
    Submitted By: Syed Muhammad Humayun - smhumayun@gmail.com
    Submission Date: 2016-05-10
    Download URL: wikipedia-urdu-20160407.rar (21.4 MB) | (.md5) | (.sha1)
    Info/Comments: For details on how we created this package and the complete process, technology and tools involved, read a detailed post here.

No comments:

Post a Comment