Kashmiri Data Corpus
Summary: An audio and text corpus for the Kashmiri language
Downloads (use a mirror closer to you):
kashmiri.tar.gz [394M] ( Kashmiri speech and transcripts ) Mirrors: [US] [EU] [CN]
About this resource:
This is a collection of transcribed Kashmiri recordings taken from native speakers.
The data collection and transcription was done by a group of students from Kashmir, India who were working on a project for the development of an ASR system for the Kashmiri language.
Scripts for the post-processing of this dataset can be found at https://github.com/erstan/kscp