About: Persian Speech Corpus

An Entity of Type: Thing, from Named Graph: http://dbpedia.org, within Data Space: dbpedia.org:8891

The Persian Speech Corpus is a Modern Persian speech corpus for speech synthesis. The corpus contains phonetic and orthographic transcriptions of about 2.5 hours of Persian speech aligned with recorded speech on the phoneme level, including annotations of word boundaries. Previous spoken corpora of Persian include FARSDAT, which consists of read aloud speech from newspaper texts from 100 Persian speakers and the Telephone FARsi Spoken language DATabase (TFARSDAT) which comprises seven hours of read and spontaneous speech produced by 60 native speakers of Persian from ten regions of Iran.

Property	Value
dbo:abstract	The Persian Speech Corpus is a Modern Persian speech corpus for speech synthesis. The corpus contains phonetic and orthographic transcriptions of about 2.5 hours of Persian speech aligned with recorded speech on the phoneme level, including annotations of word boundaries. Previous spoken corpora of Persian include FARSDAT, which consists of read aloud speech from newspaper texts from 100 Persian speakers and the Telephone FARsi Spoken language DATabase (TFARSDAT) which comprises seven hours of read and spontaneous speech produced by 60 native speakers of Persian from ten regions of Iran. The Persian Speech Corpus was built using the same methodologies laid out in the doctoral project on Modern Standard Arabic of Nawar Halabi at the University of Southampton. The work was funded by MicroLinkPC, who own an exclusive license to commercialise the corpus, though the corpus is available for non-commercial use through the corpus' website. It is distributed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The corpus was built for speech synthesis purposes, but has been used for building HMM based voices in Persian. It can also be used to automatically align other speech corpora with their phonetic transcript and could be used as part of a larger corpus for training speech recognition systems. (en)
dbo:wikiPageExternalLink	http://www.arabicspeechcorpus.com https://creativecommons.org/licenses/by-nc-sa/4.0/ http://www.persianspeechcorpus.com
dbo:wikiPageID	54057693 (xsd:integer)
dbo:wikiPageLength	3515 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1021770535 (xsd:integer)
dbo:wikiPageWikiLink	dbc:Datasets_in_machine_learning dbr:University_of_Southampton dbr:Creative_Commons dbr:Orthography dbr:Phonetic dbr:Speech_synthesis dbr:Speech_corpus dbr:Iran dbc:Persian_language dbc:Persian_corpora dbr:Hidden_Markov_model dbr:Phoneme dbr:Comparison_of_datasets_in_machine_learning dbr:Word_boundary_(linguistics) dbr:Modern_Persian
dbp:wikiPageUsesTemplate	dbt:COI dbt:Multiple_issues dbt:Notability dbt:Primary_sources dbt:Reflist dbt:Corpus_linguistics
dcterms:subject	dbc:Datasets_in_machine_learning dbc:Persian_language dbc:Persian_corpora
rdfs:comment	The Persian Speech Corpus is a Modern Persian speech corpus for speech synthesis. The corpus contains phonetic and orthographic transcriptions of about 2.5 hours of Persian speech aligned with recorded speech on the phoneme level, including annotations of word boundaries. Previous spoken corpora of Persian include FARSDAT, which consists of read aloud speech from newspaper texts from 100 Persian speakers and the Telephone FARsi Spoken language DATabase (TFARSDAT) which comprises seven hours of read and spontaneous speech produced by 60 native speakers of Persian from ten regions of Iran. (en)
rdfs:label	Persian Speech Corpus (en)
owl:sameAs	yago-res:Persian Speech Corpus wikidata:Persian Speech Corpus dbpedia-fa:Persian Speech Corpus https://global.dbpedia.org/id/2qpkp dbr:Persian Speech Corpus
prov:wasDerivedFrom	wikipedia-en:Persian_Speech_Corpus?oldid=1021770535&ns=0
foaf:homepage	http://www.persianspeechcorpus.com
foaf:isPrimaryTopicOf	wikipedia-en:Persian_Speech_Corpus
is dbo:wikiPageWikiLink of	dbr:Outline_of_machine_learning
is foaf:primaryTopic of	wikipedia-en:Persian_Speech_Corpus