A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions.In Speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognition engine). In Linguistics, spoken corpora are used to do research into Phonetic, Conversation analysis, Dialectology and other fields. A corpus is one such database. Corpora is the plural of corpus (i.e. it is many such databases). There are two types of Speech Corpora:

Property Value
dbo:abstract
  • En linguistique, un corpus oral est un corpus constitué de transcriptions de données orales. (fr)
  • A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions.In Speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognition engine). In Linguistics, spoken corpora are used to do research into Phonetic, Conversation analysis, Dialectology and other fields. A corpus is one such database. Corpora is the plural of corpus (i.e. it is many such databases). There are two types of Speech Corpora: 1. * Read Speech - which includes: 2. * Book excerpts 3. * Broadcast news 4. * Lists of words 5. * Sequences of numbers 6. * Spontaneous Speech - which includes: 7. * Dialogs - between two or more people (includes meetings); 8. * Narratives - a person telling a story (one such corpus is the Buckeye Corpus); 9. * Map-tasks - one person explains a route on a map to another; 10. * Appointment-tasks - two people try to find a common meeting time based on individual schedules. A special kind of speech corpora are non-native speech databases that contain speech with foreign accent. (en)
  • 口语语料库为语言音频文件和文字副本的数据库。在语音技术里,口语语料库可用于创建声学模型,配合语音识别引擎使用。在语言学里,口语语料库可用于语音学、会话分析、方言学等方面的研究。 口语语料库主要分为朗读语料和自然口语两类。 (zh)
dbo:wikiPageExternalLink
dbo:wikiPageID
  • 11322771 (xsd:integer)
dbo:wikiPageRevisionID
  • 731344832 (xsd:integer)
dct:subject
http://purl.org/linguistics/gold/hypernym
rdf:type
rdfs:comment
  • En linguistique, un corpus oral est un corpus constitué de transcriptions de données orales. (fr)
  • 口语语料库为语言音频文件和文字副本的数据库。在语音技术里,口语语料库可用于创建声学模型,配合语音识别引擎使用。在语言学里,口语语料库可用于语音学、会话分析、方言学等方面的研究。 口语语料库主要分为朗读语料和自然口语两类。 (zh)
  • A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions.In Speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognition engine). In Linguistics, spoken corpora are used to do research into Phonetic, Conversation analysis, Dialectology and other fields. A corpus is one such database. Corpora is the plural of corpus (i.e. it is many such databases). There are two types of Speech Corpora: (en)
rdfs:label
  • Corpus oral (fr)
  • Speech corpus (en)
  • 口语语料库 (zh)
owl:sameAs
prov:wasDerivedFrom
foaf:isPrimaryTopicOf
is dbo:wikiPageDisambiguates of
is foaf:primaryTopic of