About: Speaker diarisation

Property	Value
dbo:abstract	Speaker diarization (or diarization) (clarification: a human speaker is meant) is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity. It is used to answer the question "who spoke when?"Speaker diarisation is a combination of speaker segmentation and speaker clustering. The first aims at finding speaker change points in an audio stream. The second aims at grouping together speech segments on the basis of speaker characteristics. With the increasing number of broadcasts, meeting recordings and voice mail collected every year, speaker diarisation has received much attention by the speech community, as is manifested by the specific evaluations devoted to it under the auspices of the National Institute of Standards and Technology for telephone speech, broadcast news and meetings. (en) Диаризация (или разделение дикторов) — процесс разделения входящего аудиопотока на однородные сегменты в соответствии с принадлежностью аудиопотока тому или иному говорящему. Диаризация повышает качество текстов при автоматическом транскрибировании, а также может использоваться совместно с системой распознавания речи, значительно её улучшая. Диаризация используется для ответа на вопрос «Кто сейчас говорит?». Диаризация является сочетанием методов сегментации и кластеризации дикторов. Первый направлен на поиск точек смены диктора, второй — на группирование выделенных в речи диктора речевых сегментов. Одним из популярных методов при диаризации является использование алгоритмов на основе гауссовых смесей для моделирования каждого из говорящих и закрепление выделенных фрагментов за каждым из дикторов с помощью скрытой марковской модели. (ru)
dbo:wikiPageExternalLink	http://alize.univ-avignon.fr/svn/LIA_RAL/branches/2.0/LIA_SpkSeg/ http://shout-toolkit.sourceforge.net/ http://www-lium.univ-lemans.fr/diarization/doku.php/welcome%7CLIUM http://gforge.inria.fr/projects/audioseg http://www-lium.univ-lemans.fr/fr/content/liumspkdiarization https://github.com/pyannote/pyannote-audio https://github.com/tyiannak/pyAudioAnalysis https://www.springer.com/computer/image+processing/book/978-0-387-77591-3 http://www.icsi.berkeley.edu/~fractor/papers/friedland_146.pdf http://www.eurecom.fr/publication/3152 http://www.eurecom.fr/util/publidownload.fr.htm%3Fid=3000
dbo:wikiPageID	23322684 (xsd:integer)
dbo:wikiPageLength	5921 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1105281544 (xsd:integer)
dbo:wikiPageWikiLink	dbc:Speech_processing dbr:Mixture_model dbr:American_and_British_English_spelling_differences dbr:Graphics_processing_unit dbr:Speaker_recognition dbr:Speech_recognition dbr:Hidden_Markov_Model dbr:Artificial_neural_network dbc:Speech_recognition dbr:National_Institute_of_Standards_and_Technology dbr:ALIZE_Speaker_Diarization dbr:Audioseg dbr:PyAudioAnalysis dbr:Pyannote.audio dbr:SHoUT
dbp:wikiPageUsesTemplate	dbt:Cite_book dbt:Cite_journal dbt:More_citations_needed
dcterms:subject	dbc:Speech_processing dbc:Speech_recognition
gold:hypernym	dbr:Process
rdf:type	dbo:Election
rdfs:comment	Speaker diarization (or diarization) (clarification: a human speaker is meant) is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity. It is used to answer the question "who spoke when?"Speaker diarisation is a combination of speaker segmentation and speaker clustering. The first aims at finding speaker change points in an audio stream. The second aims at grouping together speech segments on the basis of speaker characteristics. (en) Диаризация (или разделение дикторов) — процесс разделения входящего аудиопотока на однородные сегменты в соответствии с принадлежностью аудиопотока тому или иному говорящему. Диаризация повышает качество текстов при автоматическом транскрибировании, а также может использоваться совместно с системой распознавания речи, значительно её улучшая. Диаризация используется для ответа на вопрос «Кто сейчас говорит?». Диаризация является сочетанием методов сегментации и кластеризации дикторов. Первый направлен на поиск точек смены диктора, второй — на группирование выделенных в речи диктора речевых сегментов. (ru)
rdfs:label	Speaker diarisation (en) Диаризация (ru)
owl:sameAs	freebase:Speaker diarisation wikidata:Speaker diarisation dbpedia-ru:Speaker diarisation https://global.dbpedia.org/id/4vtfV dbr:Speaker diarisation
prov:wasDerivedFrom	wikipedia-en:Speaker_diarisation?oldid=1105281544&ns=0
foaf:isPrimaryTopicOf	wikipedia-en:Speaker_diarisation
is dbo:wikiPageRedirects of	dbr:Open_source_speaker_diarisation_software dbr:Speaker_diarization dbr:Diarization
is dbo:wikiPageWikiLink of	dbr:Speaker_recognition dbr:Speech_recognition dbr:Open_source_speaker_diarisation_software dbr:Time-series_segmentation dbr:Speaker_diarization dbr:Diarization
is foaf:primaryTopic of	wikipedia-en:Speaker_diarisation