About: LipNet

An Entity of Type: Thing, from Named Graph: http://dbpedia.org, within Data Space: dbpedia.org:8891

LipNet is a deep neural network for visual speech recognition. It was created by Yannis Assael, , and Nando de Freitas, researchers from the University of Oxford. The technique, outlined in a paper in November 2016, is able to decode text from the movement of a speaker's mouth. Traditional visual speech recognition approaches separated the problem into two stages: designing or learning visual features, and prediction. LipNet was the first end-to-end sentence-level lipreading model that learned spatiotemporal visual features and a sequence model simultaneously. Audio-visual speech recognition has enormous practical potential, with applications in improved hearing aids, medical applications, such as improving the recovery and wellbeing of critically ill patients, and speech recognition in n

Property	Value
dbo:abstract	LipNet is a deep neural network for visual speech recognition. It was created by Yannis Assael, , and Nando de Freitas, researchers from the University of Oxford. The technique, outlined in a paper in November 2016, is able to decode text from the movement of a speaker's mouth. Traditional visual speech recognition approaches separated the problem into two stages: designing or learning visual features, and prediction. LipNet was the first end-to-end sentence-level lipreading model that learned spatiotemporal visual features and a sequence model simultaneously. Audio-visual speech recognition has enormous practical potential, with applications in improved hearing aids, medical applications, such as improving the recovery and wellbeing of critically ill patients, and speech recognition in noisy environments, such as Nvidia's autonomous vehicles. (en)
dbo:wikiPageExternalLink	https://ui.adsabs.harvard.edu/abs/2016arXiv161101599A/abstract%7Cdate=February
dbo:wikiPageID	65950731 (xsd:integer)
dbo:wikiPageLength	2112 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1109254909 (xsd:integer)
dbo:wikiPageWikiLink	dbr:Deep_neural_network dbr:University_of_Oxford dbr:Visual_speech_recognition dbc:Deep_learning_software_applications dbr:Nando_de_Freitas dbr:Speech_recognition dbc:Artificial_neural_networks dbr:Nvidia dbr:Brendan_Shillingford dbr:Shimon_Whiteson
dbp:wikiPageUsesTemplate	dbt:Reflist dbt:Close_paraphrasing
dcterms:subject	dbc:Deep_learning_software_applications dbc:Artificial_neural_networks
rdfs:comment	LipNet is a deep neural network for visual speech recognition. It was created by Yannis Assael, , and Nando de Freitas, researchers from the University of Oxford. The technique, outlined in a paper in November 2016, is able to decode text from the movement of a speaker's mouth. Traditional visual speech recognition approaches separated the problem into two stages: designing or learning visual features, and prediction. LipNet was the first end-to-end sentence-level lipreading model that learned spatiotemporal visual features and a sequence model simultaneously. Audio-visual speech recognition has enormous practical potential, with applications in improved hearing aids, medical applications, such as improving the recovery and wellbeing of critically ill patients, and speech recognition in n (en)
rdfs:label	LipNet (en)
owl:sameAs	wikidata:LipNet https://global.dbpedia.org/id/Fp3oK dbr:LipNet
prov:wasDerivedFrom	wikipedia-en:LipNet?oldid=1109254909&ns=0
foaf:isPrimaryTopicOf	wikipedia-en:LipNet
is dbo:wikiPageWikiLink of	dbr:Speech_recognition
is foaf:primaryTopic of	wikipedia-en:LipNet