HTML Microdata document

This HTML5 document contains 65 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

Prefix	IRI
dcterms	http://purl.org/dc/terms/
dbo	http://dbpedia.org/ontology/
n14	http://research.microsoft.com/~joshuago/
foaf	http://xmlns.com/foaf/0.1/
n9	http://www.clsp.jhu.edu/people/jelinek/
n23	https://global.dbpedia.org/id/
n8	http://www-cs-students.stanford.edu/~magerman/
n16	http://www.cs.brown.edu/people/ec/
n5	http://www.it.usyd.edu.au/about/people/staff/
rdfs	http://www.w3.org/2000/01/rdf-schema#
freebase	http://rdf.freebase.com/ns/
n21	https://www.cs.columbia.edu/~mcollins/
dbpedia-fa	http://fa.dbpedia.org/resource/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
owl	http://www.w3.org/2002/07/owl#
wikipedia-en	http://en.wikipedia.org/wiki/
dbpedia-fr	http://fr.dbpedia.org/resource/
dbc	http://dbpedia.org/resource/Category:
prov	http://www.w3.org/ns/prov#
dbp	http://dbpedia.org/property/
xsdh	http://www.w3.org/2001/XMLSchema#
wikidata	http://www.wikidata.org/entity/
gold	http://purl.org/linguistics/gold/
dbr	http://dbpedia.org/resource/

Statements

Subject Item: dbr:Parsing
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Statistical_parsing
rdf:type: dbo:Band
rdfs:label: Statistical parsing Analyse syntaxique de la langue naturelle
rdfs:comment: Statistical parsing is a group of parsing methods within natural language processing. The methods have in common that they associate grammar rules with a probability. Grammar rules are traditionally viewed in computational linguistics as defining the valid sentences in a language. Within this mindset, the idea of associating each rule with a probability then provides the relative frequency of any given grammar rule and, by deduction, the probability of a complete parse for a sentence. (The probability associated with a grammar rule may be induced, but the application of that grammar rule within a parse tree and the computation of the probability of the parse tree based on its component rules is a form of deduction.) Using this concept, statistical parsers make use of a procedure to search En linguistique informatique ou en TALN, l'analyse syntaxique (syntactic parsing) réfère au processus d'analyse automatisé d'une chaine de mots — représentant une phrase — dans le but d'obtenir les relations coexistant entre ces mots, par l'intermédiaire d'un arbre syntaxique.Lorsqu’on part de texte brut, ce dernier doit avoir été segmenté en unités lexicales au préalable (tokenization). Habituellement, une analyse lexicale (lemmatisation, analyse morphosyntaxique...) est exécutée avant l'analyse syntaxique proprement dite, afin d'identifier les unités lexicales et leurs propriétés. Le résultat de l'analyse est typiquement utilisé comme base dans l'analyse sémantique, construisant une représentation du sens du texte, ou directement dans des applications telles que la correction grammatical
dcterms:subject: dbc:Statistical_natural_language_processing dbc:Natural_language_parsing
dbo:wikiPageID: 7024544
dbo:wikiPageRevisionID: 1050636548
dbo:wikiPageWikiLink: dbr:Natural_language_processing dbr:Fred_Jelinek dbr:Artificial_intelligence dbr:Search_algorithm dbr:Stack_search dbc:Statistical_natural_language_processing dbr:Parsing dbr:Michael_Collins_(computational_linguist) dbr:Eugene_Charniak dbr:Statistical_semantics dbr:Stochastic_context-free_grammar dbr:Grammar dbr:Viterbi_algorithm dbr:Baum-Welch_algorithm dbr:Joshua_Goodman dbr:James_Curran_(linguist) dbr:Statistical_machine_translation dbr:Colorless_green_ideas_sleep_furiously dbr:Entropy_maximization dbc:Natural_language_parsing dbr:Chart_parsing dbr:Can_can dbr:David_Magerman dbr:Computational_linguistics
dbo:wikiPageExternalLink: n5:james.shtml n8: n9: n14: n16:home.html n21:
owl:sameAs: dbpedia-fr:Analyse_syntaxique_de_la_langue_naturelle dbpedia-fa:تجزیه‌کننده_آماری freebase:m.0h0v_v wikidata:Q7604407 n23:4vyLD
dbo:abstract: En linguistique informatique ou en TALN, l'analyse syntaxique (syntactic parsing) réfère au processus d'analyse automatisé d'une chaine de mots — représentant une phrase — dans le but d'obtenir les relations coexistant entre ces mots, par l'intermédiaire d'un arbre syntaxique.Lorsqu’on part de texte brut, ce dernier doit avoir été segmenté en unités lexicales au préalable (tokenization). Habituellement, une analyse lexicale (lemmatisation, analyse morphosyntaxique...) est exécutée avant l'analyse syntaxique proprement dite, afin d'identifier les unités lexicales et leurs propriétés. Le résultat de l'analyse est typiquement utilisé comme base dans l'analyse sémantique, construisant une représentation du sens du texte, ou directement dans des applications telles que la correction grammaticale. Pour un système de réponse aux questions ou de recherche d'informations, il serait par exemple difficile de répondre correctement à la demande « quels ouvrages ont été écrits par des auteurs francophones avant 1900 ? » sans reconnaitre le sujet « ouvrages », car il faut notamment comprendre que l'utilisateur désire une liste d'ouvrages et non une liste d'auteurs. Le processus d'analyse peut se reposer sur une grammaire formelle et/ou faire appel à des méthodes statistiques. Statistical parsing is a group of parsing methods within natural language processing. The methods have in common that they associate grammar rules with a probability. Grammar rules are traditionally viewed in computational linguistics as defining the valid sentences in a language. Within this mindset, the idea of associating each rule with a probability then provides the relative frequency of any given grammar rule and, by deduction, the probability of a complete parse for a sentence. (The probability associated with a grammar rule may be induced, but the application of that grammar rule within a parse tree and the computation of the probability of the parse tree based on its component rules is a form of deduction.) Using this concept, statistical parsers make use of a procedure to search over a space of all candidate parses, and the computation of each candidate's probability, to derive the most probable parse of a sentence. The Viterbi algorithm is one popular method of searching for the most probable parse. "Search" in this context is an application of search algorithms in artificial intelligence. As an example, think about the sentence "The can can hold water". A reader would instantly see that there is an object called "the can" and that this object is performing the action 'can' (i.e. is able to); and the thing the object is able to do is "hold"; and the thing the object is able to hold is "water". Using more linguistic terminology, "The can" is a noun phrase composed of a determiner followed by a noun, and "can hold water" is a verb phrase which is itself composed of a verb followed by a verb phrase. But is this the only interpretation of the sentence? Certainly "The can can" is a perfectly valid noun-phrase referring to a type of dance, and "hold water" is also a valid verb-phrase, although the coerced meaning of the combined sentence is non-obvious. This lack of meaning is not seen as a problem by most linguists (for a discussion on this point, see Colorless green ideas sleep furiously) but from a pragmatic point of view it is desirable to obtain the first interpretation rather than the second and statistical parsers achieve this by ranking the interpretations based on their probability. (In this example various assumptions about the grammar have been made, such as a simple left-to-right derivation rather than head-driven, its use of noun-phrases rather than the currently fashionable determiner-phrases, and no type-check preventing a concrete noun being combined with an abstract verb phrase. None of these assumptions affect the thesis of the argument and a comparable argument can be made using any other grammatical formalism.) There are a number of methods that statistical parsing algorithms frequently use. While few algorithms will use all of these they give a good overview of the general field. Most statistical parsing algorithms are based on a modified form of chart parsing. The modifications are necessary to support an extremely large number of grammatical rules and therefore search space, and essentially involve applying classical artificial intelligence algorithms to the traditionally exhaustive search. Some examples of the optimisations are only searching a likely subset of the search space (stack search), for optimising the search probability (Baum-Welch algorithm) and for discarding parses that are too similar to be treated separately (Viterbi algorithm).
gold:hypernym: dbr:Group
prov:wasDerivedFrom: wikipedia-en:Statistical_parsing?oldid=1050636548&ns=0
dbo:wikiPageLength: 4707
foaf:isPrimaryTopicOf: wikipedia-en:Statistical_parsing

Subject Item: dbr:Deep_learning
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Viterbi_algorithm
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Probabilistic_context-free_grammar
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Co-training
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Michael_Collins_(computational_linguist)
dbo:wikiPageWikiLink: dbr:Statistical_parsing
dbp:knownFor: dbr:Statistical_parsing
dbo:knownFor: dbr:Statistical_parsing

Subject Item: dbr:Statistical_machine_translation
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:List_of_statistics_articles
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Stochastic_grammar
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Outline_of_machine_learning
dbo:wikiPageWikiLink: dbr:Statistical_parsing

Subject Item: dbr:Stochastic_parsing
dbo:wikiPageWikiLink: dbr:Statistical_parsing
dbo:wikiPageRedirects: dbr:Statistical_parsing

Subject Item: wikipedia-en:Statistical_parsing
foaf:primaryTopic: dbr:Statistical_parsing