About: Universal approximation theorem

Property	Value
dbo:description	теорема о возможности аппроксимировать непрерывные функции нейронными сетями (ru) theorem that a feed-forward network with a single hidden layer can approximate continuous functions (en) teorema que una xarxa de feed-forward amb una sola capa oculta pot aproximar funcions contínues (ca) Teorema de que una red de propagación hacia adelante con una sola capa oculta puede aproximarse a funciones continuas (es)
dbo:wikiPageWikiLink	dbr:No_free_lunch_theorem dbr:Fully_connected_network dbr:Dense_set dbr:Deep_learning dbr:Riemannian_manifold dbr:Graph_isomorphism dbc:Networks dbr:Identity_function dbr:If_and_only_if dbr:Feedforward_neural_network dbr:Mathematics dbr:Uniform_convergence dbc:Network_architecture dbc:Theorems_in_analysis dbr:Sigmoid_function dbr:Euclidean_space dbr:Polynomial dbr:Kolmogorov–Arnold_representation_theorem dbc:Artificial_neural_networks dbr:Fourier_series dbr:Halbert_White dbr:Continuous_function dbr:Bochner_integral dbr:Affine_transformation dbr:Derivative dbr:Stone–Weierstrass_theorem dbr:Differentiable_function dbr:Compact_convergence dbr:Rectifier_(neural_networks) dbr:Lebesgue_integration dbr:George_Cybenko dbr:Representer_theorem dbr:Quantum_computers dbr:Convolutional_neural_network dbr:ReLU dbr:Artificial_neural_networks dbr:L1_distance dbr:Continuous_functions dbr:Compact_set dbr:Compact_subspace dbr:Kurt_Hornik
dbp:mathStatement	Let be a compact subset of . Let be any non-affine continuous function which is continuously differentiable at at least one point, with nonzero derivative at that point. Let denote the space of feed-forward neural networks with input neurons, output neurons, and an arbitrary number of hidden layers each with neurons, such that every hidden neuron has activation function and every output neuron has the identity as its activation function, with input layer and output layer . Then given any and any , there exists such that In other words, is dense in with respect to the topology of uniform convergence. Quantitative refinement: The number of layers and the width of each layer required to approximate to precision known; moreover, the result hold true when and are replaced with any non-positively curved Riemannian manifold. (en) There exists an activation function which is analytic, strictly increasing and sigmoidal and has the following property: For any and there exist constants , and vectors for which for all . (en) Let be a finite segment of the real line, and be any positive number. Then one can algorithmically construct a computable sigmoidal activation function , which is infinitely differentiable, strictly increasing on , -strictly increasing on , and satisfies the following properties: # For any and there exist numbers and such that for all # For any continuous function on the -dimensional box and , there exist constants , , and such that the inequality holds for all . Here the weights , , are fixed as follows: In addition, all the coefficients , except one, are equal. (en) For any Bochner–Lebesgue p-integrable function and any , there exists a fully connected ReLU network of width exactly , satisfying Moreover, there exists a function and some , for which there is no fully connected ReLU network of width less than satisfying the above approximation bound. Remark: If the activation is replaced by leaky-ReLU, and the input is restricted in a compact domain, then the exact minimum width is . Quantitative refinement: In the case where , and is the ReLU activation function, the exact depth and width for a ReLU network to achieve error is also known. If, moreover, the target function is smooth, then the required number of layer and their width can be exponentially smaller. Even if is not smooth, the curse of dimensionality can be broken if admits additional "compositional structure". (en)
dbp:name	Universal approximation theorem (en) Universal approximation theorem . (en) Universal approximation theorem: (en)
dbp:proof	It suffices to prove the case where , since uniform convergence in is just uniform convergence in each coordinate. Let be the set of all one-hidden-layer neural networks constructed with . Let be the set of all with compact support. If the function is a polynomial of degree , then is contained in the closed subspace of all polynomials of degree , so its closure is also contained in it, which is not all of . Otherwise, we show that 's closure is all of . Suppose we can construct arbitrarily good approximations of the ramp function then it can be combined to construct arbitrary compactly-supported continuous function to arbitrary precision. It remains to approximate the ramp function. Any of the commonly used activation functions used in machine learning can obviously be used to approximate the ramp function, or first approximate the ReLU, then the ramp function. if is "squashing", that is, it has limits , then one can first affinely scale down its x-axis so that its graph looks like a step-function with two sharp "overshoots", then make a linear sum of enough of them to make a "staircase" approximation of the ramp function. With more steps of the staircase, the overshoots smooth out and we get arbitrarily good approximation of the ramp function. The case where is a generic non-polynomial function is harder, and the reader is directed to. (en)
dbp:title	Proof sketch (en)
dbp:wikiPageUsesTemplate	dbt:Reflist dbt:Differentiable_computing dbt:Technical dbt:Math_proof dbt:Math_theorem dbt:Ill dbt:Short_description
dct:subject	dbc:Networks dbc:Network_architecture dbc:Theorems_in_analysis dbc:Artificial_neural_networks
rdfs:label	Universal approximation theorem (en) مبرهنة التقريب العام (ar) Théorème d'approximation universelle (fr) 시벤코 정리 (ko) Teorema da aproximação universal (pt) Теорема Цыбенко (ru) Теорема Цибенка (uk) 通用近似定理 (zh)
owl:sameAs	freebase:Universal approximation theorem yago-res:Universal approximation theorem wikidata:Universal approximation theorem dbpedia-fr:Universal approximation theorem dbpedia-zh:Universal approximation theorem dbpedia-pt:Universal approximation theorem dbpedia-ru:Universal approximation theorem dbpedia-ko:Universal approximation theorem dbpedia-ar:Universal approximation theorem dbpedia-uk:Universal approximation theorem dbpedia-global:Universal approximation theorem
prov:wasDerivedFrom	wikipedia-en:Universal_approximation_theorem?oldid=1293521457&ns=0
foaf:isPrimaryTopicOf	wikipedia-en:Universal_approximation_theorem
is dbo:wikiPageRedirects of	dbr:Cybenko_Theorem dbr:Cybenko_theorem dbr:Universal_approximator dbr:Universal_approximators
is dbo:wikiPageWikiLink of	dbr:OpenNN dbr:History_of_artificial_intelligence dbr:Deep_learning dbr:Generative_adversarial_network dbr:Feedforward_neural_network dbr:Terry_Lyons_(mathematician) dbr:Intelligent_control dbr:Activation_function dbr:Kolmogorov–Arnold_representation_theorem dbr:Decision_boundary dbr:Glossary_of_artificial_intelligence dbr:List_of_theorems dbr:Multilayer_perceptron dbr:Types_of_artificial_neural_networks dbr:George_Cybenko dbr:Flow-based_generative_model dbr:Artificial_neural_network dbr:Cybenko_Theorem dbr:Cybenko_theorem dbr:Universal_approximator dbr:Universal_approximators
is rdfs:seeAlso of	dbr:Representation_theorem
is foaf:primaryTopic of	wikipedia-en:Universal_approximation_theorem