About: Nearest-neighbor chain algorithm

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: Nearest-neighbor chain algorithm Goto Sponge NotDistinct Permalink

An Entity of Type : yago:WikicatDataClusteringAlgorithms, within Data Space : dbpedia.org:8891 associated with source document(s)
QRcode icon

http://dbpedia.org:8891/describe/?url=http%3A%2F%2Fdbpedia.org%2Fresource%2FNearest-neighbor_chain_algorithm

In the theory of cluster analysis, the nearest-neighbor chain algorithm is an algorithm that can speed up several methods for agglomerative hierarchical clustering. These are methods that take a collection of points as input, and create a hierarchy of clusters of points by repeatedly merging pairs of smaller clusters to form larger clusters. The clustering methods that the nearest-neighbor chain algorithm can be used for include Ward's method, complete-linkage clustering, and single-linkage clustering; these all work by repeatedly merging the closest two clusters but use different definitions of the distance between clusters. The cluster distances for which the nearest-neighbor chain algorithm works are called reducible and are characterized by a simple inequality among certain cluster dis

Attributes	Values
rdf:type	software yago:Abstraction100002137 yago:Cognition100023271 yago:Datum105816622 yago:Information105816287 yago:PsychologicalFeature100023100 yago:WikicatDataClusteringAlgorithms
rdfs:label	Nearest-neighbor chain algorithm (en)
rdfs:comment	In the theory of cluster analysis, the nearest-neighbor chain algorithm is an algorithm that can speed up several methods for agglomerative hierarchical clustering. These are methods that take a collection of points as input, and create a hierarchy of clusters of points by repeatedly merging pairs of smaller clusters to form larger clusters. The clustering methods that the nearest-neighbor chain algorithm can be used for include Ward's method, complete-linkage clustering, and single-linkage clustering; these all work by repeatedly merging the closest two clusters but use different definitions of the distance between clusters. The cluster distances for which the nearest-neighbor chain algorithm works are called reducible and are characterized by a simple inequality among certain cluster dis (en)
foaf:depiction
dcterms:subject	Cluster analysis algorithms
Wikipage page ID	33068704 (xsd:integer)
Wikipage revision ID	1088104637 (xsd:integer)
Link from a Wikipage to another Wikipage	Prim's algorithm Binary tree Algorithm Nearest neighbor graph Outlier Quadtree Stack (abstract data type) Cluster analysis Complete-linkage clustering Path (graph theory) Priority queue Centroid Triangle inequality Data analysis Data structure Euclidean space Cardinality Taxonomic rank Jean-Paul Benzécri Taxonomy (biology) Cluster analysis algorithms Hierarchical clustering Single-linkage clustering Stack (data structure) Distance matrix Phylogenetic tree Greedy algorithm Metric space Minimum spanning tree Maximal element Ward's method Agglomerative hierarchical clustering Disjoint set Closest pair Sequential search dbr:En:k-means_clustering
sameAs	Nearest-neighbor chain algorithm Nearest-neighbor chain algorithm Nearest-neighbor chain algorithm Nearest-neighbor chain algorithm
dbp:wikiPageUsesTemplate	dbt:Good_article dbt:Harvtxt dbt:Math dbt:Mvar dbt:Reflist dbt:Short_description
thumbnail	wiki-commons:Special:FilePath/Hierarchical_clustering_diagram.png?width=300
has abstract	In the theory of cluster analysis, the nearest-neighbor chain algorithm is an algorithm that can speed up several methods for agglomerative hierarchical clustering. These are methods that take a collection of points as input, and create a hierarchy of clusters of points by repeatedly merging pairs of smaller clusters to form larger clusters. The clustering methods that the nearest-neighbor chain algorithm can be used for include Ward's method, complete-linkage clustering, and single-linkage clustering; these all work by repeatedly merging the closest two clusters but use different definitions of the distance between clusters. The cluster distances for which the nearest-neighbor chain algorithm works are called reducible and are characterized by a simple inequality among certain cluster distances. The main idea of the algorithm is to find pairs of clusters to merge by following paths in the nearest neighbor graph of the clusters. Every such path will eventually terminate at a pair of clusters that are nearest neighbors of each other, and the algorithm chooses that pair of clusters as the pair to merge. In order to save work by re-using as much as possible of each path, the algorithm uses a stack data structure to keep track of each path that it follows. By following paths in this way, the nearest-neighbor chain algorithm merges its clusters in a different order than methods that always find and merge the closest pair of clusters. However, despite that difference, it always generates the same hierarchy of clusters. The nearest-neighbor chain algorithm constructs a clustering in time proportional to the square of the number of points to be clustered. This is also proportional to the size of its input, when the input is provided in the form of an explicit distance matrix. The algorithm uses an amount of memory proportional to the number of points, when it is used for clustering methods such as Ward's method that allow constant-time calculation of the distance between clusters. However, for some other clustering methods it uses a larger amount of memory in an auxiliary data structure with which it keeps track of the distances between pairs of clusters. (en)
gold:hypernym	Method
prov:wasDerivedFrom	wikipedia-en:Nearest-neighbor_chain_algorithm?oldid=1088104637&ns=0
page length (characters) of wiki page	27316 (xsd:nonNegativeInteger)
foaf:isPrimaryTopicOf	wikipedia-en:Nearest-neighbor_chain_algorithm
is Link from a Wikipage to another Wikipage of	Nearest neighbor graph Batch effect Stack (abstract data type) Jean-Paul Benzécri Hierarchical clustering List of statistics articles Ward's method Outline of machine learning
is foaf:primaryTopic of	wikipedia-en:Nearest-neighbor_chain_algorithm

Faceted Search & Find service v1.17_git139 as of Feb 29 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3331 as of Sep 2 2024, on Linux (x86_64-generic-linux-glibc212), Single-Server Edition (62 GB total memory, 40 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software