谱聚类python代码-Linux大棚

admin 管理员组

文章数量: 1086877

谱聚类python代码

没有太多光谱聚类的经验，只是按照文档进行(结果请跳到最后！)以下内容：

代码：import numpy as np

import networkx as nx

from sklearn.cluster import SpectralClustering

from sklearn import metrics

np.random.seed(1)

# Get your mentioned graph

G = nx.karate_club_graph()

# Get ground-truth: club-labels -> transform to 0/1 np-array

# (possible overcomplicated networkx usage here)

gt_dict = nx.get_node_attributes(G, 'club')

gt = [gt_dict[i] for i in G.nodes()]

gt = np.array([0 if i == 'Mr. Hi' else 1 for i in gt])

# Get adjacency-matrix as numpy-array

adj_mat = nx.to_numpy_matrix(G)

print('ground truth')

print(gt)

# Cluster

sc = SpectralClustering(2, affinity='precomputed', n_init=100)

sc.fit(adj_mat)

# Compare ground-truth and clustering-results

print('spectral clustering')

print(sc.labels_)

print('just for better-visualization: invert clusters (permutation)')

print(np.abs(sc.labels_ - 1))

# Calculate some clustering metrics

print(metrics.adjusted_rand_score(gt, sc.labels_))

print(metrics.adjusted_mutual_info_score(gt, sc.labels_))

输出：ground truth

[0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 0 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1]

spectral clustering

[1 1 0 1 1 1 1 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0]

just for better-visualization: invert clusters (permutation)

[0 0 1 0 0 0 0 1 1 1 0 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1]

0.204094758281

0.271689477828

总体思路：

介绍here中的数据和任务：The nodes in the graph represent the 34 members in a college Karate club. (Zachary is a sociologist, and he was one of the members.) An edge between two nodes indicates that the two members spent significant time together outside normal club meetings. The dataset is interesting because while Zachary was collecting his data, there was a dispute in the Karate club, and it split into two factions: one led by “Mr. Hi”, and one led by “John A”. It turns out that using only the connectivity information (the edges), it is possible to recover the two factions.

使用sklearn&spectral集群解决此问题：If affinity is the adjacency matrix of a graph, this method can be used to find normalized graph cuts.

This将规范化图切割描述为：Find two disjoint partitions A and B of the vertices V of a graph, so

that A ∪ B = V and A ∩ B = ∅

Given a similarity measure w(i,j) between two vertices (e.g. identity

when they are connected) a cut value (and its normalized version) is defined as:

cut(A, B) = SUM u in A, v in B: w(u, v)

...

we seek the minimization of disassociation

between the groups A and B and the maximization of the association

within each group

听起来不错。因此，我们创建邻接矩阵(nx.to_numpy_matrix(G))，并将参数affinity设置为预计算的(因为邻接矩阵是我们预计算的相似性度量)。Alternatively, using precomputed, a user-provided affinity matrix can be used.

编辑：虽然对此不熟悉，但我查找了要调整的The strategy to use to assign labels in the embedding space. There are two ways to assign labels after the laplacian embedding. k-means can be applied and is a popular choice. But it can also be sensitive to initialization. Discretization is another approach which is less sensitive to random initialization.

所以尝试不那么敏感的方法：sc = SpectralClustering(2, affinity='precomputed', n_init=100, assign_labels='discretize')

输出：ground truth

[0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 0 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1]

spectral clustering

[0 0 1 0 0 0 0 0 1 1 0 0 0 0 1 1 0 0 1 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1]

just for better-visualization: invert clusters (permutation)

[1 1 0 1 1 1 1 1 0 0 1 1 1 1 0 0 1 1 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0]

0.771725032425

0.722546051351

这是一个非常符合实际的事实！

本文标签：谱聚类python代码

版权声明：本文标题：谱聚类python代码内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.roclinux.cn/p/1697985434a280287.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

Linux大棚 – 不忘初心的技术博客，浮躁时代的安静角落

谱聚类python代码

谱聚类python代码

更多相关文章

谱聚类python代码

发表评论

推荐文章

javascript - jquery automatic fadeIn fadeOut on a series of images? - Stack Overflow

Office打开空白？内容消失不见？教你如何解决

php - Move JS code from HTML to source in HEAD section - Stack Overflow

c# - How to create a date in JavascriptJQuery from day, month and year? - Stack Overflow

赚钱宝2代armbian cups 打印机服务器 Windows 71011 如何添加打印机

热门文章

javascript - 405 (Method Not Allowed) on reactjs component - Stack Overflow

arm - Is gcc -mfpu=neon-vfpv4 implementing VFPv4-D16 or VFPv4-D32? - Stack Overflow

javascript - Raphael.js bar chart with tutorial - Stack Overflow

angularjs and bootstrap javascript - Stack Overflow

javascript - How to handle Iframe content using webdriverIO js and mocha - Stack Overflow

typescript - Module not Found when using Custom Type .d.ts in Next.js - Stack Overflow

javascript - Custom button for fancy box - Stack Overflow

javascript - WebRTC Reduce a recording video size - Stack Overflow

微软Edge浏览器下载出错！解决办法！

通过Windows镜像离线升级或修复Win10系统

最新文章

javascript - How do I toggle the readonly attribute of all child element with jquery - Stack Overflow

javascript - Might it be possible to block an entire US state from accessing my site, using PHP? - Stack Overflow

c++ - Is dereferencing std::span::end always undefined? - Stack Overflow

javascript - Delay function execution if it has been called recently - Stack Overflow

javascript - Google Maps Autocomplete List - Stack Overflow

windows7系统设置默认用户自动登录

Windows7系统安装全流程详解与技术指南

U盘启动盘安装系统，使用Diskpart命令对磁盘进行分区

使用第三方一键重装工具“系统之家”和你的移动硬盘，将惠普台式机从 Windows 7 重装为 Windows 10 的详细步骤说明，适合非技术用户操作

Windows系统下载地址：

Exploring the Finest Accommodations: A Comprehensive Guide to Ruston LA Hotels

The Enchanting Experience of ScaliniTella NYC: A Culinary Gem in the Heart of Manhattan

Exploring the Exquisite Aloft Chicago O'Hare: A Blend of Modern Luxury and Convenience

A Culinary Journey: Discovering the Finest Dining Experiences in Waco, TX

A Culinary Journey: Discovering the Finest Dining Experiences in Athens, GA