0% found this document useful (0 votes)

3 views

Stochastic Blockmodel.ipynb - Colab

Uploaded by

akhil.s18

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Stochastic Blockmodel.ipynb - Colab

Uploaded by

akhil.s18

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

12/4/24, 8:08 AM Stochastic Blockmodel.

ipynb - Colab

keyboard_arrow_down The Stochastic Blockmodel

import networkx as nx
import numpy as np
import matplotlib.pyplot as plt
import sympy

keyboard_arrow_down The simplest community: a clique

G_clique = nx.from_edgelist([(i,j) for i in range(10) for j in range(10) if i!=j])
nx.draw(G_clique, pos=nx.circular_layout(G_clique))

# The adjacency matrix is (almost) all ones

A_clique = nx.adjacency_matrix(G_clique).todense()
sympy.Matrix(A_clique)

# Visualize the adjacency matrix

plt.matshow(A_clique)
plt.colorbar()

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 1/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
<matplotlib.colorbar.Colorbar at 0x167fb20bc20>

keyboard_arrow_down Two communities: the Caveman graph

G_caveman = nx.from_edgelist([(i,j) for i in range(20) for j in range(20) if i!=j and (i-10)*(j-10)>0])
A_caveman = nx.adjacency_matrix(G_caveman).todense()
plt.matshow(A_caveman)

<matplotlib.image.AxesImage at 0x167fca4fec0>

keyboard_arrow_down Suppose you didn't know who lived in which cave

In other words, the nodes were in some random order

np.random.seed(42)
order = np.random.permutation(len(A_caveman))
A_caveman2 = A_caveman[order,:][:,order]
plt.matshow(A_caveman2)

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 2/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
<matplotlib.image.AxesImage at 0x167fcad7c80>

keyboard_arrow_down How can we figure out which nodes are in the same cave?
Let's look at a few rows of the adjacency matrix.

fig, axes = plt.subplots(nrows=3, ncols=2, figsize=(12,2))

for i, r in enumerate([0, 2, 4, 5, 15, 12]):
col = i % 2
row = int(i/2)
axes[row, col].matshow(A_caveman2[r:r+1])
axes[row, col].set_title(f"Row {r}")
axes[row, col].axes.xaxis.set_ticks([])
axes[row, col].axes.yaxis.set_ticks([])

Idea: Run K-Nearest Neighbors clustering with these rows as the feature vectors
It would group rows 0/4/15 into one cluster, and 2/5/12 into another
Clusters = Communities

(We'll improve upon this idea later)

keyboard_arrow_down How would we check if the communities were good?

SUPPOSE someone told us here are the communities.

Maybe by doing K-Nearest Neighbors.

How would we check?

We would reorder the nodes by grouping people from the same club together
Then, we would look at the new adjacency matrix

np.where(order>=10)[0]

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 3/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
array([ 2, 5, 7, 8, 9, 12, 14, 16, 17], dtype=int64)

someone_says_community1 = [0, 1, 3, 4, 6, 10, 11, 13, 15, 18]

someone_says_community2 = [2, 5, 7, 8, 9, 12, 14, 16, 17]

# Reordering the nodes

ordering = np.concatenate([someone_says_community1, someone_says_community2])
ordering

array([ 0, 1, 3, 4, 6, 10, 11, 13, 15, 18, 2, 5, 7, 8, 9, 12, 14,

16, 17])

# Adjacency matrix with new ordering

A_caveman2_ordered = A_caveman2[ordering][:, ordering]

# What does the reordered adjacency matrix look like?

plt.matshow(A_caveman2_ordered)

<matplotlib.image.AxesImage at 0x167fcb2acf0>

SUPPOSE someone told us here are the communities. How would we check?
IF the memberships are correct, the reordered adjacency matrix is block-structured.

Note: Whether the big block is first or the small block doesn't matter.

keyboard_arrow_down Stochastic Blockmodel: Generalizing the caveman graph

First, we will restate what we did in the caveman graph using new terminology.

n = 10 # number of nodes

# Each node can belong to one of two clubs

clubs = np.random.choice(2, size=n)
clubs

array([1, 1, 1, 0, 0, 1, 1, 1, 0, 1])

# interests = club memberships matrix

interests = np.zeros((n, 2))
interests[np.arange(n), clubs] = 1
interests

array([[0., 1.],
[0., 1.],
[0., 1.],
[1., 0.],
[1., 0.],
[0., 1.],
https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 4/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
[0., 1.],
[0., 1.],
[1., 0.],
[0., 1.]])

Each row represents one person

The first number is the person's interest in club #1.
The second number is the interest in club #2.

keyboard_arrow_down From interests to network

Fans of club #1 become friends
Fans of club #2 become friends

club1_fans = interests[:,0] # Everyone's interest in club #1

club1_fans

array([0., 0., 0., 1., 1., 0., 0., 0., 1., 0.])

# A1[i,j] = club1_fans[i] * club1_fans[j]

A1 = np.outer(club1_fans, club1_fans)

plt.matshow(A1)

<matplotlib.image.AxesImage at 0x167fde95e50>

club2_fans = interests[:,1] # fans of club #2

club2_fans

array([1., 1., 1., 0., 0., 1., 1., 1., 0., 1.])

A2 = np.outer(club2_fans, club2_fans)
plt.matshow(A2)

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 5/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
<matplotlib.image.AxesImage at 0x167fdf3c410>

# All friendships together gives the adjacency matrix A

A = A1 + A2
plt.matshow(A)

<matplotlib.image.AxesImage at 0x167fca65460>

keyboard_arrow_down From interests to network (in one step)

# Same thing, without all the intermediate steps
A = interests @ interests.T # Matrix multiplication
plt.matshow(A)

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 6/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
<matplotlib.image.AxesImage at 0x167fdf3f9b0>

keyboard_arrow_down From network to interests

You see the network. How can you figure out the club memberships?

⇒ To find the right memberships, we need to find the ordering that makes the matrix block-structured.

keyboard_arrow_down Method #1: Communities via modularity

G = nx.from_numpy_array(A)

# Communities via modularity

communities = nx.community.louvain_communities(G)
communities

[{0, 1, 2, 5, 6, 7, 9}, {3, 4, 8}]

# Check if the communities give a block-structured matrix

ordering = np.concatenate([list(x) for x in communities])
plt.matshow(A[ordering][:, ordering])

<matplotlib.image.AxesImage at 0x167fe47d100>

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 7/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab

keyboard_arrow_down Method #2: Communities via spectral decomposition

eigenvalues, eigenvectors = np.linalg.eigh(A)
eigenvalues

array([-4.88399708e-16, -4.17365745e-16, -4.44899761e-17, -5.41731251e-34,

3.81435172e-19, 3.71371796e-18, 2.61415001e-16, 4.46874861e-16,
3.00000000e+00, 7.00000000e+00])

The eigenvalues are returned in ascending order.

The two largest ones are 7 and 3; the rest are pretty much 0

Let's look at the last two eigenvectors

fig, ax = plt.subplots(figsize=(10,5))
ax.matshow(eigenvectors[:,-2:])

<matplotlib.image.AxesImage at 0x167fca66300>

Each row corresponds to one node.

Clearly, the rows are of two types

$\Rightarrow$ K-Nearest Neighbors clustering of the rows of the eigenvector matrix

Previously for the caveman: we clustered the rows of the adjacency matrix

from sklearn.cluster import KMeans

model = KMeans(n_clusters=2)
model.fit(eigenvectors[:,-2:]) # K-Means on the eigenvector rows
predicted_clubs = model.labels_ # Clusters found by K-Means
predicted_clubs

C:\Users\deepay\Miniconda\Lib\site-packages\sklearn\cluster\_kmeans.py:1446: UserWarning: KMeans is known to have a memory l

warnings.warn(
array([0, 0, 0, 1, 1, 0, 0, 0, 1, 0])

predicted_club1_members = np.where(predicted_clubs==0)[0]
predicted_club2_members = np.where(predicted_clubs==1)[0]
print('Predicted clubs', predicted_club1_members, 'and', predicted_club2_members)

Predicted clubs [0 1 2 5 6 7 9] and [3 4 8]

ordering = np.concatenate([predicted_club1_members, predicted_club2_members])

plt.matshow(A[ordering][:, ordering])

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 8/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
<matplotlib.image.AxesImage at 0x16780407fb0>

keyboard_arrow_down Generalization
Story so far:

all fans of the same club become friends

fans of different clubs do not become friends.

Generalization:

Fans of the same club become friends with probability 0.8 (say)
Fans of different clubs become friends with probability 0.10 (say)

# B = cluster-connection matrix
B = np.array([[0.8, 0.1], [0.1, 0.8]])
sympy.Matrix(B)

Previously: A = interests @ interests.T # Adjacency matrix

This gave us the caveman graph

Now: We create a probability matrix, from which we sample the adjacency matrix

P = interests @ B @ interests.T # Probability matrix depends on the cluster-connection matrix B

array([[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.1, 0.1, 0.1, 0.8, 0.8, 0.1, 0.1, 0.1, 0.8, 0.1],
[0.1, 0.1, 0.1, 0.8, 0.8, 0.1, 0.1, 0.1, 0.8, 0.1],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8],
[0.1, 0.1, 0.1, 0.8, 0.8, 0.1, 0.1, 0.1, 0.8, 0.1],
[0.8, 0.8, 0.8, 0.1, 0.1, 0.8, 0.8, 0.8, 0.1, 0.8]])

A = np.random.binomial(1, P) # Friendships are random

array([[1, 0, 1, 0, 0, 1, 0, 1, 0, 1],
[1, 1, 0, 0, 0, 1, 1, 1, 0, 1],
[0, 1, 1, 0, 0, 1, 1, 1, 0, 0],
[0, 0, 1, 1, 1, 0, 0, 1, 0, 0],
[0, 0, 1, 1, 1, 0, 0, 0, 1, 0],
[1, 1, 1, 0, 0, 1, 0, 1, 1, 0],
[1, 1, 1, 0, 0, 1, 1, 1, 0, 0],

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 9/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
[0, 1, 1, 0, 0, 1, 1, 1, 0, 0],
[0, 0, 0, 1, 1, 0, 0, 1, 1, 0],
[0, 1, 1, 0, 0, 1, 1, 0, 0, 1]])

plt.matshow(A)

<matplotlib.image.AxesImage at 0x167808be510>

keyboard_arrow_down Does the block-structure still apply?

ordering = np.concatenate([np.where(clubs==0)[0], np.where(clubs==1)[0]]) # Actual communities
plt.matshow(A[ordering][:, ordering])

<matplotlib.image.AxesImage at 0x167808e0950>

Still roughly block structured.

In the caveman graph, it was exactly block-structured.

keyboard_arrow_down Finding the Communities

Same ideas as before.

keyboard_arrow_down Method #1: Modularity

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 10/11
12/4/24, 8:08 AM Stochastic Blockmodel.ipynb - Colab
G = nx.from_numpy_array(A)
communities = nx.community.louvain_communities(G)
communities

[{2}, {3, 4, 8}, {7}, {0, 1, 5, 6, 9}]

Got split into too many communities

# Play with the resolution to get the desired number of communities

communities = nx.community.louvain_communities(G, resolution=0.5)
communities

[{3, 4, 8}, {0, 1, 2, 5, 6, 7, 9}]

ordering = np.concatenate([list(x) for x in communities])

plt.matshow(A[ordering][:, ordering])

<matplotlib.image.AxesImage at 0x167809c6510>

https://colab.research.google.com/drive/1J5VPLqq75tEOfmWKGVpSm0fCEP5MP6LC 11/11

11_Numpy_Matplotlib-reseni
No ratings yet
11_Numpy_Matplotlib-reseni
19 pages
AD3301 - Numpy - and - Pandas - Ipynb - Colaboratory
No ratings yet
AD3301 - Numpy - and - Pandas - Ipynb - Colaboratory
18 pages
NumPy Basics
No ratings yet
NumPy Basics
23 pages
2 - Numpy - Tutorial - Ipynb - Colaboratory
No ratings yet
2 - Numpy - Tutorial - Ipynb - Colaboratory
10 pages
L_AND_T_project_Naveen 24cs002895
No ratings yet
L_AND_T_project_Naveen 24cs002895
7 pages
NumPybasics - Ipynb - Colaboratory (Day 2)
No ratings yet
NumPybasics - Ipynb - Colaboratory (Day 2)
7 pages
Numpy Session1
No ratings yet
Numpy Session1
1 page
008.tricks and Tips
No ratings yet
008.tricks and Tips
3 pages
What Is PCA: When Should You Use PCA?
No ratings yet
What Is PCA: When Should You Use PCA?
21 pages
AIML
No ratings yet
AIML
5 pages
Week6-Matplotlib
No ratings yet
Week6-Matplotlib
5 pages
Lecture 21
No ratings yet
Lecture 21
138 pages
Principal Component Analysis Notes : Info
No ratings yet
Principal Component Analysis Notes : Info
22 pages
Matplotlib Seaborn Fundamentals (1)
No ratings yet
Matplotlib Seaborn Fundamentals (1)
72 pages
Linear Regression - Ipynb - Colab
No ratings yet
Linear Regression - Ipynb - Colab
3 pages
Zeta - Updated - Matplotlib - Ipynb - Colab
No ratings yet
Zeta - Updated - Matplotlib - Ipynb - Colab
12 pages
Logistic Multiclass Classification
No ratings yet
Logistic Multiclass Classification
2 pages
8.numpy_sorting
No ratings yet
8.numpy_sorting
2 pages
Numpy - Tutorial - Ipynb - Colaboratory
No ratings yet
Numpy - Tutorial - Ipynb - Colaboratory
9 pages
11_NumPy
No ratings yet
11_NumPy
14 pages
numpy
No ratings yet
numpy
8 pages
Numpy Merged (1)
No ratings yet
Numpy Merged (1)
93 pages
Intro to Numpy With Examples
No ratings yet
Intro to Numpy With Examples
60 pages
Sort
No ratings yet
Sort
3 pages
Lab 1 (Image Basics)
No ratings yet
Lab 1 (Image Basics)
4 pages
Advanced Python
No ratings yet
Advanced Python
48 pages
Maxbox - Starter68 Machine Learning
No ratings yet
Maxbox - Starter68 Machine Learning
5 pages
統計學習CH2 Lab - Jupyter Notebook (直向)
No ratings yet
統計學習CH2 Lab - Jupyter Notebook (直向)
41 pages
01 Matplotlib PDF
No ratings yet
01 Matplotlib PDF
9 pages
Intro Cluster Problem Python
No ratings yet
Intro Cluster Problem Python
13 pages
matplotlib - Jupyter Notebook
No ratings yet
matplotlib - Jupyter Notebook
7 pages
Statistical Data Analysis - Ipynb - Colaboratory
No ratings yet
Statistical Data Analysis - Ipynb - Colaboratory
6 pages
.. ML Lab 07
No ratings yet
.. ML Lab 07
25 pages
Matplotlib and Seaborn PDF
100% (1)
Matplotlib and Seaborn PDF
29 pages
ML Lab File Shubham
No ratings yet
ML Lab File Shubham
56 pages
ML Python Exercises UOM BDS Cluster Analysis
No ratings yet
ML Python Exercises UOM BDS Cluster Analysis
8 pages
Python Abstract
No ratings yet
Python Abstract
7 pages
GNN 01 Intro
No ratings yet
GNN 01 Intro
8 pages
Plotting - Ipynb in
100% (1)
Plotting - Ipynb in
15 pages
Matplotlib Manual
No ratings yet
Matplotlib Manual
12 pages
Python_Numpy
No ratings yet
Python_Numpy
20 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Untitled6.ipynb - Colab
No ratings yet
Untitled6.ipynb - Colab
2 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Numpy Exercises: #### Import Numpy As NP
100% (1)
Numpy Exercises: #### Import Numpy As NP
6 pages
Numpy_and_matplotlib_practical
No ratings yet
Numpy_and_matplotlib_practical
8 pages
Lectur2 PANDAS
No ratings yet
Lectur2 PANDAS
65 pages
JAVIER KMeans Clustering Jupyter Notebook
No ratings yet
JAVIER KMeans Clustering Jupyter Notebook
7 pages
Numpy Library Basics
No ratings yet
Numpy Library Basics
16 pages
Ip Python2
No ratings yet
Ip Python2
34 pages
Jupyter Notebook Viewer
No ratings yet
Jupyter Notebook Viewer
19 pages
Mabaquiao Jezreel PythonExercise1
No ratings yet
Mabaquiao Jezreel PythonExercise1
8 pages
Numpy
No ratings yet
Numpy
50 pages
MP2 Exercise 01 - Numpy Arrays
No ratings yet
MP2 Exercise 01 - Numpy Arrays
6 pages
Data Manipulation With Numpy
No ratings yet
Data Manipulation With Numpy
13 pages
DM ML Practical
No ratings yet
DM ML Practical
13 pages
Ex 2
No ratings yet
Ex 2
7 pages
Introduction To Numpy
No ratings yet
Introduction To Numpy
41 pages
Linear Algebra
No ratings yet
Linear Algebra
42 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
groupby.rst
No ratings yet
groupby.rst
32 pages
dsintro.rst
No ratings yet
dsintro.rst
15 pages
categorical.rst
No ratings yet
categorical.rst
22 pages
boolean.rst
No ratings yet
boolean.rst
2 pages
style.ipynb
No ratings yet
style.ipynb
42 pages
Mmc Motion Compension.pptx
No ratings yet
Mmc Motion Compension.pptx
12 pages
Redis
No ratings yet
Redis
9 pages
Dse8610 Mkii PCN
No ratings yet
Dse8610 Mkii PCN
4 pages
Toshiba Common Errors
No ratings yet
Toshiba Common Errors
6 pages
Lab 3 Create Database
No ratings yet
Lab 3 Create Database
5 pages
Kaspersky Unified Monitoring and Analysis RFP 1.0 En
No ratings yet
Kaspersky Unified Monitoring and Analysis RFP 1.0 En
18 pages
How To Resolve Costing Errors
No ratings yet
How To Resolve Costing Errors
38 pages
PDF Concise Guide To Software Engineering: From Fundamentals To Application Methods, 2nd Edition Gerard O'Regan Download
No ratings yet
PDF Concise Guide To Software Engineering: From Fundamentals To Application Methods, 2nd Edition Gerard O'Regan Download
49 pages
NeoProgrammer Changes
No ratings yet
NeoProgrammer Changes
3 pages
SPiiPlus Release Note (V4-20-5-20)
No ratings yet
SPiiPlus Release Note (V4-20-5-20)
179 pages
21st Century Assessments - Updated
No ratings yet
21st Century Assessments - Updated
35 pages
Main Project
No ratings yet
Main Project
21 pages
DBMS_UNTI-5
No ratings yet
DBMS_UNTI-5
57 pages
Android Advisor - Issue 114 2023
No ratings yet
Android Advisor - Issue 114 2023
96 pages
5g Lte Narrowband Internet of Things NB Iot 1138317608 9781138317604 Compress
No ratings yet
5g Lte Narrowband Internet of Things NB Iot 1138317608 9781138317604 Compress
263 pages
Android Iphone: High Definition Digital Video Recorder Fast Manual Instructions
100% (2)
Android Iphone: High Definition Digital Video Recorder Fast Manual Instructions
2 pages
User Manual DVP-14SS
No ratings yet
User Manual DVP-14SS
440 pages
SE Modbus Basic Nodes User Manual
100% (1)
SE Modbus Basic Nodes User Manual
68 pages
Os Final Sem Answers
No ratings yet
Os Final Sem Answers
60 pages
IMC411 Guidelines For Group Project
No ratings yet
IMC411 Guidelines For Group Project
6 pages
Copy of 2024
No ratings yet
Copy of 2024
75 pages
Model Checking NuSMV
No ratings yet
Model Checking NuSMV
24 pages
Immediate download Concepts and Experimental Protocols of Modelling and Informatics in Drug Design - eBook PDF ebooks 2024
100% (4)
Immediate download Concepts and Experimental Protocols of Modelling and Informatics in Drug Design - eBook PDF ebooks 2024
59 pages
MiniOpticonCFXManagerSWUpdateInstructions XP
No ratings yet
MiniOpticonCFXManagerSWUpdateInstructions XP
14 pages
PLC Math Instructions
No ratings yet
PLC Math Instructions
4 pages
Cracked Forex Tools - Clients Mirror Trader Manual
No ratings yet
Cracked Forex Tools - Clients Mirror Trader Manual
38 pages
Contact Us - Zimele Investment Enterprise Company (Pty) LTD
No ratings yet
Contact Us - Zimele Investment Enterprise Company (Pty) LTD
4 pages
Niloy Hasib: Work Experience
No ratings yet
Niloy Hasib: Work Experience
1 page
PL 542 Installation Brochure
No ratings yet
PL 542 Installation Brochure
2 pages
DSP Complex Engineering Activity
No ratings yet
DSP Complex Engineering Activity
12 pages