Rec Sys Network

Uploaded by

Fern Itsn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views45 pages

Rec Sys Network

Uploaded by

Fern Itsn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 45

Other types of

Recommender
Systems
Structural Recommendations in Networks
Example of network based
recommendation
• Networks have become ubiquitous as a modeling tool in many
applications, such as social and information networks. Therefore, it
is particularly useful to discuss various structural elements of a
network that can be recommended in different scenarios.
• Network: a collection of entities that are interconnected with links.
• people that are friends
• computers that are interconnected
• web pages that point to each other
• proteins that interact
• Networks are also called graphs, the entities are nodes, and the
links are edges
Types of Structural
Recommendations
• Recommending nodes by authority and context
• Quality and authority of a node is judged by the incoming links.
• Page-Rank algorithm is adopted by search engines for this purpose. But, it is not personalized.
• Personalized page rank can be used for this purpose
• Recommendation from reputed nodes
• Recommending nodes by example
• Nodes are tagged with similar properties
• Closely related to neighbourhood based approaches
• Target marketing
• Recommending nodes by influence and content
• Nodes with potential to disseminate for information based on their connectivity.
• Social influencer and viral marketing
• Recommending links
• Recommending links to increase the size of a network
• Friend suggestions
Recommending nodes
by authority and context
Recommending nodes by authority
and context
• A health drinks producer wishes to find a brand ambassador for a
newly introduced item. An appropriate person would be whom
everyone respect in this domain. How to find such a person in a social
network?
• Page rank algorithm is the basis
• Initially proposed for ranking Web pages
• achieved using the citation structure of the Web
• A citation can be logically viewed as a vote for the Web page
• To provide a more holistic citation-based vote PageRank is used. The PageRank algorithm
generalizes the notion of citation-based ranking in a recursive way
• Page rank algorithm is not personalized
• Personalized Page Rank
Foundations of page rank algorithm
• In-links of page : The hyperlinks that point to page i from other pages.
• Out-links of page : The hyperlinks that point out to other pages from
page .
• A hyperlink from a page pointing to another page is an implicit
conveyance of authority to the target page. Thus, the more in-links
that a page receives, the more prestige the page has.
• Pages that point to page also have their own prestige scores. A page
with a higher prestige score pointing to is more important than a
page with a lower prestige score pointing to . In other words, a page is
important if it is pointed to by other important pages.
Foundations of page rank algorithm: Markov Model
• Treat the Web as a directed graph , where V is the set of vertices or
nodes, i.e., the set of all pages, and E is the set of directed edges in the
graph, i.e., hyperlinks.
• Let is the total number of pages and is the number of out-links of page.
• Let be the state transition probability matrix, where

• In case the process satisfies certain property (ergodic, irreducible and

aperiodic), in the long run it converges to a steady state.
Foundations of page rank algorithm
• The PageRank score of the page (denoted by ) is defined by:

Whereis the number of out-links of page is the total number of

pages
A is not a stochastic matrix
because the fifth row is all 0
• To convert A to a stochastic transition matrix, add
a complete set of outgoing links from each such
page i to all the pages on the Web.
• Thus the transition probability of going from i to
every page is 1/n assuming uniform probability
distribution.
A is not irreducible. Irreducible means that • That is, we replace each row containing all 0’s
the graph is strongly connected. with e/n, where e is n-dimensional vector of all
1’s.
A directed graph G = (V, E) is strongly connected if and only if, for each pair of nodes u, v
∈ V, there is a path from u to v

A state i is periodic with period k > 1 if k is the smallest number such that all paths leading
from state i back to state i have a length that is a multiple of k. If a state is not periodic (i.e.,
k = 1), it is aperiodic. A Markov chain is aperiodic if all states are aperiodic.
Assumption to deal with the problem:
The random surfer has two options:
1. With probability d, he randomly chooses an out-link to follow.
2. With probability 1d, he jumps to a random page without a link.

With this assumption the improved model is

where E is eeT (e is a column vector of all 1’s) and thus E is a nn square matrix of all 1’s.
1/n is the probability of jumping to a particular page. n is the total number of nodes in the
graph. It is assumed that A has already been made a stochastic matrix.
Using d = 0.9

This matrix can converge to a steady state in the long run. Thus, after simplification
using steady state equations

Or equivalently
The power iteration method for PageRank
Modifications to page rank for
recommending nodes in a
personalized setting
• In personalized page rank algorithm, a personalization vector is
multiplied to the transition probability matrix.
• The personalization vector has one entry per node. If the node is of
interest then the corresponding entry in the vector takes the value 1
otherwise it is zero.
• This enables discovering topic sensitive authoritative nodes.
• The personalized PageRank approach can also be used to discover the
neighborhoods in user-item graphs or user-user graphs in traditional
collaborative filtering applications.
Recommending nodes by example
Finding the nodes with similar
interests
• An manufacturer of golf equipment wishes to target few nodes for
marketing. He must select those node, who are interested in golf. The
interests can be inferred from their own posts, the likes for the others
posts, tagging the related items/news etc.
• Finding such users utilizes the concept of homophily in a social
network, which says, nodes with similar properties are usually
connected.
• Therefore, the profile, properties, and ratings of the neighbourhood
node can be leveraged to make recommendations.
Recommendation by collective
classification
• The actors with specific interest in the network can be
specified with the use of labels. Therefore, a subset of the
nodes are associated with labels.
• It is desired to use these labels as training data to determine
the labels of the other nodes where they are unspecified. It
is assumed that for labeled nodes, the index of the label is
drawn from{1 . . . r}.
• Like the collaborative filtering problem, this is also an
incomplete data estimation problem, except that it is done
in the context of network structures.
Label propagation for classification

Source: velog.io
Challenges
•Convergence is not guaranteed.
•Node feature information was not used.
→ This is because only node labels and network information are used.
Recommendation by collective
classification
• Because nodes with similar properties are usually connected, it is reasonable to
assume that this is also true of node labels. A solution to this problem is to
examine the k labeled nodes in the proximity of a given node and report the
majority label.
• This approach is, the network analog of a nearest neighbor classifier. However,
such an approach is generally not possible in collective classification because of
the sparsity of node labels.
• In order to handle sparsity, one must not only use the direct connections to
labeled nodes, but also use the indirect connections through unlabeled nodes.
• Two widely discussed algorithms in this regard are:
• Iterative classification algorithm
• Random walk-based method
Iterative classification algorithm
• Network G = (N,A)
• Class labels: drawn from {1 . . .r}.
• The total number of nodes is denoted by n, from which nt nodes are
unlabeled test nodes.
• Each edge (i, j) ∈ A is associated with the weight wij .
• Node i has two types of features (Content feature, Link feature
• The content Xi is available at the node i in the form of a
multidimensional feature vector.
• ICA algorithm derives a set of link features in addition to the available content features
• A link feature is generated for each class, containing the fraction of its incident nodes
belonging to that class. For each node i, its adjacent node j is weighted by wij for
computing its credit to the relevant class.
Iterative classification algorithm
Recommending nodes by
influence and content
Recommending nodes by influence
and content
• You want to choose few nodes (because you may be budget constrained)
for viral marketing of your product. How to choose few node to ensure
maximum coverage in the network.
• (Influence Maximization) Given a social network G = (N,A), determine a
set of k seed nodes S, influencing which will maximize the overall spread
of influence in the network.
• Each model or heuristic can quantify the influence level of a node with
the use of a function of S that is denoted by f(·). This function maps
subsets of nodes to real numbers representing influence values.
Therefore, after a model has been chosen for quantifying the influence
f(S) of a given set S, the optimization problem is that of determining the
set S that maximizes f(S).
Recommending nodes by influence
and content
• An interesting property of a very large number of influence analysis models is
that the optimized function f(S) is submodular.
• It is a mathematical way of representing the natural law of diminishing
returns, as applied to sets. In other words, if S ⊆ T , then the additional
influence obtained by adding an individual to set T cannot be larger than the
additional influence of adding the same individual to set S.
• Thus, the incremental influence of the same individual diminishes, as larger
supersets of cohorts are available as seeds.
• Two common approaches for defining the influence function f(S) of a set of
nodes S are the Linear Threshold Model and the Independent Cascade Model.
Linear Threshold Model
the algorithm initially starts with an active set of seed nodes S and iteratively
increases the number of active nodes based on the influence of neighboring
active nodes. Active nodes are allowed to influence their neighbors over
multiple iterations throughout the execution of the algorithm until no more
nodes can be activated. The influence of neighboring nodes is quantified with
the use of a linear function of the edge-specific weights bij. For each node i in
the network G = (N, A), the following is assumed to be true:

Each node i is associated with a random threshold θi ∼ U[0,1] that is fixed

up front and stays constant over the course of the algorithm. The total
influence I(i) of the active neighbors of node i on i, at a given time-instant,
is computed as the sum of the weights bij of all active neighbors of i.
Example of Linear
Threshold Model

(A) node V is activated

and influences W and
U by 0.5 and 0.2,
respectively;
(B) W becomes activated
and influences X and U
by 0.5 and 0.3,
respectively;
(C) U becomes activated
and influences X and Y
by 0.1 and 0.2,
respectively;
(D) X becomes activated
and influences Y by
0.2; no more nodes
can be activated;
https://snap-stanford.github.io/cs224w-notes/network-methods/influence-maximization
process stops.
Independent Cascade Model
in the independent cascade model, after a node becomes
active, it obtains only a single chance to activate its
neighbors, with propagation probabilities associated with
the edges.
Recommending links
Recommending links
• In many social networks, it is desirable to predict future links between
pairs of nodes in the network.
• For example, commercial social networks, such as Facebook, often
recommend users as potential friends.
• Methods for link prediction
• Neighborhood-Based Measures
• Katz Measure
• Random Walk-Based Measures
• Classification based approach
• Matrix factorization based approach
Neighborhood-Based Measures
(Common Neighbor Measure) The common-neighbor measure between nodes i and j is
equal to the number of common neighbors between nodes i and j. In other words, if Si is the
neighbor set of node i, and Sj is the neighbor set of node j, the common-neighbor measure is
defined as follows:

The major weakness of the common-neighbor measure is that it

does not account for the relative number of common neighbors
between them as compared to the number of other
connections. It may happen that both the nodes are either
spammers or very popular public figures who were connected to
a large number of other actors. In such a case, the nodes may
have many neighbors in common, just by chance. The Jaccard
measure is designed to normalize for varying degree
distributions.
=4
Neighborhood-Based Measures
(Jaccard Measure) The Jaccard-based link prediction measure between nodes i and j is
equal to the Jaccard coefficient between their neighbor sets Si and Sj , respectively.

The Jaccard measure adjusts much better to the variations

in the degrees of the nodes between which the link
prediction is measured. However, it does not adjust well to
the degrees of their intermediate neighbors. However, all
of these common neighbors could be very popular public
figures with very high degrees. Therefore, these nodes are
statistically more likely to occur as common neighbors of
many pairs of nodes. This makes them less important in
the link prediction measure.
=4/9
Neighborhood-Based Measures
(Adamic-Adar Measure) The common-neighbor measure between nodes i and j is
equal to the weighted number of common neighbors between nodes i and j.
=1/log4 + 1/log2 + 1/log2 + 1/log4
Walk-Based Measure
While the neighborhood-based measures provide a robust estimation of the
likelihood of a link forming between a pair of nodes, they are not quite as effective
when the number of shared neighbors between a pair of nodes is small. When
there is significant indirect connectivity through longer paths, walk-based
measures are more appropriate. Katz measure is one metric.

If A is the symmetric adjacency matrix of an undirected network, then the n × n

pairwise Katz coefficient can be computed as
• There are four types of Structural Recommendations: Recommending
nodes by authority and context, Recommending nodes by example,
Recommending nodes by influence and content, and Recommending
links
• Page rank algorithm is the basis for recommending nodes by authority
and context.
• Finding the nodes with similar interests utilizes the concept of homophily
in a social network. Iterative classification algorithm is a classical
approach for this.
• Recommending nodes by influence and content is a problem to select
few seed nodes to maximize the influence. Linear Threshold Model and
the Independent Cascade Model are two classical approaches in this
regard.
• Methods for link prediction include Neighborhood-Based Measures, Katz
Measure, Random Walk-Based Measures, Classification based approach
and Matrix factorization based approach

Missing Link Prediction in Social Networks
No ratings yet
Missing Link Prediction in Social Networks
9 pages
Senior Project Whole PDF
No ratings yet
Senior Project Whole PDF
23 pages
Applications of Stochastic Models in Web Page Ranking
No ratings yet
Applications of Stochastic Models in Web Page Ranking
8 pages
Lecture2
No ratings yet
Lecture2
25 pages
Week 16
No ratings yet
Week 16
47 pages
Datamining-Lect7 - Link Analysis Ranking PageRank - Random Walks HITS Absorbing Random Walks and Label Propagation
No ratings yet
Datamining-Lect7 - Link Analysis Ranking PageRank - Random Walks HITS Absorbing Random Walks and Label Propagation
99 pages
141 2020 Missing LP Using CN and Centrality Based Parameterized Algorithm
No ratings yet
141 2020 Missing LP Using CN and Centrality Based Parameterized Algorithm
9 pages
Complex Network Models
No ratings yet
Complex Network Models
110 pages
Page Rank With 13 Cases
No ratings yet
Page Rank With 13 Cases
72 pages
Social Network Analysis
No ratings yet
Social Network Analysis
28 pages
Unit - 4
No ratings yet
Unit - 4
22 pages
Analytical Reading Inventory: Comprehensive Standards Based Assessment for All Students Including Gifted and Remedial 10th Edition, (Ebook PDF) - Download the ebook now and own the full detailed content
100% (1)
Analytical Reading Inventory: Comprehensive Standards Based Assessment for All Students Including Gifted and Remedial 10th Edition, (Ebook PDF) - Download the ebook now and own the full detailed content
51 pages
Tomaž Bratanic - Graph Algorithms For Data Science - With Examples in Neo4j-Manning Publications (2024)
No ratings yet
Tomaž Bratanic - Graph Algorithms For Data Science - With Examples in Neo4j-Manning Publications (2024)
10 pages
Social Network Analysis Unit-2
No ratings yet
Social Network Analysis Unit-2
24 pages
Thesis
No ratings yet
Thesis
80 pages
Running Head: Programming Test
100% (2)
Running Head: Programming Test
16 pages
Link Analysis: (Follow The Links To Learn More!)
No ratings yet
Link Analysis: (Follow The Links To Learn More!)
28 pages
End-To-End Learning of Latent Edge Weights For Graph Convolutional Networks
No ratings yet
End-To-End Learning of Latent Edge Weights For Graph Convolutional Networks
49 pages
IR-UNIT 11 (Link Analysis) - 2019
No ratings yet
IR-UNIT 11 (Link Analysis) - 2019
58 pages
Brin and Page 1998 Page Et Al. 1999
No ratings yet
Brin and Page 1998 Page Et Al. 1999
37 pages
Page Rank Algorithm
No ratings yet
Page Rank Algorithm
9 pages
PMBD-07-Link Analysis
No ratings yet
PMBD-07-Link Analysis
42 pages
Similarity Index Based Link Prediction Algorithms in Social Networks: A Survey
No ratings yet
Similarity Index Based Link Prediction Algorithms in Social Networks: A Survey
8 pages
Module VI Link Analysis final.pptx
No ratings yet
Module VI Link Analysis final.pptx
104 pages
Facebook Friend Recommendation
No ratings yet
Facebook Friend Recommendation
23 pages
Topic 3
No ratings yet
Topic 3
62 pages
Distributed Computing Seminar: Lecture 5: Graph Algorithms & Pagerank
No ratings yet
Distributed Computing Seminar: Lecture 5: Graph Algorithms & Pagerank
33 pages
Social Network Analysis Unit-4
No ratings yet
Social Network Analysis Unit-4
21 pages
05 Networks
No ratings yet
05 Networks
48 pages
1.1 Pagerank Description
No ratings yet
1.1 Pagerank Description
19 pages
Module 4 MapReduce and Link Analysis
No ratings yet
Module 4 MapReduce and Link Analysis
103 pages
Approximating Personalized PageRank With Minimal Use of Web Graph Data
No ratings yet
Approximating Personalized PageRank With Minimal Use of Web Graph Data
38 pages
TM3 ch05 Link Analysis
No ratings yet
TM3 ch05 Link Analysis
69 pages
Utility Design Program(1)
No ratings yet
Utility Design Program(1)
54 pages
Lecture9
No ratings yet
Lecture9
64 pages
SNA - T4-5 - Pagerank and Communities
No ratings yet
SNA - T4-5 - Pagerank and Communities
56 pages
IR Unit II
No ratings yet
IR Unit II
78 pages
DSS User Manual - ENG
No ratings yet
DSS User Manual - ENG
423 pages
2010 Lu LinkPredictionComplexNetworks
No ratings yet
2010 Lu LinkPredictionComplexNetworks
44 pages
05-linkpred
No ratings yet
05-linkpred
79 pages
Course 5-6
No ratings yet
Course 5-6
54 pages
Lesson 1
No ratings yet
Lesson 1
50 pages
RRP Mains 2021 Indian Society Ready Reckoner
No ratings yet
RRP Mains 2021 Indian Society Ready Reckoner
76 pages
Chapter 6
No ratings yet
Chapter 6
67 pages
C2 - Social Network Measurement
No ratings yet
C2 - Social Network Measurement
42 pages
硅衬底GaN基黄光LED光电性能及可靠性研究
No ratings yet
硅衬底GaN基黄光LED光电性能及可靠性研究
63 pages
Social Network Analysis
No ratings yet
Social Network Analysis
20 pages
Lecture 12 - Link Analysis
No ratings yet
Lecture 12 - Link Analysis
57 pages
2.1 Representing Network Module
No ratings yet
2.1 Representing Network Module
6 pages
BOM Configuration
No ratings yet
BOM Configuration
16 pages
Social Network Analysis: Lakshminarayana Sadineni Assistant Professor Department of Iot & Is
No ratings yet
Social Network Analysis: Lakshminarayana Sadineni Assistant Professor Department of Iot & Is
23 pages
Page Rank and HITS
No ratings yet
Page Rank and HITS
39 pages
Sports Goods
No ratings yet
Sports Goods
14 pages
Abstract. The Original Purpose of Google'S Pagerank Algorithm Is To Assess The
No ratings yet
Abstract. The Original Purpose of Google'S Pagerank Algorithm Is To Assess The
6 pages
Link Analysis
No ratings yet
Link Analysis
47 pages
Protected-Upload - 2019-10-03T060630.445 PDF
No ratings yet
Protected-Upload - 2019-10-03T060630.445 PDF
1 page
Check List - Installing AX Retail POS
100% (1)
Check List - Installing AX Retail POS
11 pages
3.5 WebMining ImportantPages
No ratings yet
3.5 WebMining ImportantPages
11 pages
Lect 14-Web Ranking
No ratings yet
Lect 14-Web Ranking
30 pages
UNIT 6 (1)
No ratings yet
UNIT 6 (1)
34 pages
feb_28
No ratings yet
feb_28
12 pages
SNA
No ratings yet
SNA
16 pages
GraphBasedDataScience
No ratings yet
GraphBasedDataScience
37 pages
Research Paper Senior High School
No ratings yet
Research Paper Senior High School
39 pages
I am sharing 'DSE ASSIGNMENT ADITI CHAUDHARY' with you
No ratings yet
I am sharing 'DSE ASSIGNMENT ADITI CHAUDHARY' with you
7 pages
Link Prediction
No ratings yet
Link Prediction
27 pages
15-link-2 - converted
No ratings yet
15-link-2 - converted
11 pages
Unit I Graph Theory and concepts
No ratings yet
Unit I Graph Theory and concepts
35 pages
JETIRreview
No ratings yet
JETIRreview
6 pages
Computer Programming: C For Statistics
No ratings yet
Computer Programming: C For Statistics
43 pages
Maria Sharapova
No ratings yet
Maria Sharapova
20 pages
St-02 Notes Bcam061
No ratings yet
St-02 Notes Bcam061
41 pages
DULUX - International Colour Trends 2025_CAIG
No ratings yet
DULUX - International Colour Trends 2025_CAIG
16 pages
HOA Loan Services Launches AI-Powered Portal to Simplify HOA Loan Applications
No ratings yet
HOA Loan Services Launches AI-Powered Portal to Simplify HOA Loan Applications
2 pages
Startup Hub 2024 Brochure
No ratings yet
Startup Hub 2024 Brochure
8 pages
LAM Project
No ratings yet
LAM Project
14 pages
19MAM81-GRLmidsem 1 answer key - Copy
No ratings yet
19MAM81-GRLmidsem 1 answer key - Copy
14 pages
Head Teacher Assistant Head Office Assistant
No ratings yet
Head Teacher Assistant Head Office Assistant
2 pages
The Role and Preparation of A Professional Counselor
100% (1)
The Role and Preparation of A Professional Counselor
18 pages
Timely-Submission-and-Completeness-of-Incident-Reports
No ratings yet
Timely-Submission-and-Completeness-of-Incident-Reports
2 pages
Listening Cơ Bản Cho Giao Tiếp
No ratings yet
Listening Cơ Bản Cho Giao Tiếp
36 pages
Tcs Employment Application Form
No ratings yet
Tcs Employment Application Form
5 pages
EN PT-400 Tech Specs 03-17
100% (1)
EN PT-400 Tech Specs 03-17
2 pages
B31.3 - Process - Piping - Course - 11 Designing With Expansion Joints
No ratings yet
B31.3 - Process - Piping - Course - 11 Designing With Expansion Joints
15 pages
DSP UNIT - V
No ratings yet
DSP UNIT - V
7 pages
MAJ-GEN-01-MEC-SPC-1001_C Specification for Centrifugal Pump
No ratings yet
MAJ-GEN-01-MEC-SPC-1001_C Specification for Centrifugal Pump
16 pages
ROLE PROFILE - KPMG Dublin (Risk Consulting)
No ratings yet
ROLE PROFILE - KPMG Dublin (Risk Consulting)
2 pages
MCT 2232 Instrumentation and Measurements: Assignment 2 - Oscilloscope
No ratings yet
MCT 2232 Instrumentation and Measurements: Assignment 2 - Oscilloscope
4 pages
Argon Gas Safety Precautions
No ratings yet
Argon Gas Safety Precautions
2 pages
Ian Talks Algos & Data Structures A-Z: WebDevAtoZ, #2
From Everand
Ian Talks Algos & Data Structures A-Z: WebDevAtoZ, #2
Ian Eress
No ratings yet

Rec Sys Network

Uploaded by

Rec Sys Network

Uploaded by

Other types of

• In case the process satisfies certain property (ergodic, irreducible and

Whereis the number of out-links of page is the total number of

With this assumption the improved model is

Each node i is associated with a random threshold θi ∼ U[0,1] that is fixed

(A) node V is activated

The major weakness of the common-neighbor measure is that it

The Jaccard measure adjusts much better to the variations

If A is the symmetric adjacency matrix of an undirected network, then the n × n

You might also like