Recommender System:
Algorithms & Architecture
     xiangliang@hulu.com
Outline
    Problem
    Data
•

    Algorithms
•

    Cold start
•

    Architecture
•
•
                         Recommender
                            System
Problem
Recommend items to users to make user, content
partner, websites happy!
Data
• User behaviors data

  Page view         All user        Very Large
  Behavior          User            Size


  Watch video       All user        Large
  Favorite          Register user   Middle
  Vote              Register user   Middle
  Add to playlist   Register user   Small
  Facebook like     Register user   Small
  Share             Register user   Small
  Review            Register user   Small
Data
• Which data is most
  important                     Page view         All user        Very Large
                                Behavior          User            Size



  – Main behavior in the
                                Watch video       All user        Large


    website
                                Favorite          Register user   Middle
                                Vote              Register user   Middle

  – All user can have such      Add to playlist   Register user   Small

    behavior
                                Facebook like     Register user   Small


  – Cost
                                Share             Register user   Small
                                Review            Register user   Small

  – Reflect user interests on
    items
Data
• Data Structure
  – User ID
  – Item ID
  – Behavior Type
  – Behavior Content
  – Context
     • Timestamp
     • Location
     • Mood
                   Sheldon watch Star Trek with his friends at home
Algorithms
                              Recommender
                              System Method



   Collaborative        Content          Social
                                                    ……
     Filtering          Filtering       Filtering


                   Latent Factor
Graph-based                               ……
                      Model


Neighborhood
                       ……
   -based



 User-based         Item-based            ……
Neighborhood-based
• User-based
  – Digg
• Item-based
  – Amazon, Netflix, YouTube, Hulu, …
User-based
• Algorithm
  – For user u, find a set of users S(u) have similar
    preference as u.
  – Recommend popular items among users in S(u)
    to user u.
User-based CF

pui =           ∑
        v∈S ( u , K ) ∩ N ( i )
                                  wuv rvi


            N (u ) ∩ N (v)
w uv =
            N (u ) ∪ N (v)
Item-based
• Algorithm
  – For user u, get items set N(u) this user like
    before.
  – Recommend items which are similar to many
    items in N(u) to user u.
Item-based CF

pui =           ∑
        j∈S ( i , K ) ∩ N ( u )
                                  w ji ruj


           N (i ) ∩ N ( j )
w ij =
           N (i ) ∪ N ( j )
Item-based CF




Why not use w ij =                      ?
                     N (i ) ∩ N ( j )
                          N (i )
Neighborhood-based
• User-based vs. Item-based
                    User-based              Item-based

  Scalability       Bad when user size is   Bad when item size is
                    large                   large
  Explanation       Bad                     Good

  Novelty           Bad                     Good

  Coverage          Bad                     Good

  Cold start        Bad for new users       Bad for new items

  Performance       Need to get many        Only need to get
                    users history           current user’s history
References
• Amazon.com Recommendations item-to-
  item Collaborative Filtering.
• Empirical Analysis of Predictive Algorithms
  for Collaborative Filtering.
Graph-based
• Users’ behaviors on items can be
  represented by bi-part graph.
 A       1   A       1   A       1   A   1

 B       2   B       2   B       2   B   2

 C       3   C       3   C       3   C   3

 D       4   D       4   D       4   D   4
Graph-based
• Two nodes will have high relevance if
  – There are many paths in graph between two
    nodes.
  – Most of paths between two nodes is short.
  – Most paths do not go through nodes with high
    out-degree.
Graph-based
• Advantage
  – Heterogeneous data            A   1
     • Multiple user behaviors
     • Social Network
                                  B   2

     • Context (Time, Location)   C   3

• Disadvantage                    D   4

  – Statistical-based
  – High cost for long path
References
• A Graph-based Recommender System for
  Digital Library.
• Random-walk computation of similarities
  between nodes of a graph with application
  to collaborative recommendation.
Latent Factor Model
• Users and items are connect by latent
  features.

       A                        1
                   a
       B                        2
                   b
       C                        3
                   c
       D                        4
Latent Factor Model
       rui = ∑ puk qik
       ˆ
                        k
Science Fiction   0.5       Science Fiction   0.9


   Universe       0.9          Universe       0.9


   Physical       0.8          Physical       0.5


 Space Travel     0.8        Space Travel     0.7


  Animation       0.3         Animation       0.1


  Romance         0.0         Romance         0.0
Latent Factor Model
• How to get p, q?

 min ∑ (rui − ∑ puk qik ) + λ ( pu       + qi )
                         2           2       2

      ( u ,i )    k

 = α (eui qik − λ puk )
 puk +
= α (eui puk − λ qik )
qik +
Latent Factor Model
• How to define rui
  – Rating prediction
  – Top-N recommendation
     • Implicit feedback data: only have positive samples
       and missing values, how to select negative samples?
Latent Factor Model
    1 (Sci-fi)         2 (Crime)         3 (Family)        4 (Horror)

                                                         The Blair Witch
The invisible Man        Jaws          101 Dalmatians
                                                             Project
 Frankenstein
                                         Back to the
 Meets the Wolf     Lethal Weapon                        Pacific Heights
                                           Future
     Man

    Godzilla          Total Recall     Groundhog Day     Stir of Echoes

  Star Wars VI      Reservoir Dogs         Tarzan          Dead Calm

The Terminator       Donnie Brasco     The Aristocats      Phantasm

                                       The Jungle Book
      Alien           The Fugitive                       Sleepy Hollow
                                              2

     Alien 2        La shou Shen tan        Antz          The Faculty
Latent Factor Model
• Advantage
  – High accuracy in rating prediction
  – Auto group items
  – Scalability is good
  – Learning-based
• Disadvantage
  – Incremental updating
  – Real-time
  – Explanation
References
• http://www.informatik.uni-
  trier.de/~ley/db/indices/a-
  tree/k/Koren:Yehuda.html
Cold Start
• Problems
  – User cold start : new users
  – Item cold start : new items
  – System cold start : new systems
User Cold Start
• How to recommend items to new users?
  – Non-personalization recommendation
     • Most popular items
     • Highly Rated items
  – Using user register profile (Age, Gender, …)
User Cold Start
• Example: Gender and TV shows




 Data comes from IMDB : http://www.imdb.com/title/tt0412142/ratings
User Cold Start

Male
Age : 20-30
Theoretical physicist
Doctor
American
Irreligious
How to get user interest quickly
• When new user comes, his feedback on
  what items can help us better understand
  his interest?
  – Not very popular
  – Can represent a group of items
  – Users who like this item have different
    preference with users who dislike this item
Item Cold Start
• How to recommend new items to user?
  – Do not recommend




                       How to recommend news??
Item Cold Start
• How to recommend new items to user?
  – Using content information
                 Machine
                            Data Mining   Recommendation
                 Learning
System Cold Start
• How to design recommender system when
  there is no user?
  – Pandora : Music Genome Project
  – Jinni : Movie Genome Project
Architecture
• Feature-based recommendation framework:

      A                      1
                  a
      B                      2
                  b
      C                      3
                  c
      D                      4


      User      Feature     Item
Architecture




    Male


   Scientist


   Physics
Architecture
• Advantage:
  – Heterogeneous data
  – Reasonable Explanation
• Disadvantage:
  – Do not support user-based methods
Open Questions
• How to weight multiple behaviors?
• How to improve diversity, novelty?
• How to build feedback loop?
Thanks!

More Related Content

PPTX
Recommendation System
PPTX
Recommender system
PPTX
Recommender system introduction
PDF
Overview of recommender system
PDF
An introduction to Recommender Systems
PPTX
Recommender Systems
PPTX
Recommender systems using collaborative filtering
PPTX
Recommendation Systems Basics
Recommendation System
Recommender system
Recommender system introduction
Overview of recommender system
An introduction to Recommender Systems
Recommender Systems
Recommender systems using collaborative filtering
Recommendation Systems Basics

What's hot (20)

PDF
Boston ML - Architecting Recommender Systems
PDF
How to build a recommender system?
PPTX
Recommendation Systems
PDF
Recommender Systems
PDF
Recommender Systems
PPTX
Recommendation system
PDF
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
PDF
Introduction to Recommendation Systems
PPT
A Hybrid Recommendation system
PPTX
Recommender systems: Content-based and collaborative filtering
PPTX
Recommender Systems
PDF
Recommendation System Explained
PPTX
Collaborative Filtering using KNN
PPTX
Recommendation system
PPTX
Collaborative Filtering Recommendation System
PPTX
Recommender system
PPT
Recommender systems
PPTX
Learning a Personalized Homepage
PDF
Déjà Vu: The Importance of Time and Causality in Recommender Systems
PDF
Recent advances in deep recommender systems
Boston ML - Architecting Recommender Systems
How to build a recommender system?
Recommendation Systems
Recommender Systems
Recommender Systems
Recommendation system
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Introduction to Recommendation Systems
A Hybrid Recommendation system
Recommender systems: Content-based and collaborative filtering
Recommender Systems
Recommendation System Explained
Collaborative Filtering using KNN
Recommendation system
Collaborative Filtering Recommendation System
Recommender system
Recommender systems
Learning a Personalized Homepage
Déjà Vu: The Importance of Time and Causality in Recommender Systems
Recent advances in deep recommender systems
Ad

Viewers also liked (20)

PDF
Building a Recommendation Engine - An example of a product recommendation engine
PDF
Recommender Systems
PPT
Recommendation system
PDF
Introduction to R Package Recommendation System Competition
PDF
A Data Scientist in the Music Industry
PDF
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC
PDF
Past present and future of Recommender Systems: an Industry Perspective
PDF
Collaborative filtering
PPTX
[Final]collaborative filtering and recommender systems
PDF
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
PPTX
How to Build a Recommendation Engine on Spark
PDF
Microservice Architecture with CQRS and Event Sourcing
PPTX
Factorization Machines with libFM
PPT
Matrix factorization
PDF
Nonnegative Matrix Factorization
PDF
Neighbor methods vs matrix factorization - case studies of real-life recommen...
PDF
Collaborative Filtering with Spark
PDF
Intro to Factorization Machines
PDF
آموزش محاسبات عددی - بخش دوم
PPT
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Building a Recommendation Engine - An example of a product recommendation engine
Recommender Systems
Recommendation system
Introduction to R Package Recommendation System Competition
A Data Scientist in the Music Industry
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC
Past present and future of Recommender Systems: an Industry Perspective
Collaborative filtering
[Final]collaborative filtering and recommender systems
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
How to Build a Recommendation Engine on Spark
Microservice Architecture with CQRS and Event Sourcing
Factorization Machines with libFM
Matrix factorization
Nonnegative Matrix Factorization
Neighbor methods vs matrix factorization - case studies of real-life recommen...
Collaborative Filtering with Spark
Intro to Factorization Machines
آموزش محاسبات عددی - بخش دوم
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Ad

Similar to Recommender system algorithm and architecture (20)

PDF
Mechanical Librarian
PPTX
Big data - A critical appraisal
PPT
Social Recommender Systems Tutorial - WWW 2011
PDF
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
PDF
Recommender Systems and Learning Analytics in TEL
PDF
Ronny lempelyahooindiabigthinkerapril2013
PDF
People who liked this talk also liked … Building Recommendation Systems Using...
PDF
RecSysTEL lecture at advanced SIKS course, NL
PDF
Recommendation System Using Social Networking
PPTX
Recommended System.pptx
PDF
Recommandation systems -
PDF
IRJET- Hybrid Recommendation System for Movies
PDF
A Survey Of Collaborative Filtering Techniques
PDF
IntroductionRecommenderSystems_Petroni.pdf
PDF
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
PPTX
SNA - Recommendation in social media.pptx
PDF
Guest lecture Recommender Systems in TEL at RWTH Aachen, Germany
PDF
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
PDF
Socially Collaborative Filtering
PDF
Netflix Recommendations - Beyond the 5 Stars
Mechanical Librarian
Big data - A critical appraisal
Social Recommender Systems Tutorial - WWW 2011
Using Social- and Pseudo-Social Networks to Improve Recommendation Quality
Recommender Systems and Learning Analytics in TEL
Ronny lempelyahooindiabigthinkerapril2013
People who liked this talk also liked … Building Recommendation Systems Using...
RecSysTEL lecture at advanced SIKS course, NL
Recommendation System Using Social Networking
Recommended System.pptx
Recommandation systems -
IRJET- Hybrid Recommendation System for Movies
A Survey Of Collaborative Filtering Techniques
IntroductionRecommenderSystems_Petroni.pdf
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
SNA - Recommendation in social media.pptx
Guest lecture Recommender Systems in TEL at RWTH Aachen, Germany
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
Socially Collaborative Filtering
Netflix Recommendations - Beyond the 5 Stars

More from Liang Xiang (6)

PPTX
Kddcup2011
PDF
Phd. Thesis : Temporal Recommendation
PDF
动态推荐系统关键技术研究
PPTX
动态推荐系统关键技术研究
PPTX
How to do model ensemble
PPTX
Temporal recommendation on graphs via long and short-term
Kddcup2011
Phd. Thesis : Temporal Recommendation
动态推荐系统关键技术研究
动态推荐系统关键技术研究
How to do model ensemble
Temporal recommendation on graphs via long and short-term

Recently uploaded (20)

PPTX
Rise of the Digital Control Grid Zeee Media and Hope and Tivon FTWProject.com
PPTX
Report in SIP_Distance_Learning_Technology_Impact.pptx
PDF
Human Computer Interaction Miterm Lesson
PDF
ELLIE29.pdfWETWETAWTAWETAETAETERTRTERTER
PDF
Technical Debt in the AI Coding Era - By Antonio Bianco
PDF
CEH Module 2 Footprinting CEH V13, concepts
PDF
Chapter 1: computer maintenance and troubleshooting
PPTX
Blending method and technology for hydrogen.pptx
PDF
NewMind AI Journal Monthly Chronicles - August 2025
PDF
Be ready for tomorrow’s needs with a longer-lasting, higher-performing PC
PPTX
Slides World Game (s) Great Redesign Eco Economic Epochs.pptx
PDF
Altius execution marketplace concept.pdf
PDF
substrate PowerPoint Presentation basic one
PPTX
From Curiosity to ROI — Cost-Benefit Analysis of Agentic Automation [3/6]
PDF
Examining Bias in AI Generated News Content.pdf
PDF
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
PDF
ment.tech-Siri Delay Opens AI Startup Opportunity in 2025.pdf
PDF
EGCB_Solar_Project_Presentation_and Finalcial Analysis.pdf
PDF
Slides World Game (s) Great Redesign Eco Economic Epochs.pdf
PDF
Uncertainty-aware contextual multi-armed bandits for recommendations in e-com...
Rise of the Digital Control Grid Zeee Media and Hope and Tivon FTWProject.com
Report in SIP_Distance_Learning_Technology_Impact.pptx
Human Computer Interaction Miterm Lesson
ELLIE29.pdfWETWETAWTAWETAETAETERTRTERTER
Technical Debt in the AI Coding Era - By Antonio Bianco
CEH Module 2 Footprinting CEH V13, concepts
Chapter 1: computer maintenance and troubleshooting
Blending method and technology for hydrogen.pptx
NewMind AI Journal Monthly Chronicles - August 2025
Be ready for tomorrow’s needs with a longer-lasting, higher-performing PC
Slides World Game (s) Great Redesign Eco Economic Epochs.pptx
Altius execution marketplace concept.pdf
substrate PowerPoint Presentation basic one
From Curiosity to ROI — Cost-Benefit Analysis of Agentic Automation [3/6]
Examining Bias in AI Generated News Content.pdf
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
ment.tech-Siri Delay Opens AI Startup Opportunity in 2025.pdf
EGCB_Solar_Project_Presentation_and Finalcial Analysis.pdf
Slides World Game (s) Great Redesign Eco Economic Epochs.pdf
Uncertainty-aware contextual multi-armed bandits for recommendations in e-com...

Recommender system algorithm and architecture

  • 2. Outline Problem Data • Algorithms • Cold start • Architecture • • Recommender System
  • 3. Problem Recommend items to users to make user, content partner, websites happy!
  • 4. Data • User behaviors data Page view All user Very Large Behavior User Size Watch video All user Large Favorite Register user Middle Vote Register user Middle Add to playlist Register user Small Facebook like Register user Small Share Register user Small Review Register user Small
  • 5. Data • Which data is most important Page view All user Very Large Behavior User Size – Main behavior in the Watch video All user Large website Favorite Register user Middle Vote Register user Middle – All user can have such Add to playlist Register user Small behavior Facebook like Register user Small – Cost Share Register user Small Review Register user Small – Reflect user interests on items
  • 6. Data • Data Structure – User ID – Item ID – Behavior Type – Behavior Content – Context • Timestamp • Location • Mood Sheldon watch Star Trek with his friends at home
  • 7. Algorithms Recommender System Method Collaborative Content Social …… Filtering Filtering Filtering Latent Factor Graph-based …… Model Neighborhood …… -based User-based Item-based ……
  • 8. Neighborhood-based • User-based – Digg • Item-based – Amazon, Netflix, YouTube, Hulu, …
  • 9. User-based • Algorithm – For user u, find a set of users S(u) have similar preference as u. – Recommend popular items among users in S(u) to user u.
  • 10. User-based CF pui = ∑ v∈S ( u , K ) ∩ N ( i ) wuv rvi N (u ) ∩ N (v) w uv = N (u ) ∪ N (v)
  • 11. Item-based • Algorithm – For user u, get items set N(u) this user like before. – Recommend items which are similar to many items in N(u) to user u.
  • 12. Item-based CF pui = ∑ j∈S ( i , K ) ∩ N ( u ) w ji ruj N (i ) ∩ N ( j ) w ij = N (i ) ∪ N ( j )
  • 13. Item-based CF Why not use w ij = ? N (i ) ∩ N ( j ) N (i )
  • 14. Neighborhood-based • User-based vs. Item-based User-based Item-based Scalability Bad when user size is Bad when item size is large large Explanation Bad Good Novelty Bad Good Coverage Bad Good Cold start Bad for new users Bad for new items Performance Need to get many Only need to get users history current user’s history
  • 15. References • Amazon.com Recommendations item-to- item Collaborative Filtering. • Empirical Analysis of Predictive Algorithms for Collaborative Filtering.
  • 16. Graph-based • Users’ behaviors on items can be represented by bi-part graph. A 1 A 1 A 1 A 1 B 2 B 2 B 2 B 2 C 3 C 3 C 3 C 3 D 4 D 4 D 4 D 4
  • 17. Graph-based • Two nodes will have high relevance if – There are many paths in graph between two nodes. – Most of paths between two nodes is short. – Most paths do not go through nodes with high out-degree.
  • 18. Graph-based • Advantage – Heterogeneous data A 1 • Multiple user behaviors • Social Network B 2 • Context (Time, Location) C 3 • Disadvantage D 4 – Statistical-based – High cost for long path
  • 19. References • A Graph-based Recommender System for Digital Library. • Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation.
  • 20. Latent Factor Model • Users and items are connect by latent features. A 1 a B 2 b C 3 c D 4
  • 21. Latent Factor Model rui = ∑ puk qik ˆ k Science Fiction 0.5 Science Fiction 0.9 Universe 0.9 Universe 0.9 Physical 0.8 Physical 0.5 Space Travel 0.8 Space Travel 0.7 Animation 0.3 Animation 0.1 Romance 0.0 Romance 0.0
  • 22. Latent Factor Model • How to get p, q? min ∑ (rui − ∑ puk qik ) + λ ( pu + qi ) 2 2 2 ( u ,i ) k = α (eui qik − λ puk ) puk + = α (eui puk − λ qik ) qik +
  • 23. Latent Factor Model • How to define rui – Rating prediction – Top-N recommendation • Implicit feedback data: only have positive samples and missing values, how to select negative samples?
  • 24. Latent Factor Model 1 (Sci-fi) 2 (Crime) 3 (Family) 4 (Horror) The Blair Witch The invisible Man Jaws 101 Dalmatians Project Frankenstein Back to the Meets the Wolf Lethal Weapon Pacific Heights Future Man Godzilla Total Recall Groundhog Day Stir of Echoes Star Wars VI Reservoir Dogs Tarzan Dead Calm The Terminator Donnie Brasco The Aristocats Phantasm The Jungle Book Alien The Fugitive Sleepy Hollow 2 Alien 2 La shou Shen tan Antz The Faculty
  • 25. Latent Factor Model • Advantage – High accuracy in rating prediction – Auto group items – Scalability is good – Learning-based • Disadvantage – Incremental updating – Real-time – Explanation
  • 26. References • http://www.informatik.uni- trier.de/~ley/db/indices/a- tree/k/Koren:Yehuda.html
  • 27. Cold Start • Problems – User cold start : new users – Item cold start : new items – System cold start : new systems
  • 28. User Cold Start • How to recommend items to new users? – Non-personalization recommendation • Most popular items • Highly Rated items – Using user register profile (Age, Gender, …)
  • 29. User Cold Start • Example: Gender and TV shows Data comes from IMDB : http://www.imdb.com/title/tt0412142/ratings
  • 30. User Cold Start Male Age : 20-30 Theoretical physicist Doctor American Irreligious
  • 31. How to get user interest quickly • When new user comes, his feedback on what items can help us better understand his interest? – Not very popular – Can represent a group of items – Users who like this item have different preference with users who dislike this item
  • 32. Item Cold Start • How to recommend new items to user? – Do not recommend How to recommend news??
  • 33. Item Cold Start • How to recommend new items to user? – Using content information Machine Data Mining Recommendation Learning
  • 34. System Cold Start • How to design recommender system when there is no user? – Pandora : Music Genome Project – Jinni : Movie Genome Project
  • 35. Architecture • Feature-based recommendation framework: A 1 a B 2 b C 3 c D 4 User Feature Item
  • 36. Architecture Male Scientist Physics
  • 37. Architecture • Advantage: – Heterogeneous data – Reasonable Explanation • Disadvantage: – Do not support user-based methods
  • 38. Open Questions • How to weight multiple behaviors? • How to improve diversity, novelty? • How to build feedback loop?