Collective Cognition with Semantic
Mediawiki: Lessons and Experiences

               Jie Bao, Li Ding and James Hendler

                    Tetherless World Constellation,
                   Department of Computer Science
                   Rensselaer Polytechnic Institute,
                             Troy, NY, USA
                   {baojie,dingl,hendler}@cs.rpi.edu

    Network-Enabled Cognition workshop, ACITA, 2009 Sep 22, 2009, UMUC, Maryland
Goal

• To identify a few common pitfalls and
  limitations of Semantic Mediawiki in
  – knowledge modeling,
  – Knowledge organization and context, and
  – Collaboration protocols
• To examine some potential approaches to
  solve these problems.


                                              2
Wiki and Recognition

• Wiki is among the most prominent of forms on
  the Web that harness the distributed, collective
  efforts of users to create content online.
   – Ideas are formed, indentified or evolved and
   – Facts are discovered, refined or removed (for
     “wrong” ones)
   In the course of a never-ending editing process.




                                                      3
Collective Cognition

• A hypothetical example: write an outline
  for a new South Park episode.




                            Picture courtesy of Wikipedia



                                                            4
Collective Cognition
•   9:00pm: show starts
•   9:01pm: Wikipedia page for the episode is created
•   9:01-9:08pm: 10 active users are adding summary when they watch
•   9:08-9:10pm: ad time, some minor reorganization and typo fixing.
•   9:10-9:30pm: repeat the above
•   9:30-10:30pm: a user A is doing major refining, e.g. adding culture
    references
•   10:31pm: a user B disagrees with A, undo A’s edit
•   10:32pm: A undos B’s edit
•   10:33pm: B leaves a message on A’s user page, to avoid further edit wars
•   10:43pm: the two agrees to compromise with wording such that “It is implied
    that Chef is dead at the end of the episode; however, some others believe
    that it is not the case”.
•   The next day: user C adds a citation to this South Park episode’s page on a
    film’s page with that “South Park episode X is a parody of the this film”.

                                                                                  5
Key elements

• Simplicity: least training required to
  contribute.

• AAA: anybody can say anything anywhere

• NPOV: neutral point of view (among other
  collaboration protocols of Wikipedia)



                                             6
Semantic Wiki

• Extensions to Wikis with some Semantic
  Web support
  – Example: Semantic Mediawiki

   Eric Cartman
       [[friend of::Butters]]                  Butters
       [[Category:Boy]]

                                   friend of


                                                 Cartman
  Eric_Cartman friend_of Butters
  Eric_Cartman rdf:type Boy
        (RDF triple)
                                                           7
Semantic Wiki

• Fast-growing adoption
  –   Healthcare
  –   E-Government
  –   Entertainment
  –   Consulting
  –   Database
  –   …
• Inside ITA
  – OWL modeling with controlled natural language
  – Rule modeling

                                                    8
Semantic Wiki

• Can Semantic wiki reproduces the success of
  wiki to be among the most prominent of forms on
  the Web that harness the distributed, collective
  efforts of users to create content knowledge
  online?
• We have seen encouraging success in quite a
  few projects
• However, some issues are identified in our real-
  world experiences.


                                                     9
Knowledge Modeling

• Myth: users can do RDF-style (triple-
  based) modeling on SMW

• Fact: few is able to do this (at least without
  substantial training)




                                                   10
“Big Fat Page” effects
We gave a 3 hour training on SMW with a group of
 undergraduate students (most with no knowledge of
 RDF), and let them do a collective annotation task on TV
 shows. However, the result is not fully satisfactory
• Difference between categories and properties is not that easy to
  understand (see a lot misuse like Category:hug)
• To describe a thing with triples requires “thinking in RDF”, which
  needs some experiences.
• It is a big headache to choose the right vocabulary and it is hard to
  know what vocabulary to reuse.

As a result, many of the testees simply use the wiki as a notepad,
   without adding much semantic annotations, resulting in a long single
   “usual” wiki page.
                                                                          11
Schema or not schema?
• Two common knowledge models on a semantic wiki,
   – “Schema”-based modeling, often represented in the form of pre-
     defined wiki templates, that are used by “common” users of the
     wiki to access data via forms or prebuilt queries.
      • c.f. “infobox” in Wikipedia
      • =>stable, shared knowledge
   – Arbitrary RDF-style semantic markup - heavily used by a
     selected few elite group
      • => less structured, less shared knowledge


• A carefully pre-populated wiki “schema” (template), is as
  important as a schema in a database project.


                                                                      12
Template Example




Template as Schema    Form for the template


                                              13
Organization and Context

• Myth: semantic wiki, like wiki, allows you
  to write things freely.

• Fact: SMW does not support AAA
  – Every “triple” has to be on its subject’s page.
     • E.g., “South Park episode X is a parody of the this
       film” can only be said on X’s page.
  – Each subject and property of a triple must be
    a local page name.

                                                             14
Organization and Context

Why it may be problematic?

• May require the creation of many trivial, small pages.
• Is troublesome to describe things (e.g., an external URL)
  that have no corresponding wiki pages.
• Discourages users due to the difficulty of determining
  where to write knowledge (i.e., the best “subject” pages).
• Many users are confused of query-based pages: they do
  not know how to track the source of the queried results
  when they want to change a query-based page.


                                                               15
Organization and Context

Potential Solution

• Extending the SMW syntax
  – [[Cartman::friend of::Butters]]


• Introducing a context model to SMW
  – Context: Where, Who, When
  – No more need to use the subject to locate a triple



                                                         16
Collaboration Protocol

• Myth: semantic wiki, as wiki always does,
  allows compromises between different
  points of view.

• Fact: Semantic wiki only allows one
  version of the (semantic) “truth”.
  – A triple can not be both true and not true



                                                 17
Ontology War

                                   No! Cartman is
                                   only a Fictional
                                     Character


Cartman is a
    Boy




               http://www.gambling911.com/files/publisher/cat-fight-032609L.jpg



           Collaboration Protocol Support Needed!

                                                                                  18
Collaboration Protocol

• Avoid edit wars in Wikipedia
  – NPOV: allows multiple points
    of view co-exist on one page
    verifiable sources.
  – natural language text can
    accommodate and explain
    multiple points of view on a
    single page



                                   19
Collaboration Protocol
Two possible approaches

• To have categories and typed links optionally
  contextualized by authors, similar to the tag
  contextualizing mechanism in delicious and flickr.
   – http://example.com/author/term (contextualized name)
   – http://example.com/term (non-contextualized name)


• To introduce a context model of SMW knowledge
  statements, so that different versions of truth may be
  formally represented with explicitly given sources.

                                                            20
Conclusions
• Modeling in SMW can be regarded as an evolving
  cognition process and schema-based modeling is useful.

• We showed that a context/provenance model is needed
  for SMW to support better knowledge organization

• Collaboration protocols: to accommodate two versions of
  a fact, provenance of a term and/or triple should be
  traceable.




                                                            21
Solution Summary

• Simplicity: improve user interaction using
  forms and templates (schema).

• AAA: Context model
  – On-going work: “Semantic History”


• NPOV: Enabled by the context model


                                               22

More Related Content

PDF
"What is left to do?", Dublin Core 2012 Keynote
PDF
Blogs And Wikis In Academia
PPTX
Walz "Impacts of OER Flexibility: Understanding, Navigating, and Leveraging t...
PDF
We Need to Talk About Paid Editing: Sorting Out Wikipedia's Most Enduring Arg...
PPTX
Open Knowledge: Wikipedia and Beyond
PPTX
Developing an Ethical Approach to Using Wikipedia as the Front Matter to all ...
PPTX
The Wiki and the Wikipedia: A Comparative Study
PPTX
Editing Wikipedia: Why You Should and How You Can Support Your Users
"What is left to do?", Dublin Core 2012 Keynote
Blogs And Wikis In Academia
Walz "Impacts of OER Flexibility: Understanding, Navigating, and Leveraging t...
We Need to Talk About Paid Editing: Sorting Out Wikipedia's Most Enduring Arg...
Open Knowledge: Wikipedia and Beyond
Developing an Ethical Approach to Using Wikipedia as the Front Matter to all ...
The Wiki and the Wikipedia: A Comparative Study
Editing Wikipedia: Why You Should and How You Can Support Your Users

What's hot (20)

PPTX
ARLIS 2010 RLG Partnership Round Table
PPTX
Ashok "Creating Resilient OER in Times of Crisis"
PPTX
Brian kelly: Open practices for the connected researcher
PDF
Bridging the Web and Digital Publishing: EPUBWEB
PPTX
From Public Pedagogy to Critical Digital Praxis: Learning/Writing as Reflecti...
PDF
Undue Diligence: Seeking Low-risk Strategies for Making Collections of Unpubl...
PDF
Wollongong 090408232854-phpapp01
PDF
Metadata is a Love Note to the Future
PDF
From Frenemies to Friends: Embracing Wikipedia
PDF
[[edit]] this GLAM
PDF
Labour of Love: Why do people contribute, or not contribute, to Wikipedia art...
PPTX
Forms of Innovation: Collaboration, Attribution, Access
PPT
A Survey of the Landscape and State-of-Art in Semantic Wiki
PDF
Khmer Wikipedia's WikiMeetupPP2 - PPT Deck 201100430
PPTX
Cr4 moderna corrigido
PPTX
Open Practices for the Connected Researcher
PPT
Fys presentation 12_aug_2010
PPT
Beyond the Silos of the LAMs - Library, Archive, Museum Collaboration
PPTX
Introduction to social media for researchers
PPTX
Using social media to build your academic career
ARLIS 2010 RLG Partnership Round Table
Ashok "Creating Resilient OER in Times of Crisis"
Brian kelly: Open practices for the connected researcher
Bridging the Web and Digital Publishing: EPUBWEB
From Public Pedagogy to Critical Digital Praxis: Learning/Writing as Reflecti...
Undue Diligence: Seeking Low-risk Strategies for Making Collections of Unpubl...
Wollongong 090408232854-phpapp01
Metadata is a Love Note to the Future
From Frenemies to Friends: Embracing Wikipedia
[[edit]] this GLAM
Labour of Love: Why do people contribute, or not contribute, to Wikipedia art...
Forms of Innovation: Collaboration, Attribution, Access
A Survey of the Landscape and State-of-Art in Semantic Wiki
Khmer Wikipedia's WikiMeetupPP2 - PPT Deck 201100430
Cr4 moderna corrigido
Open Practices for the Connected Researcher
Fys presentation 12_aug_2010
Beyond the Silos of the LAMs - Library, Archive, Museum Collaboration
Introduction to social media for researchers
Using social media to build your academic career

Viewers also liked (7)

PDF
Walk the Talk: Analyzing the relation between implicit and explicit feedback ...
PPTX
The Triplex Approach for Recognizing Semantic Relations from Noun Phrases, Ap...
PPT
PDF
Semantic Relations
PPT
446 cognitivecategories-2017
PPTX
Lexical Relations in Semantic
PPTX
Database : Relational Data Model
Walk the Talk: Analyzing the relation between implicit and explicit feedback ...
The Triplex Approach for Recognizing Semantic Relations from Noun Phrases, Ap...
Semantic Relations
446 cognitivecategories-2017
Lexical Relations in Semantic
Database : Relational Data Model

Similar to Collective Cognition with Semantic Mediawiki: Lessons and Experiences (20)

PDF
The Unbearable Lightness of Wiking
PPTX
Semantic Wiki: Social Semantic Web In Action:
PPTX
Jist tutorial semantic wikis and applications
PDF
A Semantic Wiki To Support Knowledge Sharing In Innovation Activities
PPTX
Pre-SMWCon Spring 2012 meetup (short)
PDF
Applying And Extending Semantic Wikis For Semantic Web Courses
PPTX
Aswc2009 Smw Tutorial Part 1 Intro And Examples
PPTX
Semantic Wiki: Social Semantic Web in Use
PPTX
Msra talk smw+apps
PDF
A Controlled Natural Language Interface for Semantic MediaWiki
PDF
Development of a Controlled Natural Language Interface for Semantic MediaWiki
PDF
Lecture 3: Social Web Data Formats (2012)
PPT
Web 2
PDF
Lecture 3: Data Formats on the Social Web (2013)
PPTX
Semantic Wikis - Social Semantic Web in Action
PPT
A Short Tutorial to Semantic Media Wiki (SMW)
PDF
Bratsas Web Science Semantic Wiki
PPTX
Wiki in Teaching and Learning
PPTX
Tutorial semantic wikis and applications
PPTX
SMWCon Spring 2012 SMW+ Team Dev Update
The Unbearable Lightness of Wiking
Semantic Wiki: Social Semantic Web In Action:
Jist tutorial semantic wikis and applications
A Semantic Wiki To Support Knowledge Sharing In Innovation Activities
Pre-SMWCon Spring 2012 meetup (short)
Applying And Extending Semantic Wikis For Semantic Web Courses
Aswc2009 Smw Tutorial Part 1 Intro And Examples
Semantic Wiki: Social Semantic Web in Use
Msra talk smw+apps
A Controlled Natural Language Interface for Semantic MediaWiki
Development of a Controlled Natural Language Interface for Semantic MediaWiki
Lecture 3: Social Web Data Formats (2012)
Web 2
Lecture 3: Data Formats on the Social Web (2013)
Semantic Wikis - Social Semantic Web in Action
A Short Tutorial to Semantic Media Wiki (SMW)
Bratsas Web Science Semantic Wiki
Wiki in Teaching and Learning
Tutorial semantic wikis and applications
SMWCon Spring 2012 SMW+ Team Dev Update

More from Jie Bao (20)

PDF
python-graph-lovestory
PDF
unix toolbox 中文版
PDF
unixtoolbox.book
PDF
Lean startup 精益创业 新创企业的成长思维
PPT
Towards social webtops using semantic wiki
PPT
Semantic information theory in 20 minutes
PPT
Towards a theory of semantic communication
PPTX
Expressive Query Answering For Semantic Wikis (20min)
PDF
Startup best practices
PDF
Owl 2 quick reference card a4 size
PDF
ISWC 2010 Metadata Work Summary
PPTX
Expressive Query Answering For Semantic Wikis
PDF
PDF
24 Ways to Explore ISWC 2010 Data
PPT
Semantic Web: In Quest for the Next Generation Killer Apps
PDF
Representing financial reports on the semantic web a faithful translation f...
PDF
XACML 3.0 (Partial) Concept Map
PDF
Digital image self-adaptive acquisition in medical x-ray imaging
PPT
Privacy-Preserving Reasoning on the Semantic Web (Poster)
PPT
Privacy-Preserving Reasoning on the Semantic Web
python-graph-lovestory
unix toolbox 中文版
unixtoolbox.book
Lean startup 精益创业 新创企业的成长思维
Towards social webtops using semantic wiki
Semantic information theory in 20 minutes
Towards a theory of semantic communication
Expressive Query Answering For Semantic Wikis (20min)
Startup best practices
Owl 2 quick reference card a4 size
ISWC 2010 Metadata Work Summary
Expressive Query Answering For Semantic Wikis
24 Ways to Explore ISWC 2010 Data
Semantic Web: In Quest for the Next Generation Killer Apps
Representing financial reports on the semantic web a faithful translation f...
XACML 3.0 (Partial) Concept Map
Digital image self-adaptive acquisition in medical x-ray imaging
Privacy-Preserving Reasoning on the Semantic Web (Poster)
Privacy-Preserving Reasoning on the Semantic Web

Recently uploaded (20)

PDF
Build Real-Time ML Apps with Python, Feast & NoSQL
PDF
NewMind AI Journal Monthly Chronicles - August 2025
PDF
Identification of potential depression in social media posts
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PPTX
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
PDF
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
PDF
Decision Optimization - From Theory to Practice
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PDF
Examining Bias in AI Generated News Content.pdf
PDF
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
PDF
Advancing precision in air quality forecasting through machine learning integ...
PDF
The AI Revolution in Customer Service - 2025
PDF
Co-training pseudo-labeling for text classification with support vector machi...
PPTX
Build automations faster and more reliably with UiPath ScreenPlay
PDF
Early detection and classification of bone marrow changes in lumbar vertebrae...
PDF
Human Computer Interaction Miterm Lesson
PPTX
Blending method and technology for hydrogen.pptx
PDF
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
PDF
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
PDF
Launch a Bumble-Style App with AI Features in 2025.pdf
Build Real-Time ML Apps with Python, Feast & NoSQL
NewMind AI Journal Monthly Chronicles - August 2025
Identification of potential depression in social media posts
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
Decision Optimization - From Theory to Practice
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
Examining Bias in AI Generated News Content.pdf
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
Advancing precision in air quality forecasting through machine learning integ...
The AI Revolution in Customer Service - 2025
Co-training pseudo-labeling for text classification with support vector machi...
Build automations faster and more reliably with UiPath ScreenPlay
Early detection and classification of bone marrow changes in lumbar vertebrae...
Human Computer Interaction Miterm Lesson
Blending method and technology for hydrogen.pptx
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
Launch a Bumble-Style App with AI Features in 2025.pdf

Collective Cognition with Semantic Mediawiki: Lessons and Experiences

  • 1. Collective Cognition with Semantic Mediawiki: Lessons and Experiences Jie Bao, Li Ding and James Hendler Tetherless World Constellation, Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA {baojie,dingl,hendler}@cs.rpi.edu Network-Enabled Cognition workshop, ACITA, 2009 Sep 22, 2009, UMUC, Maryland
  • 2. Goal • To identify a few common pitfalls and limitations of Semantic Mediawiki in – knowledge modeling, – Knowledge organization and context, and – Collaboration protocols • To examine some potential approaches to solve these problems. 2
  • 3. Wiki and Recognition • Wiki is among the most prominent of forms on the Web that harness the distributed, collective efforts of users to create content online. – Ideas are formed, indentified or evolved and – Facts are discovered, refined or removed (for “wrong” ones) In the course of a never-ending editing process. 3
  • 4. Collective Cognition • A hypothetical example: write an outline for a new South Park episode. Picture courtesy of Wikipedia 4
  • 5. Collective Cognition • 9:00pm: show starts • 9:01pm: Wikipedia page for the episode is created • 9:01-9:08pm: 10 active users are adding summary when they watch • 9:08-9:10pm: ad time, some minor reorganization and typo fixing. • 9:10-9:30pm: repeat the above • 9:30-10:30pm: a user A is doing major refining, e.g. adding culture references • 10:31pm: a user B disagrees with A, undo A’s edit • 10:32pm: A undos B’s edit • 10:33pm: B leaves a message on A’s user page, to avoid further edit wars • 10:43pm: the two agrees to compromise with wording such that “It is implied that Chef is dead at the end of the episode; however, some others believe that it is not the case”. • The next day: user C adds a citation to this South Park episode’s page on a film’s page with that “South Park episode X is a parody of the this film”. 5
  • 6. Key elements • Simplicity: least training required to contribute. • AAA: anybody can say anything anywhere • NPOV: neutral point of view (among other collaboration protocols of Wikipedia) 6
  • 7. Semantic Wiki • Extensions to Wikis with some Semantic Web support – Example: Semantic Mediawiki Eric Cartman [[friend of::Butters]] Butters [[Category:Boy]] friend of Cartman Eric_Cartman friend_of Butters Eric_Cartman rdf:type Boy (RDF triple) 7
  • 8. Semantic Wiki • Fast-growing adoption – Healthcare – E-Government – Entertainment – Consulting – Database – … • Inside ITA – OWL modeling with controlled natural language – Rule modeling 8
  • 9. Semantic Wiki • Can Semantic wiki reproduces the success of wiki to be among the most prominent of forms on the Web that harness the distributed, collective efforts of users to create content knowledge online? • We have seen encouraging success in quite a few projects • However, some issues are identified in our real- world experiences. 9
  • 10. Knowledge Modeling • Myth: users can do RDF-style (triple- based) modeling on SMW • Fact: few is able to do this (at least without substantial training) 10
  • 11. “Big Fat Page” effects We gave a 3 hour training on SMW with a group of undergraduate students (most with no knowledge of RDF), and let them do a collective annotation task on TV shows. However, the result is not fully satisfactory • Difference between categories and properties is not that easy to understand (see a lot misuse like Category:hug) • To describe a thing with triples requires “thinking in RDF”, which needs some experiences. • It is a big headache to choose the right vocabulary and it is hard to know what vocabulary to reuse. As a result, many of the testees simply use the wiki as a notepad, without adding much semantic annotations, resulting in a long single “usual” wiki page. 11
  • 12. Schema or not schema? • Two common knowledge models on a semantic wiki, – “Schema”-based modeling, often represented in the form of pre- defined wiki templates, that are used by “common” users of the wiki to access data via forms or prebuilt queries. • c.f. “infobox” in Wikipedia • =>stable, shared knowledge – Arbitrary RDF-style semantic markup - heavily used by a selected few elite group • => less structured, less shared knowledge • A carefully pre-populated wiki “schema” (template), is as important as a schema in a database project. 12
  • 13. Template Example Template as Schema Form for the template 13
  • 14. Organization and Context • Myth: semantic wiki, like wiki, allows you to write things freely. • Fact: SMW does not support AAA – Every “triple” has to be on its subject’s page. • E.g., “South Park episode X is a parody of the this film” can only be said on X’s page. – Each subject and property of a triple must be a local page name. 14
  • 15. Organization and Context Why it may be problematic? • May require the creation of many trivial, small pages. • Is troublesome to describe things (e.g., an external URL) that have no corresponding wiki pages. • Discourages users due to the difficulty of determining where to write knowledge (i.e., the best “subject” pages). • Many users are confused of query-based pages: they do not know how to track the source of the queried results when they want to change a query-based page. 15
  • 16. Organization and Context Potential Solution • Extending the SMW syntax – [[Cartman::friend of::Butters]] • Introducing a context model to SMW – Context: Where, Who, When – No more need to use the subject to locate a triple 16
  • 17. Collaboration Protocol • Myth: semantic wiki, as wiki always does, allows compromises between different points of view. • Fact: Semantic wiki only allows one version of the (semantic) “truth”. – A triple can not be both true and not true 17
  • 18. Ontology War No! Cartman is only a Fictional Character Cartman is a Boy http://www.gambling911.com/files/publisher/cat-fight-032609L.jpg Collaboration Protocol Support Needed! 18
  • 19. Collaboration Protocol • Avoid edit wars in Wikipedia – NPOV: allows multiple points of view co-exist on one page verifiable sources. – natural language text can accommodate and explain multiple points of view on a single page 19
  • 20. Collaboration Protocol Two possible approaches • To have categories and typed links optionally contextualized by authors, similar to the tag contextualizing mechanism in delicious and flickr. – http://example.com/author/term (contextualized name) – http://example.com/term (non-contextualized name) • To introduce a context model of SMW knowledge statements, so that different versions of truth may be formally represented with explicitly given sources. 20
  • 21. Conclusions • Modeling in SMW can be regarded as an evolving cognition process and schema-based modeling is useful. • We showed that a context/provenance model is needed for SMW to support better knowledge organization • Collaboration protocols: to accommodate two versions of a fact, provenance of a term and/or triple should be traceable. 21
  • 22. Solution Summary • Simplicity: improve user interaction using forms and templates (schema). • AAA: Context model – On-going work: “Semantic History” • NPOV: Enabled by the context model 22