Pragmatic Approaches to the Semantic Web
        or, Why Aren’t We in Hyperland Yet?




                  Michael K. Bergman
Outline
 Intro to SD and Me
 Summary of Main Thesis
 A Wee Bit of History
 What is Not Working?
 Problems with Linked Data
 What is Working?
 Some Pragmatic Lessons
 SD’s Pragmatic Approach
 Conclusion and Q & A




                              2
Structured Dynamics
 Founded 2008; predecessor Zitgist LLC; two
  principals
 Privately held, revenue funded
 Boutique semantic technology shop
 Services and consulting:
    Semantic enterprise adoption
    Ontology development and mapping
    Tech transfer and training
 Development and software:
    Open source OSF stack
    Data conversion and migration
    Client-specific development


                                               3
Current Products and OSF Stack
             the pivotal product; Web services middleware that
             provides distributed data access and federation


             Drupal-based structured data linkage to structWSF


             spreadsheet, JSON and XML authoring and
             conversion framework


             reference set of linking subjects and basis for domain
             vocabularies


             an ontology- and entity-driven information extraction
             and tagging system


                                                                      4
SD Locations




               5
Michael Bergman




                  6
Summary of Main Thesis
Main Arguments
 Not against linked data
    Proponent and explicator since 2006
 But, linked data burdensome, not pivotal to
  interoperability
 Interoperability requires:
      Structured data (from any source)
      Canonical data model (RDF)
      (Relatively simple) ontologies for world views, schema
      Curation




                                                                8
A Wee Bit of History
Key Historical Milestones
 1945: Memex
 1963: Hypertext
 1990: Hyperland
 2001: Semantic Web
    Lack of uptake
 2006: Linked Data
 2010: Revisionist Linked Data




                                  10
Hyperland




            11
Linked Data



        “Linked Data is a set of best practices for publishing
      and deploying instance and class data using the RDF
         data model, naming the data objects using uniform
      resource identifiers (URIs), thereby exposing the data
       for access via the HTTP protocol, while emphasizing
       data interconnections, interrelationships and context
            useful to both humans and machine agents.”




                                                                 12
What is Not Working?
Some Disappointments to Date
 Full semantic Web vision
 Widescale adoption of the semantic Web, linked data
 Lack of intelligent agents
 Many aspects of the practice of linked data




                                                        14
Problems with Linked Data
Problems with Linked Data
 Burdensome on publishers
 Naïve linkages:
    Overuse of sameAs
    Lack of accurate alignments
 (Often) poor data quality
 Wrong focus




                                   16
Some Conditions for Interoperability




<Interoperability> <needsMapping> <Predicates>


 <Interoperability> <needsReference> <Nouns>




                                                 17
Many Mappings Should be Approximate
   skos:broadMatch
   skos:related
   ore:similarTo
   umbel:isAbout
   vmf:isInVocabulary
   skos:closeMatch
   lvont:nearlySameAs
   umbel:isLike
   umbel:hasCharacteristic
   lvont:somewhatSameAs
   rdfs:seeAlso
   ore:describes
   map:narrowerThan
   skos:narrower
   map:broaderThan
   skos:broader
   dc:subject
   link:uri
   foaf:isPrimaryTopicOf
                                      18
What is Working?
Successes
 Siri
 Bing (Powerset)
 Google + schema.org
 (Some) linked data




                        20
Siri




       21
Bing (Powerset)




                  22
Google
 Statistical NLP
 Structured results
 Initial schema (Metaweb)
 schema.org (with Yahoo, Bing and Yandex)




                                             23
Some Linked Data
 Some selected knowledge bases:
    DBpedia
    GeoNames
    Freebase (Google)
 Biomedical community
 LOD-LAM community




                                   24
Some Pragmatic Lessons
Some Lessons Learned
 Structure is good in any form
 Keep semantic technology in the background
 Open Web (FYN) likely to be disappointing
 Ontologies essential for alignments
 NLP an essential contributor to structure
 Metadata an essential contributor to characterization,
  use
 Linked data is a burden to publishers, places
  semantic emphasis on wrong part of chain




                                                           26
Seven Pillars




                27
Preserving Existing Assets
 Relational databases (RDBMs)
 Distributed structured assets
    spreadsheets
    lightweight datastores
 Web pages and Web sites
 Existing documents and text
 Web databases and APIs
 Other databases (RDF, OO, etc.)




                                    28
irON Dataset Exchange Framework
 Simple authoring and dataset creation
 irON includes an abstract notation and vocabulary for
  instance records
 Notations for:
      Instance records
      Schema
      Datasets and metadata
      Linkages to other schema
 Serializations available for:
    XML (irXML)
    JSON (irJSON)
    CSV/spreadsheets (commON)




                                                          29
Three irON Serializations
      irXML                 irJSON




               commON
                                     30
Spreadsheet Correspondence to Triples




                                        31
More-or-less Interchangeable Formats




                                       32
SD’s Pragmatic Approach
A Layered Approach




                     34
OSF Stack




            35
Conclusion
Summary
 If you can, do linked data; it is a GOOD THING
 In any event, expose your data:
      Structured (use NLP for unstructured)
      Metadata
      Definitions
      Relations (simple)
      “Semsets” (synonyms, acronyms, spelling variants)
 Build vocabulary and ontology consortia
 Build trust and curation communities
 Semantics essential at the interoperability level, not
  necessarily publication or data transfer



                                                           37
Take Aways
 James Hendler:
       “A little bit of semantics goes a long way”
 Leverage linked data, but broaden focus
 Consider adopting the semantic enterprise as the
  broader focus




                                                     38
Further Information
More Info and Links
 Open Semantic Framework (OSF) stack:
    http://openstructs.org
 TechWiki (400 detailed OSF how-to articles):
    http://techwiki.openstructs.org
 Key ontologies:
    UMBEL: http://umbel.org
    BIBO: http://bibliontology.org
 Blogs:
    Mike Bergman: http://mkbergman.com
    Fred Giasson: http://fgiasson.com/blog
 Structured Dynamics:
    http://structureddynamics.com
    http://citizen-dan.org (community indicator systems)


                                                            40

More Related Content

PPT
The Rationale for Semantic Technologies
PPT
Seven Arguments for Semantic Technologies
PPT
DCMI Keynote: Bridging the Semantic Gaps and Interoperability
PDF
SemTecBiz 2012: Corporate Semantic Web
PDF
GraphChain
PDF
Using the Semantic Web Stack to Make Big Data Smarter
PDF
Cooking up the Semantic Web
PPTX
Metaverse for Dataverse
 
The Rationale for Semantic Technologies
Seven Arguments for Semantic Technologies
DCMI Keynote: Bridging the Semantic Gaps and Interoperability
SemTecBiz 2012: Corporate Semantic Web
GraphChain
Using the Semantic Web Stack to Make Big Data Smarter
Cooking up the Semantic Web
Metaverse for Dataverse
 

What's hot (20)

PPTX
The Future of LOD
PPTX
External CV support in Dataverse 5.7
 
PDF
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
PDF
Industry Ontologies: Case Studies in Creating and Extending Schema.org
PDF
Big Data and the Semantic Web: Challenges and Opportunities
PPTX
Fighting COVID-19 with Artificial Intelligence
 
PPTX
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
 
PPTX
Controlled vocabularies and ontologies in Dataverse data repository
 
PPTX
Setting up Dataverse repository for research data
 
PDF
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
PPT
GFGC CHIKKABASUR ( DBMS )
PDF
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
PDF
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
PPT
SKOS and Linked Data
PPTX
External controlled vocabularies support in Dataverse
 
PDF
Structured Data for the Financial Industry
PPTX
Running Dataverse repository in the European Open Science Cloud (EOSC)
 
ODT
Open Data and Interoperability
PPTX
Technical integration of data repositories status and challenges
 
The Future of LOD
External CV support in Dataverse 5.7
 
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Industry Ontologies: Case Studies in Creating and Extending Schema.org
Big Data and the Semantic Web: Challenges and Opportunities
Fighting COVID-19 with Artificial Intelligence
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
 
Controlled vocabularies and ontologies in Dataverse data repository
 
Setting up Dataverse repository for research data
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
GFGC CHIKKABASUR ( DBMS )
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
SKOS and Linked Data
External controlled vocabularies support in Dataverse
 
Structured Data for the Financial Industry
Running Dataverse repository in the European Open Science Cloud (EOSC)
 
Open Data and Interoperability
Technical integration of data repositories status and challenges
 
Ad

Viewers also liked (20)

PDF
Semantics and Pragmatics
PDF
Semantic Pragmatic Disorder : A Cognitive Science Prespective
PPTX
Semantic Pragmatic
PDF
Semantic Analysis: theory, applications and use cases
PPTX
Semantics analysis ppt
PPTX
Semantic & Pragmatic
PPTX
Psychological testing
PPTX
Natural language processing
PDF
Techbuddy: Introduction to Linux session
PDF
SAA 2012 DDIG Forum Slides: CAPACITY-BUILDING FOR ARCHAEOLOGY IN THE 21ST CEN...
PPT
solar system
PDF
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
PPS
Joan Of Arc James
PPT
Scenario based contextual learning design
PDF
The role of COINS in the Civic Space: Building a pathway to shared prosperity
KEY
Cultural Heritage Informatics: A Model for Digital Practice, Capacity Buildin...
PDF
Mobile UX
PDF
Bruce Perens: OS Landmark Case Testimony
PPT
12 Planning Successes V2
Semantics and Pragmatics
Semantic Pragmatic Disorder : A Cognitive Science Prespective
Semantic Pragmatic
Semantic Analysis: theory, applications and use cases
Semantics analysis ppt
Semantic & Pragmatic
Psychological testing
Natural language processing
Techbuddy: Introduction to Linux session
SAA 2012 DDIG Forum Slides: CAPACITY-BUILDING FOR ARCHAEOLOGY IN THE 21ST CEN...
solar system
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
Joan Of Arc James
Scenario based contextual learning design
The role of COINS in the Civic Space: Building a pathway to shared prosperity
Cultural Heritage Informatics: A Model for Digital Practice, Capacity Buildin...
Mobile UX
Bruce Perens: OS Landmark Case Testimony
12 Planning Successes V2
Ad

Similar to Pragmatic Approaches to the Semantic Web (20)

PDF
The state of the art in Linked Data
PDF
20110728 datalift-rpi-troy
PDF
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
PDF
The Future of Semantics on the Web
PDF
Standardizing for Open Data
PPTX
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
PPTX
Semantic Web Landscape 2009
PPT
Future of Web 2.0 & The Semantic Web
PDF
The Semantic Web: What IAs Need to Know About Web 3.0
PPT
Making the Conceptual Layer Real via HTTP based Linked Data
PDF
Some news about the SW
PDF
Sharing data on the web (2013)
PDF
The Web of Data: The W3C Semantic Web Initiative
PDF
G Antoniou Frank Van Harmelen A Semantic Web Primer
PDF
Open data and linked data
PDF
Linked data and Semantic Web Applications for Libraries
PDF
Cloud-based Linked Data Management for Self-service Application Development
PPTX
Linked open data project
PDF
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
PPT
Towards Semantic APIs for Research Data Services (Invited Talk)
The state of the art in Linked Data
20110728 datalift-rpi-troy
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
The Future of Semantics on the Web
Standardizing for Open Data
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
Semantic Web Landscape 2009
Future of Web 2.0 & The Semantic Web
The Semantic Web: What IAs Need to Know About Web 3.0
Making the Conceptual Layer Real via HTTP based Linked Data
Some news about the SW
Sharing data on the web (2013)
The Web of Data: The W3C Semantic Web Initiative
G Antoniou Frank Van Harmelen A Semantic Web Primer
Open data and linked data
Linked data and Semantic Web Applications for Libraries
Cloud-based Linked Data Management for Self-service Application Development
Linked open data project
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
Towards Semantic APIs for Research Data Services (Invited Talk)

Recently uploaded (20)

PDF
zbrain.ai-Scope Key Metrics Configuration and Best Practices.pdf
PPTX
MuleSoft-Compete-Deck for midddleware integrations
PPTX
Internet of Everything -Basic concepts details
PDF
Co-training pseudo-labeling for text classification with support vector machi...
PPTX
SGT Report The Beast Plan and Cyberphysical Systems of Control
PDF
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
PPTX
Microsoft User Copilot Training Slide Deck
PDF
Electrocardiogram sequences data analytics and classification using unsupervi...
PPTX
Module 1 Introduction to Web Programming .pptx
PDF
LMS bot: enhanced learning management systems for improved student learning e...
PDF
Ensemble model-based arrhythmia classification with local interpretable model...
DOCX
Basics of Cloud Computing - Cloud Ecosystem
PPTX
Training Program for knowledge in solar cell and solar industry
PDF
Data Virtualization in Action: Scaling APIs and Apps with FME
PDF
Introduction to MCP and A2A Protocols: Enabling Agent Communication
PPTX
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
PDF
SaaS reusability assessment using machine learning techniques
PDF
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
PDF
MENA-ECEONOMIC-CONTEXT-VC MENA-ECEONOMIC
PDF
Human Computer Interaction Miterm Lesson
zbrain.ai-Scope Key Metrics Configuration and Best Practices.pdf
MuleSoft-Compete-Deck for midddleware integrations
Internet of Everything -Basic concepts details
Co-training pseudo-labeling for text classification with support vector machi...
SGT Report The Beast Plan and Cyberphysical Systems of Control
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
Microsoft User Copilot Training Slide Deck
Electrocardiogram sequences data analytics and classification using unsupervi...
Module 1 Introduction to Web Programming .pptx
LMS bot: enhanced learning management systems for improved student learning e...
Ensemble model-based arrhythmia classification with local interpretable model...
Basics of Cloud Computing - Cloud Ecosystem
Training Program for knowledge in solar cell and solar industry
Data Virtualization in Action: Scaling APIs and Apps with FME
Introduction to MCP and A2A Protocols: Enabling Agent Communication
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
SaaS reusability assessment using machine learning techniques
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
MENA-ECEONOMIC-CONTEXT-VC MENA-ECEONOMIC
Human Computer Interaction Miterm Lesson

Pragmatic Approaches to the Semantic Web

  • 1. Pragmatic Approaches to the Semantic Web or, Why Aren’t We in Hyperland Yet? Michael K. Bergman
  • 2. Outline  Intro to SD and Me  Summary of Main Thesis  A Wee Bit of History  What is Not Working?  Problems with Linked Data  What is Working?  Some Pragmatic Lessons  SD’s Pragmatic Approach  Conclusion and Q & A 2
  • 3. Structured Dynamics  Founded 2008; predecessor Zitgist LLC; two principals  Privately held, revenue funded  Boutique semantic technology shop  Services and consulting:  Semantic enterprise adoption  Ontology development and mapping  Tech transfer and training  Development and software:  Open source OSF stack  Data conversion and migration  Client-specific development 3
  • 4. Current Products and OSF Stack the pivotal product; Web services middleware that provides distributed data access and federation Drupal-based structured data linkage to structWSF spreadsheet, JSON and XML authoring and conversion framework reference set of linking subjects and basis for domain vocabularies an ontology- and entity-driven information extraction and tagging system 4
  • 8. Main Arguments  Not against linked data  Proponent and explicator since 2006  But, linked data burdensome, not pivotal to interoperability  Interoperability requires:  Structured data (from any source)  Canonical data model (RDF)  (Relatively simple) ontologies for world views, schema  Curation 8
  • 9. A Wee Bit of History
  • 10. Key Historical Milestones  1945: Memex  1963: Hypertext  1990: Hyperland  2001: Semantic Web  Lack of uptake  2006: Linked Data  2010: Revisionist Linked Data 10
  • 11. Hyperland 11
  • 12. Linked Data “Linked Data is a set of best practices for publishing and deploying instance and class data using the RDF data model, naming the data objects using uniform resource identifiers (URIs), thereby exposing the data for access via the HTTP protocol, while emphasizing data interconnections, interrelationships and context useful to both humans and machine agents.” 12
  • 13. What is Not Working?
  • 14. Some Disappointments to Date  Full semantic Web vision  Widescale adoption of the semantic Web, linked data  Lack of intelligent agents  Many aspects of the practice of linked data 14
  • 16. Problems with Linked Data  Burdensome on publishers  Naïve linkages:  Overuse of sameAs  Lack of accurate alignments  (Often) poor data quality  Wrong focus 16
  • 17. Some Conditions for Interoperability <Interoperability> <needsMapping> <Predicates> <Interoperability> <needsReference> <Nouns> 17
  • 18. Many Mappings Should be Approximate  skos:broadMatch  skos:related  ore:similarTo  umbel:isAbout  vmf:isInVocabulary  skos:closeMatch  lvont:nearlySameAs  umbel:isLike  umbel:hasCharacteristic  lvont:somewhatSameAs  rdfs:seeAlso  ore:describes  map:narrowerThan  skos:narrower  map:broaderThan  skos:broader  dc:subject  link:uri  foaf:isPrimaryTopicOf 18
  • 20. Successes  Siri  Bing (Powerset)  Google + schema.org  (Some) linked data 20
  • 21. Siri 21
  • 23. Google  Statistical NLP  Structured results  Initial schema (Metaweb)  schema.org (with Yahoo, Bing and Yandex) 23
  • 24. Some Linked Data  Some selected knowledge bases:  DBpedia  GeoNames  Freebase (Google)  Biomedical community  LOD-LAM community 24
  • 26. Some Lessons Learned  Structure is good in any form  Keep semantic technology in the background  Open Web (FYN) likely to be disappointing  Ontologies essential for alignments  NLP an essential contributor to structure  Metadata an essential contributor to characterization, use  Linked data is a burden to publishers, places semantic emphasis on wrong part of chain 26
  • 28. Preserving Existing Assets  Relational databases (RDBMs)  Distributed structured assets  spreadsheets  lightweight datastores  Web pages and Web sites  Existing documents and text  Web databases and APIs  Other databases (RDF, OO, etc.) 28
  • 29. irON Dataset Exchange Framework  Simple authoring and dataset creation  irON includes an abstract notation and vocabulary for instance records  Notations for:  Instance records  Schema  Datasets and metadata  Linkages to other schema  Serializations available for:  XML (irXML)  JSON (irJSON)  CSV/spreadsheets (commON) 29
  • 30. Three irON Serializations irXML irJSON commON 30
  • 35. OSF Stack 35
  • 37. Summary  If you can, do linked data; it is a GOOD THING  In any event, expose your data:  Structured (use NLP for unstructured)  Metadata  Definitions  Relations (simple)  “Semsets” (synonyms, acronyms, spelling variants)  Build vocabulary and ontology consortia  Build trust and curation communities  Semantics essential at the interoperability level, not necessarily publication or data transfer 37
  • 38. Take Aways  James Hendler: “A little bit of semantics goes a long way”  Leverage linked data, but broaden focus  Consider adopting the semantic enterprise as the broader focus 38
  • 40. More Info and Links  Open Semantic Framework (OSF) stack:  http://openstructs.org  TechWiki (400 detailed OSF how-to articles):  http://techwiki.openstructs.org  Key ontologies:  UMBEL: http://umbel.org  BIBO: http://bibliontology.org  Blogs:  Mike Bergman: http://mkbergman.com  Fred Giasson: http://fgiasson.com/blog  Structured Dynamics:  http://structureddynamics.com  http://citizen-dan.org (community indicator systems) 40