@shawnmjones @WebSciDL @StormyArchives
SHARI
(StoryGraph Hypercane ArchiveNow Raintale Integration)
Employing the Dark and Stormy Archives Project
For Current Events Storytelling
Shawn M. Jones
Web Science and Digital Libraries Research Group
Old Dominion University
VMASC
May 13, 2020
Thanks to:
@shawnmjones @WebSciDL @StormyArchives@shawnmjones @WebSciDL
The Dark and Stormy Archives (DSA) Project
2
https://oduwsdl.github.io/dsa/
Shawn M. Jones
@shawnmjones
Michael L. Nelson
@phonedude_mln
Michele C. Weigle
@weiglemc
@shawnmjones @WebSciDL @StormyArchives
Researchers create their own web archive collections
3
Archived web pages, or mementos, are used by journalists, sociologists, and historians.
Tucson Shootings2008 OlympicsUniversity of Utah
@shawnmjones @WebSciDL @StormyArchives
Mementos – different versions of the same page –
allow us to see an unfolding news story
4
Memento from
April 19, 2013 17:12
Searching for suspects,
City on lockdown
Memento from
April 19, 2013 17:59
Officer Donahue in hospital,
Lockdown loosened,
Will the Red Sox game be cancelled?
Memento from
April 24, 2013 2:24
Suspect Found,
Office collier lost life,
Obama speaks
@shawnmjones @WebSciDL @StormyArchives
The problem that DSA
works to solve
 There are multiple collections
about the same concept.
 The metadata for each collection is
non-existent, or inconsistently
applied.
 Many collections have
1000s of seeds with multiple
mementos.
 There are more than 8000
collections.
 Human review of these
mementos for collection
understanding is an expensive
proposition.
5
@shawnmjones @WebSciDL @StormyArchives
Social media storytelling uses groups of social cards to
provide a “summary of summaries”
6
2 resources are shown in this Wakelet story6 resources are shown in this Storify story
Each social card summarizes a
web resource.
Each story groups the social
cards, summarizing the topic.
Social cards contain the same
information in the same place on
each card, allowing for easy
comparison.
We want to use this technique to
summarize web archive collections
because users are already
familiar with this visualization
paradigm.
@shawnmjones @WebSciDL @StormyArchives
Our proposal: a “story” made of cards generated from
an intelligent sample of mementos
7
> 23,000 documents collected
and stored with Archive-It
A summary “story” of 36 cards generated by the
Dark and Stormy Archives Toolkit
https://oduwsdl.github.io/dsa-puddles/stories/shari/2020/04/22/archive-
it_collection_13529___novel_coronavirus_(covid-19)/https://archive-it.org/collections/13529
@shawnmjones @WebSciDL @StormyArchives
For Summarizing Collections of
Mementos:
the Dark and Stormy Archives
(DSA) Toolkit
8
OTMT
Hypercane
Raintale MementoEmbed
AIU
Story
Collection of
Mementos
calls
calls
calls
provides
input to
input
output
Thousands of
documents
~28 Representative
Mementos
Visualized as
surrogates
calls
S. M. Jones, M. C. Weigle, and M. L. Nelson. 2018. The Off-Topic Memento Toolkit. In International
Conference on Digital Preservation (iPRES) 2018. https://doi.org/10.17605/OSF.IO/UBW87
S. M. Jones. Social Cards Probably Provide For Better Understanding Of Web Archive Collections.
2019. In ACM Conference on Information and Knowledge Management (CIKM) 2019.
https://doi.org/10.1145/3357384.3358039
S. M. Jones. “Raintale – A Storytelling Tool for Web Archives.” https://ws-dl.blogspot.com/2019/07/2019-
07-11-raintale-storytelling-tool.html, 2019.
Tools for intelligently
sampling mementos
from a collection
Tools for visualizing
mementos as a story
@shawnmjones @WebSciDL @StormyArchives
StoryGraph Hypercane ArchiveNow Raintale Integration
(SHARI)
9
Hypercane RaintaleArchiveNowStoryGraphToolkit
@shawnmjones @WebSciDL @StormyArchives
Integrating StoryGraph with the DSA Toolkit
10
StoryGraph Selects the URLs Hypercane, ArchiveNow, and Raintale
organize and render the Biggest Story of the Day
@shawnmjones @WebSciDL @StormyArchives
The SHARI Process
11
S. M. Jones. “SHARI: StoryGraph Hypercane ArchiveNow Raintale Integration --
Combining WS-DL Tools For Current Events Storytelling.” https://ws-
dl.blogspot.com/2020/04/2020-04-01-shari-storygraph-hypercane.html, 2020.
@shawnmjones @WebSciDL @StormyArchives
Viewing News Over Time
12
2017-08-08
North Korea’s Nuclear Program
2018-08-08
Special Congressional Elections
2019-08-08
Aftermath of Dayton and El Paso Shootings
@shawnmjones @WebSciDL @StormyArchives
Future Research and Resources
 Future research:
 algorithms for creating intelligent samples from different types of collections
 Resources
 Dark and Stormy Archives Project:
 Website: https://oduwsdl.github.io/dsa/
 Twitter: @StormyArchives
 DSA Puddles Web Site: https://oduwsdl.github.io/dsa-puddles/
 Raintale: https://oduwsdl.github.io/raintale/
 Hypercane: Coming in 2020
 Web Science and Digital Libraries Research Group:
 Twitter: @WebSciDL
 Blog: https://ws-dl.blogspot.com/
13

More Related Content

PDF
Uses of digital cultural heritage databases maintained by memory forming inst...
PPTX
Built in the 19th century, rebuilt for the 21st
PPTX
Storytelling With Web Archives
PPTX
Combining Social Media Storytelling With Web Archives
PPTX
Where Can We Post Stories Summarizing Web Archive Collections
PPTX
The Off-Topic Memento Toolkit
PPTX
Improving Understanding of Web Archive Collections Through Storytelling - PhD...
PPTX
The Many Shapes of Archive-It
Uses of digital cultural heritage databases maintained by memory forming inst...
Built in the 19th century, rebuilt for the 21st
Storytelling With Web Archives
Combining Social Media Storytelling With Web Archives
Where Can We Post Stories Summarizing Web Archive Collections
The Off-Topic Memento Toolkit
Improving Understanding of Web Archive Collections Through Storytelling - PhD...
The Many Shapes of Archive-It

Similar to SHARI (StoryGraph Hypercane ArchiveNow Raintale Integration) (20)

PDF
Improving Collection Understanding For Web Archives With Storytelling: Shinin...
PPTX
Nelson, Michael: Summarizing Archival Collections Using Storytelling Techniques
PPTX
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
PPTX
Summarizing archival collections using storytelling techniques
PPTX
Improving Collection Understanding in Web Archives
PPTX
Combining Storytelling and Web Archives
PPTX
2015-odu-ece-tools-for-past-web
PDF
A Framework for Aggregating Private and Public Web Archives
PDF
A Framework for Aggregating Public and Private Web Archives
PDF
Using Web Archives to Enrich the Live Web Experience Through Storytelling
PPTX
Tools for Managing the Past Web
PDF
JCDL 2016 Doctoral Consortium - Web Archive Profiling
PDF
MementoMap: A Web Archive Profiling Framework for Efficient Memento Routing
PDF
Capturing the ephemeral: Archiving our digital present
PPTX
Storytelling for Summarizing Collections in Web Archives
PDF
TPDL 2016 Doctoral Consortium - Web Archive Profiling
PPT
Profiling Web Archives
PDF
Web Archiving: A Brief Introduction
PPT
Who Will Archive the Archives? Thoughts About the Future of Web Archiving
PPTX
The Memento Protocol and Research Issues With Web Archiving
Improving Collection Understanding For Web Archives With Storytelling: Shinin...
Nelson, Michael: Summarizing Archival Collections Using Storytelling Techniques
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Summarizing archival collections using storytelling techniques
Improving Collection Understanding in Web Archives
Combining Storytelling and Web Archives
2015-odu-ece-tools-for-past-web
A Framework for Aggregating Private and Public Web Archives
A Framework for Aggregating Public and Private Web Archives
Using Web Archives to Enrich the Live Web Experience Through Storytelling
Tools for Managing the Past Web
JCDL 2016 Doctoral Consortium - Web Archive Profiling
MementoMap: A Web Archive Profiling Framework for Efficient Memento Routing
Capturing the ephemeral: Archiving our digital present
Storytelling for Summarizing Collections in Web Archives
TPDL 2016 Doctoral Consortium - Web Archive Profiling
Profiling Web Archives
Web Archiving: A Brief Introduction
Who Will Archive the Archives? Thoughts About the Future of Web Archiving
The Memento Protocol and Research Issues With Web Archiving
Ad

More from Shawn Jones (10)

PPTX
Abstract Images Have Different Levels of Retrievability Per Reverse Image Sea...
PPTX
DIRA 2022 Poster -- Abstract Images Have Different Levels of Retrievability P...
PDF
Abstract Images Have Different Levels of Retrievability Per Reverse Image Sea...
PPTX
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
PPTX
Automatically Selecting Striking Images for Social Cards
PPTX
Reference Rot
PPTX
Avoiding Spoilers On MediaWiki Fan Sites Using Memento
PPTX
Continuous Integration: Finding problems soonest
PPTX
A Brief Introduction to Test-Driven Development
PPTX
Reconstructing the past with media wiki
Abstract Images Have Different Levels of Retrievability Per Reverse Image Sea...
DIRA 2022 Poster -- Abstract Images Have Different Levels of Retrievability P...
Abstract Images Have Different Levels of Retrievability Per Reverse Image Sea...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
Automatically Selecting Striking Images for Social Cards
Reference Rot
Avoiding Spoilers On MediaWiki Fan Sites Using Memento
Continuous Integration: Finding problems soonest
A Brief Introduction to Test-Driven Development
Reconstructing the past with media wiki
Ad

Recently uploaded (20)

PDF
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PDF
CEH Module 2 Footprinting CEH V13, concepts
PDF
Electrocardiogram sequences data analytics and classification using unsupervi...
PDF
EIS-Webinar-Regulated-Industries-2025-08.pdf
PDF
Decision Optimization - From Theory to Practice
PDF
“The Future of Visual AI: Efficient Multimodal Intelligence,” a Keynote Prese...
PDF
Data Virtualization in Action: Scaling APIs and Apps with FME
PDF
Auditboard EB SOX Playbook 2023 edition.
PDF
Lung cancer patients survival prediction using outlier detection and optimize...
PDF
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PDF
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf
PPTX
Presentation - Principles of Instructional Design.pptx
PDF
Build Real-Time ML Apps with Python, Feast & NoSQL
PDF
Planning-an-Audit-A-How-To-Guide-Checklist-WP.pdf
PDF
Human Computer Interaction Miterm Lesson
PDF
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
PDF
substrate PowerPoint Presentation basic one
PPTX
Internet of Everything -Basic concepts details
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
CEH Module 2 Footprinting CEH V13, concepts
Electrocardiogram sequences data analytics and classification using unsupervi...
EIS-Webinar-Regulated-Industries-2025-08.pdf
Decision Optimization - From Theory to Practice
“The Future of Visual AI: Efficient Multimodal Intelligence,” a Keynote Prese...
Data Virtualization in Action: Scaling APIs and Apps with FME
Auditboard EB SOX Playbook 2023 edition.
Lung cancer patients survival prediction using outlier detection and optimize...
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf
Presentation - Principles of Instructional Design.pptx
Build Real-Time ML Apps with Python, Feast & NoSQL
Planning-an-Audit-A-How-To-Guide-Checklist-WP.pdf
Human Computer Interaction Miterm Lesson
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
substrate PowerPoint Presentation basic one
Internet of Everything -Basic concepts details

SHARI (StoryGraph Hypercane ArchiveNow Raintale Integration)

  • 1. @shawnmjones @WebSciDL @StormyArchives SHARI (StoryGraph Hypercane ArchiveNow Raintale Integration) Employing the Dark and Stormy Archives Project For Current Events Storytelling Shawn M. Jones Web Science and Digital Libraries Research Group Old Dominion University VMASC May 13, 2020 Thanks to:
  • 2. @shawnmjones @WebSciDL @StormyArchives@shawnmjones @WebSciDL The Dark and Stormy Archives (DSA) Project 2 https://oduwsdl.github.io/dsa/ Shawn M. Jones @shawnmjones Michael L. Nelson @phonedude_mln Michele C. Weigle @weiglemc
  • 3. @shawnmjones @WebSciDL @StormyArchives Researchers create their own web archive collections 3 Archived web pages, or mementos, are used by journalists, sociologists, and historians. Tucson Shootings2008 OlympicsUniversity of Utah
  • 4. @shawnmjones @WebSciDL @StormyArchives Mementos – different versions of the same page – allow us to see an unfolding news story 4 Memento from April 19, 2013 17:12 Searching for suspects, City on lockdown Memento from April 19, 2013 17:59 Officer Donahue in hospital, Lockdown loosened, Will the Red Sox game be cancelled? Memento from April 24, 2013 2:24 Suspect Found, Office collier lost life, Obama speaks
  • 5. @shawnmjones @WebSciDL @StormyArchives The problem that DSA works to solve  There are multiple collections about the same concept.  The metadata for each collection is non-existent, or inconsistently applied.  Many collections have 1000s of seeds with multiple mementos.  There are more than 8000 collections.  Human review of these mementos for collection understanding is an expensive proposition. 5
  • 6. @shawnmjones @WebSciDL @StormyArchives Social media storytelling uses groups of social cards to provide a “summary of summaries” 6 2 resources are shown in this Wakelet story6 resources are shown in this Storify story Each social card summarizes a web resource. Each story groups the social cards, summarizing the topic. Social cards contain the same information in the same place on each card, allowing for easy comparison. We want to use this technique to summarize web archive collections because users are already familiar with this visualization paradigm.
  • 7. @shawnmjones @WebSciDL @StormyArchives Our proposal: a “story” made of cards generated from an intelligent sample of mementos 7 > 23,000 documents collected and stored with Archive-It A summary “story” of 36 cards generated by the Dark and Stormy Archives Toolkit https://oduwsdl.github.io/dsa-puddles/stories/shari/2020/04/22/archive- it_collection_13529___novel_coronavirus_(covid-19)/https://archive-it.org/collections/13529
  • 8. @shawnmjones @WebSciDL @StormyArchives For Summarizing Collections of Mementos: the Dark and Stormy Archives (DSA) Toolkit 8 OTMT Hypercane Raintale MementoEmbed AIU Story Collection of Mementos calls calls calls provides input to input output Thousands of documents ~28 Representative Mementos Visualized as surrogates calls S. M. Jones, M. C. Weigle, and M. L. Nelson. 2018. The Off-Topic Memento Toolkit. In International Conference on Digital Preservation (iPRES) 2018. https://doi.org/10.17605/OSF.IO/UBW87 S. M. Jones. Social Cards Probably Provide For Better Understanding Of Web Archive Collections. 2019. In ACM Conference on Information and Knowledge Management (CIKM) 2019. https://doi.org/10.1145/3357384.3358039 S. M. Jones. “Raintale – A Storytelling Tool for Web Archives.” https://ws-dl.blogspot.com/2019/07/2019- 07-11-raintale-storytelling-tool.html, 2019. Tools for intelligently sampling mementos from a collection Tools for visualizing mementos as a story
  • 9. @shawnmjones @WebSciDL @StormyArchives StoryGraph Hypercane ArchiveNow Raintale Integration (SHARI) 9 Hypercane RaintaleArchiveNowStoryGraphToolkit
  • 10. @shawnmjones @WebSciDL @StormyArchives Integrating StoryGraph with the DSA Toolkit 10 StoryGraph Selects the URLs Hypercane, ArchiveNow, and Raintale organize and render the Biggest Story of the Day
  • 11. @shawnmjones @WebSciDL @StormyArchives The SHARI Process 11 S. M. Jones. “SHARI: StoryGraph Hypercane ArchiveNow Raintale Integration -- Combining WS-DL Tools For Current Events Storytelling.” https://ws- dl.blogspot.com/2020/04/2020-04-01-shari-storygraph-hypercane.html, 2020.
  • 12. @shawnmjones @WebSciDL @StormyArchives Viewing News Over Time 12 2017-08-08 North Korea’s Nuclear Program 2018-08-08 Special Congressional Elections 2019-08-08 Aftermath of Dayton and El Paso Shootings
  • 13. @shawnmjones @WebSciDL @StormyArchives Future Research and Resources  Future research:  algorithms for creating intelligent samples from different types of collections  Resources  Dark and Stormy Archives Project:  Website: https://oduwsdl.github.io/dsa/  Twitter: @StormyArchives  DSA Puddles Web Site: https://oduwsdl.github.io/dsa-puddles/  Raintale: https://oduwsdl.github.io/raintale/  Hypercane: Coming in 2020  Web Science and Digital Libraries Research Group:  Twitter: @WebSciDL  Blog: https://ws-dl.blogspot.com/ 13