Social Media Text Analytics Overview
Social Media Text Analytics Overview
● While the steps required for text analytics are largely dependent on the type of
approach and tool employed, a typical social media text analysis includes the
following cyclical steps.
Steps in Text
Analytics
Steps in Text Analytics
IDENTIFICATION AND SEARCHING
● The text analytics process starts with identifying the source of the text that will be
analyzed.
● Text posted on social media is dynamic, huge, diverse, multilingual, and noisy.
Thus, finding the right source for the purpose of text analytics is very crucial for
gaining useful business insights.
● The genre of the source text also will determine the type of tool used to extract and
analyze it. For example, extracting tweets requires different tools and approaches than
analyzing a document or website text.
● Analyzing tweets, for example, requires API-based searching and extraction of data
from the Twitter timeline based on criteria that you specify. You can choose to extract
tweets that include specific keywords, such as your company name.
● The desired business question that needs to be answered with text analytics will serve
as a good starting point.
Steps in Text Analytics
TEXT PARSING AND FILTERING
● The next step is to parse, clean, and filter the text, and create a dictionary
of words using NLP, which is mostly based on machine learning techniques.
● In order for computer and algorithms to extract meanings from the text, the
sentence structures and parts of speech are determined, named entities
extracted (people, organizations, product/service names, etc.), stop
words removed, and spellings are checked.
● Most of these steps are automatic; however, in certain stages, human
intervention is required. For example, in the filtering stage, manually
cleaning (by humans with domain expertise) may be required to remove
unwanted or irrelevant terms.
Steps in Text Analytics
TEXT TRANSFORMATION
● For analytical algorithms to be applied to the text, it should be transformed into a
computer-readable format (e.g., 0s and 1s) for analysis.
● The cleaned text is thus transformed into numerical representations using linear
algebra-based techniques, such as latent semantic analysis and vector space
models.
Steps in Text Analytics
TEXT MINING
● At this step, the text is actually mined to extract the needed business insights.
● Varieties of text mining algorithms are applied to the text, such as clustering,
association, classification, and predictive analysis, and sentiment
analysis.
● Text analysis employs these sophisticated algorithms to extract sentiment and
meanings from the text in a similar manner to the way human do; however, the
process is thousands of times faster.
Steps in Text Analytics
TEXT MINING
Association
● Association or association mining is a data-mining technique used to determine the
probability of the co occurrence of items in a collection of documents.
● The relationships between co occurring items are expressed as association rules.
● In text analytics, for example, social media text can be clustered together based on co
occurrence frequency. Or it can be used, for example, to find that a user who liked a
social media content A and B is 90 percent likely to also like content C.
Steps in Text Analytics
TEXT MINING
Clustering
● Clustering or cluster analysis groups objects based on similarity in
non-overlapping groups.
● Clustering is an important part of data mining and text analytics.
● Social media text (such as tweets or comments), for example, can be clustered into
positive, negative, and natural categories. Or nodes in a social media network can be
clustered based on importance.
Steps in Text Analytics
TEXT MINING
Classification
● From the text analytics perspective, classification or categorization is used to find
similarities in the document and groups them with predefined labels based on the
themes contained in the document.
● For example, an e-mail can be classified as spam based on its contents.
Social Media Text Analysis Tools
A variety of social media text analysis tools are available on the market. Some are
free and others are paid. Below we list some popular text analysis tools.
● Discovertext: Discovertext ([Link] is a powerful platform for
collecting, cleaning, and analyzing text and social media data streams.
● Lexalytics: Lexalytics ([Link] is a social media text and
semantic analysis tool for social media platforms, including Twitter, Facebook, blogs,
etc.
● Tweet Archivist: Tweet Archivist ([Link] is focused on
searching, archiving, analyzing, and visualizing tweets based on a search term or
hashtag (#).
Social Media Text Analysis Tools
● Twitonomy: Twitonomy ([Link] is a Twitter analytics tool for
getting detailed and visual analytics on tweets, retweets, replies, mentions, hashtags,
followers, etc.
● Netlytic: Netlytic ([Link] is a cloud-based text and social network
analytics platform for social media text that discovers social networks from online
conversations on social media sites.
● LIWC: Linguistic Inquiry and Word Count (LIWC) is a text analysis tool for
analyzing emotional, cognitive, structural, and process components present in
individuals’ verbal and written speech samples: [Link]
● Voyant: Voyant ([Link] is a web-based text reading and analysis.
With Voyant, a body of text can be read from a file or directly exported from a
website.
Review Questions
● Some platforms, such as YouTube, allow users to dislike videos. This can help
creators understand how their content is being received by their audience.
Common Social Media Actions
SHARE
● Share or “Share” button or sharing is a feature that allows social media
users to distribute the content posted over social media to other users.
● For example, the Facebook “Share” button lets users add a personal
message and customize who they share the content with.
● The WordPress (a blogging platform) “Share” button, for example, allows
users to share their blog content across a range of social media platforms.
● Companies incorporate share buttons into website to boost their
website traffic by channeling visitors from social media sites.
Common Social Media Actions
VISITORS, VISITS, REVISITS
● A visitor is a person who visits your website or blog. A single visitor
may visit a page or content one or more times (revisits). Visits are
also known as sessions.
● Unique visitor—A person who arrives at your page first time.
● Average bounce rate—the percentage of visitors who visit a website
and leave the site quickly without viewing other pages.
● Session duration—The average duration of a visit or session
Common Social Media Actions
VIEWs
● Views are the number of times social media content (a post, video,
graphic, etc.) is viewed by users.
● A slightly different but related concept is page views, which is
each time a visitor views a page on your company website or blogs
● View data can be used to understand user engagement and
popularity of content.
Common Social Media Actions
CLICKS
● Clicks are the actions performed by users by pressing or clicking on the hyperlinked
content of your website or blog.
● Through clicks, users navigate the web.
● Click data can be harvested for business intelligence purposes, such as, to reduce
bounce rate and improve website traffic.
● A technique called clickstream analysis is used by business managers for a variety of
business intelligence purposes, including website activity, website design analysis, path
optimization, market research, and finding ways to improve visitor experience on the
website.
● The clickstream is the semistructured data trail/log (such as date and time stamp,
IP address, and the URLs of the pages visited) a user leaves while visiting a
website.
Common Social Media Actions
TAGGING
● Tagging is the act of assigning or linking extra pieces of information to social media content
(such as photographs and bookmarks) for identification, classification, and search purposes.
● Tagging lets user classify social media content the way they see it.
● Tagging may take a variety of forms. For example, bloggers can attach descriptive keywords
(tags) to their posts to facilitate classification and searching of content, and Facebook users
can add tags to anything they post on their status, including photos and comments.
● Social bookmarking services (such as Delicious) let users organize their bookmarks flexibly by
adding descriptive tags. This practice of collaborative tagging is commonly known as
folksonomy—a term coined by Thomas Vander Wal.
● These days, almost all prominent companies (e.g., Facebook and Flickr) provide tagging
services to their users. Because the contents are tagged with useful keywords, social tagging
expedites the process of searching and finding relevant content.
Common Social Media Actions
MENTIONS
● Mentions or social mentions are the occurrence of a person, place, or
thing over social media by name.
● For example, a brand name maybe mentioned in a Facebook
comment, blog post, YouTube video, or tweet. Mentions are
important and can indicate popularity of person, place, or thing.
● For example, a social marketer may be able to gauge the popularity of
a product/service/campaign by mining Twitter mentions data. A
Twitter mention is the inclusion of a “@username” in a tweet.
Common Social Media Actions
HOVERING
● Hovering is the act of moving a cursor over social media content. Capturing users’
cursor movement data can help you understand user behavior on a social media
site.
● Cursor movement/hovering over an ad, for example, can be considered as a proxy for
attention.
● Most people who view an ad do not necessary click on it, thus if we are relying on
clicks analytics only, we may lose a vital piece of information (i.e., attention).
● Studies have even suggested a strong correlation between hover time and purchases.
● Traditionally, hovering data has been used in website design and for improvement of
user experience.
Common Social Media Actions
CHECK-IN
● Check-in is a social media feature that allows users to announce
and share their arrival at a location, such as a hotel, airport, city,
or store.
● Many social media services, including Facebook provide
check-in features.
● The location of the user is determined using GPS (global
positioning system) technology. Check-in data can, for example,
be mined to offer location-based services/products.
Common Social Media Actions
PINNING
● Pinning is an action performed by social media users to pin and share
interesting content (such as ideas, products, services, and information)
using a virtual pin- board platform.
● Some famous pinning platforms include Pinterest, Tumblr, StumbleUpon,
or Digg.
● Business can use these virtual pin boards to share information and
connect with and inspire their customers.
● Hotels and Resorts, for example, use Pinterest to curate travel, food, and
luxury lifestyle content to inspire customers.
Common Social Media Actions
EMBEDS
● Embedding is the act of incorporating social media content (e.g., a
link, video, or presentation) into a website or blog. An embed feature
lets users embed interesting content into their personal social media
outlets.
ENDORSEMENT
● Endorsement is a features of social media that lets people endorse and
approve other people, products, and services. For example, LinkedIn
lets user endorse the skills and qualifications of other people in their
network.
Common Social Media Actions
UPLOADING AND DOWNLOADING
● In simple words, uploading is the act of adding new content (e.g., texts, photos, and
videos) to a social media platform.
● The opposite of uploading is downloading; that is, the act of receiving data from a
social media platform.
● All most all social media content is created and uploaded by users, which is better
known as user-generated content.
● For some companies, uploading and downloading is the single most important action to
measure.
● For Instagram and Flickr, which are both photo-sharing platforms, the number of photos
uploaded daily matters more than anything else.
ACTIONS ANALYTICS TOOLS
● Hootsuite: Hootsuite is an easy-to-use online platform that enables you to manage
your social media presence across the most popular social networks. Hootsuite offers
different plans depending on your business needs and budget: free, pro, or enterprise.
● SocialMediaMineR: SocialMediaMineR is a social media analytics tool that takes
one or multiple URLs and returns the information about the popularity and reach of
the URL(s) on social media, including the number of shares, likes, tweets, pins, and
hits on Facebook, Twitter, Pinterest, StumbleUpon, LinkedIn, and Reddit. The tool
can accessed from here: [Link]
[Link]/web/packages/SocialMediaMineR/[Link]
● Lithium: Lithium ([Link] is social media management tool that
provides a variety of products and services, including social media analytics,
marking, crowd-sourcing, and social media marketing.
● Google Analytics: Google Analytics ([Link] is an
analytical tool offered by Google to track and analyze website traffic. It can also be
used to for blogs and wiki analytics.
ACTIONS ANALYTICS TOOLS
● Facebook Insights: Facebook Insights ([Link] helps Facebook
page owners understand and analyze trends within user growth and demographics.
● Klout: Klout ([Link] measures your influence across a range of social media channels
based on how many people interact with your posts. Your Klout score measures your influence on
a scale from one to one hundred.
● Topsy: Topsy ([Link] is similar to Icerocket and Social Mention, with its main focus
around social media, especially multimedia sites and blogs.
● Tweetreach: This tool helps you measure the number of impressions and reach of hashtags. The
tool can be accessed here: [Link] Kred: Kred helps you measure the influence of
a Twitter account: [Link]
● Hashtagify: This tool measures the influence of hashtags: [Link]
● Twtrland: Twtrland is a social intelligence research tool ([Link] for analyzing and
visualizes your social footprints.
● Tweetstats: using your Twitter username, Tweetstats graphs Twitter stats including tweets per
hour, tweets per month, tweet timelines, and reply statistics ([Link]
Questions
● Hyperlink analytics can also reveal, for example, Internet traffic patterns
and sources of the incoming or outgoing traffic to and from a website.
Social Media HyperLink Analytics
● Hyperlinks among websites represent not only a reasonable approximation of
a social relationship, but also serve as a symbolic meaning of validating or
endorsing the linked organization.
● These hyperlinks that exist between two organizational websites reflect a
sense of validation, trust, bonding, authority, and legitimacy. Websites
mostly connect or link to other websites of similar nature, so hyperlinks can
also serves as indicators of content similarity.
● From hyperlink analytics point of view, mainly there are three types of
hyperlinks,
1) in-links, 2) out-links, and 3) co-links.
Types of Hyperlinks
● IN-LINKS
● OUT-LINKS
● CO-LINKS
Types of Hyperlinks
● IN-LINKS
○ In-links are the incoming hyperlinks or links directed toward a
website or originated in other websites
● OUT-LINKS
○ Out-links are hyperlinks generated out of a website
● CO-LINKS
○ Co-links have two dimensions.
Social Media HyperLink Analytics
IN-LINKS
● In-links are the incoming hyperlinks or links directed toward a website or
originated in other websites.
● In-links are of great interest to social markers, because they bring traffic to a
particular website.
● In-links also play an important role in website analytics, as both the quality and number
of in-links can impact the search engine ranking of the website.
● In-links can also impact the popularity of social media contents.
● A study on YouTube viral videos, for instance, found that among other things, in-links
play crucial roles in the viral phenomenon, particularly in increasing views of videos
posted on YouTube. Studies have also shown that in-link counts strongly correlate
with measures describing business performance
Social Media HyperLink Analytics
OUT-LINKS
● Out-links are hyperlinks generated out of a website.
CO-LINKS
● Co-links have two dimensions.
● First, if two websites receive a link from a third website, they are considered to be
connected indirectly.
○ For example, page A links to both pages B and C, therefore B and C are considered to be co-linking, or
connected indirectly
○ Second, if two pages link to a third page, they are also considered to be co- linking. Pages B and C are linking
to page A; therefore, B and C are connected indirectly.
○ Co-links have been used to compare and map competitive similarity among companies.
HYPERLINK ANALYTICS
Hyperlink analysis uses and limitations:
● Uses: Hyperlink analysis has been used to study a variety of topics
○ including ranking of universities, understanding the blogosphere,
scholarly websites , and political networks, and to measure
business competitiveness.
● Limitations: Hyperlink networks does not
○ provide any insight into the type or amount of traffic flowing among websites.
○ does not include hyperlinks within a website between pages.
○ Hyperlinks between pages within a website are created mostly for navigational purposes. Also, search
engine ranking algorithms either ignore or give low importance to hyperlinks within a website.
TYPES OF HYPERLINK ANALYTICS
TYPES OF HYPERLINK ANALYTICS
Hyperlink analytics can take several forms, including: