0% found this document useful (0 votes)

168 views106 pages

1.elasticsearch Introduction Slides

Built-in analytics and aggregation capabilities Elasticsearch is the most popular enterprise search engine Elasticsearch Architecture - Cluster: A group of nodes that work together - Node: A single server that is part of the cluster - Index: A collection of documents - Type: Logical grouping of documents (deprecated) - Shard: Horizontal partition of an index - Replica: Additional copies of shards for high availability - Master node: Manages cluster state and operations - Data node: Stores and retrieves documents - Client node: Interacts with the cluster via REST API Elasticsearch is distributed, fault tolerant and scalable by design. Data is partition

Uploaded by

venunaini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

168 views106 pages

1.elasticsearch Introduction Slides

Uploaded by

venunaini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 106

Searching and Analyzing Logs

with Elasticsearch

Rajesh Kumar
[email protected]
A little search engine history and the
importance of search
Overview
Basics steps involved in indexing
and searching documents

The inverted index, the heart of a

search engine

An introduction to Elasticsearch and

its basic building blocks

Set up and install Elasticsearch on

your local machine and check
cluster health
What You Need for Learning Elastic Search?
Prerequisites

Familiarity with the command line on a

Mac, Linux or Windows machine
Familiarity with using RESTful APIs to
perform actions
A very basic understanding of distributed
computing
Install and Setup

The latest version of Elasticsearch, 7.5.1

requires Java version 8
A Mac, Linux or Windows machine on
which Elasticsearch can be installed
Overview
Introduction to basic concepts in
Elasticsearch, download and install
Building an index, adding documents to
it both individually and in bulk
Basic text analysis, including
tokenization and filtering
Search queries on an index using the
Query DSL
Aggregations: the faceting and
analytics workhorse of
Elasticsearch
A Brief History of Search
Brief History of Search

1945 1991 1993

Vannevar Bush first talks Tim Berners-Lee combined Excite improved search by
of the need to index hypertext, TCP and DNS to using statistical analysis of
records imagine W W W word relationships

1970s 1993 1994

The ARPANet network Primitive search engines, Yahoo offered a directory
which laid the foundation linear search of URLs,very of useful webpages i.e. a
of the modern internet basic ranking portal
Brief History of Search

1994 1996 1998

Lycos provided ranking Inktomi pioneered the paid Google ranking pages based
relevance, prefix inclusion model on how many other pages
matching, a huge catalog link to it

1994 1997 Today

Altavista had natural ask.com had natural Google, Bing, Baidu,
language queries, language search, human Naver, Yahoo
inbound link checking editors for queries
How Does Search Work?
What Is the Objective of Search?

Find the most relevantdocuments

with your search terms
Most Relevant Document for Search Terms

Know of the Index the Know how Retrieve

document’s document for relevant the ranked by
existence lookup document is relevance
Most Relevant Document for Search Terms

Web crawler Index the Know how Retrieve

document for relevant the ranked by
lookup document is relevance
Most Relevant Document for Search Terms

Web crawler Inverted Know how Retrieve

relevant the ranked by
index document is relevance
Most Relevant Document for Search Terms

Web crawler Inverted Scoring Retrieve

ranked by
index relevance
Most Relevant Document for Search Terms

Web crawler Inverted Scoring Search

index
Most Relevant Document for Search Terms

Web crawler Inverted Scoring Search

index
Search Is Not Restricted to The Web
Sites Have Their Own Search

E-commerce Video E-learning

The Inverted Index
An inverted index consists of a list of all the unique words that appear
in any document, and for each word, a list of the documents in which
it appears. Inverted index is created from document created in
elasticsearch.
The Inverted Index
Inverted index is created using process called analysis
- Tokenisation and
- Filterization)
Documents Have Content

Stark Baratheon Tyrell

Winter is coming Ours is the fury Growing Strong

Tokenize Text into Words

winter
is split words
coming
ours lowercased
the
fury
removed
punctuation
growing
strong
Tokenize Text into Words

winter 1
is 2
coming 1
ours 1
the 1
fury 1
growing 1
strong 1
Tokenize Text into Words

winter 1 Stark
is 2 Stark, Baratheon
coming 1 Stark
ours 1 Baratheon
the 1 Baratheon
fury 1 Baratheon
growing 1 Tyrell
strong 1 Tyrell
Tokenize Text into Words

winter 1 Stark
is 2 Stark, Baratheon
coming 1 Stark
ours 1 Baratheon
the 1 Baratheon
fury 1 Baratheon
growing 1 Tyrell
strong 1 Tyrell
Dictionary sorted so
lookup is easy
coming 1 Stark
fury 1 Baratheon
growing 1 Tyrell
is 2 Stark, Baratheon
ours 1 Baratheon
strong 1 Tyrell
the 1 Baratheon
winter 1 Stark
Postings

coming 1 Stark
fury 1 Baratheon
growing 1 Tyrell
is 2 Stark, Baratheon
ours 1 Baratheon
strong 1 Tyrell
the 1 Baratheon
winter 1 Stark
Search