0% found this document useful (0 votes)

26 views

Answer

The document describes the MongoDB aggregation framework which allows users to transform data through a pipeline of operations like filtering, grouping, counting, etc. It provides examples of using $match to filter data, $project to modify fields, $group to group and count data, and sorting with aggregation.

Uploaded by

CNTT-2C-18 Vũ Thu hoài

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

Answer

Uploaded by

CNTT-2C-18 Vũ Thu hoài

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 21

The Mongo Aggregation

Framework
The Mongo Aggregation framework gives you a document query pipeline. You can
pipe a collection into the top and transform it through a series of operations,
eventually popping a result out the bottom (snigger).

For example, you might take a result set, filter it, group by a particular field, then
sum values in a particular group. You could find the total population of Iowa given an
array of postcodes. You could find all the coupons that were used on Monday, and
then count them.

We can compose a pipeline as a set of JSON objects, then run the pipeline on a
collection.

Empty pipeline
If you provide an empty pipeline, Mongo will return all the results in the collection:

Exercise - Create an Empty pipeline

Try out the aggregate pipeline now. Call aggregate on your people collection. You'll
see the result is the same as if you called find.

Filtering the pipeline with

$match
We can use the aggregation pipeline to filter a result set. This is more or less
analogous to find, and is probably the most common thing we want to do.

Say we want to list only people who have cats (where cat is a sub-document), we
would probably do something like this this:

We can get the same result in the aggregation framework using $match, like so:

So why use aggregation over find? In this example they are the same, but the power
comes when we start to chain additional functions as we shall soon see.

Exercise - $match
 Use the people dataset. Match all the people who are 10 years old who have
8 year old cats.

 Match all the people who are over 80 years old, and who's cats are over 15
years old.
When to use match
Matching is quick but not smart. It's designed to limit the result set, so that the rest
of the pipeline can run more quickly. When used with project we can match against
fields that don't exist in our result set. This is a powerful and useful feature.

Modifying a stream with

$project
The find function allowed us to do simple whitelist projection. The aggregate pipeline
gives us many more options.

We can use $project to modify all the documents in the pipeline. We can remove
elements, allow elements through, rename elements, and even create brand new
elements based on expressions.

Say we have a set of voucher codes, like this:

We can use project to restrict the fields that are passed through. We pick the fields
we like and set true to pass them through unchanged.
Removing the id
Remove the id field bay passing _id: false.

This will yield a set something like the following:

Renaming Fields
We can use project to rename fields if we want to. We use an expression: $lastName
to pull out the value from the lastName field, and project it forward into the surname
field.
This will yield something like the following:

Chaining $match and $project

We can chain $match and $project together. Say we have a list of codes, and some
have not yet been used. We want to pull out the names and emails, but only from the
codes which have been used.
We might first $match the codes which have a usedAt field, and then use $project to
pull out the names and emails from the remainder.
Exercise
 First $match people with their cats, then print out result

 After that, use $project to pull out only cat names

Creating dynamic fields with
$project
We can use project to add new fields to our documents based on expressions.

Say we had a list of people, like so:

We can use project to compose a name field, like so:

This will give us results like this:

Exercise - String aggregation operators

We saw here how to use $concat to make a new attribute containing a
concatenated string. Have a look at the other String aggregation operators here:

http://docs.mongodb.org/manual/reference/operator/aggregation-string/

Attempt to use result from last exercise with capitalize cat name. This is useful
because Mongo grouping and matching is case sensitive
Conditional fields with $cond
We can set the value of a field using a boolean expression using $cond. There are a
couple of ways to use $cond. You may wish to review the documentation
here: http://docs.mongodb.org/manual/reference/operator/aggregation/cond/

Say we have a set of customers, and some of them have complaints. We might set a
flag on all of the unhappy customers like so:

Unhappy will either be true or false.

Exercise - Add a hasCat field

 Use projection to add a hasCat field. This might form the basis for a future
grouping or counting.

 Add an isOld field to show if person name is greater than 80

Grouping with $group
$group allows us to group a collection according to criteria. We can group by fields
that exist in the data, or we can group by expressions that create new fields.

Group will operate on a set of documents in the pipeline, and output a new set of
documents out the bottom.

We group using the _id field. This will create a new _id for each group that will be an
object containing the grouping criteria.

The simplest group would look like this:

The id field is empty, so the group contains the whole collection, but we haven't
output anything, so each output document is empty.

Grouping by id
If we just want to group by a single field we can do this easily. The id of each output
document will be the value of the expression, in this case '$name'.

Exercise
 Try this out on your people data set. You should get a list of distinct names.

 The output is untidy, each name output in the id field. Add a $project step
to the pipeline to rename the '_id' field to 'name'.
Grouping by multiple fields
You can group on more than one field by passing an object to _id:

Exercise - Grouping by object

Try out the above. Notice that the _id field is now an object. You now have distinct
names and ages.

Counting with Group

Group has the ability to count. You can count the entries in a group. By chaining
$group commands together you could count the number of groups.

For example, say you have a set of customer records which may contain duplicate
emails. You could group by email and find out who is using your service most often.
You might count the groups, to get the number of distinct emails, you might group by
count, to find how many people used your site once, twice, five times, etc.
We use $group to count, because generally we want to count groups.

Counting everything
We could count the entire collection by grouping everything, then adding a count
field. This is the same as db.collection.find().count()

Exercise - Count everything

 Count each of people grouped by name. How many are there?

Count with $match

 Add a $match step to the start of your pipeline. Count the number of cats
each person has using the aggregate pipeline. How many do you have?
Harder - Count with $project and $cond
 Use $project to create a 'hasCat' field. You will need to use $cond to do
this: http://docs.mongodb.org/manual/reference/operator/aggregation/cond/. Check
that your pipeline now contains the hasCat field.
 Now group by hasCat and count.
Counting Name Popularity
Let's group on name, then count how many people have each name:
Sorting with Aggregation
We can sort records in the aggregation pipeline just as we can with find. Choose the
fields to sort on and pass 1 or -1 to sort or reverse the sort.

Exercise - Stocks
 Sort person by distinct age
Count distinct
Another challenge is to count the number of groups. For example, say you have a
dataset containing duplicate emails, you might want to generate a list of distinct
emails and then count that list.

You have two ways to do this:

The wrong way

Now say we want to pick out all the unique emails, we might use distinct, like so:

This will pop a list out into memory, like this:

We could get the length of the collection just by querying the array, like so:

However, this is bad. Imagine now that we have 15,000,000 records. We now have to
create a massive array just for the purpose of getti ng a single number.
The right way

Instead we can do this entirely in the aggregation framework using two group
commands. First, we group by emails and throw away the rest of the data. We now
have a list of all the unique emails.

We now want to find out how big this set is, so we create a big group that holds
everything (using _id: 1) and count that.

Exercise - Enron
 List all the unique person by name
 Count each the unique person.

 group by person name and count to find out which person has the largest
number of cats
 Rank number of cats in descending order.

The Good Shadowing Guide by Julian Northbrook
67% (3)
The Good Shadowing Guide by Julian Northbrook
8 pages
MongoDB Aggregation Guide
No ratings yet
MongoDB Aggregation Guide
46 pages
05 MongoDB Aggregation Pipeline With Examples
No ratings yet
05 MongoDB Aggregation Pipeline With Examples
41 pages
ADBMS-UNIT-3
No ratings yet
ADBMS-UNIT-3
31 pages
T4 Aggregation PDF
No ratings yet
T4 Aggregation PDF
18 pages
Aggregation Lab Slides
No ratings yet
Aggregation Lab Slides
34 pages
Mongodb-Aggregation-And-Indexing Group-B-Assignment-15batch-1
No ratings yet
Mongodb-Aggregation-And-Indexing Group-B-Assignment-15batch-1
8 pages
Les12 Mongo Part 3 Aggregation
No ratings yet
Les12 Mongo Part 3 Aggregation
9 pages
ADB - Lab Sheet 4
No ratings yet
ADB - Lab Sheet 4
11 pages
Aggregation Tutorial
No ratings yet
Aggregation Tutorial
4 pages
4- MongoDB aggregation framework (1)
No ratings yet
4- MongoDB aggregation framework (1)
38 pages
Aggregation_nates
No ratings yet
Aggregation_nates
12 pages
MongoDB_Indexing_Aggregation
No ratings yet
MongoDB_Indexing_Aggregation
5 pages
MongoDB Aggregation PDF
No ratings yet
MongoDB Aggregation PDF
4 pages
MongoDB Aggregation
No ratings yet
MongoDB Aggregation
11 pages
Notes-Lecture 14 - MongoDB with NodeJS - II-3447
No ratings yet
Notes-Lecture 14 - MongoDB with NodeJS - II-3447
13 pages
BDTT Lab 2023 24 Week9
No ratings yet
BDTT Lab 2023 24 Week9
26 pages
MongoDB ReferenceCards
No ratings yet
MongoDB ReferenceCards
28 pages
Remaining NGD New
No ratings yet
Remaining NGD New
21 pages
MongDb Queries
No ratings yet
MongDb Queries
5 pages
GROUP B - Assign - 2 - Mongodb-Indexing-And-Aggregation-In-Mongodb-2-33
No ratings yet
GROUP B - Assign - 2 - Mongodb-Indexing-And-Aggregation-In-Mongodb-2-33
32 pages
4 - MongoDB Aggregation Framework
No ratings yet
4 - MongoDB Aggregation Framework
38 pages
04 - Aggregation Operations
No ratings yet
04 - Aggregation Operations
68 pages
Big Data
No ratings yet
Big Data
11 pages
MongoDB Reference Card
No ratings yet
MongoDB Reference Card
28 pages
Mongodb Indexing and Aggregation in Mongodb
No ratings yet
Mongodb Indexing and Aggregation in Mongodb
33 pages
NoSQL 14 MONGO 2
No ratings yet
NoSQL 14 MONGO 2
37 pages
Mongo DB Pracs 1to 3-1
No ratings yet
Mongo DB Pracs 1to 3-1
29 pages
M 5
No ratings yet
M 5
4 pages
Lecture 3 FULL Explanation
No ratings yet
Lecture 3 FULL Explanation
32 pages
Mongo DB Operators
No ratings yet
Mongo DB Operators
104 pages
Itvoyagers Aggregation in Mongodb
No ratings yet
Itvoyagers Aggregation in Mongodb
15 pages
NoSQL and MongoDB
No ratings yet
NoSQL and MongoDB
24 pages
Full Final
No ratings yet
Full Final
64 pages
M10A1
No ratings yet
M10A1
3 pages
ITP249 - Lecture - 13 - v2-2
No ratings yet
ITP249 - Lecture - 13 - v2-2
19 pages
MongoDB - Cours 4
No ratings yet
MongoDB - Cours 4
58 pages
Nosql Notes101
No ratings yet
Nosql Notes101
11 pages
MongoDB Aggregate Exercise2
No ratings yet
MongoDB Aggregate Exercise2
10 pages
Mongo DB
No ratings yet
Mongo DB
36 pages
Mongo DB Documentation-2922'11
No ratings yet
Mongo DB Documentation-2922'11
21 pages
Mongodb Notes
100% (1)
Mongodb Notes
14 pages
MongoDB Presentaton
No ratings yet
MongoDB Presentaton
30 pages
Dod Unit4
No ratings yet
Dod Unit4
18 pages
InternalAssignment2
No ratings yet
InternalAssignment2
11 pages
mongoDB Syllabus
No ratings yet
mongoDB Syllabus
3 pages
- Constructing Queries on Databases, Collections, ...
No ratings yet
- Constructing Queries on Databases, Collections, ...
2 pages
DF100 - 03 - Storage and Retrieval Part 2
No ratings yet
DF100 - 03 - Storage and Retrieval Part 2
32 pages
Nosql Query
No ratings yet
Nosql Query
37 pages
UNIT 2 - BDA NOTES
No ratings yet
UNIT 2 - BDA NOTES
37 pages
Ex 9,10,11
No ratings yet
Ex 9,10,11
8 pages
MongoDB Official Cheat Sheet
100% (2)
MongoDB Official Cheat Sheet
15 pages
MEAN 3 L4 Advanced MongoDB With Aggregation
No ratings yet
MEAN 3 L4 Advanced MongoDB With Aggregation
94 pages
Mongo DB Notes
No ratings yet
Mongo DB Notes
5 pages
BDA - MongoDB
No ratings yet
BDA - MongoDB
12 pages
DOC-20250318-WA0005.
No ratings yet
DOC-20250318-WA0005.
145 pages
Lab Sheet 06 - Introduction To NoSQL Databases Using MongoDB
No ratings yet
Lab Sheet 06 - Introduction To NoSQL Databases Using MongoDB
5 pages
MongoDB Aggregation Guide PDF
No ratings yet
MongoDB Aggregation Guide PDF
46 pages
MongoDB Notes For Pros
100% (1)
MongoDB Notes For Pros
73 pages
MongoDB Aggregation
No ratings yet
MongoDB Aggregation
6 pages
JavaScript for Kids: Start Your Coding Adventure
From Everand
JavaScript for Kids: Start Your Coding Adventure
Abdelfattah Ragab
No ratings yet
Luyện Đọc Tiếng Anh - p3
No ratings yet
Luyện Đọc Tiếng Anh - p3
8 pages
Design Document Database
No ratings yet
Design Document Database
62 pages
Design Key-Value Database + Real Usecase
No ratings yet
Design Key-Value Database + Real Usecase
49 pages
Cmrcet - I-Mid Adc (A30407)
No ratings yet
Cmrcet - I-Mid Adc (A30407)
6 pages
1.4.1 Static Routing
No ratings yet
1.4.1 Static Routing
1 page
CCNA Sample Resume 1
No ratings yet
CCNA Sample Resume 1
2 pages
Fpga Homework
100% (1)
Fpga Homework
4 pages
32137020401A - User Manual
No ratings yet
32137020401A - User Manual
124 pages
ICS LectureSlides Week 09 MS PowerPoint
No ratings yet
ICS LectureSlides Week 09 MS PowerPoint
34 pages
Download Fundamentals of Python First Programs 1st Edition Kenneth A. Lambert ebook All Chapters PDF
100% (15)
Download Fundamentals of Python First Programs 1st Edition Kenneth A. Lambert ebook All Chapters PDF
50 pages
Ict Chapter 7
No ratings yet
Ict Chapter 7
47 pages
Group Policy Processing and Precedence - Group Policy PDF
No ratings yet
Group Policy Processing and Precedence - Group Policy PDF
8 pages
Addressing Modes: 4.1. Interpreting Memory Addresses
No ratings yet
Addressing Modes: 4.1. Interpreting Memory Addresses
20 pages
CV Mandeep Thakur
No ratings yet
CV Mandeep Thakur
2 pages
Best Firewall Security Appliance Cisco ASA5506-K8 - New
No ratings yet
Best Firewall Security Appliance Cisco ASA5506-K8 - New
17 pages
Conexiones Establecidas
No ratings yet
Conexiones Establecidas
4 pages
Computer Graphics: Animation Movies Video Game
No ratings yet
Computer Graphics: Animation Movies Video Game
8 pages
Object-Oriented Software Engineering: The Design Workflow
No ratings yet
Object-Oriented Software Engineering: The Design Workflow
8 pages
Basic 8 - Week 2
No ratings yet
Basic 8 - Week 2
26 pages
Geometric Transformations in Augmented Reality Applications - Final
No ratings yet
Geometric Transformations in Augmented Reality Applications - Final
4 pages
Java Programming 3rd Quarter
No ratings yet
Java Programming 3rd Quarter
3 pages
MATLAB 2009b Installation Network Version - Windows
No ratings yet
MATLAB 2009b Installation Network Version - Windows
10 pages
Brksec 2050
No ratings yet
Brksec 2050
191 pages
C++ List (With Examples)
No ratings yet
C++ List (With Examples)
15 pages
An Introduction of Theory of Computation
No ratings yet
An Introduction of Theory of Computation
5 pages
Bartender Maintenance Update Checklist
No ratings yet
Bartender Maintenance Update Checklist
1 page
Learn Python
No ratings yet
Learn Python
5 pages
Manual ILE COBOL Language Reference - English
No ratings yet
Manual ILE COBOL Language Reference - English
752 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
18 pages
In-Cell Capacitive Touch Panel Structures and Their Readout Circuits
No ratings yet
In-Cell Capacitive Touch Panel Structures and Their Readout Circuits
5 pages
Aws 101 Presentation Deck August 2014 1
No ratings yet
Aws 101 Presentation Deck August 2014 1
47 pages
Android Architecture: Giteswar Goswami 19cs801014
No ratings yet
Android Architecture: Giteswar Goswami 19cs801014
8 pages
Atif Rizvi
No ratings yet
Atif Rizvi
3 pages

Answer

Uploaded by

Answer

Uploaded by

The Mongo Aggregation

Exercise - Create an Empty pipeline

Filtering the pipeline with

Modifying a stream with

Say we have a set of voucher codes, like this:

This will yield a set something like the following:

Chaining $match and $project

 After that, use $project to pull out only cat names

Say we had a list of people, like so:

We can use project to compose a name field, like so:

This will give us results like this:

Exercise - String aggregation operators

Unhappy will either be true or false.

Exercise - Add a hasCat field

 Add an isOld field to show if person name is greater than 80

The simplest group would look like this:

Exercise - Grouping by object

Counting with Group

Exercise - Count everything

Count with $match

You have two ways to do this:

The wrong way

This will pop a list out into memory, like this:

You might also like