Mongo DB

How to run?

For insall jupyter notebook

pip install jupyter notebook

To run type in your terminal

jupyter notebook

Analytics in MongoDB

CRUD                                  Analytics
CREATE ---------> Database  ---------> Group
READ   ---------> (MongoDB) ---------> COUNT
UPDATE --------->           ---------> DERIVE VALUES
DELETE --------->           ---------> FILTER, AVERAGE, SORT

Core Concept : Pipeline

ls -1 ---------> Pipe ---------> wc -l --------->   Terminal
       stdout           stdin            stdout

Ex : ps -ef

grep mongod

What is the Aggregation Pipeline?

A series of Document Transformations

Executed in stages
Original input is a collection
Output as a cursor or a collection

Rich Library of functions

Filter, compute, group, and summarize data
Output of one stage sent to input of next
Operation executed in sequential order

Syntax For an Aggregation

aggreate() method
>db.COLLECTION_NAME.aggregate(AGGREGATE_OPERATION)
>db.foo.aggregate([{ stage1 },{ stage2 },{ stage3 }, .... ])

db - variable pointing to current database
collection name
agggregate - method on collection
array of objects, each a pipeline operator
pipeling operators

Some Popular Pipeline Operators

    $match       ---> Filter documents
    $project     ---> Reshape documents
    $group       ---> Summarize documents
    $unwind      ---> Expand arrays in documents
    $sort        ---> Order documents
    $limit/$skip ---> Paginate documents
    $redact      ---> Restrict documents
    $geoNear     ---> Proximity sort documents
    $let         ---> Define variables
    $map         ---> Define variables