User:Nmshah5/sandbox

MongoDB (from humongous) is a cross-platform document-oriented database. Classified as a NoSQL database, MongoDB eschews the traditional table-based relational database structure in favor of JSON-like documents with dynamic schemas (MongoDB calls the format BSON), making the integration of data in certain types of applications easier and faster. Released under a combination of the GNU Affero General Public License and the Apache License, MongoDB is free and open-source software.

First developed by the software company MongoDB Inc. in October 2007 as a component of a planned platform as a service product, the company shifted to an open source development model in 2009, with MongoDB offering commercial support and other services. Since then, MongoDB has been adopted as backend software by a number of major websites and services, including Craigslist, eBay, Foursquare, SourceForge, Viacom, and The New York Times among others. MongoDB was the most popular NoSQL database system.

History of MongoDB
MongoDB was created by the founders of DoubleClick. Since leaving DoubleClick, they founded a number of startups and kept running into the same scaling problems over and over. They decided to try to create an application stack that would scale out easily, as companies everywhere seemed to be running into the same issues.

In Fall 2007, they founded 10gen and started working on an application platform for the cloud, similar to Google App Engine. The 10gen engine’s main language was server-side JavaScript, so the scalable database they were designing for it (proto-MongoDB) was also JavaScript-y.

The 10gen appengine was called ed (for Eliot and Dwight) and the database was called p (for platform). In the summer of 2008, someone decided that they needed real names, so they came up with Babble for the app engine and MongoDB for the database. The name “Mongo” was, originally, from Blazing Saddles (it was back-named to humongous).

Dwight commented below about the original of the name “Mongo:”

"Actually the name choice really does come from the word humongous. A couple years earlier a naming consultant showed me a list of 50 names for potential companies and consumer web products, and it was in that deck, and the point made was that it connoted “big”. But as you say some folks joked about the movie reference when we gave that name to the db, and I didn’t elaborate on the logic behind the naming at the time I would guess. I certainly didn’t in my mind make a negative association about the name at the time; my last encounter with it before that point was probably in Shrek 2. I knew it was campy but it was just a piece of the tech stack at first, not then a big standalone product and technology as it is today. Of course I now know that in some parts of the world it’s an odd choice — apologies about that to those of you in those locales."

The problem was, no one cared about Google App Engine and certainly no one cared about 10gen’s app engine. Developers would say, “well, the database is cool, but blech, app engine.”

After a year of work and practically no users, they ripped the database out of the app engine and open sourced them. Immediately, MongoDB started getting users.

Later on in the year 2009, MongoDB was introduced in the market as an open source database server that was maintained and supported by the MongoDB Inc. Many large and medium scale organizations such as SourceForge, Foursquare, craigslist and eBay are now using MongoDB in development of their database applications.

Main features
Some of the features include: Instead of taking a business subject and breaking it up into multiple relational structures, MongoDB can store the business subject in the minimal number of documents. For example, instead of storing title and author information in two distinct relational structures, title, author, and other title-related information can all be stored in a single document called Book. MongoDB supports search by field, range queries, regular expression searches. Queries can return specific fields of documents and also include user-defined JavaScript functions. Any field in a MongoDB document can be indexed (indices in MongoDB are conceptually similar to those in RDBMSes). Secondary indices are also available. MongoDB provides high availability with replica sets. A replica set consists of two or more copies of the data. Each replica set member may act in the role of primary or secondary replica at any time. The primary replica performs all writes and reads by default. Secondary replicas maintain a copy of the data of the primary using built-in replication. When a primary replica fails, the replica set automatically conducts an election process to determine which secondary should become the primary. Secondaries can also perform read operations, but the data is eventually consistent by default. MongoDB scales horizontally using sharding. The user chooses a shard key, which determines how the data in a collection will be distributed. The data is split into ranges (based on the shard key) and distributed across multiple shards. (A shard is a master with one or more slaves.) MongoDB can run over multiple servers, balancing the load and/or duplicating data to keep the system up and running in case of hardware failure. Automatic configuration is easy to deploy, and new machines can be added to a running database. MongoDB can be used as a file system, taking advantage of load balancing and data replication features over multiple machines for storing files.
 * Document-oriented
 * Ad hoc queries
 * Indexing
 * Replication
 * Load balancing
 * File storage

This function, called Grid File System, is included with MongoDB drivers and available for development languages (see "Language Support" for a list of supported languages). MongoDB exposes functions for file manipulation and content to developers. GridFS is used, for example, in plugins for NGINX and lighttpd. Instead of storing a file in a single document, GridFS divides a file into parts, or chunks, and stores each of those chunks as a separate document.

In a multi-machine MongoDB system, files can be distributed and copied multiple times between machines transparently, thus effectively creating a load-balanced and fault-tolerant system.

MapReduce can be used for batch processing of data and aggregation operations. The aggregation framework enables users to obtain the kind of results for which the SQL GROUP BY clause is used. JavaScript can be used in queries, aggregation functions (such as MapReduce), and sent directly to the database to be executed. MongoDB supports fixed-size collections called capped collections. This type of collection maintains insertion order and, once the specified size has been reached, behaves like a circular queue.
 * Aggregation
 * Server-side JavaScript execution
 * Capped collections

Criticisms
In some failure scenarios where an application can access two distinct MongoDB processes, but these processes cannot access each other, it is possible for MongoDB to return stale reads. In this scenario it is also possible for MongoDB to acknowledge writes that will be rolled back.

Before version 2.2, concurrency control was implemented on a per-mongod basis. With version 2.2, concurrency control was implemented at the database level. Since version 3.0, pluggable storage engines were introduced, and each storage engine may implement concurrency control differently. With MongoDB 3.0 concurrency control is implemented at the collection level for the MMAPv1 storage engine, and at the document level with the WiredTiger storage engine. With versions prior to 3.0, one approach to increase concurrency is to use sharding. In some situations, reads and writes will yield their locks. If MongoDB predicts a page is unlikely to be in memory, operations will yield their lock while the pages load. The use of lock yielding expanded greatly in 2.2.

Another criticism is related to the limitations of MongoDB when used on 32-bit systems. In some cases, this was due to inherent memory limitations. MongoDB recommends 64-bit systems and that users provide sufficient RAM for their working set. Some users experience increased latency when their working set exceeds available RAM and the system encounters page faults. Compose, a provider of managed MongoDB infrastructure, recommends a scaling checklist for large systems.

Additionally, MongoDB does not support collation-based sorting and is limited to byte-wise comparison via memcmp, which will not provide correct ordering for many non-English languages when used with a Unicode encoding.

Language support
MongoDB has official drivers for a variety of popular programming languages and development environments. There are also a large number of unofficial or community-supported drivers for other programming languages and frameworks.

Third-Party GUI tools
There is an active and growing community of developers building third-party rich GUI tools for the MongoDB. Some relevant examples (listed in alphabetical order):
 * 3T MongoChef, cross-platform MongoDB GUI
 * Database Master, web-based client software, supports RDMS
 * Fang of Mongo, web-based UI, built with Django and jQuery
 * HumongouS.io, Web based GUI for MongoDB
 * mms, Mongo Management Studio, cross-platfrorm and web-based GUI
 * MongoHub, a native OS-X-application for MongoDB management
 * NoSQL Manager for MongoDB, a MS Windows GUI application for MongoDB management with Shell
 * Opricot, browser-based MongoDB-Shell, implemented in PHP
 * Robomongo, Shell-centric cross-platform MongoDB management tool
 * UMongo (JMongoBrowser), cross-platform Management-GUI, implemented in Java

Performance
United Software Associates published a benchmark using YCSB as a basis of all the tests. MongoDB provides greater performance than Couchbase Server or Cassandra in all the tests they ran, in some cases by as much as 25x.

Another benchmark for top NoSQL databases was done by End Point, which compares MongoDB with other NoSQL database solutions.

Popular MongoDB Users
There are many examples of MongoDB users who are pushing the limits to scalability. Here are a few, organized around each scaling dimension.


 * FIFA (video game series): EA Sports FIFA is the world's best-selling sports video game franchise. To serve millions of players, EA's Spearhead development studio selected MongoDB to store user data and game state. Auto-sharding makes it simple to scale MongoDB across EA's 250+ servers with no limits to growth as EA FIFA wins more fans.


 * Yandex: The largest search engine in Russia uses MongoDB to manage all user and metadata for its file sharing service. MongoDB has scaled to support tens of billions of objects and TBs of data, growing at 10 million new file uploads per day.


 * eBay: The world's largest online auction site uses MongoDB to store all media metadata for the site. This includes references to images of every item for sale on eBay. The MongoDB cluster is deployed across multiple data centers and delivers 99.999% availability.


 * Foursquare: Foursquare is used by over 50 million people worldwide, who have checked in over 6 billion times, with millions more added every day. MongoDB is Foursquare's main database, supporting hundreds of thousands of operations per second and storing all check-ins and history, user, and venue data along with reviews.


 * McAfee: MongoDB powers McAfee Global Threat Intelligence (GTI), a cloud-based intelligence service that correlates data from millions of sensors around the globe. Billions of documents are stored and analyzed in MongoDB to deliver real-time threat intelligence to other McAfee end-client products.


 * Adobe: Many of the world's most recognizable brands use Adobe Experience Manager to accelerate development of digital experiences that increase customer loyalty, engagement and demand. Adobe uses MongoDB to store petabytes of data in the large-scale content repositories underpinning the Experience Manager.


 * Craigslist: With 80 million classified ads posted every month, Craigslist needs to archive billions of records in multiple formats, and must be able to query and report on these archives at runtime. Craigslist migrated from MySQL to MongoDB to support its active archive, with continuous availability mandated for regulatory compliance across 700 sites in 70 different countries.

Applications Using MongoDB

 * Bussion	Application for creating dashboards with drilldown and collaboration capabilities.
 * c5t	Content-management using TurboGears and Mongo.
 * Calipso	Content management system built using NodeJS and MongoDB.
 * Cube	Cube is an open-source system for visualizing time series data, built on MongoDB, Node and D3.
 * ErrorApp	ErrorApp tracks errors from your apps. It reports them to you and gathers all information and make reports available to you.
 * Forward	A full-featured, developer centric open source e-commerce platform that makes custom code easy, with powerful templates & expressive syntax.
 * Graylog2	Graylog2 is an open source syslog server implementation that stores logs in MongoDB and provides a Rails frontend.
 * HarmonyApp	Harmony is a powerful web-based platform for creating and managing websites. It helps connect developers with content editors, for unprecedented flexibility and simplicity. For more information, view Steve Smith’s presentation on Harmony at MongoSF (April 2010).
 * Hummingbird	Hummingbird is a real-time web traffic visualization tool developed by Gilt Groupe.
 * KeystoneJS	KeystoneJS is a Node.js content management system and web application platform built on Express.JS and MongoDB.
 * Locomotive	Locomotive is an open source CMS for Rails. It’s flexible and integrates with Heroku and Amazon S3.
 * Mogade	Mogade offers a free and simple to use leaderboard and achievement services for mobile game developers.
 * MongoLantern	MongoLantern is an open source full text search server using MongoDB as index storage, which allows MongoLantern to migrate any changes very easily into account using MongoDB API. *It’s written originally written in PHP can be migrated to any desired language as required using it’s future APIs.
 * Mongoop	Monitor and locate long running operations on MongoDB and automatically trigger specific actions (email, sentry, nsca...) for alerting and performance analysis.
 * MongoPress	A flexible CMS that uses MongoDB and PHP.
 * Mongs	A simple, web-based data browser for MongoDB.
 * Mongeez	Mongeez is an opensource solution allowing you to manage your mongo document changes in a manner that is easy to synchronize with your code changes. Check out mongeez.org.
 * NewsBlur	NewsBlur is an open source visual feed reader that powers http://newsblur.com. NewsBlur is built with Django, MongoDB, Postgres and RabbitMQ.
 * Pics.io	Pics.io is a digital asset management system that helps to centralize, manage and share images, videos, audio, PDFs and other valuable files across organization.
 * Quantum GIS	Plugin for Quantum GIS that lets you plot geographical data stored in MongoDB.
 * Rubedo	Rubedo is a full featured open source Enterprise Content Management System, built on MongoDB and Elastic Search with Zend Framework, Sencha Ext JS and Boostrap. It offers a complete set of back-office tools to easily manage galaxies of responsive, flexible and performant applications or websites.
 * Scribe	Open source image transcription tool.
 * Shapado	Free and open source Q&A software, open source stackoverflow style app written in Ruby, Rails, MongoMapper and MongoDB.
 * Strider	Strider: Open Source Continuous Integration & Deployment Server.
 * Thundergrid	Thundergrid is a simple framework written in PHP that allows you to store large files in your Mongo database in seconds.
 * Websko	Websko is a content management system designed for individual Web developers and cooperative teams.

Licensing and support
MongoDB is available for free under the GNU Affero General Public License. The language drivers are available under an Apache License. In addition, MongoDB Inc. offers proprietary licenses for MongoDB.

Popularity
, MongoDB is the fourth most popular type of database management system, and the most popular for document stores.

Production deployments
Large-scale deployments of MongoDB are tracked by MongoDB Inc. Notable users of MongoDB include:
 * Amadeus IT Group uses MongoDB for its back-end software.
 * The Compact Muon Solenoid at CERN uses MongoDB as the primary back-end for the Data Aggregation System for the Large Hadron Collider.
 * eBay uses MongoDB in the search suggestion and the internal Cloud Manager State Hub.
 * Foursquare deploys MongoDB on Amazon AWS to store venues and user check-ins into venues.
 * LinkedIn uses MongoDB as its backend DB.
 * MetLife uses MongoDB for “The Wall", a customer service application providing a "360-degree view" of MetLife customers.
 * SAP uses MongoDB in the SAP PaaS.
 * Shutterfly uses MongoDB for its photo platform. As of 2013, the photo platform stores 18 billion photos uploaded by Shutterfly's 7 million users.
 * Sophos uses MongoDB in its cloud security technology.
 * Sourceforge uses MongoDB for its back-end storage pages.
 * Talentica Software uses MongoDB in an ad serving platform for one of its clients.