News Stay informed about the latest enterprise technology news and product updates.

Marketing company applies SQL to MongoDB to uncover social media trends

Marketing analytics firm SumAll adopted SQL-enabled Data Warehouse as a Service tools to analyze social media data stored in a MongoDB NoSQL database.

Despite a host of new big data innovations, including Hadoop, MapReduce and NoSQL databases such as MongoDB, some...

familiar and trusted technologies hang on -- one being SQL.

That is because the job of uncovering important business patterns doesn't become easier as companies collect data in more varied formats, especially if those new formats don't have familiar SQL support.

Faced with such an issue, New York-based marketing analytics firm SumAll turned to a Web-based data warehouse that supports SQL queries on non-SQL data. The firm chose Data Warehouse as a Service technology from BitYota Inc. to do rapid interactive analysis on native JavaScript Object Notation (JSON) data stored in a MongoDB NoSQL database.

SumAll provides services that allow its users to tap into social and Web application performance data. According to Korey Lee, SumAll's chief information officer, the company is looking to use SQL to generate reporting dashboards that help customers identify social media trends and measure their social media efforts versus their industry peers. Twitter, Instagram, Facebook -- these are just some of the many Web applications that are supported.

BitYota's software services "build a mapping layer on top of MongoDB that allows us to query against our data and aggregate it," Lee said.

He said SumAll tracks 100s of billions of social impressions over billions of rows. He characterized MongoDB as very scalable, but added that it can be difficult to analyze and aggregate MongoDB data.

"MongoDB has its native query functionality of MapReduce," Lee said, referring to the programming environment often used to process large data sets with Hadoop clusters and NoSQL databases. "But we found that it doesn't do a really good job with aggregations for us, especially across collections [of data] at scale. BitYota solves that problem."

By such means, he said, SumAll can create analytical dashboards that let customers estimate the return on investment and compare results for marketing campaigns across social media platforms.

"We can help our customers figure out if Instagram is more effective than Facebook for them -- and, even more importantly, track if this is a broader trend across tens of thousands of business users," he said.

JSON and the 'NoSQLnauts'

The need to analyze NoSQL data from sources like MongoDB or Hadoop appears to be growing. BitYota CEO Dev Patel said his company, based in Mountain View, Calif., formed around the need to integrate varied types of data from around the Web, and that cloud data warehouses can help enable such applications.

NoSQL databases, supporting data schema creation after the fact, keep users' options open for incorporating all kinds of data as time goes by, he said. Still, SumAll's Lee and others continue to look to SQL for queries on diverse Web-borne NoSQL data, Patel noted.

"We see a lot of interest in NoSQL related to flexibility, especially in the mobile and Web application space," he said. "But there are many areas where SQL is going to continue."

JSON is showing up more often in the modern NoSQL data mix. Riding a wave of JavaScript development, it became a favored alternative to XML in Web services creation.

Meanwhile, the MongoDB database arose, often used as a store for JSON documents. The trend has not been overlooked by established SQL tools vendors such as IBM, Microsoft and Oracle, which have begun to offer various ways of handling JSON. They have customers that, like SumAll, want to apply SQL methods to NoSQL application data.

SQL is a big reason why SumAll went with the BitYota data warehouse service, Lee said: "It lets us use something familiar."

The actual implementation of SumAll's MongoDB version comes via ObjectRocket, a Rackspace U.S. Inc. cloud hosting company that provides MongoDB as a Service. Speed was an important factor in SumAll's software selections, according to Lee, who said the tools employed allow SumAll to query its full store of data on demand at any time.

Jack Vaughan is SearchDataManagement's news and site editor. Email him at, and follow us on Twitter: @sDataManagement.

Next Steps

See how NoSQL DBs vie with RDBMSs

Watch a video Q&A with MongoDB's CEO

Go behind the hype on NoSQL databases

Dig Deeper on Data warehouse software

Join the conversation


Send me notifications when other members comment.

Please create a username to comment.

Are you working with JSON data formats?
We are about to use them for an event management (with lots of huge video driven data and media files) using Oracle, MongoDB and Hadoop / MapReduce with Query router and reporting to get the analytical outputs on ones mobiles, tabs, PCs, Docs etc.

Please advice us if you can suggest a better way to achieve our goal for the end users / stakeholders to view the analytic data over their devices along with development strategies too.

Thanks & Regards