Google BigQuery update aims for enticing Hadoop users

BigQuery users can now combine query results from multiple tables

Hoping to lure more Apache Hadoop users to its own data analysis services, Google has outfitted BigQuery with the ability to query multiple data tables.

"Joining terabyte-sized tables has traditionally been a challenging task for data analysts, requiring sophisticated MapReduce development skills, powerful hardware, or a lot of time -- often all three," wrote Ju-kay Kwek, Google BigQuery product manager, in a blog post announcing the update. "Today with BigQuery you can get directly to business insights using SQL-like queries, with far less effort and far greater speed than you could before."

Google also argued that using BigQuery instead of a Hadoop deployment will save users money, because they only pay for the queries that are processed, rather than pay for the computational costs of running individual Hadoop supporting components.

Launched in 2010, BigQuery has been marketed by Google as an interactive service for parsing large amounts of data. With BigQuery, a user submits a data set to Google, then can query the data through the BigQuery API (application programming interface).

The new updates expand capabilities BigQuery already has in place. Most notably, a new JOIN clause that combines the results of a query across multiple data sources. Prior to this update, BigQuery's JOIN clause could only work with a data set less than 8MB in size. The new clause, JOIN EACH, has no limit on the size of the data.

As a result, the service can now be more effectively used as a replacement to Hadoop's MapReduce. Many Hadoop jobs are designed to bring together large amounts of data from two or more data sets. To do this however, developers must write MapReduce processes from scratch, which can be time consuming. JOIN EACH can produce a single result set from two large database tables that share a common key.

"With these capabilities, you will now be able to join and perform aggregate analysis on multi-terabyte datasets using SQL-like queries or integrated [third] party tools, instead of having to initiate complex coding projects," wrote Michael Manoochehri, Google's cloud platform developer programs engineer, in a technical blog post explaining the update.

BigQuery also now offers a better way to group query results as well. The GROUP BY EACH statement increases the number of distinct entities that can be grouped in a result set, though at a potential cost to processing performance.

The BigQuery update includes a couple of other new features as well. The service has more supports for timestamps: BigData can now import timestamps from other systems, as well as query timestamp data. Users can now add columns onto existing tables. Users can now also bookmark the specific datasets they have access to, as well as receive automated emails when they have been given access to a new dataset.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the Good Gear Guide newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags applicationsdevelopment platformsdata miningManaged Servicessoftwarecloud computinginternetInfrastructure servicesData managementGoogledatabases

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Matthew Stivala

HP OfficeJet 250 Mobile Printer

The HP OfficeJet 250 Mobile Printer is a great device that fits perfectly into my fast paced and mobile lifestyle. My first impression of the printer itself was how incredibly compact and sleek the device was.

Armand Abogado

HP OfficeJet 250 Mobile Printer

Wireless printing from my iPhone was also a handy feature, the whole experience was quick and seamless with no setup requirements - accessed through the default iOS printing menu options.

Azadeh Williams

HP OfficeJet Pro 8730

A smarter way to print for busy small business owners, combining speedy printing with scanning and copying, making it easier to produce high quality documents and images at a touch of a button.

Andrew Grant

HP OfficeJet Pro 8730

I've had a multifunction printer in the office going on 10 years now. It was a neat bit of kit back in the day -- print, copy, scan, fax -- when printing over WiFi felt a bit like magic. It’s seen better days though and an upgrade’s well overdue. This HP OfficeJet Pro 8730 looks like it ticks all the same boxes: print, copy, scan, and fax. (Really? Does anyone fax anything any more? I guess it's good to know the facility’s there, just in case.) Printing over WiFi is more-or- less standard these days.

Ed Dawson

HP OfficeJet Pro 8730

As a freelance writer who is always on the go, I like my technology to be both efficient and effective so I can do my job well. The HP OfficeJet Pro 8730 Inkjet Printer ticks all the boxes in terms of form factor, performance and user interface.

Michael Hargreaves

Windows 10 for Business / Dell XPS 13

I’d happily recommend this touchscreen laptop and Windows 10 as a great way to get serious work done at a desk or on the road.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?