Oracle hopes to make SQL a lingua franca for big data

Big Data SQL is set to ship within the next couple of months

Oracle is hoping to turn heads in the crowded data analysis market with Big Data SQL, a software tool that can run a single SQL query against Oracle's own database as well as Hadoop and NoSQL data stores.

The software is an option for Oracle's Big Data Appliance, which incorporates Cloudera's Hadoop distribution, said Neil Mendelson, vice president of product development, big data and analytics.

There's a lot of experimentation going on in enterprises around so-called big data, but certain factors are impeding customers from moving these projects into production mode, namely a lack of integration between Hadoop and other systems, difficulty obtaining the right talent and concerns about security, Mendelson said.

Big Data SQL takes advantage of the core skills any Oracle database administrator has, he added. "You get to use the full dialect of SQL."

You also have to buy big into Oracle's technology stack, however.

Big Data SQL's full benefits require an Oracle database to be installed and running on the software company's Exadata database machine. In an implementation, Exadata and the Big Data Appliance would share an interconnect for data exchange, Mendelson said.

In addition, Big Data SQL is only compatible with version 12c of the Oracle database, which was released last year. Most Oracle database customers are still running versions 11g and earlier.

But customers get benefits in exchange for the investments Big Data SQL requires, particularly the ability to use the Oracle database's advanced security features within Hadoop and NoSQL stores, he said. Security rules set for data in 12c are simply "pushed" into those other environments, Mendelson said.

Oracle over time will add support for using Big Data SQL with other hardware systems it sells, according to Mendelson. The software is set for general availability within the next couple of months, with pricing to be announced at that time.

Big Data SQL isn't an attempt to replace the SQL engines already created for Hadoop, such as Hive and Impala, which Oracle will continue to ship with the Big Data Appliance, he said. "We're really solving a wider problem."

One big challenge facing data scientists is simply the overhead of moving data among systems, he said. Big Data SQL allows various information stores to be queried in place with minimal data movement, and queries are made more efficient using Smart Scan technology from Exadata's software stack.

At a quick glance Big Data SQL might appear to simply be another take on federated querying, which has been around for quite some time. It also has its disadvantages, said analyst Curt Monash of Monash Research.

"Federating query across systems involves a network cost, always," he said. "Often it also leads to a query being planned by an optimizer that isn't ideal for all parts of the query," Monash said. "If the performance advantages of moving the data are large enough to outweigh those considerations, it usually would be even better to move the data before you start."

But Big Data SQL "is data federation with some predicate pushdown," Monash said. "A predicate is, for this purpose, part of a SQL statement. Rather than do everything at the central processing location, which can be a cluster itself, you push down some of the predicates to where the data is stored."

"That's the whole point of Exadata," Monash added. "A lot of the filtering is done locally, so that the network impact isn't as miserable as it otherwise could be. This reduces the objections to data federation. It's a good idea, just as Exadata is a good idea."

But it would be an overstatement to call Big Data SQL a breakthrough, Monash said.

"These are all well-known ideas, which Oracle seems to have now implemented for its own particular walled-garden environment."

Oracle is expected to discuss Big Data SQL further during a webcast on Wednesday featuring Andrew Mendelsohn, executive vice president of database server technologies.

Chris Kanaracus covers enterprise software and general technology breaking news for The IDG News Service. Chris' email address is Chris_Kanaracus@idg.com

Join the newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags open sourceapplication developmentdatabasesapplicationssoftwareOracle

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Chris Kanaracus

IDG News Service
Show Comments

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Ben Ramsden

Sharp PN-40TC1 Huddle Board

Brainstorming, innovation, problem solving, and negotiation have all become much more productive and valuable if people can easily collaborate in real time with minimal friction.

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Ratchada Dunn

Sharp PN-40TC1 Huddle Board

The Huddle Board’s built in program; Sharp Touch Viewing software allows us to easily manipulate and edit our documents (jpegs and PDFs) all at the same time on the dashboard.

George Khoury

Sharp PN-40TC1 Huddle Board

The biggest perks for me would be that it comes with easy to use and comprehensive programs that make the collaboration process a whole lot more intuitive and organic

David Coyle

Brother PocketJet PJ-773 A4 Portable Thermal Printer

I rate the printer as a 5 out of 5 stars as it has been able to fit seamlessly into my busy and mobile lifestyle.

Kurt Hegetschweiler

Brother PocketJet PJ-773 A4 Portable Thermal Printer

It’s perfect for mobile workers. Just take it out — it’s small enough to sit anywhere — turn it on, load a sheet of paper, and start printing.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?