Airbnb offers a query tool for unlocking massive data sets

Airbnb is known for its home rental services, but check out its tool for opening data warehouses

AirBnB's Airpal makes data analysis easier with a intuitive graphical user interface

AirBnB's Airpal makes data analysis easier with a intuitive graphical user interface

Airbnb has open-sourced a tool that allows its non-technical employees to work like data scientists, and it hopes others will use the software to more effectively mine their own data

Airbnb's Airpal application, when linked with the PrestoDB open source query engine developed by Facebook, could provide a lower cost alternative to a data warehouse, or a more friendly interface for accessing large swaths of data stored in a Hadoop cluster.

"We think of Airpal as being a supercharger on top of PrestoDB, which takes Presto from being a command line only for computer scientists to a first class tool for all the employees in a company," said James Mayfield[cq], Airbnb product manager.

Airbnb has been using Airpal for about a year. About a third of its employees have used the tool to query internal data, a far greater proportion than it anticipated, Mayfield said.

The software offers a GUI for exploring the company's operational data, about a half petabyte's worth. The data is managed by the Apache Hive data warehouse software and resides on a Hadoop file system. PrestoDB provides the ability to search this Hive data using the SQL (Structured Query Language).

While well-known by database administrators and programmers, SQL can confound most non-computer science-types. Airpal provides a more intuitive way of searching the data, according to Mayfield.

Data is a big part of the company's business. Airbnb was founded in 2008 to provide a way for people to rent their dwellings or individual rooms to travelers. Using the Web and a set of mobile apps, the company has facilitated 25 million bookings in 34,000 cities.

Airpal has helped Airbnb "democratize its data," Mayfield said, referring to how any Airnb employee, not just dedicated business analysts, can explore the company's data.

"It is always great to be able to build a product informed by metrics, rather than just one based on introspection or guesses," Mayfield said.

The software can answer questions an employee may have about the company's operations. How many bookings did Airbnb get on a given day? How many people are staying at an Airbnb location tonight? How many listings are there in any given market, city or neighborhood? How many of the users speak German, or Spanish?

Users can submit their queries using a Web browser. Results are returned as a CSV file, so they can be easily digested by a spreadsheet or an internal application. They can search for database tables and access a database's metadata, schemas and sample rows. Searches can be saved to be run again, and new tables can be created from queries.

For Airbnb administrators, the Airpal/PrestoDB stack eliminated a considerable amount of middleware that would be needed to execute these complex queries.

Prior to using the Airpal/PrestoDB combo, Airbnb used Amazon Web Service's Redshift service.

Redshift requires a set of ETL (Extract, transform and load) tools to prepare the data. It had a limit on how many queries can be run at once, which could limit wider use of the data. Also, Airbnb developers found the Redshift error messages to be obtuse, making it difficult to debug issues, said Andy Kramolisch, AirBnB software engineer.

Airbnb will explain AirPal in greater detail March 19 at a Presto meetup on the Facebook campus in Menlo Park, California.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the Good Gear Guide newsletter!

Error: Please check your email address.

Tags Development toolsapplication developmentAirbnbsoftware

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Cool Tech

Crucial Ballistix Elite 32GB Kit (4 x 8GB) DDR4-3000 UDIMM

Learn more >

Gadgets & Things

Lexar® Professional 1000x microSDHC™/microSDXC™ UHS-II cards

Learn more >

Family Friendly

Lexar® JumpDrive® S57 USB 3.0 flash drive 

Learn more >

Stocking Stuffer

Plox Star Wars Death Star Levitating Bluetooth Speaker

Learn more >

Christmas Gift Guide

Click for more ›

Most Popular Reviews

Latest News Articles

Resources

GGG Evaluation Team

Kathy Cassidy

STYLISTIC Q702

First impression on unpacking the Q702 test unit was the solid feel and clean, minimalist styling.

Anthony Grifoni

STYLISTIC Q572

For work use, Microsoft Word and Excel programs pre-installed on the device are adequate for preparing short documents.

Steph Mundell

LIFEBOOK UH574

The Fujitsu LifeBook UH574 allowed for great mobility without being obnoxiously heavy or clunky. Its twelve hours of battery life did not disappoint.

Andrew Mitsi

STYLISTIC Q702

The screen was particularly good. It is bright and visible from most angles, however heat is an issue, particularly around the Windows button on the front, and on the back where the battery housing is located.

Simon Harriott

STYLISTIC Q702

My first impression after unboxing the Q702 is that it is a nice looking unit. Styling is somewhat minimalist but very effective. The tablet part, once detached, has a nice weight, and no buttons or switches are located in awkward or intrusive positions.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?