Hortonworks brings Hadoop to Windows

Hortonworks expects its Windows version of Hadoop will feature full feature parity with the Linux version of the distribution

Hortonworks is bringing the popular open-source Apache Hadoop data processing platform to Microsoft shops.

The company has released a beta version of its Hortonworks Data Platform (HDP) Hadoop distribution for Windows and expects to release the final, enterprise-ready version in the months to come.

HDP is "the first and only distribution of Hadoop available on both Linux and Windows," said David McJannet, Hortonworks vice president of marketing.

According to McJannet, Hortonworks heard a lot of demand from potential customers for a Hadoop distribution that would run on the Microsoft platform.

"The real catalyst is, frankly, market demand. The significant majority of the servers running in the enterprise today are running Windows Server," McJannet said. "We've seen significant interest from our customers towards using Hadoop on the platform that they rely on for their critical applications."

Hortonworks and Microsoft have been porting the software to Windows over the past 18 months, as well as testing the software for enterprise use, McJannet said. The HDP distribution consists of a set of different software programs -- including HDFS, MapReduce, Hive, Pig and others. Like the Linux version, the Windows HDP will be available as open source "so others can benefit and extend the work that we have done," McJannet said.

Going forward, Hortonworks will release new versions of the HDP in both Linux and Windows. This first Windows beta version is based on the HDP 1.1 codebase.

Initially, the Windows beta does not have feature parity with the Linux version, though it does have all the "core components" to run Hadoop, McJannet said. But it does not include the Ambari set of management tools. Over time, however, Hortonworks does plan to duplicate all the features on the Windows version.

Hortonworks expects that the kind of workloads run on the Windows platform will be similar to those run on Linux, in terms of size and scope. "We fully anticipate some of the largest deployments of Hadoop could well be on Windows," McJannet said.

The distribution does not support running a mixture of Windows nodes and Linux nodes in the same deployment. Deployments should be all in one OS or another. "In practice, we'd expect homogeneity across the infrastructure, though we'd have to wait and see how that pattern emerges," McJannet said.

Over time, Microsoft will provide more support in other software products, most notably System Center, for organizations that want to move Windows Hadoop workloads in between their own data centers and a Microsoft Azure cloud service, said Herain Oberoi, Microsoft director of product marketing in the company's server and tools division.

As of press time, Hortonworks hasn't finalized the versions of Windows Servers upon which HDP will run, though the beta will run on Windows Server 2008 and Windows Server 2012. The product will not run on Windows desktop versions.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the newsletter!

Or
Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags open sourcedatabasesMicrosoftData managementsoftwareapplicationsdata warehousingdata miningHortonWorks

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Brand Post

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Andrew Teoh

Brother MFC-L9570CDW Multifunction Printer

Touch screen visibility and operation was great and easy to navigate. Each menu and sub-menu was in an understandable order and category

Louise Coady

Brother MFC-L9570CDW Multifunction Printer

The printer was convenient, produced clear and vibrant images and was very easy to use

Edwina Hargreaves

WD My Cloud Home

I would recommend this device for families and small businesses who want one safe place to store all their important digital content and a way to easily share it with friends, family, business partners, or customers.

Walid Mikhael

Brother QL-820NWB Professional Label Printer

It’s easy to set up, it’s compact and quiet when printing and to top if off, the print quality is excellent. This is hands down the best printer I’ve used for printing labels.

Ben Ramsden

Sharp PN-40TC1 Huddle Board

Brainstorming, innovation, problem solving, and negotiation have all become much more productive and valuable if people can easily collaborate in real time with minimal friction.

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?