Cloudera outfits Hadoop with management tools

Version 3.5 of the Cloudera Enterprise edition has new modules for configuration and process monitoring

Anticipating greater Apache Hadoop use in production environments, Cloudera has outfitted its commercial distribution of the data processing framework with additional configuration and management tools.

"As more and more Hadoop use cases are moving to production, people are starting to care about service level agreements and the quality of service provided to end-users," said Charles Zedlewski, Cloudera vice president of products.

Version 3.5 of Cloudera Enterprise updates the management suite with two modules, each of which helps manage Hadoop deployments.

One new module, called the Activity Monitor, provides "a real-time view into the performance of all the running workloads in a Hadoop system," Zedlewski said.

Performance monitoring is typically found with standard relational databases, though tools for Hadoop remain scarce, he noted. This module displays the status of various Hadoop components, including MapReduce jobs, Hive queries, Pig scripts and Oozie workflows. It tallies about 45 metrics overall, including CPU, memory and network usage, as well as statistics on scheduling.

Cloudera also introduced a service and configuration manager module. This software "provides a single means of managing the Hadoop stack," Zedlewski said. The Service and Configuration Manager automates many of the processes for updating these programs and provides a set of checks that can be used to verify that the changes requested did indeed take place.

A typical Hadoop deployment running across 30 servers may be comprised of about 95 programs, with a configuration file to go along with each program. As a result, any administrative change might require restarting any number of these programs, or making changes to multiple configuration files.

Using this software, "It's much less likely that you will incur downtime due to mis-configuration or mis-setting," Zedlewski said.

With this release, Cloudera also updated a number of existing modules as well. Users can now track historical usage of disk and file space usage within the Resource Manager. Users can also now administer (ACLs) Access Control Lists in the Authorization Manager module.

Although most Hadoop tools are open source, Cloudera's management modules are only available as part of Cloudera's paid edition of Hadoop. However, the company is making available for no cost a version of the Service and Configuration Module, called SCM Express, that will help potential users set up and test a Hadoop deployment.

This version of the module can download all the appropriate programs and set them up on designated servers. "You can be running your first Hadoop workload from a Web client within 10 minutes," Zedlewski said.

Large ISPs such as Yahoo, Facebook and eBay, use Hadoop for analyzing user behavior, and the venture capital investment community anticipates its wider use in the enterprise. Cloudera Enterprise is a pre-integrated stack of Hadoop software and associated support available on a subscription basis.

"You take a look at how you operationalize any platform. People typically want to get some visibility into what is going on," Zedlewski said. "This is the first time people have been able to apply that full lifecycle management to a Hadoop system."

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the Good Gear Guide newsletter!

Error: Please check your email address.

Tags applicationsdata miningsoftwareclouderasystem managementdata warehousingbusiness intelligence

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Most Popular Reviews

Latest News Articles

Resources

PCW Evaluation Team

Azadeh Williams

HP OfficeJet Pro 8730

A smarter way to print for busy small business owners, combining speedy printing with scanning and copying, making it easier to produce high quality documents and images at a touch of a button.

Andrew Grant

HP OfficeJet Pro 8730

I've had a multifunction printer in the office going on 10 years now. It was a neat bit of kit back in the day -- print, copy, scan, fax -- when printing over WiFi felt a bit like magic. It’s seen better days though and an upgrade’s well overdue. This HP OfficeJet Pro 8730 looks like it ticks all the same boxes: print, copy, scan, and fax. (Really? Does anyone fax anything any more? I guess it's good to know the facility’s there, just in case.) Printing over WiFi is more-or- less standard these days.

Ed Dawson

HP OfficeJet Pro 8730

As a freelance writer who is always on the go, I like my technology to be both efficient and effective so I can do my job well. The HP OfficeJet Pro 8730 Inkjet Printer ticks all the boxes in terms of form factor, performance and user interface.

Michael Hargreaves

Windows 10 for Business / Dell XPS 13

I’d happily recommend this touchscreen laptop and Windows 10 as a great way to get serious work done at a desk or on the road.

Aysha Strobbe

Windows 10 / HP Spectre x360

Ultimately, I think the Windows 10 environment is excellent for me as it caters for so many different uses. The inclusion of the Xbox app is also great for when you need some downtime too!

Mark Escubio

Windows 10 / Lenovo Yoga 910

For me, the Xbox Play Anywhere is a great new feature as it allows you to play your current Xbox games with higher resolutions and better graphics without forking out extra cash for another copy. Although available titles are still scarce, but I’m sure it will grow in time.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?