Informatica rolls out data parser for Hadoop

The data-integration vendor is one of a growing field hoping to capitalize on the open-source programming framework

Informatica has strengthened its hand in the burgeoning market for Hadoop, the open-source programming framework for large-scale data processing, unveiling a new data parser on Wednesday that can transform piles of unstructured information into a more structured form for use in running Hadoop jobs.

The release builds on Informatica's June release of a Hadoop connector, which was aimed at data movement in and out of a Hadoop cluster, rather than data transformation. It also comes amid a wave of announcements from vendors such as Sybase and MarkLogic in the run-up to next week's Hadoop World conference.

Hadoop has emerged as one of the highest-profile technologies associated with "Big Data," an industry buzzword referring to the large amounts of unstructured information generated by websites, sensors and other non-relational sources, as well as the desire by companies to sift through such data for insights about their customers and businesses.

Informatica has been in the data-parsing business for some time. HParser includes a set of libraries for various data types, from standards like XML to industry-specific formats such as HIPAA, which is used in healthcare, and ASN.1 for telcos.

It comes in three editions, including two commercial versions, HParser Industry Standards and HParser for Documents, as well as a community version. The latter is available at no cost but premium services and add-ons are for sale.

Also Wednesday, Informatica announced that the community version of HParser will be available for use and downloadable from the website of Hortonworks, a spinoff of Yahoo which announced a preview version of its own Hadoop distribution this week.

The news drew a pair of thumbs-up from industry analysts.

The parser represents "great news for the Hadoop community," as it gives them "field-proven" technology, said James Kobielus, senior analyst with Forrester Research.

The Hortonworks announcement illustrates the "sorts of vendor partnerships that Hortonworks is building in the Hadoop community that will drive continued development of the fully open-source Apache Hadoop stack," Kobielus added.

One big stumbling block for Hadoop has been that many IT shops don't have the skills to easily adopt it. HParser's graphical development environment could help mitigate this problem, wrote David Menninger, vice president and research director at Ventana Research, in a blog post Wednesday.

"Using a graphical environment to develop these routines should make it easier and faster to create the code necessary to parse the data," he wrote.

Chris Kanaracus covers enterprise software and general technology breaking news for The IDG News Service. Chris's e-mail address is

Join the Good Gear Guide newsletter!

Error: Please check your email address.

Tags applicationsinformaticamiddlewaredata miningsoftwaresybasedata integrationData managementdata warehousingbusiness intelligenceMarkLogicHortonWorks

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Chris Kanaracus

IDG News Service
Show Comments

Cool Tech

Crucial® BX200 SATA 2.5” 7mm (with 9.5mm adapter) Internal Solid State Drive

Learn more >

ASUS ROG Swift PG279Q – Reign beyond virtual world

Learn more >

D-Link TAIPAN AC3200 Ultra Wi-Fi Modem Router (DSL-4320L)

Learn more >

Lexar® Professional 1000x microSDHC™/microSDXC™ UHS-II cards

Learn more >

D-Link PowerLine AV2 2000 Gigabit Network Kit

Learn more >

Gadgets & Things

Lexar Professional 2000x SDHC™/SDXC™ UHS-II cards

Learn more >

Lexar® Professional 1000x microSDHC™/microSDXC™ UHS-II cards

Learn more >


Learn more >

Family Friendly

Lexar Professional 2000x SDHC™/SDXC™ UHS-II cards

Learn more >

ASUS VivoPC VM62 - Incredibly Powerful, Unbelievably Small

Learn more >

Lexar® Professional 1000x microSDHC™/microSDXC™ UHS-II cards

Learn more >

Stocking Stuffer

Lexar Professional 2000x SDHC™/SDXC™ UHS-II cards

Learn more >

Lexar® Professional 1000x microSDHC™/microSDXC™ UHS-II cards

Learn more >

Christmas Gift Guide

Click for more ›

Most Popular Reviews

Best Deals on Good Gear Guide

Latest News Articles


GGG Evaluation Team

Kathy Cassidy


First impression on unpacking the Q702 test unit was the solid feel and clean, minimalist styling.

Anthony Grifoni


For work use, Microsoft Word and Excel programs pre-installed on the device are adequate for preparing short documents.

Steph Mundell


The Fujitsu LifeBook UH574 allowed for great mobility without being obnoxiously heavy or clunky. Its twelve hours of battery life did not disappoint.

Andrew Mitsi


The screen was particularly good. It is bright and visible from most angles, however heat is an issue, particularly around the Windows button on the front, and on the back where the battery housing is located.

Simon Harriott


My first impression after unboxing the Q702 is that it is a nice looking unit. Styling is somewhat minimalist but very effective. The tablet part, once detached, has a nice weight, and no buttons or switches are located in awkward or intrusive positions.


Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?