DataTorrent tackles complexity of Hadoop data ingestion

It designed dtIngest to streamline the collection, aggregation and transfer of data to and from a Hadoop cluster

While the buzz around big data analysis is at a peak, there is less discussion about how to get the necessary data into the systems in the first place, which can involve the cumbersome task of setting up and maintaining a number of data processing pipelines.

To help solve this problem, Santa Clara, California start-up DataTorrent has released what it calls the first enterprise-grade ingestion application for Hadoop, DataTorrent dtIngest.

The application is designed to streamline the process of collecting, aggregating, and moving data onto and off of a Hadoop cluster.

The software is based on Project Apex, an open source software package available under the Apache 2.0 license.

Working as a component within a Hadoop platform, dtIngest can work with both streaming and batch data. It can exchange data across a variety of file systems and protocols, including NFS, FTP, the Hadoop File System, Amazon Web Service's Simple Storage Service (S3), Kafka, and the Java Message Service.

The software is fault tolerant, in that it can resume a file transfer automatically after disruption. It comes with a point-and-click interface, as well as monitoring logs.

The company has released dtIngest for free, hoping that users will upgrade to DataTorrent's enterprise Hadoop data ingestion pipeline software, DataTorrent RTS 3, which is based on dtIngest/Project Apex and includes additional capabilities for operational management, easy development and data visualization.

DataTorrent was co-founded by Amol Kekre and Phu Hoang, a pair of engineers who used to work at Hadoop pioneer Yahoo. The company has formed partnerships with Hadoop distributors Hortonworks and Pivotal, and has drummed up nearly $24 million in early stage funding from investors.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags DataTorrentapplicationsdata miningsoftwareData management

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Ben Ramsden

Sharp PN-40TC1 Huddle Board

Brainstorming, innovation, problem solving, and negotiation have all become much more productive and valuable if people can easily collaborate in real time with minimal friction.

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Ratchada Dunn

Sharp PN-40TC1 Huddle Board

The Huddle Board’s built in program; Sharp Touch Viewing software allows us to easily manipulate and edit our documents (jpegs and PDFs) all at the same time on the dashboard.

George Khoury

Sharp PN-40TC1 Huddle Board

The biggest perks for me would be that it comes with easy to use and comprehensive programs that make the collaboration process a whole lot more intuitive and organic

David Coyle

Brother PocketJet PJ-773 A4 Portable Thermal Printer

I rate the printer as a 5 out of 5 stars as it has been able to fit seamlessly into my busy and mobile lifestyle.

Kurt Hegetschweiler

Brother PocketJet PJ-773 A4 Portable Thermal Printer

It’s perfect for mobile workers. Just take it out — it’s small enough to sit anywhere — turn it on, load a sheet of paper, and start printing.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?