Oracle engineer reveals latency mysteries with heat maps

Heat maps can reveal, though not always explain, system latency, an Oracle engineer argues

While data-center managers have long used heat maps to help determine where to best position racks of servers and cooling units, this mode of visualization can also be handy for better understanding system latency, argues an Oracle engineer in the July issue of Communications of the ACM.

"Presenting latency as a heat map is an effective way to identify subtle characteristics that may otherwise be missed," writes Brendan Gregg, a principal software engineer at Oracle, in the article "Visualizing system latency."

Gregg also cautioned that while such visualization can give us greater overview of what is taking place, it doesn't always provide answers for the behavior being observed. Still, heat maps can provide insight into tackling the next generation of data-center latency issues.

Pinpointing the causes of system sluggishness has long been a frustration for data-center managers and system administrators. Network analysis tools are available to visualize network performance, though other aspects of a system, such as the responsiveness of disks in a storage array, have been harder to quantify.

Sun Microsystems has long offered one tool for its Solaris operating system, called DTrace, that can characterize latency within various parts of a system on a second-by-second basis. The overwhelming data it can produce, however, still needs to be boiled down into a readily understandable form.

Enter Gregg's heat maps. Heat maps are a simple visualization technique in which, on a two-dimensional graph, different values are represented by different colors.

Heat graphs can reveal more than the line graphs on most network analysis tools, because while graphs "would allow average latency to be examined over time, the actual makeup or distribution of that latency cannot be identified beyond a maximum, if provided," he writes.

Heat maps are also good for rapidly identifying outliers, which then can be examined in greater detail, he argued.

For the article, Gregg plotted a variety of unusual workload conditions, using the Oracle Analytics visualization software to visually render data gathered by DTrace. He set the X axis to represent time and the Y axis to represent the time of latency. The darkest colors represented the most input-output.

In many cases, he found simple workloads can produce a variety of complex -- and sometimes unexplainable -- patterns.

In one case, a small amount of data was sequentially written to a pool of disks. Gregg expected to see only "white noise" representing random latency to appear. Instead, the heat map showed latency levels rising and falling in distinct patterns for some unknown reason. "Visualizing latency in this way clearly poses more questions than it provides answers," he said.

Another pattern proved equally mysterious. The test involved sending a stream of data to 44 disks. First, data would be sent to only one disk, then to two disks, and so on, until all 44 disks were receiving data.

Gregg expected disk latency to increase in a linear fashion as the system buses became saturated with data.

Instead the latency would increase, then subside somewhat, before increasing some more.

He called this pattern the rainbow pterodactyl, in that the heat graph resembled the profile of a colorful flying dinosaur.

"To summarize the rainbow pterodactyl: little is known with accuracy, and much more investigation is needed. What this does show is how deep a simple visualization can become," he writes.

Gregg also used a heat map to reveal the shock effects that loud noise has on servers, phenomena that Gregg demonstrated a few years back on YouTube.

Although these heat maps were done on a system running on the Zettabyte File System (ZFS) running over Network File Storage (NFS) protocol, this approach could be used for characterizing the operations of other file systems, and even other components such as CPUs, Gregg writes.

Joab Jackson covers enterpise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags applicationsNetworkingsoftwareIT managementOraclesystem managementbusiness intelligencenetworking hardwaremanagementTraffic management

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Ratchada Dunn

Sharp PN-40TC1 Huddle Board

The Huddle Board’s built in program; Sharp Touch Viewing software allows us to easily manipulate and edit our documents (jpegs and PDFs) all at the same time on the dashboard.

George Khoury

Sharp PN-40TC1 Huddle Board

The biggest perks for me would be that it comes with easy to use and comprehensive programs that make the collaboration process a whole lot more intuitive and organic

David Coyle

Brother PocketJet PJ-773 A4 Portable Thermal Printer

I rate the printer as a 5 out of 5 stars as it has been able to fit seamlessly into my busy and mobile lifestyle.

Kurt Hegetschweiler

Brother PocketJet PJ-773 A4 Portable Thermal Printer

It’s perfect for mobile workers. Just take it out — it’s small enough to sit anywhere — turn it on, load a sheet of paper, and start printing.

Matthew Stivala

HP OfficeJet 250 Mobile Printer

The HP OfficeJet 250 Mobile Printer is a great device that fits perfectly into my fast paced and mobile lifestyle. My first impression of the printer itself was how incredibly compact and sleek the device was.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?