|
|
|
||
Cornell
|
Georgia Institute
|
Lawrence Livermore
|
In Owl, we are developing a reconfigurable monitoring framework, which will function as one of the fundamental building blocks for such autonomous systems. Owl splits monitoring functionality into two parts: capsules containing reconfigurable logic and analysis modules, which are loaded into the capsules and perform data aggregation and preprocessing. The capsules contain the actual data probes and may be located throughout the system.
Each capsule provides a standardized interface between itself and the reconfigurable logic containing the analysis module. This allows analysis modules to be applied at any system location and thereby enables the reuse of analysis and aggregation techniques. A module's logic may be instantiated from a library of existing modules. Each loaded module may further be configured through memory mapped configuration registers available in each capsule. Once activated, the capsule directs the probed data to the module where it is preprocessed, analyzed, aggregated, or simply compressed. When necessary the module generates output data and this data is injected into the regular system memory traffic and stored in a reserved region of main memory organized as a ring buffer of configurable size.
|
|
| Figure 1 | Figure 2 |
|
Monitoring capsules can potentially located anywhere in the system. |
A standardized interface allows the exchange of monitoring modules. |
First results show that a monitoring system with autonomous data delivery has a relatively small impact on system performance, even in the case of logging individual memory accesses, and that with lower injection rates the overhead becomes negligible. In addition, simple hardware techniques can further reduce system perturbation in the general case. Our feasibility studies demonstrate the viability of the general approach. As the framework was designed as a general monitoring facility, we believe its success in the specific context of memory analysis will extend to more pervasive system-wide monitoring - and towards better understanding system behavior.
|
This work is supported by the National Science Foundation under: NSF Medium ITR/NGS Award (#O325536), Towards Autonomic Computing Platforms: System-wide Hardware/Software Performance Monitoring and Adaptation |
| To the project internal web site |