SpaceStudio Monitoring User Guide

4. SpaceStudio Monitoring User Guide#

4.1. Introduction#

SpaceStudio offers an extensive monitoring infrastructure designed to allow profiling an architecture. When enabled, the monitoring infrastructure produces a database containg events that occurred during the simulation.

SpaceStudio also provides code profiling support (gprof) and analysis of the usage of embedded processors’ caches.

When using the monitoring infrastructure, it is possible to retrieve information about inter-module communications, bus transactions, FIFO usage, and more. For a system designer, it is a useful tool to find communication bottlenecks, analyze system performance and debug faulty bus transactions.

4.1.1. Launching a simulation with monitoring activated#

Open a project
In Tools menu, select Generate…
From the popped window, select Monitoring option and click OK.
Build the project by doing Tools and Build.
Execute the simulation by doing Run and Execute.

4.1.2. Generated monitoring database#

When the simulation has ended, the resulted database is located at <PROJECT_ROOT>\solutions\<SOLUTION_NAME>\architectures\<ARCHITECTURE_NAME>\build\monitoring\monitoring.db3 where :

<PROJECT_ROOT> is the location of the project’s file (.spacestudio)
<SOLUTION_NAME> is the name of the solution
<ARCHITECTURE_NAME> is configuration’s name

The database file can be opened by any SQLite database brwoser (e.g. sqlitebrowser4).

4.2. Monitoring views#

The generated database contains raw data from monitored events in the system. From this database, SpaceStudio presents monitoring views for some of these events such as code profiling, cache analysis, module computation, processor load and FIFO usage.

These views are used to quickly assess the system performance and guide the system designer in the design system exploration (DSE).

4.2.1. Module computation#

The module computation view displays a pie chart of the module time allocation between communication time, communication wait time and computation time.

The communication time is the time needed to read/write data from the SpaceStudio communication API. The communication time accounts for data movement (excluding blocked time). When the communication block for some reason, this will be accounted in the communication wait time (see after)

The communication wait time refers to the time lost because the reader/writer was not ready. When a communication is block, the blocked time will be accounted in the communication wait time. For instance, when reading from an empty FIFO or writing to a full FIFO.

The computation time is the time allocated for everything that is not communication time and communication wait time. This represents the time budget to compute the application function. For hardware-mapped module, the computation time is incremented by calls to hw_compute_latency() function. For software-mapped module, the computation time is incremented by the execution of the application on the processor’s model.

This view is accessible via the Project Explorer, as shown by Figure 4.1, or from the diagram, as shown by Figure 4.2.

_images/monitoring_module_computation_projexp.png — Figure 4.1 Accessing the module computation view from the project explorer.#

_images/monitoring_module_computation_diag.png — Figure 4.2 Accessing the module computation view from the diagram.#

Figure 4.3 shows the module computation view. When hovering over a section of the pie chart, a tooltip will show the percentage of this section.

_images/monitoring_module_computation.png — Figure 4.3 Module computation view.#

Note

The module computation view is available only for hardware modules and software modules running on a bare-metal OS.

4.2.2. Processor load#

The processor load view displays a pie chart of the time consumed (computation + communication) per module on the processor’s core. This view is used to highlight which module consumes most of the processor core’s time. For multi-core processors, there will be a view per core.

This view is accessible via the Project Explorer, as shown by Figure 4.4, or from the diagram, as shown by Figure 4.5.

_images/monitoring_processor_load_projexp.png — Figure 4.4 Accessing the processor load view from the project explorer.#

_images/monitoring_processor_load_diag.png — Figure 4.5 Accessing the processor load view from the diagram.#

Figure 4.6 shows the processor load view. When hovering over a section of the pie chart, a tooltip will show the percentage of this section.

_images/monitoring_processor_load.png — Figure 4.6 Processor load view.#

4.2.3. FIFO usage#

The FIFO usage view displays a pie chart of the slot used in the FIFO. This view is used to adjust the depth of the FIFO. For instance, if the FIFO is empty most of the time, the depth might be decreased without affecting the throughput of the application.

This view is accessible via the Project Explorer, as shown by Figure 4.7, or from the diagram, as shown by Figure 4.8.

_images/monitoring_fifo_usage_projexp.png — Figure 4.7 Accessing the FIFO usage view from the project explorer.#

_images/monitoring_fifo_usage_diag.png — Figure 4.8 Accessing the FIFO usage view from the diagram.#

_images/monitoring_fifo_usage.png — Figure 4.9 FIFO usage view.#

Figure 4.9 shows the fifo usage view. When hovering over a section of the pie chart, a tooltip will show the percentage of this section. The pie chart is divided into several portions {“empty”, “less than 25%”, “25% - 50%”, “50% - 75%”, “more than 75%”, “full”}.

4.2.4. Code profiling#

SpaceStudio automatically invokes gprof to generate code profiling. The Profiling results are shown in the Project Explorer view under the processor node as seen in Figure 4.10.

_images/monitoring_code_profiling_projexp.png — Figure 4.10 Accessing the code profiling view from the project explorer.#

Double-click on the Profiling results node will bring the profiling view for the associated processor as shown by Figure 4.11.

_images/monitoring_code_profiling.png — Figure 4.11 Code profiling view.#

Note

Code profiling is only available for software modules running on Zynq and Zynq Ultrascale+ with Linux.

4.2.5. Cache analysis#

SpaceStudio provides a view that shows which portions of the application code are responsible for suboptimal cache usage (high number of cache misses). In particular, the loops are highlighted according to the percentage of cache hits and misses. Loops colored in green make an optimal usage of the caches, while loops colored in red make a poor usage of the cache. This may be the case for example when traversing a 2D array in an inappropriate order (column after column instead of line after line).

The cache analysis view can be opened by double clicking on Cache analysis nodes in the Project Explorer as shown in Figure 4.12.

_images/monitoring_cache_analysis_projexp.png — Figure 4.12 Accessing the cache analysis view from the project explorer.#

The cache analysis view will open the compiled source code where loops will be highlighted with a red or green color. When hovering over a loop with the mouse, SpaceStudio shows the percentage of cache hits and misses for the loop as seen in Figure 4.13.

_images/monitoring_cache_analysis.png — Figure 4.13 Hovering above a loop display the miss and hit percentage#

Note

The cache analysis view is only available for software modules running on Zynq and Zynq Ultrascale+ with Linux.

4.3. Database#

In this section, we present the complete description of the database schematic and the events monitored by the monitoring infrastructure.

Tables starting with the prefix cache_ contain computed data used by the viewer to speed up database requests.

All timestamps present in the database are SystemC’s timestamps (in picosecond) obtained with the function call sc_core::sc_time_stamp().value().

4.3.1. `bus`#

This table contains a list of id that refers to a bus. For example, if there are 3 buses in the virtual platform, this table will contain 3 entries which refer to an id of bus. The associated bus name can be determined using the table component.

Table 4.1 `bus` schema#
Column	Description
`id`	ID of the bus

4.3.2. `bus_transaction`#

This table traces every bus transaction during the simulation. Each entry of this table represents a master/slave communication (read and write).

Table 4.2 Communication types#
`module_read`
`module_write`
`device_read`
`device_write`
`register_read`
`register_write`

Table 4.3 `bus_transaction` schema#
Column	Description
`bus_id`	Id of the bus
`source_id`	Id of the module initiating the communication
`destination_id`	Id of the destination component
`bytes`	Number of bytes transferred
`destination_address`	Absolute destination address
`communication_type`	Type of the current communication (refer to Table 4.2)
`request_time`	Simulation time when the source request the bus access
`start_transfer_time`	Simulation time when the data transfer starts
`end_transfer_time`	Simulation time when the data transfer ends

4.3.3. `cache_bus_interaction_X`#

This table contains all master and slave components using the bus X to communicate.

Table 4.4 `cache_bus_interaction_X` schema#
Column	Description
`component_id`	Id of the component

4.3.4. `cache_module_interaction_X`#

This table contains a list of components that communicated (read and/or write) with the module (X).

Table 4.5 `cache_module_interaction_X` schema#
Column	Description
`component_id`	Id of the component

4.3.5. `component`#

The table component contains the id and name of all components found in the simulation.

Table 4.6 `component` schema#
Column	Description
`id`	Component id (as defined in `PlatformDefinition.h`)
`instance_name`	Component instance name

4.3.6. `core`#

This table contains a list of processor cores. For example, if there are 3 mono-core processors in the virtual platform, this table will contain 3 entries where each entry refers the id of the processor core. If a processor has two cores, both cores id will be in the table. The associated core name can be determined using the table component.

Table 4.7 `core` schema#
Column	Description
`id`	ID of the core

4.3.7. `end_of_simulation`#

This table contains the simulation time of the end of the simulation.

Table 4.8 `end_of_simulation` schema#
Column	Description
`time`	Simulation time when simulation ended

4.3.8. `fifo`#

This table contains a list fifo. For example, if there are 3 fifos in the virtual platform, this table will contain 3 entries where each entry refers to an id of a fifo. The associated fifo name can be determined using the table component.

Table 4.9 `fifo` schema#
Column	Description
`id`	Id of the fifo
`source_id`	Id of the master module inserting data into the fifo
`destination_id`	Id of the slave module reading data from the fifo

4.3.9. `fifo_usage`#

This table contains information about the number of used slot in a fifo and the operations executed on it. A new entry is added to this table after every read/write fifo operation (blocking put, non-blocking put, blocking get and non-blocking get).

Table 4.10 `fifo_usage` schema#
Column	Description
`fifo_id`	Id of the fifo
`slot_used`	The number of used slots (words) after the action
`action_type`	Type of the action: a string “put” or “get”
`operation_start`	Simulation time when the event started
`operation_end`	Simulation time when the event ended
`operation_delta`	Simulation time elapsed between the beginning and end of the operation

4.3.10. `max_fifo_usage`#

This table is generated at the end of the simulation and presents the maximum number of used slot for each fifo.

Table 4.11 `max_fifo_usage` schema#
Column	Description
`fifo_id`	Id of the fifo
`max_slot_used`	The maximum number of used slots (words)

4.3.11. `memory_access`#

This table contains all memory accesses. Each entry in the table happens when a memory receives an access through the API DeviceRead/DeviceWrite.

The origin_id represents the module id responsible of the access.

Table 4.12 Communication types#
`module_read`
`module_write`
`register_read`
`register_write`

Table 4.13 `memory_access` schema#
Column	Description
`memory_id`	Id of the memory
`origin_id`	Id of the module accessing the memory
`bytes`	Number of bytes transferred
`communication_type`	Type of the current communication (refer to Table 4.12)
`start_address`	Address accessed
`start_time`	Simulation time when the access starts
`end_time`	Simulation time when the access ends

4.3.12. `memory_size`#

This table contains a list of memories id and the size. For example, if there are 3 memories in the virtual platform, this table will contain 3 entries where each entry refers to a memory ID of and its size. The associated memory name can be determined using the table component.

..table:: memory_size schema

Column

Description

id

Id of the memory

size

Memory size (in bytes)

Column	Description
`id`	Id of the memory
`size`	Memory size (in bytes)

module

This table contains a list of modules id. For example, if there are 3 modules in the virtual platform, this table will contain 3 entries where each entry refers to a module id. The associated module name can be determined using the table component.

Table 4.14 `module` schema#
Column	Description
`id`	Id of the module

4.3.13. `module_communication`#

This table contains all communications initiated by a module using the communication API function call (i.e ModuleRead, ModuleWrite, DeviceRead, DeviceWrite, RegisterRead and RegisterWrite). Each entry in the table is triggered by the API call. The source_id is the module executing the API function call.

The status of the communication is one of the following:

success: the communication has been completed successfully
empty: a non-blocking communication could not be completed due to an empty fifo
full: a non-blocking communication could not be completed due to a full fifo
error: a communication could not be completed due to an unknown error

Table 4.15 Communication types#
`module_read`
`module_write`
`device_read`
`device_write`
`register_read`
`register_write`

Table 4.16 `module_communication` schema#
Column	Description
`source_id`	Id of the module initiating the communication
`destination_id`	Id of the destination component
`bytes`	Number of bytes transferred
`communication_type`	Type of the current communication (refer to Table 4.15)
`is_blocking`	“yes” or “no” depending if the communication is blocking or not
`status`	Status of the communication
`start_time`	Simulation time when the communication starts
`end_time`	Simulation time when the communication ends

Note

For a matching communication (ModuleWrite→ModuleRead), there are two entries; one entry for each source module.
For software module, the processor is the source for RegisterRead and RegisterWrite communication.

Warning

All DMA functions calls (Stream2Memory, Memory2Stream and Memory2Memory) are not monitored.
FIFO-based communication functions calls (ModuleRead/ModuleWrite) implemented as software fifo are not monitored.
There is no module communication monitoring support for software module on the SMP simulator.
There is no module communication monitoring support for software module running on RTEMS.

4.3.14. `module_communications_summary`#

This table contains all information from the module_communication table with added time measurements for each communication.

Table 4.17 `module_communications_summary` schema#
Column	Description
`source_id`	Id of the module initiating the communication
`destination_id`	Id of the destination component
`bytes`	Number of bytes transferred
`communication_type`	Type of the current communication (refer to Table 4.15)
`is_blocking`	“yes” or “no” depending if the communication is blocking or not
`status`	Status of the communication
`start_time`	Simulation time when the communication starts
`end_time`	Simulation time when the communication ends
`effective_communication_time`	Simulation time elapsed transferring data
`communication_wait_time`	Simulation time elapsed while the communication is stalled
`total_communication_time`	Simulation time elapsed between start and end of communication (includes both

4.3.15. `module_summary`#

This table contains a summary of computation, communication and wait time for each module.

Table 4.18 `module_summary` schema#
Column	Description
module_id	Id of the module
communication_time	Simulation time elapsed while doing communication
computation_time	Simulation time elapsed while doing computation
communication_wait_time	Simulation time elapsed while waiting
communication_time_pct	Amount of time doing communication relative to the total execution time of the module
computation_time_pct	Amount of time doing computation relative to the total execution time of the module
communication_wait_time_pct	Amount of time spend waiting while doing communications relative to the total execution time of the module

Note

For software module, the waiting time represent time being pre-empted by the OS
For Hardware module, the waiting time represent time being blocked waiting for fifo. Either because it’s empty or full.

4.3.16. `sw_event`#

This table contains all relevant software events. The monitored software events are presented by Table 4.19.

Table 4.19 `event_type` description#
`event_type`	Description
`boot`	Processor boot procedure is running (until the application starts)
`task_idle`	Idle task is running
`task_schedule`	Application task is running (software module)
`irq_isr`	Interruption service routine is running
`os_io`	OS processes is running excluding application specific processes (application task and idling task)

Table 4.20 `sw_event` schema#
Column	Description
`processor_id`	The processor id or the core id.
`event_type`	Software event (refer to Table 4.19)
`module_id`	When event_type Is task_schedule only. The module id that is running on the processor.
`start_time`	The simulation time when the software event starts.
`end_time`	The simulation time when the software event ends.

For task_schedule event, the module_id represents the module id being executed on the processor. For boot, task_idle, irq_isr and os_io event, the module_id is irrelevant (NULL).

Note

There is no boot event for the SMP simulator
There is no irq_isr event for the SMP simulator
The event os_io is only supported for the SMP simulator

Warning

There is no software event when running on RTEMS and FreeRTOS.

SpaceStudio Monitoring User Guide

4. SpaceStudio Monitoring User Guide#

4.1. Introduction#

4.1.1. Launching a simulation with monitoring activated#

4.1.2. Generated monitoring database#

4.2. Monitoring views#

4.2.1. Module computation#

4.2.2. Processor load#

4.2.3. FIFO usage#

4.2.4. Code profiling#

4.2.5. Cache analysis#

4.3. Database#

4.3.1. bus#

4.3.2. bus_transaction#

4.3.3. cache_bus_interaction_X#

4.3.4. cache_module_interaction_X#

4.3.5. component#

4.3.6. core#

4.3.7. end_of_simulation#

4.3.8. fifo#

4.3.9. fifo_usage#

4.3.10. max_fifo_usage#

4.3.11. memory_access#

4.3.12. memory_size#

4.3.13. module_communication#

4.3.14. module_communications_summary#

4.3.15. module_summary#

4.3.16. sw_event#

4.3.1. `bus`#

4.3.2. `bus_transaction`#

4.3.3. `cache_bus_interaction_X`#

4.3.4. `cache_module_interaction_X`#

4.3.5. `component`#

4.3.6. `core`#

4.3.7. `end_of_simulation`#

4.3.8. `fifo`#

4.3.9. `fifo_usage`#

4.3.10. `max_fifo_usage`#

4.3.11. `memory_access`#

4.3.12. `memory_size`#

4.3.13. `module_communication`#

4.3.14. `module_communications_summary`#

4.3.15. `module_summary`#

4.3.16. `sw_event`#