Analyzing Scene Performance and Memory

 
 
 

You can analyze the performance of a scene using the Scene Debugger. This can tell you the processing time and memory consumed by the elements in your scene, and help you to find and correct things that are slowing it down.

To open the Scene Debugger

Do one of the following:

  • Choose View General Scene Debugger.

or

  • Click the Open Scene Debugger button at the bottom of the main Softimage window.

Scene Debugger Overview

A

Switch between Performance and Memory.

When you switch to Memory, you need to click Update. Switching to Memory flushes captured performance data, so you need to re-capture data when you switch back to Performance.

B

Export to a text file, or set preferences.

C

Update the display. Click this after you have stopped or started consolidating below nodes, or changed preferences.

D

Clear the data. This also stops capture.

E

Start or stop capturing performance data. This is required for displaying performance data. (Performance tab only)

F

Controls how operators are grouped for display. (Performance tab only)

  • Consolidate Below Objects collects all the operators in the call stack into a list directly under each object. By default, the operators are grouped by category (Animation, Deform, etc.).

  • Consolidate Below Drawing Steps collects all the operators in the call stack into a list below the associated drawing step (Transform, Visibility Determination, or Draw Primitive) under each object. Again, the operators are grouped by category by default.

  • Custom indicates that you have modified the display by right-clicking in the view below. Do not choose Custom directly from this menu — the last custom state is not stored and cannot be retrieved.

G

Information about performance or memory appears in this pane.

On the Performance tab, you can right-click on certain nodes to control how the operators are consolidated, and then click Update.

H

On the Performance tab:

  • Net Total shows the time taken by a node and its children (all the operators it pulled).

  • Executing shows the time taken by a node itself (not including its children).

  • # of requests shows the number of times a node was called for its output. When a node is called and it hasn't been evaluated yet, an evaluation is triggered.

For information about the Memory tab, see Displaying and Interpreting Memory Data.

Walk-through: Profiling Performance

Here is a quick walk-through that outlines the general steps for profiling the performance of your scene:

  1. Choose View General Scene Debugger and make sure the Performance tab is displayed.

  2. If necessary, choose File Preferences and set the Filtering threshold to filter out nodes that take very little time.

    You can change this later and click Update. However it can take a long time to refresh the display depending on how much data there is to capture in your scene, so it's best to start with the value you want.

  3. Select either Consolidate Below Objects or Consolidate Below Drawing Steps.

    Again, you can change this later and click Update but it's best to start with the value you want.

  4. Start capturing performance data.

  5. Play back the scene, or interact with it (move deformers and so on).

  6. Do one of the following:

    • Stop capturing.

      or

    • If you want to accumulate timing information, you can continue capturing and click Update instead.

    The performance data appears in the Scene Debugger. To understand how to interpret it, see Interpreting Performance Data.

  7. When you find a node that takes a long time to evaluate, you can investigate it further. You can right-click on the following types of node in the lower portion of the window (in the Call Graph section below the Category section):

    • Geometry views, for example, OpenGL Render View B

    • 3D objects, including models

    • Drawing steps: Transform, Visibility Determination, and Draw Primitive

    • Operators

    The context menu contains a couple of mutually exclusive options:

    • Consolidate Below This Node. This option collects the operators called by the node into a list. It is useful when you want to simplify the presentation of the data.

    • Don't Consolidate Below This Node. This option shows you the entire nested call stack from that node down. It is useful for seeing exactly which node calls which.

    These settings are stored per node. When you choose one of these commands, the display changes to Custom but it does not update right away. You can right-click on additional nodes to control how their operator calls are consolidated, and then click Update. Note that Custom settings for consolidation are not kept if you switch to another setting, so you cannot retrieve your previous view by switching back to Custom from the menu.

    Also note that if you choose to consolidate below a node such as an operator, it will have no effect if the nodes are already being consolidated at some point above it in the call stack. You must stop consolidating from the object or drawing step above the operator to see the effect of consolidating below the operator.

  8. Make some changes to the scene, and then repeat as desired:

    • If you don't want to accumulate performance data, click Clear and go to step 4.

    • If you do want to accumulate data, go straight to step 4 (or to step 5 if you didn't stop capturing).

  9. When you have finished debugging, click Clear before closing the Scene Debugger window to discard the performance data.

Interpreting Performance Data

Interpreting the data in the Scene Debugger can be tricky. For example, it may seem like a particular node is taking an unusually long time to evaluate, but that may only be because it is the first node to pull on an expensive operator that must be evaluated anyway. In such a case, modifying the scene so that the node doesn't pull on the expensive operator simply shifts the blame to another node. For another example, computing the transform of the left eye may require pulling on many inputs, but then computing the right eye may require only one since the others have already been evaluated.

How Information Is Laid Out

The information on the Performance tab of the Scene Debugger is divided into 3 sections:

  • The first two lines show the time unit and filtering. You can simplify the data by filtering out nodes that are less than the Filtering threshold you set — see Scene Debugging Preferences [Preference Reference]. Note that you must click Update to refresh the data after you change the threshold.

  • The next section, labelled Category, displays summary information for every category of operator. This section is the same no matter how the data is consolidated in the main section. The categories are shown in descending order of evaluation times, and categories with long evaluation times are automatically expanded.

  • The main section, labelled Call Graph, displays the detailed timing data according to how you have chosen to consolidate the operators. Again, nodes are shown in descending order of evaluation times, and nodes with long evaluation times are automatically expanded.

    Each top-level node represents a separate call stack. There is one for each geometry view, unless views have been muted or minimized. There may be additional ones that represent call stacks executed outside of drawing a view — for example, if you interact with a manipulator in Local mode then Softimage needs to evaluate the transforms of the hierarchy.

Categories

Operators are categorized by type in the summary near the top of the Scene Debugger.

By default, they are also categorized in the consolidated lists in the main section. This is controlled by Show the consolidated categories — see Scene Debugging Preferences [Preference Reference]. When this option is off, the consolidated operators are displayed as an uncategorized flat list.

The following table describes the categories:

Animation Operators

This category includes all animated transforms, including kinematics and constraints.

Deform Operators

This category includes deformations. It does not include ICE trees that are used for deformations.

Drawing

This category is divided into the following drawing steps:

  • Visibility Determination. This represents the time taken to determine whether an object needs to be drawn, which includes evaluating the extent of its bounding box and culling. To compute the bounding box, the object's geometry may need to be evaluated, resulting in some inputs such as deformation operators being pulled. If nodes are consolidated below drawing steps, those inputs are included in this subcategory. If nodes are consolidated below objects, those operators appear under other categories (Deformations, etc.).

  • Transform. This represents the time taken to find the object's center. Although this is also required for visibility determination, it is classified separately to make it easier to see what's really slowing things down. For example, an envelope may be slow because of its rig rather than because of the deformation itself. As with Visibility Determination, if nodes are consolidated below drawing steps then the children include operators like animation and others (for example, a deformation that is pulled because of a cluster constraint). If nodes are consolidated below objects, those operators appear in other categories.

  • Draw Primitive. This represents the time taken to draw objects after Transform and Visibility Detection. When using approximate envelope operators, the time taken to evaluate the point positions and normals appears here.

ICE Operators

This category includes all ICE operators whether they are being used for deformation, simulation, and so on.

You can profile the performance of ICE operators individually as described in Profiling the Performance of ICE Trees [ICE Guide].

Objects

This represents the overhead consumed by objects outside of operator evaluations. The values should be quite small.

OpenGL Render

This represents the time taken to render the updated views.

Custom Operators

This category includes scripted and custom compiled operators.

Scripts

This includes any scripts that you ran while capturing performance data.

Tasks

This includes any commands that you invoked while interacting.

Topology Operators

This includes any operators that add or remove geometry components.

Other Operators

This includes all operators that don't fall into one of the other categories, such as expressions (which may or may not represent animated transforms), cluster operators, and so on.

Events

This includes any time spent processing events during data capture.

Strategies for Optimizing

The techniques for optimizing are very specific to individual scenes. However, there are some general strategies that you can apply:

  • If there is a great deal of time spent in Visibility Determination (usually due to the time to required compute the geometry), key the visibility of objects off while they are out of frame.

  • If much time is spent in Transform, try reducing it by plotting animation, caching geometry, and simplifying rigs.

  • To minimize time spent in Draw Primitive, look for unfrozen texture projections and similar things.

Displaying and Interpreting Memory Data

Compared to the Performance tab, the Memory tab is quite simple. You don't need to capture anything — just click Update. Note that when you switch back to the Performance tab, you need to recapture performance data.

The memory data is divided into two sections:

  • The first section shows summary memory usage by object type.

  • The second section shows detailed memory usage by owner.

This gives you a rough idea of the memory used by the scene, but note that not every byte is accounted for.

If you see that there are data copies listed, this may indicate something that can be frozen to save memory. However, this is not always the case — copies can also represent internal data that will be removed automatically when no longer needed.

Creative Commons License Except where otherwise noted, this work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License