A tool for visualizing the Tenstorrent Neural Network model (TT-NN)
For the latest updates and features, please see releases.
- Comprehensive list of all operations in the model
- Interactive graph visualization of operations
- Detailed and interactive L1, DRAM, and circular buffer memory plots
- Filterable list of tensor details
- Overview of all buffers for the entire model run
- Visualization of input and output tensors with core tiling and sharding details
- Visualize inputs/outputs per tensor or tensor allocation across each core
- Detailed insights into L1 peak memory consumption, with an interactive graph of allocation over time
- Navigate a tree of device operations with associated buffers and circular buffers
- Operation flow graph for a holistic view of model execution
- Load reports via the local file system or through an SSH connection
- Supports multiple instances of the application running concurrently
TT-NN-Visualizer.mp4
L1 Summary with Tensor highlight | Operation inputs and ouputs |
---|---|
< |
Device operations with memory consumption | DRAM memory allocation |
---|---|
Operation graph view | Model buffer summary |
---|---|
Per core allocation details | Per core allocation details for individual tensors |
---|---|
Tensor details list | Performance report |
---|---|
Performance charts | |
---|---|
How to get started with TT-NN Visualizer.
Use remote querying instead of syncing the report data to your local file system.
You may test the application using the following sample reports.
Unzip the files into their own directories and select them with the local folder selector.
Segformer encoder report
Segformer decoder report
Llama mlp report performance trace
How to run TT-NN Visualizer from source.