⚡

Visual Learning Library

Tensor Core Architecture

Interactive visualizations explaining how NVIDIA and AMD GPUs feed their matrix units — from LDGSTS to TMA to TMEM

18

Visualizations

A100→B200

NVIDIA Coverage

MI300X

AMD Coverage

Featured — Start Here

NVIDIA Evolution: LDGSTS → TMA → TMEM

Complete comparison of A100, H100, and B200 with code examples

HipKittens: AMD Optimization Patterns

8-Wave Ping-Pong, 4-Wave Interleave, and XCD Grouping explained

NVIDIA Deep Dives

NVIDIA Tensor Cores — Deep Architecture Guide

Complete technical breakdown from Volta to Blackwell with PTX examples

New NVIDIA Code Diagram

NVIDIA Tensor Core Timeline

Volta → Ampere → Hopper → Blackwell evolution

Feeding the Tensor Cores — Complete Guide

Comprehensive breakdown of data movement strategies

Warpgroups & TMEM Explained

128 threads working as one unit, TMEM slice ownership

TMEM & Register Pressure

How TMEM eliminates register pressure for tensor tiles

AMD Deep Dives

AMD Matrix Cores — Deep Architecture Guide

MI300X MFMA instructions, AGPRs, and CDNA3 internals

New AMD Code Diagram

AMD MFMA Scheduling Deep Dive

Understanding MFMA blocking behavior and scheduling challenges

AMD AGPR Restrictions

Accumulator registers and their unique constraints

Comparisons

NVIDIA vs AMD — 2026 Architecture Comparison

Head-to-head: Blackwell B200 vs MI300X tensor/matrix cores

New NVIDIA AMD Diagram

Wave Specialization: NVIDIA vs AMD

Why NVIDIA patterns don't work on AMD hardware

Memory Layouts & Swizzling

NVIDIA XOR swizzle vs AMD shape-specific layouts

NVIDIA AMD Diagram

NVIDIA vs AMD Complete Comparison

Side-by-side architectural comparison

Animations & Visuals

Tensor Architecture Visual

Complete visual breakdown of tensor core architecture

Tensor Data Flow Animation

Animated visualization of data movement through the SM

NVIDIA Animated

TMEM Flow Animation

Blackwell TMEM data path visualization

NVIDIA Animated

Hologram — Original Concept

The original hologram visualization

© 2026 Subramaniyam Pooni
CS²B Technologies