Puzzle 8: Shared Memory

Overview

Implement a kernel that adds 10 to each position of a vector a and stores it in vector output.

Note: You have fewer threads per block than the size of a.

Shared memory visualization

Learn how to manually manage shared memory and synchronization.

Use LayoutTensor’s built-in shared memory management features.

💡 Note: Experience how LayoutTensor simplifies shared memory operations while maintaining performance.