Modern Transformer Architecture: A Curated YouTube Course(x.com)
Jia-Bin Huang (UMD) curated a short YouTube course on Transformer architecture: attention, positional encodings, vision transformers, and recent variants. If you work in CV and feel like you picked up transformers by osmosis rather than really understanding them, this is a good way to fill the gaps. Links in the thread, in order.