LLM Architecture Gallery
A curated gallery of architecture diagrams and technical specifications for major LLMs, from GPT-2 to DeepSeek and Qwen, covering dense, sparse mixture-of-experts, and hybrid attention designs.
Sebastian Raschka · sebastianraschka.com