Model Deep Dives
One-pager technical analyses -- parameter breakdowns, architecture diagrams, benchmark tables, and novel component deep dives
About This Project
A comprehensive, continuously evolving encyclopedia of LLM and multimodal model architecture, documenting 116+ architectures from 2022–2026. The site covers model components, attention mechanisms, feed-forward networks, normalization, MoE routing, SSM/recurrent alternatives, multimodal encoders, scaling laws, and design patterns -- all backed by a formal description language.
// A SELF-EVOLVING ENCYCLOPEDIA
The MADL Encyclopedia is an evolving artifact, continuously updated through an automated write–review–fix pipeline. Content improves through three mechanisms: analyzing new model releases from HuggingFace, autonomously generating and revising book chapters, and refining the underlying MADL specification as new architectural patterns emerge.
Each model architecture is described in MADL (Model Architecture Description Language) -- a formal, human-readable notation designed by studying 116+ real architectures. Every MADL file is verified against the actual huggingface/transformers source code.
// GENERATION PIPELINE
The 197-chapter book follows an autonomous write–review–fix loop:
// SCORING DIMENSIONS
Ten criteria guide the automated review. The overall score is the minimum across all dimensions -- a chapter must excel everywhere to pass.
// ARCHITECTURE COMPONENTS COVERED
// HOW TO ADD A MODEL
# 1. Analyze a HuggingFace model (uses Claude API or Claude Code CLI) python analyze.py deepseek-ai/DeepSeek-V3.2-Exp --claude-code # 2. Fast config-only converter (no API key needed, 34 model types) python hf_to_madl.py Qwen/Qwen3-8B # 3. Rebuild the dashboard python build_dashboard.py # 4. Open in browser open dashboard.html
// CITATION & ATTRIBUTION
If you use this encyclopedia in your research or work -- even if you don't quote directly -- if it helped you understand a model architecture, shaped your thinking, or saved you research time, a mention or backlink is appreciated.
@misc{kinas2026madl,
author = {Kinas, Remek},
title = {MADL Encyclopedia: A Comprehensive Reference
on LLM Architecture},
year = {2026},
howpublished = {\url{https://madl.si5.pl}},
note = {116+ model architectures, 197 chapters,
formal MADL specification}
}
Plain text: Kinas, R. (2026). MADL Encyclopedia: A Comprehensive Reference on LLM Architecture. https://madl.si5.pl
// AUTHOR
Creator of the MADL specification, the Architecture Browser, and the automated chapter generation pipeline. Also author of OmniEvolve for evolutionary algorithm discovery.
// LICENSE
MIT License. Architecture details verified against huggingface/transformers source code. MADL model files and generated chapters may be shared freely with attribution.