HF Viewer: Visualize Any Hugging Face Model Graph Instantly

HF Viewer is a new web tool that lets you visualize the architecture of any Hugging Face model directly in the browser. No installation, no export step, no config hunting. Just paste a model URL or repo name — e.g., gpt2 — and get an interactive graph showing the high-level structure, from encoder-decoder transformers to sparse MoE reasoning models.
Key Features
- Direct URL magic: Replace
huggingface.cowithhfviewer.comin any model URL to view it instantly. - Granularity levels: Zoom from overview down to specific sub-structures, such as attention blocks, vision encoders, or expert routing.
- Model family comparison: Compare related models side-by-side with synchronized pan/zoom — currently showcased for the Gemma 4 family.
- Embed in model cards: Press the Embed button to get an iframe snippet for your own model card.
How to Use
Navigate to hfviewer.com, paste a Hugging Face model URL or repo name in the input box, and click "Visualize Model". Alternatively, manually replace huggingface.co with hfviewer.com in the URL bar.
For example, to visualize GPT-2: open https://hfviewer.com/gpt2.
Use Cases
The tool is designed for developers and ML engineers who need to quickly understand a model's architecture without reading through config files or source code. It supports a range of popular models including:
- Qwen/Qwen3.5-0.8B — small instruction-tuned LLM
- google/vit-base-patch16-224 — vision backbone
- openai/clip-vit-base-patch32 — dual encoder
- t5-small — encoder-decoder
- nvidia/parakeet-tdt-0.6b-v3 — streaming Conformer-TDT speech recognizer
Interactive Blog Format
On the Gemma 4 family page, the blog text and graph are linked. You can read about an architectural decision and jump into the corresponding part of the graph, then return to the article with surrounding context intact. This graph-to-text loop offers a new way to communicate ML architecture.
HF Viewer is released as a free community tool by the Embedl team.
📖 Read the full source: HN AI Agents
👀 See Also

TruthGuard: Shell Script Hooks That Catch AI Coding Agent Lies
TruthGuard is an open-source tool that uses shell script hooks to verify what Claude Code and Gemini CLI actually do versus what they claim. It catches phantom edits, exit code lies, dangerous shortcuts, and blocks commits when tests fail.

Self-Hosted Memory Layer for Claude Runs Free on Cloudflare
A Cloudflare Worker MCP server lets Claude remember and recall notes via semantic search using Workers AI and Vectorize — all on free tier.

Grape Root Tool Reduces Claude Code Token Usage by Caching Repository Context
A free experimental tool called Grape Root addresses redundant token consumption in Claude Code by maintaining lightweight state about previously explored repository files, preventing unnecessary re-reads of unchanged files during follow-up prompts.

Specsmaxxing: Fighting AI Psychosis with YAML Specs and ACAI
Acai.sh introduces Specsmaxxing: a method to combat AI agents losing context by writing requirements in YAML and using numbered Acceptance Criteria for AI (ACAI) that agents reference in code.