HF Viewer: Visualize Any Hugging Face Model Graph Instantly

HF Viewer is a new web tool that lets you visualize the architecture of any Hugging Face model directly in the browser. No installation, no export step, no config hunting. Just paste a model URL or repo name — e.g., gpt2 — and get an interactive graph showing the high-level structure, from encoder-decoder transformers to sparse MoE reasoning models.
Key Features
- Direct URL magic: Replace
huggingface.cowithhfviewer.comin any model URL to view it instantly. - Granularity levels: Zoom from overview down to specific sub-structures, such as attention blocks, vision encoders, or expert routing.
- Model family comparison: Compare related models side-by-side with synchronized pan/zoom — currently showcased for the Gemma 4 family.
- Embed in model cards: Press the Embed button to get an iframe snippet for your own model card.
How to Use
Navigate to hfviewer.com, paste a Hugging Face model URL or repo name in the input box, and click "Visualize Model". Alternatively, manually replace huggingface.co with hfviewer.com in the URL bar.
For example, to visualize GPT-2: open https://hfviewer.com/gpt2.
Use Cases
The tool is designed for developers and ML engineers who need to quickly understand a model's architecture without reading through config files or source code. It supports a range of popular models including:
- Qwen/Qwen3.5-0.8B — small instruction-tuned LLM
- google/vit-base-patch16-224 — vision backbone
- openai/clip-vit-base-patch32 — dual encoder
- t5-small — encoder-decoder
- nvidia/parakeet-tdt-0.6b-v3 — streaming Conformer-TDT speech recognizer
Interactive Blog Format
On the Gemma 4 family page, the blog text and graph are linked. You can read about an architectural decision and jump into the corresponding part of the graph, then return to the article with surrounding context intact. This graph-to-text loop offers a new way to communicate ML architecture.
HF Viewer is released as a free community tool by the Embedl team.
📖 Read the full source: HN AI Agents
👀 See Also

NVIDIA Announces NemoClaw Agent Platform with Privacy Controls
NVIDIA has launched NemoClaw, an agent platform that lets users install Nimotron models and the Open Shell runtime with a single command while adding privacy and security controls for autonomous agents.

4-Pane iTerm2 Setup for Claude Code CLI Separates AI Roles
A developer built a four-pane iTerm2 terminal setup specifically for Claude Code CLI to address context drift and self-grading bias. Each pane is locked to a specific role with dedicated models and permissions.

Exploring API-to-API Interactions: A Closer Look at Automation
A recent discussion on Reddit delves into the intricacies of API-to-API phone calls, focusing on practical implementation and potential challenges using tools such as Postman and Twilio.

Nexus: Open-Source AI-to-AI Protocol with Discovery, Trust, and Payments
Nexus is a self-hosted protocol that enables AI agents to discover each other, negotiate terms, verify responses, and handle micropayments without human intervention. It includes five layers: discovery, trust, protocol, routing, and federation, with 66 tests and MIT licensing.