GuppyLM: A 9M Parameter LLM Built from Scratch for Educational Purposes

What GuppyLM Is
GuppyLM is a tiny language model (~9M parameters) that pretends to be a fish named Guppy. It's built from scratch to demonstrate how language models work without requiring a PhD or massive GPU cluster. The project includes data generation, tokenizer creation, model architecture, training loop, and inference—all in about 130 lines of PyTorch code.
Architecture Details
- Parameters: 8.7M
- Layers: 6
- Hidden dimension: 384
- Heads: 6
- FFN: 768 (ReLU)
- Vocab: 4,096 (BPE)
- Max sequence: 128 tokens
- Norm: LayerNorm
- Position: Learned embeddings
- LM head: Weight-tied with embeddings
This is a vanilla transformer with no GQA, RoPE, SwiGLU, or early exit—as simple as possible.
Training and Data
The model trains on 60K synthetic conversations across 60 topics in about 5 minutes on a free Colab T4 GPU. The dataset (arman-bd/guppylm-60k-generic on HuggingFace) contains 57K training and 3K test samples in format: {"input": "...", "output": "...", "category": "..."}.
Topics include: greetings, feelings, temperature, food, light, water, tank, noise, night, loneliness, bubbles, glass, reflection, breathing, swimming, colors, taste, plants, filter, algae, snails, scared, excited, bored, curious, happy, tired, outside, cats, rain, seasons, music, visitors, children, meaning of life, time, memory, dreams, size, future, past, name, weather, sleep, friends, jokes, fear, love, age, intelligence, health, singing, TV, and more.
Personality and Output
Guppy speaks in short, lowercase sentences about water, temperature, light, vibrations, and food. It doesn't understand human abstractions like money, phones, or politics. Sample conversation:
You> hi guppy Guppy> hi there. i just found a nice spot near the rock. the temperature feels nice. You> are you hungry Guppy> yes. always yes. i will swim to the top right now. i promise to eat all of it. You> what is the meaning of life Guppy> food. the answer is always food.
Quick Start
Chat with Guppy (no training needed): Downloads the pre-trained model from HuggingFace. Just run all cells in the notebook.
Train your own: Set runtime to T4 GPU, run all cells—downloads dataset, trains tokenizer, trains model, tests it. Upload to HuggingFace or download locally.
Chat locally:
pip install torch tokenizers python -m guppylm chat
Example local interaction:
You> the cat is looking at you Guppy> i hide behind the plant when the furry one comes. You> it is raining outside Guppy> i think rain is the best thing about outside.
Project Structure
guppylm/ ├── config.py # Hyperparameters (model + training) ├── model.py # Vanilla transformer ├── dataset.py # Data loading + batching ├── train.py # Training loop (cosine LR, AMP) ├── generate_data.py # Conversation data generator (60 topics) ├── eval_cases.py # Held-out test cases ├── prepare_data.py # Data prep + tokenizer training └── inference.py # Chat interface
This project is useful for developers who want to understand transformer architecture fundamentals without dealing with billion-parameter models. The complete implementation shows every piece from raw text to trained weights to generated output.
📖 Read the full source: HN LLM Tools
👀 See Also

OpenClaw Outlook Add-in Connects Local Agent to Email Sidebar
A developer built an Outlook add-in that connects to a local OpenClaw Gateway via WebSocket, providing full agent access with tools and automations directly in the email sidebar. The tool reads selected emails as context, maintains per-email chat sessions, and works with Outlook Desktop and Web.

latexnav: Python Tool for LLM and Human Navigation of LaTeX Files
latexnav is a free open source Python tool that parses LaTeX files to extract structural elements like theorems, definitions, sections, labels, cross-references, and dependencies, providing summaries with authoritative line numbers to help LLMs and humans navigate large manuscripts efficiently.

Claude Workflow Library Now Tracks and Rates Reddit- Sourced Workflows Automatically
A searchable, auto-updated index of Claude and Claude Code workflows from major subreddits, with steps, artifacts, and community ratings.

Exploring AI with Tiny Bots: Understanding AI Agents Through Nanobot Tutor
OpenClaw community member shares insights with the 'Nanobot Tutor', a miniature framework aimed at demystifying AI agent functionality. Discover how diving into this compact learning environment unveils the workings of intelligent agents.