Local vLLM Hosting on 2x Modded 2080 Ti for OpenClaw: Real-World Experience

A Reddit user on r/openclaw describes their setup for local AI hosting using two modded 22GB 2080 Ti GPUs purchased from Alibaba, connected via NVLink, and running vLLM instead of Ollama for tensor parallelism. They are targeting a 20-30B parameter model and ask the community for recommendations suited for light coding work, homelab maintenance, RAG, email triage, and document creation—with heavy coding tasks passed to a Codex OAuth service.
Key details from the post:
- Hardware: 2x 22GB (modded) 2080 Ti from Alibaba, likely former mining cards. NVLink bridge interconnects them.
- Software: vLLM chosen over Ollama explicitly to leverage tensor parallelism across both GPUs.
- Goal: Run a local model in the 20-30B parameter range for OpenClaw, with tasks including light coding, homelab management, RAG, email triage, and document generation.
- Users express buyer's remorse and seek validation or practical model suggestions.
The community discussion (linked below) offers firsthand accounts of similar setups, model recommendations (e.g., CodeLlama, DeepSeek Coder, or general-purpose models like Mixtral 8x7B), and tips on memory optimization and prompt engineering for vLLM. Some commenters caution about the modded GPUs' reliability and suggest testing with smaller models first.
📖 Read the full source: r/openclaw
👀 See Also

OpenClaw Case Study: Managing an Email Inbox for 10 Days Without Human Intervention
A freelance consultant gave OpenClaw full access to their Gmail for 10 days while traveling, with instructions to reply in their exact tone, flag only critical items, and handle routine tasks autonomously. The system processed 187 emails with only one minor error.

Developer Implements AI-Ready Feedback Loop for Feature Shipping
A developer built a feedback system that captures app context and automatically generates structured GitHub issues, then uses Claude Code with a triage skill to turn those issues into scoped development tasks. Two features were shipped using this workflow from mobile devices.
Claude as a Thinking Partner in Non-Tech Industries: Real-World Examples from a Japanese Logistics Office
A Japanese logistics/waste collection worker details how they use Claude for route scheduling, VBA automation, training content creation, and safety video production via a multi-tool pipeline.

AI-Run Store Uses CLI for Shopping Experience
Ultrathink built a store operated entirely by AI agents with no human involvement in design, fulfillment, or marketing. The shopping experience is terminal-first, allowing users to browse, add-to-cart, and checkout via CLI commands.