xAI TTS Integration for Home Assistant Built with Claude — Full Repo
A developer on r/ClaudeAI worked with Claude to build a custom xAI TTS integration for Home Assistant. The integration uses xAI's new Eve voice and supports all five xAI voices (Eve, Ara, Rex, Sal, Leo), plus expressive speech tags like [pause], [laugh], <emphasis>, <slow>, etc.
The main technical hurdle: HA's modern TTS platform requires the async_stream_tts_audio method returning a TTSAudioResponse — the older async_get_tts_audio path silently fails in voice pipelines. This detail is not well documented and took time to debug.
At $15/1M characters, Eve is priced identically to OpenAI and reportedly more expressive for longer content than OpenAI's Ballad voice.
The repo is available at github.com/therealakahn/ha-xai-tts. The author states no HACS support is planned; it's provided as-is.
📖 Read the full source: r/ClaudeAI
👀 See Also

Cortex: A Local Memory Layer for OpenClaw Agents with Ebbinghaus Decay
Cortex is an open-source memory tool built to solve context compaction issues in OpenClaw agents. It implements Ebbinghaus forgetting curves for fact decay, imports from files first, and runs as a single 19MB Go binary with SQLite.

ClawPy: Minimal Single-File Python Implementation of OpenClaw with Experience Memory
A developer built ClawPy, a stripped-down Python script that implements OpenClaw's autonomous task execution mechanics with a persistent experience system that learns from past errors and successes.

VibeAround: Local Daemon Connects Coding Agents to Telegram and Discord
VibeAround is a local daemon that connects coding agents like Claude Code, Gemini CLI, and Codex to IM platforms including Telegram and Discord. The tool features session handover with pickup codes to continue conversations across devices.

Codebook Lossless LLM Compression: 10-25% RAM Reduction with Bitwise Packing
A developer's proof-of-concept code demonstrates lossless LLM compression by packing fp16 weights into blocks, achieving 10-25% RAM reduction with a trade-off of approximately halved inference speed. The approach identifies that most models only use 12-13 bits of unique values despite fp16's 16-bit representation.