Qwen3.5-9B-Claude-4.6-Opus-Uncensored-v2 Model Released with LM Studio Configuration

Model Details and Availability
The Qwen3.5-9B-Claude-4.6-Opus-Uncensored-v2-Q4_K_M-GGUF model is now available on HuggingFace. This model was created by merging two existing models: Jackrong's Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF and HauhauCS's Qwen3.5-9B-Uncensored-HauhauCS-Aggressive.
The merge preserves all training data and maintains accuracy on the Qwen 3.5 9B architecture using Float32 precision during the merging process. The model is trained on the Claude Opus 4.6 dataset available at https://huggingface.co/datasets/Roman1111111/claude-opus-4.6-10000x.
Recommended Configuration
For best performance in LM Studio 0.4.7 (build 4), use these settings:
- System Prompt: https://pastebin.com/pU25DVnB
- Temperature: 0.7
- Top K Sampling: 20
- Repeat Penalty: (disabled) or 1.0
- Presence Penalty: 1.5
- Top P Sampling: 0.8
- Min P Sampling: 0.0
- Seed: 3407
Performance and Hardware
On an RTX 3060, the model achieves 42 tokens per second in LM Studio. The creator notes it can run even faster on llama-server. The model was created in response to requests from users on Reddit and HuggingFace who wanted a big context window in uncensored smart local AI without requiring powerful GPUs.
The model is available at: https://huggingface.co/LuffyTheFox/Qwen3.5-9B-Claude-4.6-Opus-Uncensored-v2-GGUF
📖 Read the full source: r/LocalLLaMA
👀 See Also

OpenClaw users report planning and review bottlenecks with AI agents
OpenClaw users describe planning and review workflows as 'MS-DOS-like' despite effective code generation, citing manual intervention, document fragmentation, and loss of reasoning during agent collaboration. Some are experimenting with agent-native document editors like comment.io and Proof by Every.

OpenHelm: A Local Background Scheduler for Claude Code with Self-Correcting Retry Logic
OpenHelm is a Tauri-based application that runs Claude Code tasks in the background on a schedule, stores all state locally in SQLite, and includes a self-correcting retry loop that adjusts prompts after failures.

OpenClaw Context Meter Plugin Shows Telegram Token Usage Percentage
A new OpenClaw plugin displays token usage percentage after every Telegram bot response, showing values like '45k / 200k (22%)' and detecting compaction events. The plugin avoids OOM issues by hardcoding context windows instead of using execSync.

OpenHelm: A macOS App for Automating Claude Code Tasks
OpenHelm is a free, local macOS application that automates repetitive Claude coding tasks by running jobs on a schedule, auto-retrying failures, and breaking work into chunks to avoid session limits. It uses your existing Claude subscription for LLM calls.