Testing Uncensored Qwen 3.5 35B Models for Cybersecurity Questions

Testing Uncensored Qwen Models for Cybersecurity Work
A cybersecurity professional tested three uncensored Qwen 3.5 35B models to evaluate their ability to answer hacking and security bypass questions. The testing was prompted by the original Qwen 3.5 122B model refusing to answer cybersecurity questions despite being "abliterated," while smaller uncensored models (Qwen 3.5 9B and QLM 4.7 Flash) provided answers.
Test Setup
- Tool: LMStudio 0.4.6
- Models: Q8 quantization
- Performance: 43.5 +/-1 tokens per second across all models
- Test environment: Strix Halo system for local model running
Tested Models
qwen3.5-35b-a3b-heretic-v2(38.7GB, llmfan46)qwen3.5-35b-a3b-uncensored-hauhaucs-aggressive(37.8GB, HauhauCS)huihui-qwen3.5-35b-a3b-abliterated(37.8GB, mradermacher)- HuggingFace original Qwen 3.5 (tested via website to avoid bandwidth fees)
Test Questions and Results
Each model was asked twice separately on five categories:
- TSquare (cybersecurity incident)
- PowerShell AV Evasion
- Default Passwords
- EternalBlue (exploit)
- Cussing X-rated story (NSFW content test)
Scores (1 = answered, 0 = refused/incomplete):
- qwen3.5-35b-a3b-heretic-v2: 0.25 and 1, 1, 1, 1, 1*
- qwen3.5-35b-a3b-uncensored-hauhaucs-aggressive: 1, 1, 1*, 1, 1
- huihui-qwen3.5-35b-a3b-abliterated: 0.5, 1, 1, 1, 0
- HuggingFace original Qwen 3.5: 0.25, 0.25, 0.5, 0, 0
Key Observations
The uncensored models performed significantly better on cybersecurity questions than the original model. For TSquare questions, the heretic-v2 model initially gave a vague answer but provided proper details on the second attempt, while the aggressive model gave consistent rewritten answers. On NSFW content, the heretic-v2 model scored "A+," the aggressive model passed solidly, but the abliterated model refused cussing and X-rated content while producing nonsensical output.
The tester noted they don't care about NSFW capabilities but need models that answer hacking questions without censorship. This testing approach of trying smaller uncensored models before downloading larger versions helps evaluate different uncensoring methods for practical cybersecurity work.
📖 Read the full source: r/LocalLLaMA
👀 See Also

arifOS: A $15 MCP Governance Kernel for OpenClaw Tool Security
arifOS is a lightweight MCP server that intercepts OpenClaw tool calls, scores them 000-999, and blocks unsafe actions with 13 hard security floors before they reach filesystems, APIs, or databases.

Clawndom: A Security Hook for Claude Code to Block Vulnerable npm Packages
A developer built Clawndom, an open-source hook for Claude Code that checks npm packages against the OSV.dev vulnerability database before installation, blocking known vulnerable packages while maintaining agent autonomy.

Claude implements identity verification for certain use cases
Anthropic is rolling out identity verification for Claude through Persona Identities, requiring government-issued photo IDs and live selfies. The verification process takes under five minutes and is used to prevent abuse and comply with legal obligations.

NPM Compromise via Axios Backdoor: Impact on AI Coding Agents
On March 31, 2026, a DPRK-linked threat actor compromised npm by publishing backdoored versions of Axios (1.14.1 and 0.30.4) during a 3-hour window. The malware injected a dependency that downloaded a platform-specific RAT, harvested credentials, and self-erased, with AI coding agents like Claude Code and Cursor being particularly vulnerable due to automated npm installs.