wuwangzhang1216/abliterix
Automated alignment adjustment for LLMs — direct steering, LoRA, and MoE expert-granular abliteration, optimized via multi-objective Optuna TPE.
GitHub repository with 141 stars and 27 forks.
Language: Python
Topics: abliteration, alignment, decensoring, gemma, llm, lora, mixture-of-experts, model-editing, moe, optuna