Open-Source Tool Strips Safety From Any AI Model in Seconds
The open-source community just automated what AI labs spend millions trying to prevent—and it runs from the command line. Heretic is a fully automated tool that removes safety alignment from language models using directional ablation, requiring zero understanding of transformer architecture
Continue reading ›