A tool that removes censorship from open-weight LLMs
Link: https://github.com/elder-plinius/OBLITERATUSDiscussion: https://news.ycombinator.com/item?id=47275291
@hn50 @AmpBenzScientist perhaps this could be of interest for you.
Tool that seems to be reversing the model, finding and then avoiding the censor branch. Seems to me like a big cracking tool for llms.
@PawelK The Xbox One was finally cracked. The security on it is insane.
QOTO: Question Others to Teach Ourselves An inclusive, Academic Freedom, instance All cultures welcome. Hate speech and harassment strictly forbidden.