PoisonGPT: We hid a lobotomized LLM on Hugging Face to spread fake news

PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news.
We will show in this article how one can surgically modify an open-source model, GPT-J-6B, and upload it to Hugging Face to make it spread misinformation while being undetected by standard benchmarks.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.