Editor’s picks

Bibendum amet at molestie mattis.

Latest

  • Piotr Szczuko

    🔒 LLMs’ dangerous weak spots: cats, Dr. House, poetry and authority figures

    In theory, they’re resistant to manipulation. In practice, a cleverly phrased prompt can push them to work around their own safeguards. Language models can handle very long contexts, but they still get…

    🔒 LLMs’ dangerous weak spots: cats, Dr. House, poetry and authority figures