Safety

Newsletter

Nec massa viverra eget feugiat pellentesque. Feugiat adipiscing massa vitae auctor mi massa. Sodales libero viverra cursus sed duis luctus nulla. In malesuada vulputate pharetra ipsum orci.

Zapisz się do newslettera

Latest

prof. dr hab. Dariusz Jemielniak

🔒 Deepfakes in war. When the news anchor doesn’t exist

The war in the Middle East has been fought on the traditional and informational fronts since the beginning. And the latter has turned out to be the most intense test of deepfake…

05.03
Piotr Szczuko

🔒 LLMs’ dangerous weak spots: cats, Dr. House, poetry and authority figures

In theory, they’re resistant to manipulation. In practice, a cleverly phrased prompt can push them to work around their own safeguards. Language models can handle very long contexts, but they still get…

11.12
Karolina Ceroń

🔒 Aardvark: automated security screening

OpenAI is launching Aardvark in its beta version — an AI agent based on GPT-5. Its mission is to automatically detect and assist in fixing large-scale software security vulnerabilities.

01.11
Adam Jędrusyna

🔒 AI on the modern battlefield

In the 19th century, Prussian Field Marshal Helmuth Karl Bernhard von Moltke led military operations against France under conditions of information scarcity. It was then that he introduced the “fog of war”…

23.07

Bibendum amet at molestie mattis.

Latest

🔒 Deepfakes in war. When the news anchor doesn’t exist

🔒 LLMs’ dangerous weak spots: cats, Dr. House, poetry and authority figures

🔒 Aardvark: automated security screening

🔒 AI on the modern battlefield