Skip to main content

“We broke them all” — How researchers broke current image watermarking protections and what it means for a new era of truth-altering ‘reality’

The tool many big tech companies are banking heavily on being able to help the public and businesses separate fact from fiction, in the context of AI’s meteoric rise, has already been undermined before it’s even taken off.

The idea of watermarking is something that companies including OpenAI, Amazon and Google have pointed to as being able to combat disinformation online. With generative AI on the rise, particularly in the form of deepfakes, it might be looked at as one way to identify what’s actually real. It’s one of the key proposals among efforts to make the usage of AI, safer and more transparent.

There aren’t, however, many clear-cut approaches to watermarking yet that are completely fool-proof or reliable, and professors with the University of Maryland have already found a way to break all of the existing methods, according to TechXplore.

How scientists have already cracked AI watermarking

The researchers used a technique called diffusion purification to blast Gaussian noise – a kind of electronic noise signaling – at a watermark to completely remove it, without impacting the underlying image too much. 

With AI-generated content on the rise, especially in certain industries, the scope for abuse has also surfaced as a very real possibility. It’s also essential to find tools and strategies to be able to distinguish genuine content from that made by machines.

Watermarking is a promising approach, according to the paper, published on 29 September. It involves hiding a signal in a piece of text or image to determine if it’s AI-generated. The theory goes a tool you run the content through would then be able to determine whether it’s real or fake, and avoid the prospect of falling for something that isn’t real. But the attack method – diffusion purification – has already been able to nullify today’s watermarks.

“Based on our results, designing a robust watermark is a challenging, but not necessarily impossible task,” the paper said, offering a glimmer of hope.

“An effective method should possess specific attributes, including a substantial enough watermark perturbation, resistance to naive classification, and resilience to noise transferred from other watermarked images.”

More from TechRadar Pro



Comments

Popular posts from this blog

Windows Copilot leak suggests deeper assimilation with Windows 11 features

Key Windows 11 features may soon be customizable as Microsoft further integrates its Windows Copilot AI assistant into the operating system. This tidbit comes from tech news site Windows Latest , which claims to have discovered new .json (JavaScript Object Notation) files within recent preview builds of Windows 11. These files apparently hint at future upgrades for the desktop AI assistant. For example, a “TaskManagerService-ai-plugin.json” was found which is supposedly a “plugin for Task Manager integration”. If this ever comes out, it could give users the ability to “monitor or close running apps using” Copilot. In total, six are currently tested and they affect various aspects of Windows 11. Next, there is an “AccessbilityTools-ai-plugin.json” that gives Copilot a way to “control accessibility [tools]. This would make it "easier for those with [a] disability to navigate through the system.” Third is “ai-plugin-WindowsSettings.json” for controlling important Windows 11 set...

Google Chrome releases security fix for this major flaw, so update now

Google says it has fixed a high-severity flaw in its Chrome browser which is currently being exploited by threat actors in the wild.  In a security advisory , the company described the flaw being abused and urged the users to apply the fix immediately.  "Google is aware that an exploit for CVE-2023-2033 exists in the wild," the advisory reads. Automatic updates The zero-day in question is a confusion weakness vulnerability in the Chrome V8 JavaScript engine, the company said. Usually, this type of flaw can be used to crash the browser, but in this case it can also be used to run arbitrary code on compromised endpoints.  The flaw was discovered by Clement Lecigne from the Google Threat Analysis Group (TAG). Usually, TAG works on finding flaws abused by nation-states, or state-sponsored threat actors. There is no word on who the threat actors abusing this flaw are, though. Read more > Patch Google Chrome now to fix this emergency security flaw > Emergency...

Samsung's ViewFinity S9 may be the monitor creatives have been searching for

Originally revealed during CES 2023 , Samsung has finally launched its ViewFinity S9 5K monitor after nine long months of waiting.  According to the announcement, the ViewFinity S9 is the company’s first-ever 5K resolution (5,120 x 2880 pixels) IPS display aimed primarily at creatives. IPS stands for in-plane switching , a form of LED tech offering some of the best color output and viewing angles on the market. This quality is highlighted by the fact that the 27-inch screen supports 99 percent of the DCI-P3 color gamut plus delivers 600 nits of brightness.  Altogether, these deliver great picture quality made vibrant by saturated colors and dark shadows. The cherry on top for the ViewFinity S9 is a Matte Display coating to “drastically [reduce] light reflections.”  As a direct rival to the Apple Studio Display , the monitor is an alternative for creative professionals looking for options. It appears Samsung has done its homework as the ViewFinity S9 addresses some of...