Skip to main content

AI could help turbocharge your SSD’s effective storage by compressing data even further — but just don’t delete 7zip just yet

DeepMind scientists have treated compression technology to a major upgrade thanks to a large language model (LLM) that has achieved astonishing lossless compression rates with image and audio data.

Thanks to the company’s Chinchilla 70B LLM, the researchers used a special compression algorithm to reduce images to 43.4% and audio files to 16.4% of their original sizes, as detailed in their paper - making it better than some of the best compression software out there.

By contrast, standard image compression algorithm PNG reduces images to 58.5% and FLAC compressors shrink audio to 30.3% of their original file sizes. It means storing so much more on any one of the best SSDs.

Although Chinchilla 70B is trained mainly on text, they achieved these results by leaning on the predictive capabilities of the model, and framed the “prediction problem” through the lens of file compression. In other words, they retooled the best qualities of an LLM and found these traits also serve to compress large files.

AI is great at compression – up to a point

The DeepMind researchers showed that due to this equivalence between prediction and compression, any compressor can be used as a conditional generative model – and even the other way around. 

But, they added, they can only achieve such compression results up to a certain file size, meaning using generative AI as a compression solution may not be practical for everyone.

“We evaluated large pretrained models used as compressors against various standard compressors, and showed they are competitive not only on text but also on modalities they have never been trained on,” the researchers noted.

“We showed that the compression viewpoint provides novel insights on scaling laws since it takes the model size into account, unlike the log-loss objective, which is standard in current language modeling research.”

Due to this scaling limitation, the models used in this research aren’t better than, say, the likes of 7zip when you’re looking at files above a certain threshold. They may not compress as impressively as the results show, and they may also not be as fast as conventional compression algorithms.     

More from TechRadar Pro



Comments

Popular posts from this blog

The latest Apple TV 4K test lets you watch four sports streams at once

Apple is trying something new with the latest beta version of tvOS 16.5: the option to watch up to four simultaneous streams at once. Right now it's limited to live sports streamed through the Apple TV app on the Apple TV 4K , specifically MLB Friday Night Baseball and the MLS Season Pass. A multi-view option was spotted in the tvOS software last month, but the code was hidden and not enabled. MacRumors reported that the feature would be enabled this weekend, and beta testers have since been able to use it. As yet multi-view hasn't been officially announced by Apple, but it's expected that tvOS 16.5 is going to be pushed out in its final form within the next month or so. WWDC 2023 is around the corner as well, when we should be hearing about the next major updates for Apple's various operating systems – including tvOS 17. How it works Over at 9to5Mac there's a hands-on demonstrating how the multi-view feature works, and it's pretty much as you would expe...

Quantum computers are fast becoming cheaper and smaller — and they could be coming to a data center near you very soon

IonQ claims we’re closer to widespread enterprise quantum computing deployment as it lifted the lid on two rack-mounted models that can be deployed on-premises.   The startup has built the fourth-generation #AQ35 IonQ Forte Enterprise and fifth-generation #AQ64 IonQ Tempo, both of which are designed to be deployed in enterprise and government data centers. It’s also said it is deploying two quantum computers to the US Air Force.  While revealing these two models, IonQ co-founder and CTO Jungsang Kim said quantum computers are already in use by enterprises to churn through machine learning workloads. This, he added, suggests we’re much closer to readily available and affordable machines. Priming enterprises for a quantum future “We believe in the enterprise-grade quantum computing, which is where it can be something of value for enterprises, can happen in the next few years as we build powerful enough quantum computers that can actually do things that classical computers w...

Nvidia RTX 4080 GPU could get cheaper with a new version – but don’t get your hopes up

Nvidia’s RTX 4080 is purportedly getting a new spin on the GPU which could reduce the cost, but any price reduction will likely be very minor, sadly, if it happens at all. Tom’s Hardware flagged up this rumor – and treat it with caution, as with anything from the ever-spinning mill – that originated from HKEPC (a tech site in Hong Kong), claiming that while the current RTX 4080 graphics card is built on the AD103-300 chip, Nvidia is going to use a slightly different GPU in the future, namely AD103-301. There’s now more evidence this is actually happening, Tom’s points out, courtesy of a graphics card maker, Galax, which under its RTX 4080 product details lists the GPU as ‘AD103-300/301’. Furthermore, VideoCardz , which also picked up on this, informs us that Gainward, another card maker, has also listed the updated GPU variant AD103-301 in its product specs. With two separate third-party graphics card makers mentioning this new spin on the GPU in their specs, it seems pret...