Skip to main content

Move asides CPU, today is the day the GPU takes over your kingdom

It is often easy to forget that GPU stands for Graphics Processing Unit, but in yesterday’s busy GTC 2023 (GPU Technology Conference) sessions, Nvidia announced no less than four new processors (the company's own word) as it looks to cement its position as the AI hardware champion with barely any competition in sight. 

The extraordinary success of ChatGPT has helped shape the narrative, boosting demand for bigger, better, larger, more powerful inference computing platforms.

“The number of applications for generative AI is infinite, limited only by human imagination. Arming developers with the most powerful and flexible inference computing platform will accelerate the creation of new services that will improve our lives in ways not yet imaginable.” said Jensen Huang, founder and CEO of NVIDIA.

Because Nvidia has a much smaller set of customers to service, it can deliver fine tuned products for specific tasks. The new L4 for AI video is 120x more powerful at video performance than CPUs while the L40 for image generation is 7x better at inference for Stable Diffusion than the previous generation. 

The H100 NVL has been built to tackle massive LLM (Large Language Model) deployments with a whopping 94GB of memory, helping it deliver 12x faster inference performance on the already-obsolete ChatGPT3 (and compared to the previous A100 generation). Last on the list is the Nvidia Grace Hopper superchip for recommendation models that brings Nvidia’s Arm based CPU and GPU closer and is expected later this year.

What about Intel and AMD? 

For 40 years, the CPU (Central Processing Unit) has been at the core of computing but we’re witnessing - live - a leadership coup. Nvidia single handedly tilted the balance of power; Intel has been the biggest loser with its share price halved since last year while AMD continues to play second fiddle in the land of GPU (less so when it comes to CPU).

The CPU was always in the danger of being supplanted due to its generic nature; it is a jack of all trade after all. The GPU (and other more exotic processing units) leveraged their ability to bring hardware and software closer (through Nvidia’s API CUDA for example) to excel in specialized workloads.

Nvidia’s long lasting impact is illustrated by a new service called cuLitho (short for computational lithography) where it collaborated with ASML, TSMC and Synopsys to help design and manufacture the next generation of chips, regardless of whether its CPU, GPU, NAND or any other silicon-based product. Here we have GPUs accelerating the development of the next generation of GPUs with Nvidia predicting a performance improvement of 40x compared to CPU.

With a market capitalization of almost $640 billion, Nvidia is worth more than Qualcomm ($137bn), Intel ($117bn), AMD ($153bn) and Arm (est. $50bn) put together. With Nvidia having seemingly built an insurmountable lead in AI hardware, all odds are off as to who will be second best.



Comments

Popular posts from this blog

The latest Apple TV 4K test lets you watch four sports streams at once

Apple is trying something new with the latest beta version of tvOS 16.5: the option to watch up to four simultaneous streams at once. Right now it's limited to live sports streamed through the Apple TV app on the Apple TV 4K , specifically MLB Friday Night Baseball and the MLS Season Pass. A multi-view option was spotted in the tvOS software last month, but the code was hidden and not enabled. MacRumors reported that the feature would be enabled this weekend, and beta testers have since been able to use it. As yet multi-view hasn't been officially announced by Apple, but it's expected that tvOS 16.5 is going to be pushed out in its final form within the next month or so. WWDC 2023 is around the corner as well, when we should be hearing about the next major updates for Apple's various operating systems – including tvOS 17. How it works Over at 9to5Mac there's a hands-on demonstrating how the multi-view feature works, and it's pretty much as you would expe...

Quantum computers are fast becoming cheaper and smaller — and they could be coming to a data center near you very soon

IonQ claims we’re closer to widespread enterprise quantum computing deployment as it lifted the lid on two rack-mounted models that can be deployed on-premises.   The startup has built the fourth-generation #AQ35 IonQ Forte Enterprise and fifth-generation #AQ64 IonQ Tempo, both of which are designed to be deployed in enterprise and government data centers. It’s also said it is deploying two quantum computers to the US Air Force.  While revealing these two models, IonQ co-founder and CTO Jungsang Kim said quantum computers are already in use by enterprises to churn through machine learning workloads. This, he added, suggests we’re much closer to readily available and affordable machines. Priming enterprises for a quantum future “We believe in the enterprise-grade quantum computing, which is where it can be something of value for enterprises, can happen in the next few years as we build powerful enough quantum computers that can actually do things that classical computers w...

Nvidia RTX 4080 GPU could get cheaper with a new version – but don’t get your hopes up

Nvidia’s RTX 4080 is purportedly getting a new spin on the GPU which could reduce the cost, but any price reduction will likely be very minor, sadly, if it happens at all. Tom’s Hardware flagged up this rumor – and treat it with caution, as with anything from the ever-spinning mill – that originated from HKEPC (a tech site in Hong Kong), claiming that while the current RTX 4080 graphics card is built on the AD103-300 chip, Nvidia is going to use a slightly different GPU in the future, namely AD103-301. There’s now more evidence this is actually happening, Tom’s points out, courtesy of a graphics card maker, Galax, which under its RTX 4080 product details lists the GPU as ‘AD103-300/301’. Furthermore, VideoCardz , which also picked up on this, informs us that Gainward, another card maker, has also listed the updated GPU variant AD103-301 in its product specs. With two separate third-party graphics card makers mentioning this new spin on the GPU in their specs, it seems pret...