Skip to main content

Scientists can get massive discounts when renting Nvidia’s A100 GPUs for AI training – but it won’t last long

Users with the National Energy Research Scientific Computing Center (NERSC) can run AI jobs on the organization’s Perlmutter supercomputer for half-price this month.

In the midst of a lack of worldwide availability of computing horsepower for AI workloads, the facility – which operates on behalf of the US Department for Energy’s Office of Science – is changing the equation.

Between September 7 and October 1, those registered with the organization will be charged half the normal charges. For example, a three-hour job that normally runs on seven nodes would incur a charge of 21 GPU node-hours – but throughout September, it will be charged 10.5 GPU node-hours.

Perlmutter's A100 GPUs

“Using your time now benefits the entire NERSC community and spreads demand more evenly throughout the year, so to encourage usage now, we are discounting all jobs run on the Perlmutter GPU nodes by 50% starting tomorrow and through the end of September,” wrote user engagement group leader, Rebecca Hartman-Baker.

Hartman-Baker also pointed to additional help that NERSC will be offering users. This may be of use to those who are getting bad performance and need help making sure their script is up to scratch, or just those who want to try out code but aren’t sure where to start, among other potential uses.

Established in 2021, Perlmutter is an HPE Cray EX supercomputer that uses AMD Zen 3 Epyc CPUs as well as Nvidia A100 Tesla Core GPUs. The first phase of development saw the machine fitted with 1,536 GPU-accelerated AMD CPU nodes, each including four A100 GPUs, complemented with 35PB all-flash Lustre-based storage. The second phase saw the supercomputer augmented with 3,072 CPU-only nodes, each with two AMD Epyc processors and 512GB memory.

The supercomputer itself is largely used for nuclear fusion simulations, climate projections, as well as material and biological research. The first workloads run on Perlmutter included a project to discover how atomic interactions worked – which may lead to better batteries and biofuels.

GPU capacity to run AI workloads is hard to come by, and the offer is sadly only applicable to members of NERSC. It was originally pointed out by a Microsoft high-performance computing (HPC) specialist Glenn Lockwood, who pointed out NERSC could “make a killing” by backfilling idle capacity with commercial workloads.

This would be particularly applicable during the summer months when academics are largely away. There are, however, alternative means of renting GPUs, including through Akash’s decentralized Supercloud for AI network.

More from TechRadar Pro



Comments

Popular posts from this blog

The latest Apple TV 4K test lets you watch four sports streams at once

Apple is trying something new with the latest beta version of tvOS 16.5: the option to watch up to four simultaneous streams at once. Right now it's limited to live sports streamed through the Apple TV app on the Apple TV 4K , specifically MLB Friday Night Baseball and the MLS Season Pass. A multi-view option was spotted in the tvOS software last month, but the code was hidden and not enabled. MacRumors reported that the feature would be enabled this weekend, and beta testers have since been able to use it. As yet multi-view hasn't been officially announced by Apple, but it's expected that tvOS 16.5 is going to be pushed out in its final form within the next month or so. WWDC 2023 is around the corner as well, when we should be hearing about the next major updates for Apple's various operating systems – including tvOS 17. How it works Over at 9to5Mac there's a hands-on demonstrating how the multi-view feature works, and it's pretty much as you would expe...

Quantum computers are fast becoming cheaper and smaller — and they could be coming to a data center near you very soon

IonQ claims we’re closer to widespread enterprise quantum computing deployment as it lifted the lid on two rack-mounted models that can be deployed on-premises.   The startup has built the fourth-generation #AQ35 IonQ Forte Enterprise and fifth-generation #AQ64 IonQ Tempo, both of which are designed to be deployed in enterprise and government data centers. It’s also said it is deploying two quantum computers to the US Air Force.  While revealing these two models, IonQ co-founder and CTO Jungsang Kim said quantum computers are already in use by enterprises to churn through machine learning workloads. This, he added, suggests we’re much closer to readily available and affordable machines. Priming enterprises for a quantum future “We believe in the enterprise-grade quantum computing, which is where it can be something of value for enterprises, can happen in the next few years as we build powerful enough quantum computers that can actually do things that classical computers w...

Nvidia RTX 4080 GPU could get cheaper with a new version – but don’t get your hopes up

Nvidia’s RTX 4080 is purportedly getting a new spin on the GPU which could reduce the cost, but any price reduction will likely be very minor, sadly, if it happens at all. Tom’s Hardware flagged up this rumor – and treat it with caution, as with anything from the ever-spinning mill – that originated from HKEPC (a tech site in Hong Kong), claiming that while the current RTX 4080 graphics card is built on the AD103-300 chip, Nvidia is going to use a slightly different GPU in the future, namely AD103-301. There’s now more evidence this is actually happening, Tom’s points out, courtesy of a graphics card maker, Galax, which under its RTX 4080 product details lists the GPU as ‘AD103-300/301’. Furthermore, VideoCardz , which also picked up on this, informs us that Gainward, another card maker, has also listed the updated GPU variant AD103-301 in its product specs. With two separate third-party graphics card makers mentioning this new spin on the GPU in their specs, it seems pret...