Skip to main content

NVIDIA's next DGX supercomputer is all about generative AI

NVIDIA CEO Jensen Hiang made a string of announcements during his Computex keynote, including details about the company’s next DGX supercomputer. Given where the industry is clearlyheading, it shouldn’t come as a surprise that the DGX GH200 is largely about helping companies develop generative AI models.

The supercomputer uses a new NVLink Switch System to enable 256 GH200 Grace Hopper superchips to act as a single GPU (each of the chips has an Arm-based Grace CPU and an H100 Tensor Core GPU). This, according to NVIDIA, allows the DGX GH200 to deliver 1 exaflop of performance and to have 144 terabytes of shared memory. The company says that's nearly 500 times as much memory as you'd find in a single DGX A100 system.

For comparison, the latest ranking of the Top500 supercomputers lists Frontier at Oak Ridge National Laboratory in Tennessee as the only known exascale system, having reached a performance of nearly 1.2 exaflops on the Linmark benchmark. That's over twice the peak performance of the second-placed system, Japan's Fugaku.

In effect, NVIDIA claims to have developed a supercomputer that can stand alongside the most powerful known system on the planet (Meta is building one that it claims will be the fastest AI supercomputer in the world once it’s fully built out). NVIDIA says the architecture of the DGX GH200 offers 10 times more bandwidth than the previous generation, "delivering the power of a massive AI supercomputer with the simplicity of programming a single GPU."

Some big names are interested in the DGX GH200. Google Cloud, Meta and Microsoft should be among the first companies to gain access to the supercomputer to test how it can handle generative AI workloads. NVIDIA says DGX GH200 supercomputers should be available by the end of 2023.

The company is also building its own supercomputer, Helios, that combines four DGX GH200 systems. NVIDIA expects Helios to be online by the end of the year.

Huang discussed other generative AI developments during his keynote, including one on the gaming front. NVIDIA Avatar Cloud Engine (ACE) for Games is a service developers will be able to tap into in order to create custom AI models for speech, conversation and animation. NVIDIA says ACE for Games can "give non-playable characters conversational skills so they can respond to questions with lifelike personalities that evolve."

This article originally appeared on Engadget at https://ift.tt/HU1rXDl

from Engadget is a web magazine with obsessive daily coverage of everything new in gadgets and consumer electronics https://ift.tt/HU1rXDl
via IFTTT

Comments

Popular posts from this blog

Instagram accidentally reinstated Pornhub’s banned account

After years of on-and-off temporary suspensions, Instagram permanently banned Pornhub’s account in September. Then, for a short period of time this weekend, the account was reinstated. By Tuesday, it was permanently banned again. “This was done in error,” an Instagram spokesperson told TechCrunch. “As we’ve said previously, we permanently disabled this Instagram account for repeatedly violating our policies.” Instagram’s content guidelines prohibit  nudity and sexual solicitation . A Pornhub spokesperson told TechCrunch, though, that they believe the adult streaming platform’s account did not violate any guidelines. Instagram has not commented on the exact reasoning for the ban, or which policies the account violated. It’s worrying from a moderation perspective if a permanently banned Instagram account can accidentally get switched back on. Pornhub told TechCrunch that its account even received a notice from Instagram, stating that its ban had been a mistake (that message itself w

Colorado police identified the serial killer who murdered 4 women 40 years ago after exhuming his body to analyze a DNA sample

A scientist examines computer images of DNA models. Getty Images Police in Colorado have cracked the cold cases of four women killed 40 years ago. Denver PD said genetic genealogy and DNA analysis helped them identify the serial killer. He had died by suicide in jail in 1981. DNA from his exhumed body matched evidence from the murders. Police in Colorado have cracked the code on four murder cases that went unsolved for 40 years, using DNA from the killer's exhumed body. The cases pertain to four women killed in the Denver metro area between 1978 and 1981. They were 33-year-old Madeleine Furey-Livaudais, 53-year-old Dolores Barajas, 27-year-old Gwendolyn Harris, and 17-year-old Antoinette Parks. The four women were stabbed to death. Denver Police Commander Matt Clark said in a press conference Friday that there was an "underlying sexual component" to the murders but didn't elaborate further. In 2009, a detective reviewed Parks' case and picked several p

Axeleo Capital raises $51 million fund

Axeleo Capital has raised a $51 million fund (€45 million). Axeleo first started with an accelerator focused on enterprise startups. The firm is now all grown up with an acceleration program and a full-fledged VC fund. The accelerator is now called Axeleo Scale , while the fund is called Axeleo Capital . And it’s important to mention both parts of the business as they work hand in hand. Axeleo picks up around 10 startups per year and help them reach the Series A stage. If they’re doing well over the 12 to 18 months of the program, Axeleo funds those startups using its VC fund. Limited partners behind the company’s first fund include Bpifrance through the French Tech Accélération program, the Auvergne-Rhône-Alpes region, Vinci Energies, Crédit Agricole, BNP Paribas, Caisse d’Épargne Rhône-Alpes as well as various business angels and family offices. The firm is also partnering with Hi Inov, the holding company of the Dentressangle family. Axeleo will take care of the early stage in