Skip to main content

NVIDIA's next DGX supercomputer is all about generative AI

NVIDIA CEO Jensen Hiang made a string of announcements during his Computex keynote, including details about the company’s next DGX supercomputer. Given where the industry is clearlyheading, it shouldn’t come as a surprise that the DGX GH200 is largely about helping companies develop generative AI models.

The supercomputer uses a new NVLink Switch System to enable 256 GH200 Grace Hopper superchips to act as a single GPU (each of the chips has an Arm-based Grace CPU and an H100 Tensor Core GPU). This, according to NVIDIA, allows the DGX GH200 to deliver 1 exaflop of performance and to have 144 terabytes of shared memory. The company says that's nearly 500 times as much memory as you'd find in a single DGX A100 system.

For comparison, the latest ranking of the Top500 supercomputers lists Frontier at Oak Ridge National Laboratory in Tennessee as the only known exascale system, having reached a performance of nearly 1.2 exaflops on the Linmark benchmark. That's over twice the peak performance of the second-placed system, Japan's Fugaku.

In effect, NVIDIA claims to have developed a supercomputer that can stand alongside the most powerful known system on the planet (Meta is building one that it claims will be the fastest AI supercomputer in the world once it’s fully built out). NVIDIA says the architecture of the DGX GH200 offers 10 times more bandwidth than the previous generation, "delivering the power of a massive AI supercomputer with the simplicity of programming a single GPU."

Some big names are interested in the DGX GH200. Google Cloud, Meta and Microsoft should be among the first companies to gain access to the supercomputer to test how it can handle generative AI workloads. NVIDIA says DGX GH200 supercomputers should be available by the end of 2023.

The company is also building its own supercomputer, Helios, that combines four DGX GH200 systems. NVIDIA expects Helios to be online by the end of the year.

Huang discussed other generative AI developments during his keynote, including one on the gaming front. NVIDIA Avatar Cloud Engine (ACE) for Games is a service developers will be able to tap into in order to create custom AI models for speech, conversation and animation. NVIDIA says ACE for Games can "give non-playable characters conversational skills so they can respond to questions with lifelike personalities that evolve."

This article originally appeared on Engadget at https://ift.tt/HU1rXDl

from Engadget is a web magazine with obsessive daily coverage of everything new in gadgets and consumer electronics https://ift.tt/HU1rXDl
via IFTTT

Comments

Popular posts from this blog

Instagram accidentally reinstated Pornhub’s banned account

After years of on-and-off temporary suspensions, Instagram permanently banned Pornhub’s account in September. Then, for a short period of time this weekend, the account was reinstated. By Tuesday, it was permanently banned again. “This was done in error,” an Instagram spokesperson told TechCrunch. “As we’ve said previously, we permanently disabled this Instagram account for repeatedly violating our policies.” Instagram’s content guidelines prohibit  nudity and sexual solicitation . A Pornhub spokesperson told TechCrunch, though, that they believe the adult streaming platform’s account did not violate any guidelines. Instagram has not commented on the exact reasoning for the ban, or which policies the account violated. It’s worrying from a moderation perspective if a permanently banned Instagram account can accidentally get switched back on. Pornhub told TechCrunch that its account even received a notice from Instagram, stating that its ban had been a mistake (that message itse...

Colorado police identified the serial killer who murdered 4 women 40 years ago after exhuming his body to analyze a DNA sample

A scientist examines computer images of DNA models. Getty Images Police in Colorado have cracked the cold cases of four women killed 40 years ago. Denver PD said genetic genealogy and DNA analysis helped them identify the serial killer. He had died by suicide in jail in 1981. DNA from his exhumed body matched evidence from the murders. Police in Colorado have cracked the code on four murder cases that went unsolved for 40 years, using DNA from the killer's exhumed body. The cases pertain to four women killed in the Denver metro area between 1978 and 1981. They were 33-year-old Madeleine Furey-Livaudais, 53-year-old Dolores Barajas, 27-year-old Gwendolyn Harris, and 17-year-old Antoinette Parks. The four women were stabbed to death. Denver Police Commander Matt Clark said in a press conference Friday that there was an "underlying sexual component" to the murders but didn't elaborate further. In 2009, a detective reviewed Parks' case and picked several p...

Gemini vs. ChatGPT: Which one planned my wedding better?

I was all about the wedding bells after getting engaged in June, but after seeing some of these wedding venue quotes, it’s more like alarm bells. "Ding-dong" has been remixed to "cha-ching" – and I need help. I don’t even know how to begin wedding planning. What are the first steps? What do I need to prioritize first? Which tasks are pressing – and which can wait a year or two? I decided to enlist the help of an AI assistant. Taking it one step further, I thought it’d be interesting to see which chatbot – Gemini Advanced or ChatGPT Plus (i.e., ChatGPT 4o) – is the better wedding planner. Gemini vs ChatGPT: Create a to-do list I’m planning on have my wedding in the summer of 2026 – sometime between August and September. Besides that, I don’t have anything else nailed down, so I asked both Gemini and ChatGPT to give me a to-do list based on the following prompt: “My wedding is between August 2026 and September 2026. Give me a to-do list of things to do for the...