The next generation of AI supercomputers is coming online



summary
Summary

As Cerebras, Inflection, Amazon, and others bring new hardware online, this next generation of AI supercomputers will be available for new generative AI models.

The use of large language models and other types of generative AI is growing rapidly. With ChatGPT, Bard, Copilot, and others, there are now chatbots in nearly every digital ecosystem, with models like GPT-4 still leading the race. But the next generation of generative AI models is on the horizon. To train these complex AI systems, technology companies are building a new generation of AI supercomputers.

Cerebras unveils 2 Exaflop supercomputer

An example of this new generation comes from Cerebras, which just unveiled its Condor Galaxy 1 system, capable of 2 Exaflops. It was assembled and put into operation in just 10 days and consists of 32 Cerebras CS-2 computers. According to the company, Condor Galaxy is on track to double in size within the next 12 weeks. It is owned by G42, an Abu Dhabi-based holding company that owns nine AI-based companies, including G42 Cloud, one of the largest cloud computing providers in the Middle East.

Over the next 18 months, Cerebras plans to install additional systems to reach a total of 36 Exaflops across 9 installations. This will position it as one of the largest supercomputers focused on AI workloads.

Ad

Each CS-2 is powered by the Waferscale Engine-2, an AI-specific processor with 2.6 trillion transistors and 850,000 AI cores built from a single wafer of silicon.

Cerebras CEO Andrew Feldman noted that “the number of companies training neural network models with 50 billion or more parameters went from 2 in 2021 to more than 100 this year.”

Inflection AI’s 22,000 GPU supercomputer

Like many others, startup Inflection AI is betting on Nvidia to build an AI supercomputer powered by a whopping 22,000 Nvidia H100 GPUs. With the supply of AI chips limited, Inflection’s connections as an Nvidia investment target likely helped secure this large GPU allocation. The startup recently raised $1.3 billion in funding from Microsoft, Nvidia, and others.

Inflection recently introduced its first proprietary language model, Inflection-1, which is said to be on par with GPT 3.5, Chinchilla, and PaLM-540B, and powers its personal AI Pi.

Inflection AI focuses on generating and understanding natural language. People should be able to talk to computers and give them complex tasks without having to learn code, says CEO Mustafa Suleyman, a co-founder of DeepMind. He expects a breakthrough in conversational AI in the next five years, which should open up “a whole new suite of things that we can do in the product space.”

Recommendation

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top