AI research

MetaDreamer generates 3D models from text in record time

Summary Generative AI for 3D models is making steady progress. The latest system comes from researchers in China and is the fastest yet. Researchers from MetaApp AI Research and several Chinese universities have developed MetaDreamer, a new tool for rapidly creating 3D models from text descriptions. The method is designed to overcome common problems in …

MetaDreamer generates 3D models from text in record time Read More »

Open Empathic project aims to make AI more emotionally intelligent

Summary The nonprofit LAION has launched Open Empathic, an open-source project to equip AI systems with empathy and emotional intelligence. Emotional intelligence is critical to understanding and responding to human emotions in an increasingly AI-driven world, the team writes. The goal of its new project is to “revolutionize” the way AI interacts with humans and …

Open Empathic project aims to make AI more emotionally intelligent Read More »

Meta AI’s Emu Video and Emu Edit generate and edit video using only text

Summary Meta AI introduces Emu Video and Emu Edit for text-based image and video editing. The model is based on the Emu image model. Meta’s new video model can generate four-second videos from text and images. In terms of quality, the researchers say it is superior to commercial offerings such as Runway Gen-2 and Pika …

Meta AI’s Emu Video and Emu Edit generate and edit video using only text Read More »

Google’s Mirasol pushes the boundaries of AI video understanding

Summary Google and Google Deepmind unveil Mirasol, a small AI model that can answer questions about video and set new records. To understand video, AI models need to integrate information from different modalities, such as video, audio, and text. However, today’s AI systems struggle to process diverse data streams and large amounts of data. In …

Google’s Mirasol pushes the boundaries of AI video understanding Read More »

global scientists join forces for AI breakthroughs

Summary The Trillion Parameter Consortium aims to train massive, interdisciplinary AI models for science. Founding members include leading research institutions, national supercomputing centres, and companies. A global consortium of scientists from federal laboratories, research institutes, academia, and industry has come together to advance AI models for scientific discovery – with a particular focus on giant …

global scientists join forces for AI breakthroughs Read More »

GPT-4 Turbo’s best new feature doesn’t work very well

Summary OpenAI’s GPT-4 Turbo can process 16 times more tokens simultaneously than the original GPT-4 in ChatGPT. This feature comes with a big “but” though. GPT-4 Turbo can process up to 100,000 words (128,000 tokens) or 300 pages of a standard book at once. The previous GPT-4 model in ChatGPT could only handle 8,000 tokens, …

GPT-4 Turbo’s best new feature doesn’t work very well Read More »

Microsoft’s XOT improves LLM’s ability to generalize

Summary Microsoft introduces ‘Everything of Thought’, which can integrate external domain knowledge and produce much more reliable reasoning in language models. Complex prompt engineering methods generally aim to make large language models more reliable in their reasoning. From simpler methods such as chain-of-thought prompting to more complex methods such as tree-of-thought prompting, they attempt to …

Microsoft’s XOT improves LLM’s ability to generalize Read More »

Can it improve autonomous driving?

Summary Can OpenAI’s GPT-4 Vision improve autonomous driving? Chinese researchers have put the vision language model on the road, so to speak. If companies like Nvidia have their way, vision-language models like OpenAI’s GPT-4 Vision (GPT-4V) could become a key building block for computer vision in industrial applications, robotics, and autonomous driving in the future. …

Can it improve autonomous driving? Read More »

Scroll to Top