AI research

For Microsoft’s bGPT, the world is just bytes

Summary Byte instead of token: A new paper from researchers at Microsoft Research Asia, the Central Conservatory of Music, China, and Tsinghua University introduces bGPT, a transformer model that relies on byte prediction instead of classical token prediction. Similar attempts have been made before, but unlike other models, which are usually limited to specific formats …

For Microsoft’s bGPT, the world is just bytes Read More »

New foundation model “Evo” unlocks sequence modeling and design at the genomic scale

Summary A team from TogtherAI and the Arc Institute presents Evo, an AI model for biological research that can interpret DNA, RNA, and proteins and enable generative design at the molecular and genomic level. Developed by a team of experts consisting of Eric Nguyen, Michael Poli, Matthew Durrant, Patrick Hsu and Brian Hie, the model …

New foundation model “Evo” unlocks sequence modeling and design at the genomic scale Read More »

How DeepMind’s Genie AI could reshape robotics by generating interactive worlds from images

Summary Researchers at DeepMind have developed Genie, a model that creates worlds from images and moves video game characters around in them on their own. It sounds like a gimmick, but it could be the basis for something much bigger. “What if, given a large corpus of videos from the Internet, we could not only …

How DeepMind’s Genie AI could reshape robotics by generating interactive worlds from images Read More »

DeepMind has found a simple way to make language models reason better

Summary Logical reasoning is still a major challenge for language models. DeepMind has found a way to support reasoning tasks. A study by Google’s AI division DeepMind shows that the order of the premises in a task has a significant impact on the logical reasoning performance of language models. They work best when the premises …

DeepMind has found a simple way to make language models reason better Read More »

YOLOv9 improves real-time object recognition accuracy with less computation

Summary YOLOv9 sets a new standard for real-time object recognition. It offers greater accuracy with less computation than previous models. YOLO, short for “You Only Look Once,” is an open-source image analysis AI that recognizes objects in real time. The software enables machines to “see” like humans and identify a wide variety of objects in …

YOLOv9 improves real-time object recognition accuracy with less computation Read More »

Montreal’s AI system aims to prevent subway suicides by analyzing passenger behavior

Summary Montreal is currently testing an AI system to prevent suicides on the subway. The software uses video surveillance footage to analyze passenger behavior and sounds an alarm when warning signals are detected. The AI system scans CCTV footage for signs of mental distress among passengers, according to the Société de transport de Montréal (STM), …

Montreal’s AI system aims to prevent subway suicides by analyzing passenger behavior Read More »

Google Deepmind goes open source with Gemini-based Gemma models

Summary Google has introduced Gemma, a new generation of open AI models that builds on the experience of the Gemini models and aims for responsible AI development. Google DeepMind and other Google teams created Gemma to provide developers and researchers around the world with accessible, capable models, the company said. The model comes in two …

Google Deepmind goes open source with Gemini-based Gemma models Read More »

Meta’s Aria smart glasses dataset helps shape the future of AI conversations

Meta has released the MMCSG (Multi-Modal Conversations in Smart Glasses) dataset, featuring two-sided conversations recorded using Aria glasses. The dataset includes multi-channel audio, video, accelerometer, and gyroscope data, and is aimed at supporting research in areas such as automatic speech recognition, activity detection, and speaker diarization. The glasses capture video and audio with seven microphones, …

Meta’s Aria smart glasses dataset helps shape the future of AI conversations Read More »

Meta’s chief AI researcher says OpenAI’s “world simulator” Sora is a dead end

Summary Sora is widely perceived primarily as a text and video-to-video model. However, the real research goal of OpenAI is a world simulator. But according to Yann LeCun, head of Meta’s AI department, Sora is not suited for that. The renowned AI researcher has harsh words for OpenAI’s simulator theory: “Modeling the world for action …

Meta’s chief AI researcher says OpenAI’s “world simulator” Sora is a dead end Read More »

Can LLMs take on the role of human experts in data analysis?

Summary Can we use the large language models as a mechanism for quantitative knowledge retrieval to aid data analysis tasks? A guest post by Kai Spriestersbach. In data science, researchers often face the challenge of working with incomplete data sets. Many established algorithms simply cannot process incomplete data series. Traditionally, data scientists have turned to …

Can LLMs take on the role of human experts in data analysis? Read More »

Scroll to Top