en
Feedback
🚨 AI News | TestingCatalog

🚨 AI News | TestingCatalog

Open in Telegram

Latest AI News on AI Agents, Model Releases, Tools, Leaks, and Rumors πŸ—ž

Show more
The country is not specifiedTechnologies & Applications16 303
6 551
Subscribers
+1724 hours
+837 days
+44730 days
Posts Archive
GOOGLE πŸ”₯: A new Troubleshooting mode has been spotted on Gemini. In this mode, Gemini will explain troubleshooting process v
GOOGLE πŸ”₯: A new Troubleshooting mode has been spotted on Gemini. In this mode, Gemini will explain troubleshooting process via text responses and interactive widgets. Even though it is working and available, it still looks like an unintended release and might get reverted soon. Models for Troubleshooting πŸ‘€ h/t @AntiIkindaloveGemini

NVIDIA πŸ”₯: Nemotron 3 Ultra has been released on Huggingface with 5x faster inference and 30% lower costs in comparison to ot
NVIDIA πŸ”₯: Nemotron 3 Ultra has been released on Huggingface with 5x faster inference and 30% lower costs in comparison to other open models. > Nemotron-3-Ultra-550B-A55B-NVFP4 is a frontier-scale large language model (LLM) trained by NVIDIA, designed to deliver strong agentic, reasoning, and conversational capabilities.

OPENAI πŸ”₯: A new "more capable and scalable system for synthesizing memory" is being rolled out to Plus and Pro users in the
OPENAI πŸ”₯: A new "more capable and scalable system for synthesizing memory" is being rolled out to Plus and Pro users in the US. > Today, we are launching a significantly more capable and compute-efficient memory architecture built on top of dreaming. > From the memory summary, you can quickly glean the highlights of what ChatGPT knows about you. > This update will roll out to additional countries and Free and Go users over the coming weeks.

ANTHROPIC πŸ”₯: A new internal research has been published, highlighting an accelerated AI development and a potential path to
ANTHROPIC πŸ”₯: A new internal research has been published, highlighting an accelerated AI development and a potential path to recursive self-improvement. > Claude Mythos Preview could work for β€œat least” 16 hours and was β€œat the upper end of what [METR] can measure.” > Today, Anthropic engineers on average ship 8x as much code per quarter as they did compared to 2021-2025. Do you feel it? πŸ‘€

ANTHROPIC πŸ”₯: A new "claude-oceanus-v1-p" has been made available to Red Teams. This appearance may signal an upcoming releas
ANTHROPIC πŸ”₯: A new "claude-oceanus-v1-p" has been made available to Red Teams. This appearance may signal an upcoming release of newer Mythos models, referenced earlier by Antropic. Soon? πŸ‘€

Reve 2.0 is now available, and it landed in second place in the text-to-image arena, outranking Nano Banana 2. > We invented
Reve 2.0 is now available, and it landed in second place in the text-to-image arena, outranking Nano Banana 2. > We invented a new way to generate and edit any image using precise layouts. For the first time, it’s possible to create images you can touch. > Images are represented as code, so every part of an image becomes addressable, editable, and manipulable. > Every image in Reve is segmented and labeled, giving you precise control over every region and element.

Microsoft Build 2026 recap, from Windows to Copilot, all AI Microsoft used Build 2026 to push MAI as the in-house model stack for reasoning, coding, image, voice, and transcription. The key shift is strategic: less reliance on OpenAI, tighter integration across Copilot, Windows, and enterprise agents. πŸ—ž #microsoftcopilot @testingcatalog

GOOGLE πŸ”₯: A new Dreambeans experiment is now available in Google Labs for US-based Google AI Ultra users on the waitlist. This experiment uses Personal Intelligence to deliver daily stories based on the user's data context. Not a testing time for the most πŸ‘€

Ideogram announced Ideogram 4.0, a new SOTA open image generation model. > Ideogram 4.0 lands in the 8th spot on LM Arena and the 5th spot on Design Arena in the text-to-image category, and is getting close to Nano Banana Pro's performance. > Ideogram 4.0 features dense, accurate text rendering, native 2K resolution, active background transparency, and precise layout control

GOOGLE πŸ”₯: A new Gemma 4 12B is now available on Huggingface under Apache 2.0 license! > Built with the same multimodal funct
GOOGLE πŸ”₯: A new Gemma 4 12B is now available on Huggingface under Apache 2.0 license! > Built with the same multimodal functionality as Gemma 4 E2B and E4B (text, audio, image, and video inputs), it brings native audio and vision understanding directly to local environments without the need for separate encoders. > This unified approach to multimodality makes the model encoder-free, offering a deployment size that is perfect for consumer devices and streamlined local execution.

Perplexity Personal Computer is now available to Max and Enterprise Max users on Windows! Waitlist below πŸ‘€

ICYMI πŸ‘€: Claude Code CLI can now operate Claude Platform, including the Messages API and Claude Managed Agents. One CLI to rule them all πŸ€–

OpenAI makes its next hardware move with Opal Electronics OpenAI is backing Opal Electronics to expand from premium webcams into AI-native creative devices, likely focused on vision and voice, as part of its broader push into ambient hardware while its flagship device remains delayed. πŸ—ž #chatgpt @testingcatalog

Perplexity Computer will soon be able to dynamically split compute power between local models and cloud models! If that would drive Perplexity Computer costs down, it would be huge, since it is one of the top blockers for many at this moment. Soon πŸ‘€

HERMES πŸ”₯: A new Hermes Desktop app is now available on macOS, Windows, and Linux! Testing time πŸ‘€

Google tests Planning Mode for NotebookLM Video Overviews Google is testing a planning mode for NotebookLM Video Overviews that lets users review and edit a draft outline before generation. The feature points to tighter editorial control and a possible shift from Veo to Gemini Omni. πŸ—ž #notebooklm @testingcatalog

MICROSOFT πŸ”₯: A new Copilot super app has been announced! It arrives with a concept of Autopilots, long-running, always-on ag
MICROSOFT πŸ”₯: A new Copilot super app has been announced! It arrives with a concept of Autopilots, long-running, always-on agents, with Scout being the first Agent coming out of the box. More Autopilot Agents will be added later.

MAI Thinking 1 Benchmarks πŸ‘€
MAI Thinking 1 Benchmarks πŸ‘€

MICROSOFT πŸ”₯: New MAI Code 1 Flash and MAI Thinking 1 models have been revealed on the official MAI website! Also, MAI Image
MICROSOFT πŸ”₯: New MAI Code 1 Flash and MAI Thinking 1 models have been revealed on the official MAI website! Also, MAI Image 2.5, MAI Voice 2, and MAI Transcribe 1.5 are there too. > MAI-Code-1-Flash plans and reasons through complex coding tasks from start to finish, so you spend less time debugging and more time building. > MAI-Thinking-1 (35B active, ~1T total parameters, MoE) has a smaller inference footprint than much larger models, yet is competitive with Claude Opus 4.6 on SWE-Bench Pro.

Microsoft ❀️ OpenClaw Microsoft is launching the OpenClaw Companion app, a built-in, always-on OpenClaw agent, deeply integra
Microsoft ❀️ OpenClaw Microsoft is launching the OpenClaw Companion app, a built-in, always-on OpenClaw agent, deeply integrated into the Windows ecosystem.