π¨ AI News | TestingCatalog
ΠΡΠΊΡΡΡΡ Π² Telegram
Latest AI News on AI Agents, Model Releases, Tools, Leaks, and Rumors π
ΠΠΎΠ»ΡΡΠ΅Π‘ΡΡΠ°Π½Π° Π½Π΅ ΡΠΊΠ°Π·Π°Π½Π°Π’Π΅Ρ
Π½ΠΎΠ»ΠΎΠ³ΠΈΠΈ ΠΈ ΠΏΡΠΈΠ»ΠΎΠΆΠ΅Π½ΠΈΡ16 330
6 543
ΠΠΎΠ΄ΠΏΠΈΡΡΠΈΠΊΠΈ
+924 ΡΠ°ΡΠ°
+927 Π΄Π½Π΅ΠΉ
+43530 Π΄Π΅Π½Ρ
ΠΡΡ
ΠΈΠ² ΠΏΠΎΡΡΠΎΠ²
NVIDIA π₯: Nemotron 3 Ultra has been released on Huggingface with 5x faster inference and 30% lower costs in comparison to other open models.
> Nemotron-3-Ultra-550B-A55B-NVFP4 is a frontier-scale large language model (LLM) trained by NVIDIA, designed to deliver strong agentic, reasoning, and conversational capabilities.
OPENAI π₯: A new "more capable and scalable system for synthesizing memory" is being rolled out to Plus and Pro users in the US.
> Today, we are launching a significantly more capable and compute-efficient memory architecture built on top of dreaming.
> From the memory summary, you can quickly glean the highlights of what ChatGPT knows about you.
> This update will roll out to additional countries and Free and Go users over the coming weeks.
ANTHROPIC π₯: A new internal research has been published, highlighting an accelerated AI development and a potential path to recursive self-improvement.
> Claude Mythos Preview could work for βat leastβ 16 hours and was βat the upper end of what [METR] can measure.β
> Today, Anthropic engineers on average ship 8x as much code per quarter as they did compared to 2021-2025.
Do you feel it? π
ANTHROPIC π₯: A new "claude-oceanus-v1-p" has been made available to Red Teams.
This appearance may signal an upcoming release of newer Mythos models, referenced earlier by Antropic.
Soon? π
Reve 2.0 is now available, and it landed in second place in the text-to-image arena, outranking Nano Banana 2.
> We invented a new way to generate and edit any image using precise layouts. For the first time, itβs possible to create images you can touch.
> Images are represented as code, so every part of an image becomes addressable, editable, and manipulable.
> Every image in Reve is segmented and labeled, giving you precise control over every region and element.
Microsoft Build 2026 recap, from Windows to Copilot, all AI
Microsoft used Build 2026 to push MAI as the in-house model stack for reasoning, coding, image, voice, and transcription. The key shift is strategic: less reliance on OpenAI, tighter integration across Copilot, Windows, and enterprise agents.
π #microsoftcopilot @testingcatalog
GOOGLE π₯: A new Dreambeans experiment is now available in Google Labs for US-based Google AI Ultra users on the waitlist.
This experiment uses Personal Intelligence to deliver daily stories based on the user's data context.
Not a testing time for the most π
Ideogram announced Ideogram 4.0, a new SOTA open image generation model.
> Ideogram 4.0 lands in the 8th spot on LM Arena and the 5th spot on Design Arena in the text-to-image category, and is getting close to Nano Banana Pro's performance.
> Ideogram 4.0 features dense, accurate text rendering, native 2K resolution, active background transparency, and precise layout control
GOOGLE π₯: A new Gemma 4 12B is now available on Huggingface under Apache 2.0 license!
> Built with the same multimodal functionality as Gemma 4 E2B and E4B (text, audio, image, and video inputs), it brings native audio and vision understanding directly to local environments without the need for separate encoders.
> This unified approach to multimodality makes the model encoder-free, offering a deployment size that is perfect for consumer devices and streamlined local execution.
Perplexity Personal Computer is now available to Max and Enterprise Max users on Windows!
Waitlist below π
ICYMI π: Claude Code CLI can now operate Claude Platform, including the Messages API and Claude Managed Agents.
One CLI to rule them all π€
OpenAI makes its next hardware move with Opal Electronics
OpenAI is backing Opal Electronics to expand from premium webcams into AI-native creative devices, likely focused on vision and voice, as part of its broader push into ambient hardware while its flagship device remains delayed.
π #chatgpt @testingcatalog
Perplexity Computer will soon be able to dynamically split compute power between local models and cloud models!
If that would drive Perplexity Computer costs down, it would be huge, since it is one of the top blockers for many at this moment.
Soon π
HERMES π₯: A new Hermes Desktop app is now available on macOS, Windows, and Linux!
Testing time π
Google tests Planning Mode for NotebookLM Video Overviews
Google is testing a planning mode for NotebookLM Video Overviews that lets users review and edit a draft outline before generation. The feature points to tighter editorial control and a possible shift from Veo to Gemini Omni.
π #notebooklm @testingcatalog
MICROSOFT π₯: A new Copilot super app has been announced!
It arrives with a concept of Autopilots, long-running, always-on agents, with Scout being the first Agent coming out of the box. More Autopilot Agents will be added later.
MICROSOFT π₯: New MAI Code 1 Flash and MAI Thinking 1 models have been revealed on the official MAI website!
Also, MAI Image 2.5, MAI Voice 2, and MAI Transcribe 1.5 are there too.
> MAI-Code-1-Flash plans and reasons through complex coding tasks from start to finish, so you spend less time debugging and more time building.
> MAI-Thinking-1 (35B active, ~1T total parameters, MoE) has a smaller inference footprint than much larger models, yet is competitive with Claude Opus 4.6 on SWE-Bench Pro.
Microsoft β€οΈ OpenClaw
Microsoft is launching the OpenClaw Companion app, a built-in, always-on OpenClaw agent, deeply integrated into the Windows ecosystem.
TinyFish Bigset turns text prompts into live datasets from web
TinyFish launched Bigset, an open-source, self-hosted multi-agent system that turns plain-language prompts into live web datasets. It infers schema, verifies sources, deduplicates rows, supports scheduled refreshes, and exports CSV/XLSX.
π #sponsored @testingcatalog
Π£ΠΆΠ΅ Π΄ΠΎΡΡΡΠΏΠ½ΠΎ! ΠΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠ΅ Telegram 2025 β ΠΊΠ»ΡΡΠ΅Π²ΡΠ΅ ΠΈΠ½ΡΠ°ΠΉΡΡ Π³ΠΎΠ΄Π° 
