uk
Feedback
🚨 AI News | TestingCatalog

🚨 AI News | TestingCatalog

Відкрити в Telegram

Latest AI News on AI Agents, Model Releases, Tools, Leaks, and Rumors 🗞

Показати більше
Країна не вказанаТехнології та додатки15 644
6 887
Підписники
+324 години
+977 днів
+47330 день
Архів дописів
ClickUp rolls out Brain² AI with deep workspace context ClickUp relaunched Brain² as a context-aware AI coworker that acts across workspace data, picks models per task, creates deliverables and agents, cites sources, and supports secure, permission-aware work management. 🗞 #sponsored @testingcatalog

Anthropic launched Claude Tag for Team and Enterprise users. Claude Tag works in Slack and can tackle more complex tasks, bre
Anthropic launched Claude Tag for Team and Enterprise users. Claude Tag works in Slack and can tackle more complex tasks, break them down into smaller milestones, and integrate with connected tools. A new AI coworker 👀

Mistral AI launched OCR 4 👀 > Win rates averaging 72%, alongside the top overall score on OlmOCRBench (85.20). > Alongside t
Mistral AI launched OCR 4 👀 > Win rates averaging 72%, alongside the top overall score on OlmOCRBench (85.20). > Alongside the extracted text, OCR 4 returns bounding boxes, typed-block classification, and inline confidence scores. > OCR 4 is an ingestion component of Search Toolkit, Mistral's open-source, composable search framework. > Support for 170 languages across 10 language groups. > OCR 4 is compact enough to run in a single container.

Latitude launches open-source platform to monitor AI agents Latitude released an MIT-licensed open-source platform for monitoring AI agents in production, clustering live conversations, detecting recurring failures, creating tests from real sessions, and routing fixes into developer tools. 🗞 #sponsored @testingcatalog

OPENAI 🔥: Bidi 1, an upcoming voice model from OpenAI, can sing and generate different sounds too. A rap sample 👀

OPENAI 🔥: An upcoming Bidi 1 voice model will be able to translate in real-time! This will unlock a huge pile of use cases to be built on top of when it lands on the APIs.

OpenAI prepares bidirectional voice mode for rollout on ChatGPT OpenAI is testing Bidi 1, a bidirectional voice model for ChatGPT that can listen and speak at once, handle interruptions, retain context, and reduce unwanted cut-ins. A wider web and mobile rollout may begin soon. 🗞 #chatgpt @testingcatalog

BREAKING 🔥: First tests of "Bidi 1", an upcoming bidirectional voice model from OpenAI. This upgrade will arrive in ChatGPT and, potentially, in Codex soon as well. > Bidi 1 can speak over while you are talking and keep listening. > Bidi 1 can switch between tasks back and force mid-sentence. > Bidi 1 is much better at handling interruptions and pauses. > Bidi 1 can better keep and memorize the context while you speak. There is still a cap on how long it can keep speaking, which is expected, but it easily counted to 23 without pausing. * Bidi 1 is not available yet, but given all the recent preparations, we will get it very, very soon.

BYTEDANCE 🔥: Seedance 2.5 has been officially announced, along with an updated Seedance 2.0. - Seedance 2.0 now supports 4k output - Seedance 2.5 will be able to generate 30-second videos in one go - ByteDance also announced a new AI copyright commercialization platform This video ad is stunning 👀

BREAKING 🔥: OpenAI is preparing "Bidi 1" for the upcoming web release! > A new voice model will be available in settings, al
BREAKING 🔥: OpenAI is preparing "Bidi 1" for the upcoming web release! > A new voice model will be available in settings, alongside standard and advanced options. > Voice mode bubble will have a Yellow color instead of blue. How soon? 👀

Anthropic prepares Cowork support for mobile apps Anthropic’s iOS app hints at Cowork shifting from desktop-tethered Dispatch to cloud and web, enabling mobile task runs and a unified scheduled-actions view. App code also points to a selectable voice model, suggesting a coming refresh. 🗞 #claude @testingcatalog

Google tests literature review matrix tool for NotebookLM Google is developing a NotebookLM “Lit Review” artifact that turns uploaded sources into a literature review matrix. Aimed at research-heavy reading, it may connect with Play Books, but launch timing and citation reliability remain unclear. 🗞 #notebooklm @testingcatalog

Flashcards are now editable on NotebookLM 👀 Users can adjust the text of questions and answers, plus add new cards to the st
Flashcards are now editable on NotebookLM 👀 Users can adjust the text of questions and answers, plus add new cards to the stack. FlashcardLM ⚡️

OpenAI launches new security tools and updates GPT-5.5-Cyber OpenAI expands Daybreak from bug discovery to patch delivery with Codex Security, limited GPT-5.5-Cyber access, partner distribution, and Patch the Planet for open-source projects, targeting validated fixes across enterprise, government, and OSS. 🗞 #chatgpt @testingcatalog

Sakana AI releases Fugu Ultra system to rival top AI labs Sakana AI launched Fugu Ultra, a public OpenAI-compatible orchestration model for complex engineering, research, cybersecurity, and data analysis. It rivals top models while reducing vendor dependence and export-control exposure. 🗞 #ai @testingcatalog

OpenAI announces GPT-5.5-Cyber (new) model update, which scores 85.6% on CyberGym benchmark in comparison to 81.9% in its ear
OpenAI announces GPT-5.5-Cyber (new) model update, which scores 85.6% on CyberGym benchmark in comparison to 81.9% in its early version. Codex got a new Security plugin too 👀

ANTHROPIC 🔥: Claude for mobile is getting Cowork support soon! Users will be able to trigger Cowork tasks on mobile and view
+1
ANTHROPIC 🔥: Claude for mobile is getting Cowork support soon! Users will be able to trigger Cowork tasks on mobile and view scheduled tasks in the app. > Keep Cowork going when you are on the go > Start and steer tasks directly from your phone > Check in from your phone, browser, or Claude desktop app > Work continues in the background, even when you close the app h/t DevMode

BREAKING 🔥: Sakana AI announced the Sakana Fugu and Sakana Fugu Ultra systems, which perform on par with Claude Fable 5 and
BREAKING 🔥: Sakana AI announced the Sakana Fugu and Sakana Fugu Ultra systems, which perform on par with Claude Fable 5 and Mythos 5 across many benchmarks. > Sakana AI is an AI lab from Japan, and Fugu is an orchestration model trained to operate other LLMs. > It is available as an API but not yet accessible in the EEA region. That's a natural evolution. Orchestration multi-model systems will outperform single-model systems, and they will become much more accessible for smaller labs and companies to build. Big players will have to consider building orchestrating systems that rely on models built by competitors. It is already happening at Meta, Apple, and Microsoft, and will likely catch Google, Anthropic, and OpenAI as well eventually.

Which AI labs are you rooting for?
Anonymous voting

ICYMI 👀: Cursor got a new /automate Skill Automation your toil got insanely simpler over the past few years with AI. Even Automation is Automated now 🤖