ru
Feedback
Data Science, Machine Learning, AI & IOT

Data Science, Machine Learning, AI & IOT

Открыть в Telegram

Posts from world's largest datascientists community and latest trends learning articles in Machine learning, deep learning, AI, IOT and tools Part of @nuggetsnetwork Instagram: kdnuggets Chat @datasciencechats Admin: @LordAdminBot

Больше

📈 Аналитический обзор Telegram-канала Data Science, Machine Learning, AI & IOT

Канал Data Science, Machine Learning, AI & IOT (@kdnuggets) языкового сегмента Английский является активным участником. Сейчас сообщество объединяет 23 756 подписчиков, занимая 5 676 место в категории Технологии и приложения и 17 865 место в регионе Индия.

📊 Показатели аудитории и динамика

С момента создания невідомо проект демонстрирует стремительный рост, собрав аудиторию из 23 756 подписчиков.

Согласно последним данным от 30 июня, 2026, канал показывает стабильную активность. За последние 30 дней изменение числа участников составило -242, а за последние 24 часа — -1, при этом общий охват остаётся высоким.

  • Статус верификации: Не верифицирован
  • Уровень вовлечённости (ER): Средний показатель вовлечённости аудитории составляет 4.30%. В первые 24 часа после публикации контент обычно набирает 1.47% реакций от общего числа подписчиков.
  • Охват публикаций: В среднем каждый пост получает 1 021 просмотров. В течение первых суток публикация набирает 350 просмотров.
  • Реакции и взаимодействия: Аудитория активно поддерживает контент: среднее количество реакций на один пост — 1.

📝 Описание и контентная политика

Автор описывает ресурс как площадку для выражения субъективного мнения:
Posts from world's largest datascientists community and latest trends learning articles in Machine learning, deep learning, AI, IOT and tools Part of @nuggetsnetwork Instagram: kdnuggets Chat @datasciencechats Admin: @LordAdminBot

Благодаря высокой частоте обновлений (последние данные получены 01 июля, 2026) канал поддерживает актуальность и высокий уровень охвата публикаций. Аналитика показывает, что аудитория активно взаимодействует с контентом, что делает его важной точкой влияния в категории Технологии и приложения.

23 756
Подписчики
-124 часа
-517 дней
-24230 день
Архив постов
🔬 AI Research Digest 📅 Week of Jun 24–Jun 30, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🤖 ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration Authors/Org: Ruofeng Yang, Yongcan Li, Shuai Li (Shanghai Jiao Tong University) | arXiv: 2605.03042 Bottleneck solved: Long-horizon AI research agents that fabricate or silently inherit unsupported claims. A cross-model adversarial setup (executor + reviewer from different model families) enforces evidence-gated claim auditing across 65+ reusable skills, making fully autonomous ML research pipelines reliably self-correcting. 🔗 ARIS – arXiv 2605.03042 ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🦾 D-VLA: Distributed Async RL for Vision-Language-Action Models Authors/Org: Yucheng Guo, Yongjian Guo, Zhong Guan et al. (Tsinghua / Peking / Tianjin Universities) | arXiv: 2605.13276 Bottleneck solved: RL training throughput for large-scale embodied foundation models is crippled by interference between simulation data and weight updates. D-VLA's "Plane Decoupling" physically isolates high-frequency simulation from low-frequency optimization, enabling high-concurrency training of VLA robots at previously infeasible scale. 🔗 D-VLA – arXiv 2605.13276 ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🦞 OpenClaw: Local-First Personal AI Assistant Authors/Org: Peter Steinberger / openclaw team | GitHub: openclaw/openclaw Bottleneck solved: AI assistants require cloud infrastructure, exposing data and creating latency for everyday workflows. Running entirely on-device, OpenClaw connects any AI model to 20+ messaging channels (WhatsApp, Telegram, Slack, etc.) without a cloud dependency — now the most-starred active project on GitHub at 375K+ stars. 🔗 openclaw/openclaw – GitHub ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay curious. Read the papers. For More: @kdnuggets @datasciencechats

🤖 AI & Data Science Weekly Digest 📅 Week of Jun 23–Jun 29, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🧠 Databricks Launches Genie One: Agentic Coworker for Every Data Team Databricks unveiled Genie One at the Data + AI Summit 2026 — a fully agentic AI coworker that understands structured and unstructured data, now natively embedded in Microsoft Teams, M365 Copilot, and Excel. Data teams can tag Genie in Teams threads for live lakehouse queries, build low-code apps, and design natural-language pipelines — all governed by Unity Catalog. 🔗 Databricks Launches Genie One ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 💸 GLM-5.2 Beats GPT-5.5 on Coding Benchmarks at 1/6th the Cost Z.ai (formerly Zhipu AI) released GLM-5.2, a 753B-parameter open-weight Mixture-of-Experts model under the MIT license with a 1M-token context window, priced at $1.40/$4.40 per million tokens. It outperforms GPT-5.5 on SWE-bench Pro and MCP-Atlas multi-tool agent benchmarks — making it a compelling drop-in for cost-conscious developer teams running agentic coding workflows. 🔗 GLM-5.2 Beats GPT-5.5 at a Sixth of the Cost – VentureBeat ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 📈 OpenAI Files Confidential IPO S-1 with the SEC OpenAI submitted a confidential S-1 to the SEC, initiating what would be one of the most anticipated IPOs in tech history, following a $852 billion private valuation in March 2026 despite projected annual losses of $14 billion. For developer and data teams, this signals accelerating enterprise adoption pressure and potential shifts in OpenAI's API pricing and product roadmap as it transitions to a public company. 🔗 OpenAI IPO & June 2026 AI News – devFlokers ━━━━━━━━━━━━━━━━━━━━━━━━ 4. ❄️ Snowflake Drops June 2026 AI Pulse: New Data + AI Product Releases Snowflake's June 2026 AI Pulse recap includes a suite of new product launches spanning AI-ready data pipelines, expanded Cortex AI capabilities, and deeper integrations for ML teams working within the Snowflake platform. Engineers and data scientists building on Snowflake can expect faster model serving, native vector search improvements, and new governance tooling for AI workloads. 🔗 Snowflake AI Pulse – June 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 5. 🔬 Unity Catalog Extends AI Governance Across Clouds, Regions & Accounts Databricks expanded Unity Catalog at DAIS 2026 to govern an organization's entire Databricks footprint — across accounts, regions, and multi-cloud deployments — with SecureConnect enabling zero-copy cross-cloud data sharing. New Domain Marketplace features let data and AI assets be browsed and queried by agents, making Unity Catalog the central governance layer for the emerging agentic enterprise stack. 🔗 What's New with Unity Catalog at Data + AI Summit 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay ahead. Stay curious. For More: @kdnuggets @datasciencechats

🤖 AI & Data Science Weekly Digest 📅 Week of Jun 16–Jun 22, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. ⚡ MiniMax M3: 1M-Token Context at 15x Faster Decoding MiniMax released M3, a multimodal model using Sparse Attention that cuts per-token compute to just 1/20th of previous models, delivering 9x faster prefilling and 15x faster decoding at 1M token context lengths. Data teams working on long-document pipelines and RAG systems can now process massive corpora at a fraction of the compute cost. 🔗 MiniMax M3 – LLM News Today ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🏗️ Databricks Data + AI Summit 2026: Lakehouse Meets Microsoft 365 Azure Databricks launched Genie for Microsoft Teams and M365 Copilot (Beta), letting users tag Genie in Teams threads to get context-aware answers directly from their Unity Catalog-governed lakehouse. Data teams using Databricks can now surface insights without leaving their collaboration tools, reducing friction between analysis and action. 🔗 Azure Databricks at Data + AI Summit 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🧠 Microsoft Ships Phi-4-reasoning-vision-15B Open-Weight Model Microsoft released Phi-4-reasoning-vision-15B, a 15B-parameter open-weight multimodal model purpose-built for math and science reasoning with strong computational efficiency. Developers can self-host a capable vision-reasoning model without the cost of frontier APIs, making it practical for on-prem and edge deployments. 🔗 AI Model Releases June 2026 – devFlokers ━━━━━━━━━━━━━━━━━━━━━━━━ 4. 🔧 Qualcomm in $8–10B Talks to Acquire Tenstorrent Qualcomm is in early-stage acquisition talks for Tenstorrent, the RISC-V AI chip startup, in a deal valued between $8 and $10 billion. This signals a major push to compete in the AI accelerator market with open-standard silicon, which could expand hardware options beyond NVIDIA for ML teams building custom inference infrastructure. 🔗 AI News Briefs June 2026 – Radical Data Science ━━━━━━━━━━━━━━━━━━━━━━━━ 5. 📦 Google Cloud Releases Open Knowledge Format (OKF) v0.1 Google Cloud introduced OKF v0.1, an open specification for packaging organizational knowledge as directories of Markdown files with YAML frontmatter, designed to be vendor-neutral and agent-friendly. Software teams building AI agents and RAG pipelines now have a standardized, portable format for bundling internal docs, runbooks, and knowledge bases. 🔗 AI News June 2026 – dentro.de ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay ahead. Stay curious. For More: @kdnuggets @datasciencechats

🔬 AI Research Digest 📅 Week of Jun 9–Jun 16, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🧮 MaxProof: Scaling Mathematical Proof Generation Beyond Human Gold-Medal Authors/Org: Jiacheng Chen et al. | arXiv: 2606.13473 Bottleneck solved: LLMs could not reliably generate and verify competition-level mathematical proofs end-to-end without external scaffolding. MaxProof trains a single M3 model to generate, verify, and repair proofs via generative-verifier RL, then applies population-level test-time scaling — enabling it to score 35/42 on IMO 2025 and 36/42 on USAMO 2026, surpassing the human gold-medal threshold on both. 🔗 MaxProof on arXiv ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🛠️ nanochat: Train a Full ChatGPT Clone for Under $100 Authors/Org: Andrej Karpathy | GitHub: karpathy/nanochat Bottleneck solved: Full LLM training pipelines (tokenization → pretraining → RLHF → inference) were scattered across multiple large, hard-to-understand codebases. nanochat packs the entire pipeline into a single readable repo — a single --depth flag auto-tunes all hyperparameters, and you can train a GPT-2-level model on 8×H100s for ~$15 on spot instances, making it the definitive hands-on LLM learning resource for practitioners. 🔗 nanochat on GitHub ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🕸️ LLMs+Graphs: Toward Graph-Native, Synergistic AI Systems Authors/Org: arXiv contributors | arXiv: 2606.11560 Bottleneck solved: LLMs hallucinate and lose factual consistency because their parametric memory lacks structured relational grounding. This survey/position paper argues for making graph computation a first-class citizen in LLM architectures — using knowledge graphs for semantic constraints and retrieval, and LLMs to enrich graph reasoning — pointing toward systems where structured and neural memory work in tandem rather than in isolation. 🔗 LLMs+Graphs on arXiv ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay curious. Read the papers. For More: @kdnuggets @datasciencechats

🤖 AI & Data Science Weekly Digest 📅 Week of Jun 9–Jun 15, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🚨 Anthropic Hit with US Export Controls — Models Suspended Globally The Trump administration imposed export controls on Anthropic following tense exchanges between Dario Amodei and officials, prompting Anthropic to suspend worldwide access to its Fable 5 and Mythos 5 models. Teams relying on Claude APIs for production workloads should assess fallback options and monitor the situation closely, as European leaders have called the episode a "wake-up call" about US AI dependency. 🔗 Anthropic Suspends Access to Fable 5 & Mythos 5 (TechCrunch) ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🧠 Claude Fable 5 Launches with 95% SWE-bench Verified Score Anthropic released Claude Fable 5 on June 10, achieving a remarkable 95% on SWE-bench Verified and 80% on SWE-bench Pro — setting a new bar for AI coding performance before access was suspended by export controls. For data and engineering teams, Fable 5's benchmark results confirm that frontier models are now genuinely capable of resolving real-world GitHub issues autonomously. 🔗 Claude Fable 5: Review, Benchmarks and Pricing (LLM Stats) ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🏗️ Microsoft Ships MAI-Thinking-1: A Transparent Frontier Reasoning Model At Build, Microsoft unveiled a family of seven MAI models including MAI-Thinking-1 — a sparse Mixture-of-Experts reasoning model with 35B active parameters (1T total), a 256K context window, and a 109-page technical report trained from scratch on commercially licensed data. For development teams, its transparency and licensing terms make it a compelling alternative to models with restrictive usage policies, and it puts Microsoft in direct competition with OpenAI and Anthropic on frontier reasoning. 🔗 Introducing MAI-Thinking-1 (Microsoft AI) ━━━━━━━━━━━━━━━━━━━━━━━━ 4. 🛠️ OpenAI Expands Codex Beyond Developers to All Business Roles OpenAI added six role-specific plugins connecting Codex to 62 business applications with 110 pre-built skills, plus a new "Codex Sites" feature that builds and deploys internal apps from a prompt — noting that non-developers are now ~20% of Codex users and growing 3x faster than developers. Data teams and analysts can now use Codex as a general work automation platform, not just a code assistant, though teams should establish governance policies to avoid ungoverned tool sprawl. 🔗 Codex for Every Role: Tool & Workflow (OpenAI) ━━━━━━━━━━━━━━━━━━━━━━━━ 5. 📈 GitHub Hit by 14x Commit Surge from AI Agents, Forcing Infrastructure Rewrites GitHub COO Kyle Daigle revealed that AI coding agents have driven commits to ~275 million per week — up 14x — causing outages and forcing rewrites of decade-old infrastructure including a single database handling permissions for 200 million accounts. Open source maintainers are overwhelmed by the volume and uneven quality of AI-generated pull requests, signaling that code review pipelines and governance frameworks need to scale well beyond human-paced contribution assumptions. 🔗 GitHub's AI Commit Surge & Infrastructure Crisis (Latent Space) ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay ahead. Stay curious. For More: @kdnuggets @datasciencechats

🤖 AI & Gaming Digest 📅 June 10, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🕹️ Fable 5 Announcement: AI-Powered Storytelling in a New Fantasy World Microsoft and Playground Games unveiled Fable 5 with a focus on AI-driven narrative systems, dynamic characters, and branching quest logic that adapts to player choices. The announcement highlights how in-game NPCs will use generative AI to personalize dialogue, react to emergent events, and create a more responsive fantasy world. 🔗 Fable 5 announcement and AI narrative update ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🤖 Claude Integration: Smarter Game Dialogue and Assistants The Fable 5 update also references Claude-powered AI assistants for content design and in-game help. Claude's announcement link shows how the model can be used to generate coherent story beats, write quest summaries, and help world builders scale immersive game text safely. 🔗 Claude announcement for AI game and narrative tools ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 💡 Benefits for AI, Game, and Data Teams Fable 5’s AI-first approach offers major benefits: faster story iteration, more varied player interactions, richer procedural quests, and lower writing overhead. For AI teams, this demonstrates how Claude-like models can be integrated into entertainment pipelines while retaining oversight over tone, consistency, and brand voice. ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 AI games are becoming co-authored experiences. Stay ahead. For More: @kdnuggets @datasciencechats

✅ Native reactions test 🔥👍 This is a temporary test of Telegram's built-in reaction bar. For More: @kdnuggets @datasciencechats

✅ Native reactions test 🔥👍 This is a temporary test of Telegram's built-in reaction bar. For More: @kdnuggets @datasciencechats

🤖 AI & Gaming Digest 📅 June 10, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🕹️ Fable 5 Announcement: AI-Powered Storytelling in a New Fantasy World Microsoft and Playground Games unveiled Fable 5 with a focus on AI-driven narrative systems, dynamic characters, and branching quest logic that adapts to player choices. The announcement highlights how in-game NPCs will use generative AI to personalize dialogue, react to emergent events, and create a more responsive fantasy world. 🔗 Fable 5 announcement and AI narrative update ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🤖 Claude Integration: Smarter Game Dialogue and Assistants The Fable 5 update also references Claude-powered AI assistants for content design and in-game help. Claude's announcement link shows how the model can be used to generate coherent story beats, write quest summaries, and help world builders scale immersive game text safely. 🔗 Claude announcement for AI game and narrative tools ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 💡 Benefits for AI, Game, and Data Teams Fable 5’s AI-first approach offers major benefits: faster story iteration, more varied player interactions, richer procedural quests, and lower writing overhead. For AI teams, this demonstrates how Claude-like models can be integrated into entertainment pipelines while retaining oversight over tone, consistency, and brand voice. ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 AI games are becoming co-authored experiences. Stay ahead. For More: @kdnuggets @datasciencechats

🔬 AI Research Digest 📅 Week of Jun 3–Jun 9, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🤖 OpenClaw — Local-First Personal AI Assistant Authors/Org: openclaw | GitHub: openclaw/openclaw Bottleneck solved: Removes cloud dependency by running AI entirely on your own devices while connecting to 50+ messaging platforms (Telegram, Slack, WhatsApp, Discord, and more). With 377K+ stars and explosive growth since January 2026, OpenClaw is becoming the go-to local AI gateway — ideal for developers who want privacy-first automation across every chat surface they already use. 🔗 OpenClaw on GitHub ━━━━━━━━━━━━━━━━━━━━━━━━ 2. ⏱️ TimeMaster — Time-Series Reasoning via Reinforcement Learning Authors/Org: Feng Lang et al. | arXiv: 2506.13705 Bottleneck solved: Enables multimodal LLMs to reason accurately over visualized time-series data (ECG, EMG, HAR) using a composite RL reward that balances format, accuracy, and insight quality. A 3B-parameter TimeMaster model beats GPT-4o and Qwen2.5-7B on time-series benchmarks — huge win for data teams working with sensor, financial, or health signal data. 🔗 TimeMaster on arXiv ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🔄 Bridging Offline and Online RL for LLMs Authors/Org: Jack Lanchantin, Angelica Chen, Janice Lan et al. (Meta) | arXiv: 2506.21495 Bottleneck solved: Clarifies when to use offline vs. online RL fine-tuning for LLMs, showing online/semi-online methods consistently outperform offline across both verifiable math and open-ended instruction following. The key practical finding: multi-tasking with verifiable and non-verifiable rewards jointly boosts performance across both task types — a recipe developers can apply directly to RLHF pipelines. 🔗 Bridging Offline and Online RL for LLMs on arXiv ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay curious. Read the papers. For More: @kdnuggets @datasciencechats

🤖 AI & Data Science Weekly Digest 📅 Week of Jun 2–Jun 8, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🚀 MiniMax M3 Slashes Multimodal Compute by 20x MiniMax M3 is a new multimodal model supporting up to 1 million tokens while cutting per-token compute requirements to just 1/20th of previous models, with 9x faster prefilling and 15x faster decoding at 1M context. For data teams processing long documents, codebases, or large datasets, this makes million-token context practically affordable for production use. 🔗 LLM Stats – AI Model Releases June 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 💸 Orion-100B: 100B-Parameter Model Trained at $1.25/Hour Orion-100B demonstrated that training a 100-billion-parameter model can now cost as little as $1.25/hour, a dramatic drop that fundamentally changes the economics of large-scale AI development. This opens the door for mid-sized engineering teams and startups to fine-tune or replicate frontier-scale models without enterprise budgets. 🔗 AI News June 2026 – AI Startup Edge ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🧠 GPT-5.5 Instant, Gemini 3.5 Flash & Claude Opus 4.8 Set New Benchmarks OpenAI, Google, and Anthropic each released updated frontier models this week — GPT-5.5 Instant, Gemini 3.5 Flash, and Claude Opus 4.8 — all pushing new performance ceilings on reasoning, coding, and multimodal tasks. Developers building AI-powered applications should evaluate which model best fits their latency, cost, and capability tradeoffs with these new baselines. 🔗 LLM Updates June 2026 – LLM Stats ━━━━━━━━━━━━━━━━━━━━━━━━ 4. 🔐 Prompt Injection Attacks Officially Classified as CVE Category Prompt injection vulnerabilities have been formally recognized as a CVE category, and AI-generated code CVEs are up nearly 6x compared to 2025 — a signal that AI-assisted development carries real security debt. Software teams should integrate prompt injection testing into their security review pipelines and audit any LLM-integrated endpoints for input sanitization gaps. 🔗 AI News Briefs – Radical Data Science June 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 5. 📊 Databricks 2026 Data + AI Summit: 30,000 Professionals Descend on SF Databricks announced the full agenda for its 2026 Data + AI Summit, set for June 15–18 at the Moscone Center in San Francisco, with over 30,000 data and AI professionals expected to attend. The summit will cover the latest in data lakehouses, MLOps, real-time AI, and enterprise AI governance — essential viewing for data engineers and ML teams. 🔗 Databricks 2026 Data + AI Summit Announcement ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay ahead. Stay curious. For More: @kdnuggets @datasciencechats

🔬 AI Research Digest 📅 Week of May 27–Jun 2, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 📱 MobileGym: Verifiable & Parallel Mobile GUI Agent Simulation Authors/Org: Chinese Academy of Sciences, Peking University, CUHK | arXiv: 2605.26114 Bottleneck solved: Training mobile GUI agents at scale is blocked by slow, non-deterministic simulators with no reliable reward signal. MobileGym runs 256 parallel Android instances in-browser, uses JSON state for bit-exact reproducibility, and ships 416 task templates with sub-millisecond judges — lifting Qwen3-VL-4B real-device pass rate from 32% → 73% with GRPO fine-tuning on a single 3×RTX node. 🔗 arXiv 2605.26114 ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🦞 AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Authors/Org: Aiming Lab | arXiv: 2605.20025 Bottleneck solved: Fully autonomous research pipelines hallucinate results and lack a principled way to incorporate human oversight without defeating the purpose of automation. AutoResearchClaw combines structured multi-agent debate, a self-healing executor with Pivot/Refine loops, and seven human-in-the-loop intervention modes — outperforming AI Scientist v2 by 54.7% on ARC-Bench while preventing fabricated citations via live literature grounding. 🔗 arXiv 2605.20025 ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🧠 nanochat: Full-Stack LLM Training Pipeline for ~$100 Authors/Org: Andrej Karpathy | GitHub: karpathy/nanochat Bottleneck solved: End-to-end LLM training (pretraining → RLHF → chat UI) has no minimal, hackable reference implementation that a single developer can run affordably. Unlike nanoGPT which stops at pretraining, nanochat covers tokenization, SFT, evaluation, inference, and a ChatGPT-like UI in one dependency-minimal codebase — reaching GPT-2 capability in ~$48 of cloud GPU time. 🔗 github.com/karpathy/nanochat ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay curious. Read the papers. For More: @kdnuggets @datasciencechats

🤖 AI & Data Science Weekly Digest 📅 Week of May 26–June 1, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. 🧠 Anthropic Releases Claude Opus 4.8 with Faster, Cheaper Inference Anthropic launched Claude Opus 4.8, featuring stronger benchmarks, improved honesty, dynamic workflows in Claude Code, and a fast mode that runs at 2.5× the speed of previous models at one-third the cost. Data and engineering teams get a meaningfully cheaper path to high-capability agentic workflows without sacrificing quality. 🔗 Anthropic Release Notes – May 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 🔧 Anthropic Acquires Stainless, the Dev-Tools Startup Behind OpenAI and Google SDKs Anthropic acquired Stainless, whose tooling auto-generates idiomatic SDKs and is already used by OpenAI, Google, and Cloudflare to ship client libraries. This gives Anthropic direct control over the developer experience layer, signaling a deeper push to own the full API toolchain. 🔗 Anthropic has acquired the dev tools startup used by OpenAI, Google, and Cloudflare – TechCrunch ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🤖 Claude Managed Agents Now Support Private MCP Servers and Custom Sandboxes Anthropic's Claude Managed Agents can now run inside a developer-controlled sandbox while connecting to private Model Context Protocol (MCP) servers, with the agent loop remaining on Anthropic's infrastructure. This hybrid architecture lets teams integrate proprietary data and tools into production agents without exposing sensitive resources externally. 🔗 Google IO 2026 and Anthropic advance agentic AI for businesses ━━━━━━━━━━━━━━━━━━━━━━━━ 4. ⚡ AMD EPYC "Venice" Enters Production on TSMC 2nm — First HPC Chip at This Node AMD began production ramp of its 6th Gen EPYC "Venice" processor on TSMC's 2nm process, making it the first high-performance computing product at this fabrication node. Better performance-per-watt and higher transistor density directly benefit data center operators running AI inference and large-scale data pipelines under tight power budgets. 🔗 AMD Announces Production Ramp of Next-Generation AMD EPYC Processor "Venice" on TSMC 2nm ━━━━━━━━━━━━━━━━━━━━━━━━ 5. 💻 OpenAI Codex Reaches 4 Million Active Users Powered by GPT-5.5 OpenAI's Codex coding agent, now running on GPT-5.5, has hit 4 million active users — reflecting rapid enterprise adoption of AI-assisted software development. For developer and data teams, this signals that agentic coding tools are moving from novelty to standard workflow infrastructure. 🔗 OpenAI and Anthropic news from the past week (Week 20, 2026) ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay ahead. Stay curious. For More: @kdnuggets @datasciencechats

🤖 AI & Data Science Weekly Digest 📅 Week of May 25–May 31, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 1. ⚡ Google Launches Gemini 3.5 Flash at I/O 2026 Google released Gemini 3.5 Flash — a frontier model that's 4x faster than comparable models and optimized for agentic workflows, coding, and multimodal tasks. Developers can access it via the Gemini API with 1M token context at $1.50/$9 per 1M tokens, and the new Managed Agents feature lets a single API call spin up a full reasoning agent with tool use and code execution. 🔗 Google Introduces Gemini 3.5 Flash at I/O 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 2. 📢 OpenAI Opens ChatGPT Ad Platform to All Businesses OpenAI launched a self-serve Ads Manager at ads.openai.com, eliminating the previous $50,000 minimum spend and introducing CPC bidding, a Conversions API, and pixel-based tracking. This shifts ChatGPT from a subscription-only product toward a major ad-supported platform — opening new acquisition channels for data teams and developer-focused SaaS companies. 🔗 OpenAI launches self-serve ad platform ━━━━━━━━━━━━━━━━━━━━━━━━ 3. 🧬 PolyU Researchers Store Data in Engineered Proteins The Hong Kong Polytechnic University pioneered a method to store and retrieve digital data using de novo designed unnatural proteins, offering a radically denser and more stable storage medium as AI-generated data volumes explode. For data teams, this signals a new frontier in long-term archival storage that could complement or eventually compete with DNA and tape-based cold storage. 🔗 Data explosion in AI era: PolyU leads breakthroughs in protein-based data storage ━━━━━━━━━━━━━━━━━━━━━━━━ 4. 💡 Light-Based Computing Could Slash AI Energy Costs Penn researchers created a hybrid light-matter particle (polariton) system that dramatically accelerates AI inference while consuming far less energy than silicon-based hardware. This matters for data teams running large-scale model inference, where energy cost and latency are growing operational constraints. 🔗 Breakthroughs in AI Innovations for May 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 5. 🛡️ Five Nations Release Agentic AI Security Guidance Cybersecurity agencies from the US, UK, Australia, Canada, and New Zealand jointly published guidance on "Careful Adoption of Agentic AI Services," covering risks like prompt injection, over-permissioned agents, and supply chain attacks. Software teams deploying AI agents in production should treat this as a practical security checklist before rollout. 🔗 7 Explosive AI Updates in May 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 Stay ahead. Stay curious. For More: @kdnuggets @datasciencechats

🤖 *AI & Data Science Weekly Digest* 📅 Week of May 25–May 31, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ **1. ⚡ Google Launches Gemini 3.5 Flash at I/O 2026** Google released Gemini 3.5 Flash — a frontier model that's 4x faster than comparable models and optimized for agentic workflows, coding, and multimodal tasks. Developers can access it via the Gemini API with 1M token context at $1.50/$9 per 1M tokens, and the new Managed Agents feature lets a single API call spin up a full reasoning agent with tool use and code execution. 🔗 [Google Introduces Gemini 3.5 Flash at I/O 2026](https://www.marktechpost.com/2026/05/20/google-introduces-gemini-3-5-flash-at-i-o-2026-a-faster-and-cheaper-model-for-ai-agents-and-coding/) ━━━━━━━━━━━━━━━━━━━━━━━━ **2. 📢 OpenAI Opens ChatGPT Ad Platform to All Businesses** OpenAI launched a self-serve Ads Manager at ads.openai.com, eliminating the previous $50,000 minimum spend and introducing CPC bidding, a Conversions API, and pixel-based tracking. This shifts ChatGPT from a subscription-only product toward a major ad-supported platform — opening new acquisition channels for data teams and developer-focused SaaS companies. 🔗 [OpenAI launches self-serve ad platform](https://www.axios.com/2026/05/05/openai-self-serve-ad-platform) ━━━━━━━━━━━━━━━━━━━━━━━━ **3. 🧬 PolyU Researchers Store Data in Engineered Proteins** The Hong Kong Polytechnic University pioneered a method to store and retrieve digital data using de novo designed unnatural proteins, offering a radically denser and more stable storage medium as AI-generated data volumes explode. For data teams, this signals a new frontier in long-term archival storage that could complement or eventually compete with DNA and tape-based cold storage. 🔗 [Data explosion in AI era: PolyU leads breakthroughs in protein-based data storage](https://www.eurekalert.org/news-releases/1129790) ━━━━━━━━━━━━━━━━━━━━━━━━ **4. 💡 Light-Based Computing Could Slash AI Energy Costs** Penn researchers created a hybrid light-matter particle (polariton) system that dramatically accelerates AI inference while consuming far less energy than silicon-based hardware. This matters for data teams running large-scale model inference, where energy cost and latency are growing operational constraints. 🔗 [Breakthroughs in AI Innovations for May 2026](https://www.aiandnews.com/blog/latest-ai-news-may-2026-3/) ━━━━━━━━━━━━━━━━━━━━━━━━ **5. 🛡️ Five Nations Release Agentic AI Security Guidance** Cybersecurity agencies from the US, UK, Australia, Canada, and New Zealand jointly published guidance on "Careful Adoption of Agentic AI Services," covering risks like prompt injection, over-permissioned agents, and supply chain attacks. Software teams deploying AI agents in production should treat this as a practical security checklist before rollout. 🔗 [7 Explosive AI Updates in May 2026](https://imfounder.com/science-tech/ai/ai-updates-may-2026/) ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 *Stay ahead. Stay curious.* For More: @kdnuggets @datasciencechats

🤖 *AI & Data Science Weekly Digest* 📅 Week of May 25–May 31, 2026 ━━━━━━━━━━━━━━━━━━━━━━━━ **1. ⚡ Google Launches Gemini 3.5 Flash at I/O 2026** Google released Gemini 3.5 Flash — a frontier model that's 4x faster than comparable models and optimized for agentic workflows, coding, and multimodal tasks. Developers can access it via the Gemini API with 1M token context at $1.50/$9 per 1M tokens, and the new Managed Agents feature lets a single API call spin up a full reasoning agent with tool use and code execution. 🔗 [Google Introduces Gemini 3.5 Flash at I/O 2026](https://www.marktechpost.com/2026/05/20/google-introduces-gemini-3-5-flash-at-i-o-2026-a-faster-and-cheaper-model-for-ai-agents-and-coding/) ━━━━━━━━━━━━━━━━━━━━━━━━ **2. 📢 OpenAI Opens ChatGPT Ad Platform to All Businesses** OpenAI launched a self-serve Ads Manager at ads.openai.com, eliminating the previous $50,000 minimum spend and introducing CPC bidding, a Conversions API, and pixel-based tracking. This shifts ChatGPT from a subscription-only product toward a major ad-supported platform — opening new acquisition channels for data teams and developer-focused SaaS companies. 🔗 [OpenAI launches self-serve ad platform](https://www.axios.com/2026/05/05/openai-self-serve-ad-platform) ━━━━━━━━━━━━━━━━━━━━━━━━ **3. 🧬 PolyU Researchers Store Data in Engineered Proteins** The Hong Kong Polytechnic University pioneered a method to store and retrieve digital data using de novo designed unnatural proteins, offering a radically denser and more stable storage medium as AI-generated data volumes explode. For data teams, this signals a new frontier in long-term archival storage that could complement or eventually compete with DNA and tape-based cold storage. 🔗 [Data explosion in AI era: PolyU leads breakthroughs in protein-based data storage](https://www.eurekalert.org/news-releases/1129790) ━━━━━━━━━━━━━━━━━━━━━━━━ **4. 💡 Light-Based Computing Could Slash AI Energy Costs** Penn researchers created a hybrid light-matter particle (polariton) system that dramatically accelerates AI inference while consuming far less energy than silicon-based hardware. This matters for data teams running large-scale model inference, where energy cost and latency are growing operational constraints. 🔗 [Breakthroughs in AI Innovations for May 2026](https://www.aiandnews.com/blog/latest-ai-news-may-2026-3/) ━━━━━━━━━━━━━━━━━━━━━━━━ **5. 🛡️ Five Nations Release Agentic AI Security Guidance** Cybersecurity agencies from the US, UK, Australia, Canada, and New Zealand jointly published guidance on "Careful Adoption of Agentic AI Services," covering risks like prompt injection, over-permissioned agents, and supply chain attacks. Software teams deploying AI agents in production should treat this as a practical security checklist before rollout. 🔗 [7 Explosive AI Updates in May 2026](https://imfounder.com/science-tech/ai/ai-updates-may-2026/) ━━━━━━━━━━━━━━━━━━━━━━━━ 💡 *Stay ahead. Stay curious.* For More: @kdnuggets @datasciencechats

Sample PNG image
Sample PNG image

Direct PNG file post @beingman
Direct PNG file post @beingman

<b>GPT-5.5 Update</b> Introducing GPT-5.5 — our smartest model yet, faster and built for complex tasks. Tap the button below to like this post. For More: @kdnuggets @datasciencechats

<b>📰 KDnuggets Update</b> <i>Google is expanding the Gemini 3 model family with the release of Gemini 3 Flash, which offers frontier intelligence built for speed at a fraction of the cost.</i> 🔗 Read more on the blog For More: @kdnuggets @datasciencechats