Architecture Weekly
الذهاب إلى القناة على Telegram
Architecture Weekly newsletter originated at https://blog.vvsevolodovich.dev. ~10 articles or videos on solution architecture and system design every week!.
إظهار المزيد2 986
المشتركون
-124 ساعات
+17 أيام
+430 أيام
أرشيف المشاركات
2 986
ScyllaDB Clusters at Discord, Zero Trust for AI Agents, CloudFlare Data Platform and many more in Architecture Weekly #198
https://www.youtube.com/watch?v=71AXRNKlg5c
2 986
Zero Trust for AI Agents 🍼
Each new technology bears security risks and the agentic systems is no exception. Both common vulnerabilities like supply chain attacks and excessive permissions stay; new ones like direct and indirect prompt injection, RAG poisoning and others comes. Follow the Antropic guide on applying Zero Trust to your agentic systems.
#security #ai
2 986
Migrating Data Ingestion Systems at Meta Scale 👨💼
Big migrations are scary and effortful. Imagine one with the biggest MySQL deployment in the world like at Meta’s data ingestion system.
The team ran both legacy and new systems in a shadow testing mode, diffing outputs for correctness so individual pipelines could be validated and migrated independently; partition-level metadata flags automatically halted new delta landings and forced merges with known-good partitions whenever a partition was flagged as bad, bounding data quality risk during the transition.
At @boltapp we followed the same shadow testing approach during our migration from MySQL to TitaniumDB.
#db #architecture #distributed #engineering
2 986
How Discord Automates ScyllaDB Clusters at Scale 🤓
A small infrastructure team operating 20+ ScyllaDB clusters with nearly 500 nodes cannot afford purely manual runbooks for cluster-wide operations. Discord’s Scylla Control Plane (SCP) encodes operations — rolling OS upgrades, cluster expansion, shadow cluster provisioning, node recovery — as YAML-defined workflows with explicit retry counts, parallelism controls, and abort-on-failure semantics; idempotency is a hard requirement for every task so that any retry is safe. Shadow clusters — temporary production replicas receiving real traffic — let the team validate new ScyllaDB versions before they touch live data; SCP automates the full shadow cluster lifecycle, cutting what once required more than a day of continuous engineer attention to largely unattended runs.
#db #distributed #architecture #engineering
2 986
Decided to give another shot to video versions of Architecture Weekly. Check it out and leave me a comment if you like this format!
https://youtu.be/oSkLhBRqWIY
2 986
What's Easy Now? What's Hard Now? 👨💼
AI coding agents are fundamentally feedback loops built around LLMs, and the quality of available feedback determines where they succeed or fail. Tasks with fast, clear, objective feedback — like building a UI — are much easier for agents than tasks where feedback is delayed, silent, or subjective: writing concurrent code (where bugs manifest as silent data corruption at runtime), or making architecture decisions (where feedback is inherently contextual and often never arrives). Brooker frames this as the most important axis for evaluating agent capability, whether the problem domain supports tight feedback loops.
#ai #softwareengineering #engineering #llm
2 986
5 Ways to make CDC with Postgres
Short article not only on the CDC blueprint from Pinterest, but also mentioning 5 different ways of how Change Data Capture can be implemented in PostgreSQL. I knew only half of them!
#db #cdc
2 986
RAG in Production
We faced the problem of choosing data sources for our AI Agents and obviously we started weighing retriaval augamented generation. And it looks like RAG is surprisingly complex: tokenization process, committing to the model, reindexing cost and many more. Grab a great article on the topic.
2 986
Your AI wants to nuke your database 👷♂️
AI deleting your production database is not a nightmare anymore: it’s a reality which happened to PocketOS running their systems on Railway. To be fair to the latter company, they learned from the incident and implemented 48 hour window for soft deletion and backups for backups. Learn the full story in the article.
#ai #resilience
2 986
Structured prompt driven development 👷♂️
Everybody who hasn’t spent last 2 years in a cave came up with their own way of working in the AI for software development era. Thoughtworks is no exception. Their core idea is prompt should become a first class citizen, just like code: be saved, versioned and reviewed. Based on this idea they develop the structured prompt process and tools and show how to implement a feature with it. To my taste the approach is a bit naive because creating holistic, full and correct test cases is much more complicated than “make me test cases, avoid duplicates” like it is shown in the article, but interest approach anyway.
#softwaredevelopment
2 986
Harness Design for Long Running Apps 👷♂️
Long running AI agent can fail on multiple occasions building wrong stuff, building it incorrectly or building it with a suboptimal quality. Antropic experimented with long running agents extensively and converged on Planner - Executor - Reviewer architecture. Feel free to steal the approach.
#ai #aiagents
2 986
Practical Lessons From the Claude Code Leak 🍼
While Axios was compromised on purpose, the Claude Code sources leaked accidentally as source maps during the update publishing. A great chance to learn from it though to get best practices on Claude.MD, multi-agent orchestration, permissions and many more.
#security #ai
2 986
Why fakes beat mocks and testcontainers 👷♂️
Mocks and Testcontainers are the two tools most developers reach for. Both have fundamental limitations. Testcontainers are binary failing to provide partial failure modes, and mocks test implementation rather than results. Fakes fix both issues: they are the in-memory implementations which allow to test results, while providing partial failures. Grab the best practice guide!
#testcontainers #qa
2 986
Compromised Axios 👷♂️
Another week, another compromised npm package. Axios - the most popular js library for HTTP requests - got infected with a malicious dependency via… right, social engineering. Make sure to check out the infection signals and apply all the remediation and prevention steps from the article!
#security
2 986
Redis Cluster 👷♂️
Redis is single-threaded and despite that, can handle hundred of thousands requests per second while guaranteeing atomicity. This is enough for the majority of systems, but not for Stripe. They moved from a single hot node to a 10-nodes cluster and told everyone how Redis cluster operates in this well-written article.
#redis #performance
2 986
First of all, the code writing was never a bottleneck: product understanding, architecture, quality control was. Second, AI will generate the code but why the engineer has not reviewed it? How it passed the peer review with such grave architecture problems? Those are my questions to the article, but it is good anyway: whatever entropy you have in your codebase, the AI will only amplify, not solve them.
https://ctosub.com/p/the-ctos-entropy-war
2 986
At Supplied we're using GitHub Actions, and it is mostly fine... until it does not. Slow builds, tricky YAML, awful log explorer and the (un)availability start to bite altogether. Apparently, we're not the only one experiencing the troubles. Ian Duncan digs into each issues and actually suggest couple of alternatives. #ci
https://www.iankduncan.com/engineering/2026-02-05-github-actions-killing-your-team
2 986
A year ago I honestly had no idea what AI agents were. I believe a lot of people still are still confused about AI-powered workflows and AI Agents. That's why I wrote an article explaining when an LLM becomes truly agentic and what are the foundational building block of an AI Agent. Grab a read, leave a like and of course share!
https://softwarearchitectureweekly.substack.com/p/the-ai-agent-blueprint
2 986
The time at a startup in 2025 flies 10 times faster: what previously took a month now achieved in 2 days. At Supplied we went from CSV file upload as a single data ingestion mechanism to enterprise-grade system in no time. Find our journey of securely connecting to 700+ sources here:
https://open.substack.com/pub/softwarearchitectureweekly/p/enterprise-grade-data-source-integration
متاح الآن! بحث تيليغرام 2025 — أهم رؤى العام 
