CatOps

Kanalga Telegram’da o‘tish

DevOps and other issues by Yurii Rochniak (@grem1in) - SRE @ Preply && Maksym Vlasov (@MaxymVlasov) - Engineer @ Star. Opinions on our own. We do not post ads including event announcements. Please, do not bother us with such requests!

Ko'proq ko'rsatish

Ukraina9 694 Texnologiyalar & Aralashmalar18 650

5 085

Obunachilar

+224 soatlar

+167 kunlar

+2430 kunlar

1 334

Post ko'rishlar

~ 70124 soatlar

~ 81248 soatlar

26.23%

Muloqot nisbati

Ma'lumot yo'q

Kuniga postlar

Ads index

beta

Postlar arxiv

5 085

Multi-Agent System Reliability is an article by Alex Ewerlöf that touches on some practical aspects of working with agentic AI systems day-to-day. It covers patterns you can use to improve the output, and probably the main point: > stop treating LLMs like magic chatbots. Start treating them like unreliable components in a distributed system. #ai #sre

5 085

My friend raises money for a pickup truck for the 423rd battalion. Right now, there are 200 out of 350 k UAH raised. You can donate on a supportive jar: https://send.monobank.ua/jar/2aMqtZT592 Or on her jar: https://send.monobank.ua/jar/8oTyqJUjPV #donations #Ukraine

5 085

A new issue of the CatOps Digest is here! https://newsletter.catops.dev/p/catops-digest-2026-07-11 #digest #newsletter

5 085

Some book bundles on Humble Bundle: - AI usage and practices - Linux things Just remember to always check, if you have the books already, because these bundles repeat from time to time. #books

5 085

Yesterday Flux turned 10 years old! 🎉 In this article they reflect on this journey and recall some pivotal moments from the past. Plus, highlight what are they doing now. P.S. Do not forget to update your CV 😁 #kubernetes #gitops #flux

5 085

A post from Cloudflare about a low-level race condition they tracked down and fixed in the Rust Hyper library. I like reading such detective stories. Also, I recall times, when people would regularly ask about strace on the interviews. I am not sure if this is still the case. At least, I wasn’t asked about strace for a long time. #programming #postmortem

5 085

Finally, easy AWS EKS rollbacks to previous K8s version! Now you can trust EKS upgrade even to your AI agent (please don't) https://aws.amazon.com/blogs/aws/upgrade-amazon-eks-clusters-with-confidence-using-kubernetes-version-rollbacks/ #kubernetes #eks #aws

5 085

The four horsemen behind thousands of Postgres outages is a self-promotion article, but it can teach you some things about Postgres, so I allow it. A few corrections, though. Postgres does have a pg_hint_plan extension that allows you to modify the plan yourself. However, if you need to use that, there may be something odd with your queries in the first place. The second thing is JSON. Postgres works with it and a lot of people use JSON fields, but this database was not created for JSON in the first place. So, if you need to work mostly with JSON, you would probably be better with another storage, or you could deserialize JSON fields into columns and work with data as usual. #databases

5 085

Term “gateway” is super-widespread in the Kubernetes world. One of the recent additions is, of course, the AI Gateway. But what is that? Obviously, a reverse proxy, but what else? This article aims to answer this question. The most important part here is that it doesn’t try to tell you what gateway is the best, but rather outlines subtle differences between the flavors. So, you could choose responsibly, if you need such a gateway. #ai #kubernetes

5 085

A big fundraiser from DOU for the 2nd separate corps of the National Guard of Ukraine «Хартія» is still ongoing. The goal of this fundraiser is to buy heavy bomber drones "Vampire" for the Kupiansk direction. Monobank jar: https://send.monobank.ua/jar/26mrQPQ3PZ #donations #Ukraine

5 085

Save a list of Kubernetes defaults in one place, so you don't forget, and don't need to find them every time. #kubernetes

5 085

An article about optimizing the symbolicator - the part of the observability stack that translates stack traces of minified code into human-readable ones. It’s an interesting read about what a design optimization can achieve. In the discussion on Reddit, commentators rightfully pointed out that the drastic difference between this new symbolicator and the baseline is due to the approach that author uses, and that a C/Rust version would still perform batter compared to the example in Go. Yet, this is kinda the point: by designing your application in a clever way, you can achieve better performance with “slower” technologies compared to brute-forcing the solution using “faster” technologies. #programming

5 085

Not all index scans are equal is an article by Datadog, where they describe the idea of targeted DB indices and when to use those. There is also some praise for their database monitoring tooling, but this is a vendor article after all. The only thing is that they didn't mention that too many indices also comes with a price: you need to store and update them. So, always evaluate the performance for some period of time after adding indices. #databases #observability

5 085

We continue supporting DOU with their fundraiser for the 2nd separate corps of the National Guard of Ukraine «Хартія». The goal of this fundraiser is to buy heavy bomber drones "Vampire" for the Kupiansk direction. Monobank jar: https://send.monobank.ua/jar/26mrQPQ3PZ #donations #Ukraine

5 085

So, that's for AI in the companies, but what about AI in the wild i.e. in open source? We have cases like curl, that had to take down their bug bounty program due to the influx of slop bug reports. Yet, the industry adapts. Here's a study by Redmonk on the stance of various foundations and standalone open source projects on AI, including their major concerns, and openness to AI-generated contributions. #ai #open_source

5 085

Continuing with our AI week. AI in SRE: What's Actually Coming in 2026 is telling a story of AI coming for help with incident response. The article suggests trying an AI tool for real investigation or data collection for postmortems. To clarify this, in my experience, you don’t need to have a dedicated tool, a general purpose AI agent with some harness (skills and scripts) would do. You should try it! AI does the job of data gathering incredibly well. Yet, the results are indeed not perfect. Another good point in this article is data quality. AI results are as good as context you provide. I witnessed two prominent failure modes so far: 1. Inference on incomplete data: a person with limited access (typically a developer) asks their agent to investigate an alert. The agent comes to some conclusion. At the same time, a person with elevated access (typically a systems engineer) asks their agent to investigate the same alert and gets a different result, likely because some data is only available via kubectl events, etc. The fix for that is not to allow everyone to do everything, the fix is to revisit your observability pipelines and ensure that you ship all the relevant data, which is easier said than done. 2. Agent that cries "wolves": if you have a pollutant in your logs, or simply an event that happens very often, agents like to correlate it with everything. If your clusters are elastic, an agent could blame node count fluctuations for every error. The problem here is that once node count fluctuation actually causes a problem, you will be the one to ignore this hint from an agent, because it suggests it every single time. If you are ready to share more AI failure modes specifically related to SRE in Ukrainian, welcome to our chat. #ai #sre

5 085

Harness engineering for coding agent users is a new guest article in Martin Fowler's blog that summarizes approaches to improve AI output and make it more manageable. If you're actively using AI agents day-to-day, things described in this article won't be news to you, but it helps to structure one's thoughts. #ai

5 085

I will post AI-related articles this week, because why not? The first one is from Charity Majors called AI demands more engineering discipline. Not less, in which she follows up on her another article. This one is on technical aspects of moving to the disposable code. It also has a lot of links to other articles, which is also cool. #ai

5 085

For today's Donations Monday, I'd like to share with you a fundraiser that our friends at DOU started for the 2nd separate corps of the National Guard of Ukraine «Хартія». The goal of this fundraiser is to buy heavy bomber drones "Vampire" for the Kupiansk direction. Monobank jar: https://send.monobank.ua/jar/26mrQPQ3PZ #donations #Ukraine

5 085

A new issue of the CatOps Digest is here! https://newsletter.catops.dev/p/catops-digest-2026-06-13 #digest #newsletter