en
Feedback
DevOps & SRE notes

DevOps & SRE notes

Open in Telegram

Helpful articles and tools for DevOps&SRE WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F For paid consultation (RU/EN), contact: @tutunak All ways to support https://telegra.ph/How-support-the-channel-02-19

Show more

๐Ÿ“ˆ Analytical overview of Telegram channel DevOps & SRE notes

Channel DevOps & SRE notes (@devops_sre_notes) in the English language segment is an active participant. Currently, the community unites 12 684 subscribers, ranking 10 040 in the Technologies & Applications category and 2 960 in the USA region.

๐Ÿ“Š Audience metrics and dynamics

Since its creation on ะฝะตะฒั–ะดะพะผะพ, the project has demonstrated rapid growth, gathering an audience of 12 684 subscribers.

According to the latest data from 15 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 232 over the last 30 days and by 5 over the last 24 hours, overall reach remains high.

  • Verification status: Not verified
  • Engagement rate (ER): The average audience engagement rate is 15.80%. Within the first 24 hours after publication, content typically collects 4.81% reactions from the total number of subscribers.
  • Post reach: On average, each post receives 2 004 views. Within the first day, a publication typically gains 610 views.
  • Reactions and interaction: The audience actively supports content: the average number of reactions per post is 5.
  • Thematic interests: Content is focused on key topics such as kubernete, cluster, author, engineering, monitoring.

๐Ÿ“ Description and content policy

The author describes the resource as a platform for expressing subjective opinions:
โ€œHelpful articles and tools for DevOps&SRE WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F For paid consultation (RU/EN), contact: @tutunak All ways to support https://telegra.ph/How-support-the-channel-02-19โ€

Thanks to the high frequency of updates (latest data received on 16 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Technologies & Applications category.

12 684
Subscribers
+524 hours
+497 days
+23230 days
Posts Archive
Rhel compatible distribution in danger. RedHat change their policy and licenses agreements https://www.jeffgeerling.com/blog/2023/dear-red-hat-are-you-dumb

This is a Helm plugin which map deprecated or removed Kubernetes APIs in a release to supported APIs https://github.com/helm/helm-mapkubeapis

Debug a target container in a Kubernetes cluster by automatically creating a new, non-invasive, 'debug' container in the same pid, network, user, and ipc namespace as the target container without disrupting the target container. https://github.com/JamesTGrant/kubectl-debug

Adrien "ZeratoR" Nougaret's annual charity event, Zevent, returned this year with a new addition called Zevent Place. Inspired by Reddit's r/place, Zevent Place is a collaborative canvas where donors can draw pixels based on the amount they donate. Developers William Traorรฉ and Alexandre Moghrabi created the platform with several features, such as Pixel Upgrade system and real-time updates, to protect community creations and enhance user experience. The team utilized various technologies like GraphQL, NestJS, Redis, and MinIO, and managed to handle massive amounts of updates while maintaining a low CPU and bandwidth footprint. Although there were challenges, such as unexpected rate limit errors with Cloudflare, the event achieved 98.4% uptime, with the downtime being addressed and resolved promptly. Overall, Zevent Place was a successful project, and valuable lessons were learned throughout its development and implementation. https://medium.com/@alexmogfr/zevent-place-how-we-handled-100k-ccu-on-a-real-time-collective-canvas-71d3d346e0ab

In this post, the author explores various load balancing algorithms, including round robin, weighted round robin, dynamic weighted round robin, and least connections. The simulations demonstrate how these algorithms perform in different scenarios, highlighting their strengths and weaknesses. Round robin performs well in terms of median latency but struggles with higher percentiles. Least connections offer a good balance between simplicity and performance but may not be optimal in terms of latency. The PEWMA algorithm, which combines techniques from dynamic weighted round robin and least connections, shows significant improvements across all latency percentiles but has additional complexity and may not handle dropped requests as well as least connections. Ultimately, the choice of load balancing algorithm depends on the specific requirements of a workload and the performance characteristics that need to be optimized. https://samwho.dev/load-balancing/

macOS and Linux VMs on Apple Silicon to use in CI and other automations https://github.com/cirruslabs/tart

Have you ever heard that company migrate from microservice architecture to monolith? Moving our service to a monolith reduced our infrastructure cost by over 90%. It also increased our scaling capabilities. Today, weโ€™re able to handle thousands of streams and we still have capacity to scale the service even further. Moving the solution to Amazon EC2 and Amazon ECS also allowed us to use the Amazon EC2 compute saving plans that will help drive costs down even further. https://www.primevideotech.com/video-streaming/scaling-up-the-prime-video-audio-video-monitoring-service-and-reducing-costs-by-90

rustic - fast, encrypted, deduplicated backups powered by Rust https://github.com/rustic-rs/rustic

Open-Source Tracing Platform https://github.com/teletrace/teletrace

Roadmapper - A Roadmap as Code (Rac) python library. Generate professional roadmap diagram using python code. https://github.com/csgoh/roadmapper

Efficient GPU utilization is crucial for minimizing infrastructure expenses, especially in large Kubernetes clusters running AI and HPC workloads. NVIDIA MIG enables partitioning GPUs into smaller slices, but using MIG in Kubernetes through the NVIDIA GPU Operator alone has limitations due to static configurations. Dynamic MIG Partitioning addresses these limitations by automating the creation and deletion of MIG profiles based on real-time workload requirements, ensuring optimal GPU utilization. The nos module works alongside the NVIDIA GPU Operator to implement dynamic MIG partitioning, simplifying the management of MIG configurations and reducing operational costs. https://towardsdatascience.com/dynamic-mig-partitioning-in-kubernetes-89db6cdde7a3

f you like this channel, you can support it. If you don't have an account in DO and what to have a simple cloud platform for development and running your project, you can register here by my referral link, https://m.do.co/c/0f8bec835d26 . By this link, you get $100 in credit over 60 days.

Pipedrive Infra manages numerous Kubernetes clusters across different clouds, including AWS and on-premise OpenStack. They had been experiencing intermittent failing pod health checks, which became more frequent over time. After an extensive investigation, the team discovered that Kubelet was initiating TCP sessions to pods using random source ports within the same range reserved by Kubernetes nodeports. This caused the TCP SYN-ACK to be redirected to other pods, leading to failed health checks. The solution was to disallow the use of the nodeport range as the source port for TCP sessions with a single line of code, effectively resolving the issue. https://medium.com/pipedrive-engineering/solving-the-mystery-of-pods-health-checks-failures-in-kubernetes-55b375493d03