DevOps&SRE Library

Open in Telegram

Библиотека статей по теме DevOps и SRE. Реклама: @ostinostin Контент: @mxssl РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3

Network:DevOps&SRE Library Russia34 727 Technologies & Applications6 932...

📈 Analytical overview of Telegram channel DevOps&SRE Library

Channel DevOps&SRE Library (@devopslibrary) in the English language segment is an active participant. Currently, the community unites 19 414 subscribers, ranking 6 932 in the Technologies & Applications category and 34 727 in the Russia region.

📊 Audience metrics and dynamics

Since its creation on невідомо, the project has demonstrated rapid growth, gathering an audience of 19 414 subscribers.

According to the latest data from 19 June, 2026, the channel demonstrates stable activity. Although there has been a change in the number of participants by 123 over the last 30 days and by -3 over the last 24 hours, overall reach remains high.

Verification status: Not verified
Engagement rate (ER): The average audience engagement rate is 14.85%. Within the first 24 hours after publication, content typically collects 7.26% reactions from the total number of subscribers.
Post reach: On average, each post receives 2 883 views. Within the first day, a publication typically gains 1 409 views.
Reactions and interaction: The audience actively supports content: the average number of reactions per post is 1.
Thematic interests: Content is focused on key topics such as kubernete, cluster, infrastructure, storage, configuration.

📝 Description and content policy

The author describes the resource as a platform for expressing subjective opinions:
“Библиотека статей по теме DevOps и SRE. Реклама: @ostinostin Контент: @mxssl РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3”

Thanks to the high frequency of updates (latest data received on 20 June, 2026), the channel maintains relevance and a high level of publication reach. Analytics show that the audience actively interacts with content, making it an important point of influence in the Technologies & Applications category.

19 414

Subscribers

-324 hours

+67 days

+12330 days

2 883

Post views

~ 1 40924 hours

~ 1 76648 hours

14.85%

Engagement rate

~ 2

Posts per day

Ads index

beta

Posts Archive

19 412

Kubernetes CPU Limits and Go https://www.ardanlabs.com/blog/2024/02/kubernetes-cpu-limits-go.html

19 412

Distributed Tracing: A Whistle Stop Tour

Know enough to be dangerous in 10 minutes

https://metoro.io/blog/distributed-tracing-whistle-stop-tour

19 412

Мы запустили профессиональную сертификацию по облачным технологиям! Наша программа сертификации ориентирована на международные стандарты, поэтому теперь специалисты по облачным технологиям смогут официально подтвердить свои компетенции. Это поможет им получить конкурентное преимущество при трудоустройстве, ускорить развитие карьеры и претендовать на более высокую оплату. А для тех, кто работает с заказчиками напрямую, — получать более выгодные контракты. Экзамен на сертификат Yandex Cloud Certified Engineer Associate проверяет знания и навыки в шести областях: • Базовые облачные технологии • Хранение и обработка данных • DevOps и автоматизация • Бессерверные вычисления • Информационная безопасность • Биллинг 🔍 О том, как устроена сертификация, что нужно сделать для подготовки и участия в первом экзамене, читайте по ссылке.

19 412

Different Ways to Aggregate Nines

While working on SLOs, SLAs and SLIs I have found that there are only so many ways to aggregate service metrics. I have not yet found somewhere that attempts to review the different aggregation methods and what their relative strengths and weaknesses are.

https://hross.substack.com/p/different-ways-to-aggregate-nines

19 412

👉 Изучите возможности балансировки нагрузки в Nginx и Angie и прокачайте скиллы администратора Linux 🎁 Приходите на бесплатный практический урок от OTUS, где вы вместе с опытным экспертом: 1. изучите варианты балансировки нагрузки в веб-серверах Nginx и Angie; 2. научитесь их использовать; 3. разберёте различие продуктов и их особенности. ⏰ Занятие пройдёт 9 апреля в 19:00 мск в рамках курса «Инфраструктура высоконагруженных систем». Доступна рассрочка на обучение! 👉 Пройдите короткий тест прямо сейчас, чтобы посетить бесплатный урок и получить запись: https://otus.pw/5soB/ Реклама. ООО «Отус онлайн-образование», ОГРН 1177746618576, www.otus.ru, erid: 2VtzqxjX9bc

19 412

How to deal with alert fatigue head-on

Everyone experiences stress at work—thankfully, it’s a topic folks aren’t shying away from anymore. But for on-call engineers, alert fatigue is a phenomenon closer to home. Unfortunately, like stress, it can be just as insidious and drastically impact those it affects. First discussed in the context of hospital settings, this phrase later entered engineering circles. Alert fatigue is when an excessive number of alerts overwhelms the individuals responsible for answering them, often over a prolonged period, resulting in missed or delayed responses, or them being ignored altogether The impact of this fatigue can have an effect beyond the individual and can create significant risks for your organization. But, if you approach on-call the right way, you can mitigate the impacts of alert fatigue or, better yet, avoid it altogether. Here, we'll dive into the tactics teams can implement to address alert fatigue and its underlying causes.

https://incident.io/hubs/on-call/dealing-with-alert-fatigue-head-on

19 412

erid: 2Vtzqx6QSDc Хотите улучшить свои навыки в разработке программного обеспечения и принимать решения на основе данных? Тогда этот открытый урок для вас! На вебинаре вы узнаете, как использовать ArgoCD — инструмент для управления конфигурациями, который позволяет получать информацию через API и анализировать динамику системы. Мы рассмотрим различные метрики, такие как DORA, Engineering и MTT, которые помогут вам понять узкие места и аргументированно предлагать изменения, основываясь на данных. Урок будет полезен всем, кто хочет применять подход «решения на основе данных» в своей работе. По итогам урока вы получите готовый фреймворк «Как начать работать с метриками». Встречаемся 10 апреля в 20:00 МСК в рамках курса «SRE практики и инструменты». Регистрация на бесплатный урок по ссылке: https://clck.ru/39qRHt

19 412

Service Level Agreement

Introduction to the SLA in relation to SLI and SLO

https://blog.alexewerlof.com/p/sla

19 412

Documentation as code: Principles, workflow, and challenges

Core principles of documentation-as-code tools - Treating documentation with the same rigor as code - Storing documentation in version control - Automation of documentation generation and deployment - Peer review processes for documentation updates

https://www.tabnine.com/blog/documentation-as-code-principles-workflow-and-challenges

19 412

Best practices for monitoring software testing in CI/CD

A key challenge of monitoring your CI/CD system is understanding how to optimize your workflows and create best practices that help you minimize pipeline slowdowns and better respond to CI issues. In addition to monitoring CI pipelines and their underlying infrastructure, your organization also needs to cultivate effective relationships between platform and development teams. Fostering collaboration between these two teams is a critical and equally valuable aspect of improving the reliability and performance of your CI. In this post, we’ll explore how platform teams can help developers visualize trends in CI test performance and notify them of new flaky tests, test failures, and performance regressions with dashboards and monitors. We’ll also detail best practices that can help developers identify, investigate, and remediate flaky tests.

https://www.datadoghq.com/blog/best-practices-for-monitoring-software-testing

19 412

Properly Running Kubernetes Jobs with Sidecars in 2024 (K8s 1.28+)

Kubernetes has been a great orchestrator of Jobs and CronJobs for over half a decade now, but if you had a need for running proxy containers or other secondary containers alongside the job, running things properly took a bit of work and decision-making to handle gracefully. This article introduces the easiest way to run Jobs with sidecars using the latest Kubernetes features, and has a complementary repository with complete example manifests you can try in your own cluster. The repository contains all the examples for earlier versions of K8s as well, so make sure to focus on the cronjob.sidecar.*.yaml examples.

https://medium.com/teamsnap-engineering/properly-running-kubernetes-jobs-with-sidecars-in-2024-k8s-1-28-ad9b51d17d50

19 412

Fine-grained RBAC for GitHub Action workflows With GitHub OIDC and HashiCorp Vault https://www.digitalocean.com/blog/fine-grained-rbac-for-github-action-workflows-hashicorp-vault

19 412

Устали тушить пожары на пайплайне? Давайте разбираться, как их избежать! На онлайн-митапе «CI/CD и SRE: Архитектура безупречного деплоя» от Сбера при поддержке JUG Ru Group. Митап пройдет 9 апреля в 17:00 (МСК, GMT+3). Ссылку на трансляцию отправим за 1 час до начала. От экспертов из Сбера и VK вы узнаете: ✔️ Какие правила помогут надежно подготовиться к выходу в Production. ✔️ Как не сломать observability своими руками. И что делать, если все же сломали. ✔️ Почему энтерпрайзу больше подходит распределенная система. ✔️ Как сделать Jenkins стабильным в крупных проектах. ✔️ Зачем делать свои инструменты, когда есть open source. ✔️ Как за один день настроить конвейер от CI до PROD. Регистрируйтесь на сайте митапа. Реклама. ПАО Сбербанк. ИНН 77070838

19 412

garnet

Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication features. Garnet can work with existing Redis clients.

https://github.com/microsoft/garnet

19 412

Облачные технологии повышают гибкость инфраструктуры на 22% А где гибкость, там чаще деплои, меньше время внесения изменений и восстановления работоспособности. Знакомьтесь, VK Cloud — безопасная и технологичная платформа с широким набором облачных сервисов для эффективной разработки и работы с данными. 🔹 Все, что нужно для разработки: виртуальные машины, базы данных, GPU, Kubernetes, S3-хранилище, бэкапы, решения для машинного обучения и работы с Big Data. 🔹 Аудит, миграция, мониторинг и другие лучшие практики VK от команды опытных инженеров. 🔹 Комплексная защита веб-сервисов от атак и взломов. Зарегистрируйтесь в VK Cloud и получите 3 000 ₽ для тестирования облачных сервисов в течение 60 дней!

19 412

How Figma’s databases team lived to tell the scale

Our nine month journey to horizontally shard Figma’s Postgres stack, and the key to unlocking (nearly) infinite scalability. Figma’s database stack has grown almost 100x since 2020. This is a good problem to have because it means our business is expanding, but it also poses some tricky technical challenges. Over the past four years, we’ve made a significant effort to stay ahead of the curve and avoid potential growing pains. In 2020, we were running a single Postgres database hosted on AWS’s largest physical instance, and by the end of 2022, we had built out a distributed architecture with caching, read replicas, and a dozen vertically partitioned databases. We split groups of related tables—like “Figma files” or “Organizations”—into their own vertical partitions, which allowed us to make incremental scaling gains and maintain enough runway to stay ahead of our growth.

https://www.figma.com/blog/how-figmas-databases-team-lived-to-tell-the-scale

19 412

❓Как создавать и настраивать различные типы сервисов в Kubernetes? Эта тема актуальна, так как играет ключевую роль в развертывании масштабируемых и надежных приложений в контейнерах. 👨‍🎓Освойте ее на бесплатном практическом уроке от OTUS. На вебинаре вы узнаете, как создавать и настраивать различные типы сервисов в Kubernetes: ✔️ClusterIP для внутренних связей; ✔️ ExternalService для внешнего доступа; ✔️NodePort для открытия порта на уровне узла; ✔️LoadBalancer для балансировки нагрузки. 📆Занятие пройдёт 11 апреля в 20:00 (мск) в рамках набора на онлайн-курс «Инфраструктурная платформа на основе Kubernetes». 💥Спикер — преподаватель курса и действующий Senior DevOps Engineer. Также на вебинаре вы сможете задать эксперту вопросы о самом курсе и перспективах выпускников. 👉Пройдите короткий тест прямо сейчас, чтобы посетить бесплатный урок https://vk.cc/cvW9v2 🔥Для всех, кто пройдет вступительный тест и запишется на бесплатный вебинар этого курса, будет доступна спец.цена на курс — обсудите свое обучение с менеджерами OTUS! Реклама. ООО «Отус онлайн-образование», ОГРН 1177746618576, www.otus.ru, erid: 2VtzqvEGzwY

19 412

excalidraw

An open source virtual hand-drawn style whiteboard.

https://github.com/excalidraw/excalidraw

19 412

🔝 Сбер, Островок.ру, B2Broker, Яндекс, Иннотех, Andersen и многие другие уже используют для проверки своих систем Chaos Engineering. Крупные компании ищут сотрудников, которые умеют тестировать системы. А мы запускаем видеокурс, который поможет вам расширить стек технологий и получить новый полезный навык. В результате курса вы: ✅ поймете, зачем разбираться в Chaos Engineering и какие эксперименты существуют; ✅ узнаете, с помощью каких инструментов можно реализовать эксперименты, и как выбрать подходящий; ✅ получите навык тестирования нескольких гипотез в рамках нескольких экспериментов; ✅ научитесь объяснять результаты экспериментов руководству; ✅ разберетесь, как генерить гипотезы; ✅ сможете научить коллег этому подходу. Релиз курса — 22 апреля, но уже сейчас мы проводим конкурс на 3 бесплатных места для тех, кто хочет научиться управлять хаосом! ПОДРОБНОСТИ 📌

19 412

gritql

GritQL is a declarative query language for searching and modifying source code.

https://github.com/getgrit/gritql