Kubernetes at Mercado Libre
https://medium.com/mercadolibre-tech/kubernetes-at-mercado-libre-ec331bea1866
At Mercado Libre, managing over 30,000 microservices and supporting 16,000 developers requires a robust and scalable solution. To address these challenges, we adopted Kubernetes (K8s) as the core engine of our internal platform, Fury. This article explores how Kubernetes streamlined our infrastructure management, accelerated software delivery, and improved cost efficiency, enabling our developers to focus on innovation and product creation.
https://medium.com/mercadolibre-tech/kubernetes-at-mercado-libre-ec331bea1866
From Autopilot to Standard GKE: The Key to 15x Cheaper Istio
https://medium.com/google-cloud/from-autopilot-to-standard-gke-the-key-to-15x-cheaper-istio-d755c1d3866d
TL;DR Istio proxy costs for 10-nodes GKE cluster reduced from $3065 to $185 per month.
https://medium.com/google-cloud/from-autopilot-to-standard-gke-the-key-to-15x-cheaper-istio-d755c1d3866d
All My DevOps Pipelines from GitLab Commit to ArgoCD got beaten by FTP
https://medium.com/@davidpech_39825/all-my-devops-pipelines-from-gitlab-commit-to-argocd-got-beaten-by-ftp-205b34f61441
https://medium.com/@davidpech_39825/all-my-devops-pipelines-from-gitlab-commit-to-argocd-got-beaten-by-ftp-205b34f61441
When is read-only not read-only?
https://raesene.github.io/blog/2024/11/11/When-Is-Read-Only-Not-Read-Only
Kubernetes RBAC has some tricky areas where the behaviour you get might not be exactly what you expect, and sometimes as in this case, those unexpected behaviours are not very apparent!
https://raesene.github.io/blog/2024/11/11/When-Is-Read-Only-Not-Read-Only
Managing Kubernetes CEPH cluster monitor: easier than it looks
https://blog.devgenius.io/managing-kubernetes-ceph-cluster-monitor-easier-than-it-looks-56ee000e68ff
In this article I will explain how you can safely fail over a rook-ceph Monitor to a new Kubernetes node.
I hope I will save you some time and if you find the content useful please leave a clap.Also let me know in the comments of the article if you need professional help with a common topic.
https://blog.devgenius.io/managing-kubernetes-ceph-cluster-monitor-easier-than-it-looks-56ee000e68ff
How to Troubleshoot OOM Issues in Go Applications Running on Kubernetes
https://medium.com/@csepulvedab/how-to-troubleshoot-oom-issues-in-go-applications-running-on-kubernetes-149e8bb104ac
https://medium.com/@csepulvedab/how-to-troubleshoot-oom-issues-in-go-applications-running-on-kubernetes-149e8bb104ac
Securing Argo CD in a Multi-Tenant Environment with Application Projects
https://medium.com/containers-101/securing-argo-cd-in-a-multi-tenant-environment-with-application-projects-39ad30893fd5
https://medium.com/containers-101/securing-argo-cd-in-a-multi-tenant-environment-with-application-projects-39ad30893fd5
How to mount secrets as files or environment variables in Kubernetes
https://itnext.io/how-to-mount-secrets-as-files-or-environment-variables-in-kubernetes-f03d545dcd89
https://itnext.io/how-to-mount-secrets-as-files-or-environment-variables-in-kubernetes-f03d545dcd89
Connecting K8S/Cilium cluster and K8S/Calico cluster using Submariner
https://medium.com/@yboaron/connecting-k8s-cilium-cluster-and-k8s-calico-cluster-using-submariner-d56d7c39f0cb
https://medium.com/@yboaron/connecting-k8s-cilium-cluster-and-k8s-calico-cluster-using-submariner-d56d7c39f0cb
container-startup-autoscaler
https://github.com/ExpediaGroup/container-startup-autoscaler
container-startup-autoscaler (CSA) is a Kubernetes controller that modifies the CPU and/or memory resources of containers depending on whether they're starting up, according to the startup/post-startup settings you supply. CSA works at the pod level and is agnostic to how the pod is managed; it works with deployments, statefulsets, daemonsets and other workload management APIs.
https://github.com/ExpediaGroup/container-startup-autoscaler
1
kubectl.nvim
https://github.com/Ramilito/kubectl.nvim
Processes kubectl outputs to enable vim-like navigation in a buffer for your cluster.
https://github.com/Ramilito/kubectl.nvim
falco
https://github.com/falcosecurity/falco
Falco is a cloud native runtime security tool for Linux operating systems. It is designed to detect and alert on abnormal behavior and potential security threats in real-time.
https://github.com/falcosecurity/falco
dice
https://github.com/dicedb/dice
DiceDB is an open-source, fast, reactive, in-memory database optimized for modern hardware. Commonly used as a cache, it offers a familiar interface while enabling real-time data updates through query subscriptions. It delivers higher throughput and lower median latencies, making it ideal for modern workloads.
https://github.com/dicedb/dice
stu
https://github.com/lusingander/stu
STU is the TUI explorer application for Amazon S3 (AWS S3) written in Rust using ratatui.
https://github.com/lusingander/stu
xan
https://github.com/medialab/xan
xan is a command line tool that can be used to process CSV files directly from the shell.
https://github.com/medialab/xan
openproject
https://github.com/opf/openproject
OpenProject is the leading open source project management software.
https://github.com/opf/openproject
Beyond “5 Whys”: A Better Way to Learn from Incidents
https://uptimelabs.io/beyond-5-whys-a-better-way-to-learn-from-incidents
We all can agree that the most important purpose of a post-incident review (or post-mortem) is to learn from incidents. Implied in this learning is improving the system (people, processes, technology, and their interactions). All my reflections on the “5 Whys” technique refer back to how the technique enhances our learning (or not) from incidents.
https://uptimelabs.io/beyond-5-whys-a-better-way-to-learn-from-incidents
Anatomy of a Throttler
Part 1: https://planetscale.com/blog/anatomy-of-a-throttler-part-1
Part 2: https://planetscale.com/blog/anatomy-of-a-throttler-part-2
Part 3: https://planetscale.com/blog/anatomy-of-a-throttler-part-3
Part 1: https://planetscale.com/blog/anatomy-of-a-throttler-part-1
Part 2: https://planetscale.com/blog/anatomy-of-a-throttler-part-2
Part 3: https://planetscale.com/blog/anatomy-of-a-throttler-part-3
Systematically Terraforming a Brownfield of Cloud Infrastructure
https://www.evalapply.org/posts/systems-approach-to-infrastructure-as-code
Some thinking, trade-offs, theory building, and method-making one might ended up doing, in the course of bringing Infrastructure as Code (IaC) discipline to brownfield (and greenfield) services, at a small regulated fintech company, having a smaller engineering team that serves several business units, including one of India's largest national tax gateways. Only somewhat easier than reading a long compound sentence without pausing for breath. Phew.
https://www.evalapply.org/posts/systems-approach-to-infrastructure-as-code
The Infra to handle 10M Requests in 10 Minutes for $0.0116
https://tonywang.io/blog/infra-10m-requests-10-minutes-0.0116
In this article, we'll break down the infrastructure required to achieve a target of 10 million requests in 10 minutes, all for around $0.0116. This guide goes beyond basic setup and explores practical considerations for production-ready systems, balancing cost efficiency and high availability.
https://tonywang.io/blog/infra-10m-requests-10-minutes-0.0116