bregman-arie
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
awesome-foss
A curated list of amazingly awesome open-source sysadmin resources.
milanm
DevOps Roadmap for 2026. with learning resources
dastergon
A curated list of Site Reliability and Production Engineering resources.
kubeshark
eBPF-powered network observability for Kubernetes. Indexes L4/L7 traffic with full K8s context, decrypts TLS without keys. Queryable by AI agents via MCP and humans via dashboard.
upgundecha
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
coroot
Coroot is an open-source observability and APM tool with AI-powered Root Cause Analysis. It combines metrics, logs, traces, continuous profiling, and SLO-based alerting with predefined dashboards and inspections.
jaegertracing
Web UI for Jaeger
cloudprober
An active monitoring software to detect failures before your customers do.