Ramadan Khalifa

Software Engineer

How to Pass the AWS Certified Solution Architect Associate Exam in Two Month: A Practical Guide

Leave a Comment / Kubernetes

Amazon Web Services (AWS) is a cloud computing platform that provides a wide range of services, including computing, storage, and database services, to name a few. As the demand for cloud computing continues to grow, the need for certified professionals who can manage these services efficiently also increases. AWS Certified Solution Architect Associate Exam is an entry-level […]

How to Pass the AWS Certified Solution Architect Associate Exam in Two Month: A Practical Guide Read More »

Why you should learn Golang in 2025

Leave a Comment / Programming

Golang (or Go) is an open-source statically typed compiled programming language introduced by Google in 2007. It was build to fill in the gaps of C++ and Java that Google came across while working with its servers and distributed systems. It is easy to learn, concise, expressive and readable. It offers high performance and the

Why you should learn Golang in 2025 Read More »

How to Pass the CKAD Exam in One Month: A Practical Guide

Leave a Comment / Kubernetes

The Certified Kubernetes Application Developer (CKAD) exam is designed to test your skills in developing and deploying applications on Kubernetes. If you are planning to take the CKAD exam, you may be wondering how to best prepare for it in a short amount of time. In this article, we will provide you with practical details

How to Pass the CKAD Exam in One Month: A Practical Guide Read More »

Group of graduates celebrating by throwing caps in the air during a sunny day.

How to Pass the CKA Exam in One Month: A Practical Guide

Leave a Comment / Kubernetes

The Certified Kubernetes Administrator (CKA) exam is a challenging certification that validates your Kubernetes skills and knowledge. If you’re preparing to take the CKA exam, you may be wondering how to best prepare for it in a short amount of time. In this article, we’ll provide you with practical details on how to pass the

How to Pass the CKA Exam in One Month: A Practical Guide Read More »

Book Summary: SRE, Part 4, Best Practices for Building Monitoring and Alerting

Leave a Comment / Book Summary

Monitoring is a crucial aspect of Site Reliability Engineering (SRE) because it allows teams to detect, diagnose, and resolve issues in distributed systems. In this article, we’ll explore the principles of monitoring and best practices for monitoring distributed systems. First principle: Measure what matters Teams should identify key performance indicators (KPIs) that directly impact user

Book Summary: SRE, Part 4, Best Practices for Building Monitoring and Alerting Read More »

Semantic Conventions in OpenTelemetry

Leave a Comment / Observability

In this article, we’re going to learn about semantic conventions in OpenTelemetry and how they are used to make data processing much easier. We’ll also discuss the different types of semantic conventions. without further ado let’s get started. What Are Semantic Conventions? Semantic conventions in general are the agreed-upon meaning of words and phrases within

Semantic Conventions in OpenTelemetry Read More »

Root Cause Analysis (RCA) Using Distributed tracing

Leave a Comment / Observability

Distributed tracing is a method of tracking the propagation of a single request as it’s handled by various services that make up an application. Tracing in that sense is “distributed” because in order to fulfill its function, a single request must often traverse process, machine and network boundaries. Once we instrumented our application and exported our

Root Cause Analysis (RCA) Using Distributed tracing Read More »

Sampling Traces In OpenTelemetry

Leave a Comment / Observability

At a scale, the cost to collect, process and save traces can dramatically outweigh the benefits because many of these events are virtually identical and successful. The point of debugging is to search for patterns or examine failed events during an outage. That’s why it’s wasteful to transmit 100% of all events to the observability

Sampling Traces In OpenTelemetry Read More »

Book Summary: SRE, Part 3, Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs)

Leave a Comment / Book Summary

In this article, We are going to learn about Site Reliability Engineering (SRE) core terminologies. It’s important to understand those terms because they are used a lot nowadays in the software industry. I know that learning terminologies might sound boring or complex but I will try to make it simple and as practical as possible.

Book Summary: SRE, Part 3, Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs) Read More »

Book Summary: Site Reliability Engineering, Part 2, Error Budgets and Service Level Objectives (SLOs)

Leave a Comment / Book Summary

It would be nice to build 100% reliable services. Ones that never fail. right? absolutely not. It’s going to be really bad to do such a thing because it’s very expensive and it will limit how fast new features can be developed and delivered to the users. Also users typically won’t notice the difference between

Book Summary: Site Reliability Engineering, Part 2, Error Budgets and Service Level Objectives (SLOs) Read More »