Branch Prediction

Branch Prediction Nədir?

Branch Prediction - CPU-nun branch təlimatının hansı yolla gedəcəyini əvvəlcədən proqnozlaşdırmasıdır. Pipeline-da control hazard-ları minimizə etmək üçün istifadə olunur.

Problem: Control Hazards

if (x > 0) {
    // Path A
    y = x + 5;
} else {
    // Path B
    y = x - 5;
}

CPU-ya branch qərarı verilənədək gözləmək lazımdır, bu da pipeline-ı dayanmağa məcbur edir.

Branch Misprediction Penalty

Modern CPU-da 15-20 cycle itki!

Cost = Pipeline Depth × Misprediction Rate

Nümunə:

Pipeline depth: 20 stages
Misprediction rate: 5%
Branch hər 5 təlimatdan bir

Penalty = 20 × 0.05 × (1/5) = 0.2 cycles per instruction
IPC reduction: ~20%

Branch Növləri

1. Conditional Branches

if (condition) { ... }
while (condition) { ... }
for (...) { ... }

cmp rax, 0
je label          ; Jump if equal
jne label         ; Jump if not equal
jg label          ; Jump if greater

2. Unconditional Branches

goto label;
break;
continue;

jmp label         ; Always jump

3. Function Calls/Returns

function();
return value;

call function     ; Call function
ret               ; Return

Static Branch Prediction

Compiler və ya hardware tərəfindən sabit qərar.

1. Predict Not Taken

Hər zaman branch alınmaz fərz edilir.

Üstünlük: Sadə implementasiya Çatışmazlıq: Loop-lar üçün pis (çox taken)

2. Predict Taken

Hər zaman branch alınır fərz edilir.

Üstünlük: Loop-lar üçün yaxşı Çatışmazlıq: Sequential kod üçün pis

3. Backward Taken, Forward Not Taken (BTFNT)

// Backward branch (loop) - predict taken
for (int i = 0; i < 100; i++) {
    // Loop body
}  // Branch back - usually taken

// Forward branch - predict not taken
if (unlikely_condition) {
    // Rare case
}
// Continue - usually not taken

Accuracy: ~60-70%

Dynamic Branch Prediction

Runtime-da branch behavior-a əsasən proqnoz.

1-Bit Predictor

Ən sadə dynamic predictor.

Problem: Loop-da 2 dəfə səhv edir!

// Loop iterations: TTTTTTTTN (7 taken, 1 not taken)
for (int i = 0; i < 8; i++) {
    // Body
}

Mispredictions:

Son iterasiyada: T → N (səhv)
Növbəti loop-da ilk dəfə: N → T (səhv)

2-Bit Saturating Counter

Daha stabil - iki dəfə səhv olmalıdır ki, dəyişsin.

States:

00: Strongly Not Taken - predict not taken
01: Weakly Not Taken - predict not taken
10: Weakly Taken - predict taken
11: Strongly Taken - predict taken

Loop nümunəsi:

Iterations: TTTTTTTTN
Predictor:  11→11→11→11→11→11→11→10→01 (yalnız 1 səhv!)

Pattern History Table (PHT)

Branch address-ə görə predictor.

Ölçü: 4K-64K entries (tipik)

Əks tərəfi: Aliasing - müxtəlif branch-lər eyni entry-ə map ola bilər

Branch Target Buffer (BTB)

Branch-in target address-ini cache edir.

BTB Entry:

Branch PC
Target Address
Branch Type
Prediction bits

BTB Structure

Branch PC	Target Address	Prediction	Valid
0x1000	0x2000	Taken	1
0x1100	0x1200	Not Taken	1
0x1500	0x3000	Taken	1

Two-Level Adaptive Predictor

Branch history pattern istifadə edir.

Global History

Bütün branch-lərin ümumi tarixi.

Branch History Register (BHR):

Last 8 branches: 10110101

Correlation nümunəsi:

if (a > 0) {      // Branch 1
    x = 1;
}
if (a > 0) {      // Branch 2 - same as Branch 1!
    y = 1;
}

Branch 2-nin davranışı Branch 1-dən asılıdır!

Local History

Hər branch-in öz tarixi.

Üstünlük: Müxtəlif branch-lərin tarixi qarışmır

Tournament Predictor

Bir neçə predictor-dan ən yaxşısını seçir.

Selector: Hansı predictor daha yaxşı işləyir?

Intel Core və AMD Ryzen istifadə edir.

Speculative Execution

Branch proqnozuna əsasən əvvəlcədən təlimatları icra edir.

Speculative Execution States

1. Fetch: Predicted path-dən təlimatlar gətir 2. Execute: Speculatively icra et 3. Verify: Həqiqi branch qərarını yoxla 4. Commit/Flush: Düzgündürsə commit, yoxsa flush

Misprediction Recovery

Cost: 15-20 cycles modern CPU-da

Return Address Stack (RAS)

Function return address-lərini predict edir.

Stack structure:

Top  → 0x2000  (most recent call)
       0x1500
       0x1000
Bottom → 0x500   (oldest call)

Accuracy: ~95-99% (çox yüksək)