Cilium 1.20: evaluate datapath performance knobs alongside the upgrade (bandwidthManager+BBR, bpf.masquerade; netkit later)

> 🤖 Generated by the Daily AI Assistant

## Context — why these are deferred to the 1.20 upgrade

A 2026-06-11 performance survey of the platform concluded the Cilium datapath is **not the measured bottleneck** today: CPU throttling is ~0% across workloads, user-facing slowness traced to scheduling/capacity (autoscaler thrash, memory-bound control planes), not packet processing. Meanwhile the cilium HelmRelease sets `rollOutCiliumPods: true`, so **every values change rolls every agent**, on a cluster whose worst incidents were all datapath regressions (nodeEncryption black-hole → #1975, strict-mode drops → #1944, SPIRE wedges → #1809/#1818, firewall 4250 → #1859).

The Cilium **1.20 upgrade is already planned** (gate for replacing auth-proxy with the Gateway API ExternalAuth filter, GEP-1494 / cilium/cilium#45739). That upgrade rolls all agents anyway — the right moment to pay the roll cost for datapath knobs **once** instead of per-change.

Current baseline (`k8s/bases/infrastructure/controllers/cilium/helm-release.yaml` + hetzner patch): tunnel routing (vxlan), `kubeProxyReplacement: true`, WireGuard pod-to-pod with strict egress mode (`nodeEncryption: false`, deliberate), SPIRE mutual auth enforced, Hubble + relay (2) + UI (KEDA 0/1), Gateway API + ALPN, agents request-only (no CPU limits — per Cilium guidance), monitor aggregation at chart default.

## Recommended — adopt with/after 1.20, one knob per PR

### 1. `bandwidthManager: { enabled: true, bbr: true }`
- Cilium's own performance-tuning recommendation: fq/EDT pacing + BBR congestion control improves tail latency and throughput fairness on egress (gateway responses toward Cloudflare).
- Kernel requirement (≥ 5.18 for BBR) comfortably met by the Talos 6.x kernel.
- Caveat: replaces the node qdisc — soak in local/CI **with WireGuard enabled** before prod.

### 2. `bpf.masquerade: true`
- Moves pod→external SNAT from iptables to eBPF, removing per-packet iptables traversal on all egress (Cloudflare-bound traffic, registry pulls, webhooks).
- Requires kube-proxy replacement (already on).
- Caveat: verify the WireGuard interplay in local/CI first; changes the NAT path for all egress.

**Rollout discipline for both:** one knob per PR; local/CI soak first; low-traffic window; after each node rolls, check `cilium status --verbose` and watch Hubble for unexpected drops. `rollOutCiliumPods: true` means each PR is a full one-node-at-a-time agent roll.

## Revisit later (not at 1.20)

- **netkit device mode** — the largest upstream datapath win (replaces veth), but still bleeding-edge for this cluster's risk tolerance; re-evaluate once 1.20.x has matured and after the two knobs above have soaked.

## Evaluated and rejected (rationale recorded so it isn't re-litigated)

| Knob | Verdict | Why |
| --- | --- | --- |
| `routingMode: native` / autoDirectNodeRoutes | ❌ | Needs L2 adjacency or route programming Hetzner's private network doesn't naturally provide; high effort/risk, modest gain at this scale. |
| DSR | ❌ | Requires switching the tunnel to Geneve, and most user traffic is L7-proxied through Envoy/Gateway anyway — DSR's benefit barely applies. |
| CiliumEndpointSlice | ⏸️ | Cuts apiserver watch load (fits the memory-bound CPs) but the benefit scales with endpoint count — small at a few hundred pods — and it changes ipcache propagation timing on a cluster sensitized to exactly that. |
| Disabling Hubble | ❌ | Would free the 2 relay pods + agent flow processing, but Hubble is the only visibility into Cilium policy verdicts/drops (Coroot cannot see CNI drops), and drop-debugging is this cluster's recurring incident mode. |
| Monitor aggregation tuning | ✅ already fine | Chart default (`medium`) is the sane setting. |
| Agent resource changes | ✅ already done | Requests right-sized in #1713; limits deliberately absent per Cilium guidance. |

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cilium 1.20: evaluate datapath performance knobs alongside the upgrade (bandwidthManager+BBR, bpf.masquerade; netkit later) #2029

Context — why these are deferred to the 1.20 upgrade

Recommended — adopt with/after 1.20, one knob per PR

1. `bandwidthManager: { enabled: true, bbr: true }`

2. `bpf.masquerade: true`

Revisit later (not at 1.20)

Evaluated and rejected (rationale recorded so it isn't re-litigated)

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Knob	Verdict	Why
`routingMode: native` / autoDirectNodeRoutes	❌	Needs L2 adjacency or route programming Hetzner's private network doesn't naturally provide; high effort/risk, modest gain at this scale.
DSR	❌	Requires switching the tunnel to Geneve, and most user traffic is L7-proxied through Envoy/Gateway anyway — DSR's benefit barely applies.
CiliumEndpointSlice	⏸️	Cuts apiserver watch load (fits the memory-bound CPs) but the benefit scales with endpoint count — small at a few hundred pods — and it changes ipcache propagation timing on a cluster sensitized to exactly that.
Disabling Hubble	❌	Would free the 2 relay pods + agent flow processing, but Hubble is the only visibility into Cilium policy verdicts/drops (Coroot cannot see CNI drops), and drop-debugging is this cluster's recurring incident mode.
Monitor aggregation tuning	✅ already fine	Chart default (`medium`) is the sane setting.
Agent resource changes	✅ already done	Requests right-sized in #1713; limits deliberately absent per Cilium guidance.

Cilium 1.20: evaluate datapath performance knobs alongside the upgrade (bandwidthManager+BBR, bpf.masquerade; netkit later) #2029

Description

Context — why these are deferred to the 1.20 upgrade

Recommended — adopt with/after 1.20, one knob per PR

1. bandwidthManager: { enabled: true, bbr: true }

2. bpf.masquerade: true

Revisit later (not at 1.20)

Evaluated and rejected (rationale recorded so it isn't re-litigated)

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

1. `bandwidthManager: { enabled: true, bbr: true }`

2. `bpf.masquerade: true`