Relay System¶

The WireKube relay server bridges WireGuard UDP packets over TCP for peers that cannot establish direct P2P connections (Symmetric NAT, restrictive firewalls).

Design¶

flowchart LR
    subgraph NodeA["Node A (Symmetric NAT)"]
        WG1[WireGuard wire_kube]
        P1[UDP Proxy 127.0.0.1:random]
        WG1 <--> P1
    end
    subgraph RelayPool["Relay Pool"]
        R1[relay-0 :3478]
        R2[relay-1 :3478]
    end
    subgraph NodeB["Node B"]
        P2[UDP Proxy 127.0.0.1:random]
        WG2[WireGuard wire_kube]
        P2 <--> WG2
    end
    P1 <-->|TCP| RelayPool
    RelayPool <-->|TCP| P2

Protocol¶

Frame Format¶

All messages are framed with a length prefix:

Field	Size	Description
Length	4 bytes (uint32)	Total message length
Type	1 byte	Message type code
Body	variable	Message payload

Message Types¶

Type	Code	Body	Description
`MsgRegister`	`0x01`	32-byte WireGuard public key	Agent registers itself with the relay
`MsgData`	`0x02`	32-byte dest pubkey + UDP payload	Forward WireGuard packet to peer
`MsgKeepalive`	`0x03`	(empty)	Keep TCP connection alive (30s interval)
`MsgError`	`0xFF`	Error message string	Relay reports an error

Connection Lifecycle¶

sequenceDiagram
    participant Agent
    participant Relay
    Agent->>Relay: TCP connect
    Agent->>Relay: MsgRegister(myPubKey)
    Note over Relay: maps pubkey → conn
    Relay-->>Agent: ready
    Agent->>Relay: MsgData(destPubKey, payload)
    Note over Relay: lookup destPubKey → forward
    Relay->>Agent: MsgData(srcPubKey, payload)
    Agent->>Relay: MsgKeepalive (every 30s)
    Note over Relay: TCP drop detected
    Agent->>Relay: reconnect (exponential backoff)
    Agent->>Relay: MsgRegister(myPubKey)
    Note over Relay: re-maps pubkey → new conn

Auto-Reconnect¶

The relay client implements automatic reconnection with exponential backoff:

Backoff range: 1 second (initial) to 30 seconds (max)
Trigger: Any read/write error on the TCP connection, or connection close
Behavior: Sets connected=false, closes the old connection, signals reconnect
Registration: On reconnect, re-sends MsgRegister to re-associate the public key
Proxy persistence: Existing UDP proxies are preserved across reconnections

The connected state is tracked via atomic.Bool for lock-free access from the agent's main sync loop.

Local UDP Proxy¶

Each relayed peer gets a dedicated UDP proxy running on localhost.

Why a Local Proxy?¶

WireGuard is a kernel-level interface that speaks UDP only. It cannot directly use a TCP connection. The proxy bridges this gap:

flowchart LR
    WG[WireGuard kernel] -->|UDP| P[Proxy 127.0.0.1:random]
    P -->|TCP| R[Relay Pool]

Socket Strategy¶

The proxy uses net.DialUDP to create a connected UDP socket:

localAddr  := &net.UDPAddr{IP: net.IPv4(127, 0, 0, 1), Port: 0}
remoteAddr := &net.UDPAddr{IP: net.IPv4(127, 0, 0, 1), Port: wgPort}
conn, _ := net.DialUDP("udp4", localAddr, remoteAddr)

This gives the proxy a stable local address (e.g., 127.0.0.1:54321) that WireGuard uses as the peer's endpoint. Since the socket is connected to the WireGuard port, conn.Write() uses write(2) instead of sendto(2), which is important for Cilium compatibility.

Adaptive Write¶

The proxy implements a two-tier write strategy:

Standard: conn.Write(payload) — uses Go's net.UDPConn
Fallback: syscall.Write(dupFD, payload) — raw syscall on a duplicated fd

If conn.Write() returns EPERM (e.g., from Cilium BPF hooks), the proxy switches to syscall.Write mode for all subsequent writes. This is tracked via an atomic.Bool for lock-free access.

Sender Interface¶

The proxy sends data through a Sender interface:

type Sender interface {
    SendToPeer(destPubKey [32]byte, payload []byte) error
}

This abstraction allows the proxy to work with either a single Client or the Pool, making the relay layer pluggable.

Relay Pool¶

The relay pool manages connections to multiple relay server instances for scalability and high availability.

Architecture¶

flowchart TB
    subgraph Agent
        Pool[Relay Pool]
        Pool --> C1[Client relay-0]
        Pool --> C2[Client relay-1]
        Pool --> C3[Client relay-2]
    end
    subgraph K8s["Headless Service"]
        R1[relay-0 Pod]
        R2[relay-1 Pod]
        R3[relay-2 Pod]
    end
    C1 <-->|TCP| R1
    C2 <-->|TCP| R2
    C3 <-->|TCP| R3

How It Works¶

DNS Discovery: The pool resolves the relay address (typically a Kubernetes Headless Service) to get all pod IPs.
Full Registration: Agents connect to and register on all discovered relay instances. This ensures any relay can deliver packets to any agent.
Send Strategy: When sending a packet, the pool tries each connected relay in order until one succeeds.
Periodic Re-resolution: Every 30 seconds, the pool re-resolves DNS to detect scale-up/scale-down events. New replicas get connected; stale entries are removed.
Per-Client Reconnect: Each client in the pool has its own auto-reconnect loop, so individual relay failures don't affect the rest.

Scaling Relay¶

To scale the relay:

Deploy as a Deployment with multiple replicas
Create a Headless Service (clusterIP: None) pointing to the relay pods
The agent's pool resolves the Headless Service DNS → gets all pod IPs
Each agent registers on all replicas → any replica can route to any agent

apiVersion: v1
kind: Service
metadata:
  name: wirekube-relay
  namespace: wirekube-system
spec:
  clusterIP: None
  selector:
    app: wirekube-relay
  ports:
    - port: 3478
      targetPort: 3478

Data Handler Callback¶

When the pool receives data from any relay, it routes the packet to the correct local UDP proxy based on the source WireGuard public key:

Relay → Pool.handleData(srcKey, payload) → proxies[srcKey].DeliverToWireGuard(payload)

Managed Relay Discovery¶

For provider: managed, the agent needs to connect to the relay before the mesh tunnel is up (chicken-and-egg problem). The agent resolves this by querying the Kubernetes Service API for the relay's externally reachable address:

ExternalIPs — Manually configured public IPs on the Service
LoadBalancer Ingress — Cloud-assigned external IP or hostname
NodePort — Service NodePort via a cluster node's public IP (ExternalIP or public InternalIP for cloud providers like OCI)

ClusterIP DNS is intentionally not used as a fallback because CoreDNS resolution depends on a functioning CNI, which may not be available on hybrid/NAT'd nodes before the mesh tunnel is established. If no external address is found, the agent retries with exponential backoff until the Service becomes externally reachable (e.g., LoadBalancer IP is assigned).

Deployment Options¶

Managed Relay (In-Cluster)¶

kubectl apply -f config/relay/deployment.yaml

Configure in WireKubeMesh:

spec:
  relay:
    provider: managed
    managed:
      replicas: 1
      serviceType: LoadBalancer
      port: 3478

External Relay¶

Deploy on any machine with a public IP or behind a TCP load balancer:

wirekube-relay --addr :3478

Configure in WireKubeMesh:

spec:
  relay:
    provider: external
    external:
      endpoint: "relay.example.com:3478"
      transport: tcp

Behind a TCP Load Balancer¶

Internet ---- TCP LB :3478 ---- Relay Pod/Server :3478

The relay's TCP transport was specifically designed to work with TCP-only load balancer offerings.

Capacity¶

A single relay instance can handle thousands of concurrent connections. Each connection is a lightweight TCP socket with minimal CPU overhead — the relay only forwards opaque encrypted packets without any decryption.