Podman Rootless Containers: Understanding the Copy Fail Exploit and How to Defend Against It

Rootless containers were supposed to be the safe default — no root on the host, no problem. The Copy Fail vulnerability in Podman shatters that assumption by demonstrating how a seemingly mundane file-copy operation can become a privilege escalation vector. If you're running Podman in rootless mode in CI pipelines, developer workstations, or production edge nodes, this is worth your full attention right now.

🔍 What Is the Copy Fail Vulnerability?

The vulnerability centers on podman cp — the command used to copy files between a container and the host filesystem. In rootless mode, Podman uses a user namespace mapping: your unprivileged host UID (e.g., 1000) maps to UID 0 inside the container. This is the foundation of rootless isolation.

The exploit arises from a race condition and improper privilege handling during the copy operation. When podman cp transfers a file from a container to the host, it temporarily operates with elevated capabilities inside the user namespace to read files owned by the container's root. Under specific conditions, an attacker who controls the container's filesystem can craft a symlink or hardlink that redirects the write destination on the host — landing a file with attacker-controlled content in a path they shouldn't be able to write to.

The critical detail: the write happens in the host mount namespace context, not the container's. If the timing or symlink resolution is exploited correctly, you can overwrite host files that the unprivileged user nominally cannot touch.

🗺️ Attack Surface Breakdown

Prerequisites for Exploitation

The attacker controls or can influence files inside the container (e.g., a compromised image layer or writable volume).
A privileged user or automation script runs podman cp from that container to the host.
The host kernel's user namespace support is enabled (standard on most modern Linux distros).

Why Rootless Doesn't Fully Protect You Here

Rootless containers eliminate the need for a setuid daemon, but they still rely on newuidmap and newgidmap — setuid binaries that perform the UID mapping. The user namespace gives the container process apparent root inside the namespace. The vulnerability exploits the moment when Podman's copy logic bridges the namespace boundary without fully sanitizing the destination path against symlink attacks.

sequenceDiagram participant A as Attacker (Container) participant P as Podman cp Logic participant H as Host Filesystem A->>A: Create malicious symlink inside container P->>A: Read source path (container namespace) P->>P: Resolve destination path (crosses namespace boundary) A-->>P: Symlink redirects to sensitive host path P->>H: Write file to attacker-controlled host path H-->>P: File written outside intended destination

⚙️ Reproducing the Concept (Safely)

The following demonstrates the structural risk — a container with a malicious symlink that could redirect a copy operation. This is illustrative, not a working exploit payload.

# Pull a base image and start a container
podman run -d --name test-copy alpine sleep 300

# Inside the container, create a symlink pointing outside the container root
# (In a real attack, this would target a sensitive host path)
podman exec test-copy sh -c "
  mkdir -p /tmp/evil &&
  ln -s /etc/cron.d /tmp/evil/target
"

# A naive copy from the container to the host
# If vulnerable, this could follow the symlink into the host namespace
podman cp test-copy:/tmp/evil/target /tmp/host-output

# Clean up
podman rm -f test-copy

Patched versions of Podman resolve symlinks within the container's mount namespace before performing the host-side write, breaking the chain. Always verify your Podman version against published CVEs.

🛡️ Concrete Mitigation Strategies

1. Keep Podman Updated

This is non-negotiable. The fix lives in the Podman codebase itself. Check your version and compare against the relevant CVE advisory:

podman version
# Ensure you're on a patched release per your distro's security advisories

# On Fedora/RHEL/CentOS Stream:
sudo dnf update podman

# On Ubuntu/Debian (via upstream PPA or package manager):
sudo apt-get update && sudo apt-get install --only-upgrade podman

2. Restrict `podman cp` in Automation

If your CI/CD pipeline uses podman cp, scope it tightly. Never copy from containers running untrusted or user-supplied images without inspection:

import subprocess
import shlex

def safe_container_copy(container_id: str, src_path: str, dest_path: str) -> None:
    """
    Wrapper that validates destination path before invoking podman cp.
    Prevents writes outside an explicit allowed directory.
    """
    import os
    allowed_base = "/var/lib/ci-artifacts"
    real_dest = os.path.realpath(dest_path)

    if not real_dest.startswith(allowed_base):
        raise ValueError(
            f"Destination {dest_path} resolves to {real_dest}, "
            f"which is outside the allowed base {allowed_base}"
        )

    cmd = ["podman", "cp", f"{container_id}:{src_path}", real_dest]
    result = subprocess.run(cmd, capture_output=True, text=True, timeout=30)

    if result.returncode != 0:
        raise RuntimeError(f"podman cp failed: {result.stderr}")

    print(f"Copied to {real_dest} successfully.")

3. Use SELinux or AppArmor Confinement

Mandatory access controls add a host-side enforcement layer that can block unexpected writes even if the copy logic is tricked:

# Verify SELinux is enforcing on the host
getenforce
# Should return: Enforcing

# Run containers with the default SELinux label (Podman does this automatically on RHEL-family)
podman run --security-opt label=type:container_t alpine sh

# For stricter isolation, use a custom policy or the spc_t type only when absolutely needed

4. Drop Unnecessary Capabilities

Even in rootless mode, explicitly drop capabilities to reduce the blast radius:

podman run \
  --cap-drop=ALL \
  --security-opt no-new-privileges \
  --read-only \
  alpine sh

5. Audit User Namespace Configuration

Limit the number of subordinate UIDs/GIDs available to each user. Smaller UID ranges reduce the scope of what a compromised container can impersonate:

# Check current subordinate UID ranges
cat /etc/subuid
# Example output: youruser:100000:65536

# Reduce the range if your workloads don't need 65k UIDs
# Edit /etc/subuid and /etc/subgid, then run:
podman system migrate

Key Takeaways

Rootless ≠ risk-free: User namespaces reduce attack surface but don't eliminate it — namespace boundary crossings during operations like podman cp are where vulnerabilities hide.
Patch first, harden second: No amount of configuration compensates for running a version with a known code-level flaw. Update Podman immediately.
Treat container filesystems as untrusted: Any file copied from a container to the host should be treated like user input — validate destination paths programmatically.
Defense in depth works: SELinux/AppArmor enforcement, capability dropping, and no-new-privileges together make exploitation significantly harder even when a single layer fails.
Audit your automation: CI pipelines that blindly run podman cp on arbitrary images are the highest-risk scenario — add path validation wrappers like the Python example above.

Your next steps: audit every place podman cp appears in your scripts and pipelines, confirm your Podman version is patched, and enable SELinux enforcing mode if you're on a RHEL-family system. For deeper hardening, review the Red Hat rootless containers guide and subscribe to Red Hat Security Advisories for Podman CVE notifications.

Search This Blog

AI Dev Notes