Production-ready Docker packaging for Python developers

Table of Contents

Articles: The basics of Docker packaging

  1. Connection refused? Docker networking and how it impacts your image
    Learn how to fix connection refused errors when trying to connect to a Docker container.

  2. Where’s that log file? Debugging failed Docker builds
    Your Docker build just failed, and the reason is buried a log file—which is somewhere inside the build process. How do you read that log file?

  3. Faster or slower: the basics of Docker build caching
    Docker’s layer caching can speed up your image build—if you write your Dockerfile correctly.

  4. Debugging ImportError and ModuleNotFoundErrors in your Docker image
    There are many reasons your Python code might fail to import in Docker. Here’s a quick series of checks you can do to figure out the problem.

  5. A tableau of crimes and misfortunes: the ever-useful docker history
    Use the docker history command to understand how a Docker image is constructed, why an image is too big, and how Dockerfile commands work.

Looking for more? Learn the fundamental concepts of Docker packaging in just one afternoon, by reading my book: Just Enough Docker Packaging.


Free ebook: Introduction to Dockerizing for Production

Learn a step-by-step iterative DevOps packaging process in this free mini-ebook. You'll learn what to prioritize, the decisions you need to make, and the ongoing organizational processes you need to start.

Plus, you'll join my newsletter and get weekly articles covering practical tools and techniques, from Docker packaging to Python best practices.


Articles: Best practices for production

The broken status quo

  1. Broken by default: why you should avoid most Dockerfile examples
    Most Dockerfile examples for Python you’ll find on the Web are broken. And that’s a problem.

  2. Reviewing the official Dockerfile best practices: good, bad, insecure
    The official Docker documentation’s Dockerfile best practices are mostly good—but they are sometimes wrong, and if you’re using Python, too generic.

  3. The worst so-called “best practice” for Docker
    Many people (although fewer than in the past) will tell you not to install security updates in your Docker image. This is terrible advice.

Base image and dependencies

  1. The best Docker base image for your Python application (August 2021)
    Ubuntu? Official Python images? Alpine Linux? Here’s how to choose a good base Docker image for your Python application.

  2. Why you really need to upgrade pip
    Using old versions of pip can result in installing old packages, or needing to recompile packages from scratch. Make sure you upgrade pip before using it.

  3. A deep dive into the official Docker image for Python
    The official Python Docker image is useful, but to actually understand why, and to use it correctly, it’s worth understanding how exactly it’s constructed.

  4. Using Alpine can make Python Docker builds 50× slower
    Alpine Linux is often recommended as a smaller, faster Docker base image. But if you’re using Python, it will slow down your build and make your image larger.

  5. Why you can’t switch to Python 3.10 just yet
    Python 3.10 is out now—when should you start using it?

  6. Building on solid ground: reproducible Docker builds for Python
    Learn how to get reproducible Docker builds for your Python application, including base image, system packages, and Python dependencies.

  7. Push and pull: when and why to update your dependencies
    When should you update your software project’s dependencies? There are two rhythms to updates: security and critical bug fixes, and broader updates.

Security

  1. Installing system packages in Docker with minimal bloat
    Learn how to minimize your Docker image size while installing or updating system packages on on Debian, Ubuntu, and RHEL.

  2. Less capabilities, more security: preventing Docker escalation attacks
    Reduce the security risk from your Docker image by running as a non-root user and reducing capabilities.

  3. Avoiding insecure images from Docker build caching
    Docker’s layer caching is great for speeding up builds—but you need to be careful or it’ll cause you to have insecure dependencies.

  4. How to (not) use Docker to share your password with hackers
    Docker images can leak runtime secrets, build secrets, and even just some secret files you have lying around. Learn how to leak them, and how to avoid it.

  5. Don’t leak your Docker image’s build secrets
    When you’re building Docker images you often need some secrets: a password, an SS Hkey. The secure mechanism is BuildKit; others might leak them.

  6. Build secrets in Docker and Compose, the secure way
    Builds secrets like passwords may be used to build your Docker image; learn how to use them securely in Docker Compose without leaking them.

  7. The security scanner that cried wolf
    If you’ve ever been alarmed by how many security vulnerabilities your Docker image has, even after you’ve installed security updates, here’s what’s going on.

  8. Security scanners for Python and Docker: from code to dependencies
    How do you know your Python code is secure? How about your Docker image? Learn how to catch problems is using security scanners running in your CI setup.

Fast builds, small images

  1. The high cost of slow Docker builds
    A slow Docker build on the critical path for developer feedback is a lot more expensive than you think.

  2. Faster Docker builds with pipenv, poetry, or pip-tools
    Installing Python dependencies separately from your code speed ups Docker builds. Here’s how to do it with pipenv, poetry, or pip-tools.

  3. Elegantly activating a virtualenv in a Dockerfile
    How to activate a Python virtualenv in a Dockerfile without repeating yourself—plus, you’ll learn what activating a virtualenv actually does.

  4. Poetry vs. Docker caching: Fight!
    Poetry’s versioning scheme for Python dependencies makes Docker caching harder, which means slower images rebuilds. Learn some workarounds.

  5. Speeding up Docker builds in CI with BuildKit
    If your CI runners spin up an empty environment, your Docker builds will be slow. Speed up builds by warming the cache, plus BuildKit’s extra speedup.

  6. Speed up pip downloads in Docker with BuildKit’s new caching
    Every time you change your Python pip requirements and rebuild your Docker image, you’re going to redownload all your packages. You can fix this with BuildKit.

  7. Shrinking your Python application’s Docker image: an overview
    Learn the variety of techniques you can use to make your Python application’s Docker image a whole lot smaller.

  8. Multi-stage builds #1: Smaller images for compiled code
    Building Docker images with compiled code can lead to huge images. Learn how to shrink them with multi-stage builds.

  9. Multi-stage builds #2: Python specifics
    Once you understand generic Docker multi-stage builds, here’s how to implement them for Python applications, with virtualenvs or user installs.

  10. Multi-stage builds #3: Speeding up your builds
    Multi-stage Docker image builds give you small images and fast builds, but only if takes extra steps prevent slowness due to caching problems.

Conda

  1. Pip vs Conda: an in-depth comparison of Python’s two packaging systems
    Python has two packaging systems, pip and Conda. Learn the differences between them so you can pick the right one for you.

  2. Activating a Conda environment in your Dockerfile
    Learn how to activate a Conda environment in your Dockerfile.

  3. Scanning your Conda environment for security vulnerabilities
    Learn how to check your Conda environment and packages for security vulnerabilities.

  4. Shrink your Conda Docker images with conda-pack
    Docker images built for Conda tend to be quite large. Learn how to shrink them by using the conda-pack tool and multi-stage builds.

  5. Reproducible and upgradable Conda environments with conda-lock
    You want your packaging to be reproducible, and upgrade Conda dependencies without conflicts. Learn how to do it with a third-party tool: conda-lock.

Applications and runtime

  1. Configuring Gunicorn for Docker
    Running Gunicorn in a Docker container isn’t the same as running on a virtual machine or physical server. Learn what you need to do differently.

  2. What’s running in production? Making your Docker images identifiable
    It’s difficult to debug production problems if you don’t know what image is running in production.

  3. Decoupling database migrations from server startup: why and how
    Migrating your database schema when your application’s Docker container starts up? Here’s some reasons to rethink that choice.

  4. A Python prompt into a running process: debugging with Manhole
    Your Python process is acting strange—learn how to get a live Python interpreter prompt inside your running process for debugging.

Packaging as a process

  1. A thousand little details: developing software for ops
    Software for ops suffers both from historical complexity and from problem space complexity. Some generic suggestions, with Docker packaging as an example.

  2. Your Docker build needs a smoke test
    If you don’t test your Docker image before you push it, you’ll waste time (and maybe break production).

  3. “Let’s use Kubernetes!” Now you have 8 problems
    For smaller teams, Kubernetes is usually the wrong solution: too complex, too complicated, and with too much work to keep it running.

Docker variants and alternatives

  1. Docker BuildKit: faster builds, new features, and now it’s stable
    BuildKit is Docker’s new system for building images. It’s faster, has previously missing security featuers, and it’s finally stable.

  2. Options for Python packaging: Wheels, Conda, Docker, and more
    Learn and compare the many ways to package your Python server for distribution: wheels, PEX, RPM/DEB, Conda, executables, Docker.

  3. Docker vs. Singularity for data processing: UIDs and filesystem access
    Containers allow for reproducibility of data processing applications. Docker is the most popular option, but Singularity is also well-suited to this use case.

  4. Using Podman with BuildKit, the better Docker image builder
    Podman is a Docker replacment, and BuildKit is a new builder for Docker images. Learn how to use BuildKit together with Podman.

  5. Building Docker images on GitLab CI: Docker-in-Docker and Podman
    Building Docker images with Gitlab CI can be a little complicated. Learn how to do it with Docker-in-Docker, or the simpler option of using Podman.


Free ebook: Introduction to Dockerizing for Production

Learn a step-by-step iterative DevOps packaging process in this free mini-ebook. You'll learn what to prioritize, the decisions you need to make, and the ongoing organizational processes you need to start.

Plus, you'll join my newsletter and get weekly articles covering practical tools and techniques, from Docker packaging to Python best practices.


Products and services

Just Enough Docker Packaging ($29)

Just Enough Docker Packaging

New to Docker? Learn the fundamental concepts and the practical debugging techniques you need to understand Docker packaging—in just one afternoon.


Learn more


Introduction to Dockerizing for Production (FREE)

Level-up your DevOps understanding. Learn why production is different, why Docker images are great for packaging, and a step-by-step iterative Dockerizing process covered in this free mini-ebook. A good conceptual introduction for the Handbook.

Learn more


Python on Docker Production Handbook ($79)

Python on Docker Production Handbook

Quickly learn how to make your Python application’s Docker packaging production-ready. You’ll get a step-by-step plan, and a reference covering 70+ best practices, including security, fast builds, small images, Pipenv, Poetry, Conda, and much more.

Learn more


Production-Ready Python Containers template ($299)

Instead of wasting days of expensive developer time implementing and testing your own Docker packaging infrastructure, you can ship your Docker images with confidence—in just hours!—by using this template.

Learn more

Production-Ready Conda Containers template (coming soon)

Create a production-ready Docker image of your Conda-based application, in just one hour.

Learn more

Remote corporate training

Make you team more productive by upgrading their skills. Whether it’s the basics of Docker packaging, or the best practices you need to run in production, consider one of my Docker packaging training courses.