Posts

  • Benchmarking IP and Unix domain sockets (for real)

    In a previous post an artificial benchmark was done to measure the performance difference between IP and Unix domain sockets. The results were somewhat impressive, as Unix sockets performing at least twice as fast as IP sockets. But how these two forms of communication behaves in the real-world, using a battle-tested application protocol? Would the throughput really double just by switching between them? We’ll be using a Flask app served by Gunicorn behind an nginx reverse proxy to find out.

    The following tests were executed on a c4.large (2 Cores, 3.75GB RAM) instance on Amazon Web Services (AWS). None of the multi-threading/multi-process options offered by Gunicorn were used, so what we’ve got here was really what it can serve using a single CPU core. This way, we’ll also have the benefit of a free core to run both nginx and the benchmarking (wrk) tool itself.

    The application is pretty close to the standard Flask “hello world” example:

    requirements.txt

    Flask==0.12
    gunicorn==19.6.0
    

    server.py

    from flask import Flask
    app = Flask(__name__)
    
    @app.route("/")
    def hello():
        return "Hello there!"
    
    if __name__ == "__main__":
        app.run()
    

    Gunicorn was used to serve the application with no other option besides --bind.

    • IP: gunicorn --bind 0.0.0.0:8000 server:app
    • Unix domain socket: gunicorn --bind unix:/tmp/gunicorn.sock server:app

    This is the nginx virtual host configuration for both Gunicorn instances:

    /etc/nginx/sites-available/gunicorn

    server {
        listen 80;
        server_name bench-ip.myhro.info;
    
        location / {
            proxy_pass http://127.0.0.1:8000;
            proxy_redirect off;
            proxy_set_header Host $host;
            proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
            proxy_set_header X-Forwarded-Proto $scheme;
        }
    }
    
    server {
        listen 80;
        server_name bench-unix.myhro.info;
    
        location / {
            proxy_pass http://unix:/tmp/gunicorn.sock;
            proxy_redirect off;
            proxy_set_header Host $host;
            proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
            proxy_set_header X-Forwarded-Proto $scheme;
        }
    }
    

    We’ll have to append both hostnames to our /etc/hosts, in order to avoid the need for a DNS server:

    (...)
    127.0.0.1 bench-ip.myhro.info
    127.0.0.1 bench-unix.myhro.info
    

    The parameters used in this benchmark were pretty much what wrk offers by default. Experimenting with more threads or connections didn’t resulted in a significant difference, so the only parameter set was -d5s, which means “send the maximum number of requests as you can during five seconds”.

    IP benchmark

    Running 5s test @ http://bench-ip.myhro.info/
      2 threads and 10 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     5.44ms  303.33us  11.56ms   99.02%
        Req/Sec     0.92k    16.21     0.96k    66.00%
      9191 requests in 5.00s, 1.60MB read
    Requests/sec:   1837.29
    Transfer/sec:    328.26KB
    

    Unix domain socket benchmark

    Running 5s test @ http://bench-unix.myhro.info/
      2 threads and 10 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     4.95ms  283.81us  11.25ms   97.96%
        Req/Sec     1.01k    24.75     1.04k    90.00%
      10107 requests in 5.00s, 1.76MB read
    Requests/sec:   2019.39
    Transfer/sec:    360.79KB
    

    During multiple runs, these numbers were consistent. The Unix socket virtual host answered around 5 to 10% more requests in average. This number is small but can be significant, specially when dealing with high traffic web servers answering thousands of requests per minute. Anyway, this isn’t anywhere near the 100% performance improvement we saw when comparing raw sockets instead of a real protocol like HTTP.

    It would still be interesting to compare how this application would perform running inside a Docker container. Docker is known for having network overhead when using forwarded ports, so we’ll see how much it means in this case. Two files will be used to create our application image and its containers:

    Dockerfile

    FROM ubuntu:xenial
    
    RUN apt-get update
    RUN apt-get install -y python-pip
    
    ADD . /app
    
    RUN pip install -r /app/requirements.txt
    
    WORKDIR /app
    

    docker-compose.yml

    version: "2"
    services:
      base:
        build: .
        image: flask
      ip:
        image: flask
        command: gunicorn --bind 0.0.0.0:8000 server:app
        ports:
          - "8000:8000"
        volumes:
          - .:/app
      uds:
        image: flask
        command: gunicorn --bind unix:/tmp/gunicorn.sock server:app
        volumes:
          - .:/app
          - /tmp:/tmp
    

    Let’s run wrk again, after docker-compose build and docker-compose up:

    Docker IP benchmark

    $ wrk -d5s http://bench-ip.myhro.info/
    Running 5s test @ http://bench-ip.myhro.info/
      2 threads and 10 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     7.03ms  791.63us  16.84ms   93.51%
        Req/Sec   713.54     20.21   747.00     70.00%
      7109 requests in 5.01s, 1.24MB read
    Requests/sec:   1420.17
    Transfer/sec:    253.73KB
    

    Docker Unix domain socket benchmark

    $ wrk -d5s http://bench-unix.myhro.info/
    Running 5s test @ http://bench-unix.myhro.info/
      2 threads and 10 connections
      Thread Stats   Avg      Stdev     Max   +/- Stdev
        Latency     4.94ms  266.67us  10.74ms   97.24%
        Req/Sec     1.02k    29.87     1.04k    95.00%
      10116 requests in 5.00s, 1.76MB read
    Requests/sec:   2022.18
    Transfer/sec:    361.29KB
    

    The difference between IP sockets over forwarded ports and Unix sockets via shared volumes were huge under Docker. 40-45% is a pretty big number when considering web server performance penalty. With a setup like this one, it would be needed almost twice as hardware resources to serve the same number of clients, which would directly reflect on infrastructure and project costs as a whole.

    A few conclusions can be drawn from this experiment:

    • Avoid Docker forwarded ports in production environments. Use either Unix sockets or the host network mode in this case, as it will introduce virtually no overhead.
    • Ports can be easier to manage, instead of a bunch of files, when dealing with multiple processes - either regarding many applications or scaling a single one. If you can afford a little drop in throughput, go for IP sockets.
    • If you have to extract every drop of performance available, use Unix domain sockets where possible.
  • How fast are Unix domain sockets?

    Warning: this is my first post written in English, after over five years writing only in Portuguese. After reading many technical articles written in English by non-native speakers, I’ve wondered: imagine how much information I would be missing if they wrote those posts in French or Russian. Following their examples, this blog can also reach a much wider audience as well.

    It probably happened more than once, when you ask your team about how a reverse proxy should talk to the application backend server. “Unix sockets. They are faster.”, they’ll say. But how much faster this communication will be? And why a Unix domain socket is faster than an IP socket when multiple processes are talking to each other in the same machine? Before answering those questions, we should figure what Unix sockets really are.

    Unix sockets are a form of inter-process communication (IPC) that allows data exchange between processes in the same machine. They are special files, in the sense that they exist in a file system like a regular file (hence, have an inode and metadata like ownership and permissions associated to it), but will be read and written using recv() and send() syscalls instead of read() and write(). When binding and connecting to a Unix socket, we’ll be using file paths instead of IP addresses and ports.

    In order to determine how fast a Unix socket is compared to an IP socket, two proofs of concept (POCs) will be used. They were written in Python, due to being small and easy to understand. Their implementation details will be clarified when needed.

    IP POC

    ip_server.py

    #!/usr/bin/env python
    
    import socket
    
    server_addr = '127.0.0.1'
    server_port = 5000
    
    sock = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
    sock.setsockopt(socket.SOL_SOCKET, socket.SO_REUSEADDR, 1)
    sock.bind((server_addr, server_port))
    sock.listen(0)
    
    print 'Server ready.'
    
    while True:
        conn, _ = sock.accept()
        conn.send('Hello there!')
        conn.close()
    

    ip_client.py

    #!/usr/bin/env python
    
    import socket
    import time
    
    server_addr = '127.0.0.1'
    server_port = 5000
    
    duration = 1
    end = time.time() + duration
    msgs = 0
    
    print 'Receiving messages...'
    
    while time.time() < end:
        sock = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
        sock.connect((server_addr, server_port))
        data = sock.recv(32)
        msgs += 1
        sock.close()
    
    print 'Received {} messages in {} second(s).'.format(msgs, duration)
    

    Unix domain socket POC

    uds_server.py

    #!/usr/bin/env python
    
    import os
    import socket
    
    server_addr = '/tmp/uds_server.sock'
    
    if os.path.exists(server_addr):
        os.unlink(server_addr)
    
    sock = socket.socket(socket.AF_UNIX, socket.SOCK_STREAM)
    sock.bind(server_addr)
    sock.listen(0)
    
    print 'Server ready.'
    
    while True:
        conn, _ = sock.accept()
        conn.send('Hello there!')
        conn.close()
    

    uds_client.py

    #!/usr/bin/env python
    
    import socket
    import time
    
    server_addr = '/tmp/uds_server.sock'
    
    duration = 1
    end = time.time() + duration
    msgs = 0
    
    print 'Receiving messages...'
    
    while time.time() < end:
        sock = socket.socket(socket.AF_UNIX, socket.SOCK_STREAM)
        sock.connect(server_addr)
        data = sock.recv(32)
        msgs += 1
        sock.close()
    
    print 'Received {} messages in {} second(s).'.format(msgs, duration)
    

    As we can see by those code snippets, both implementations are close to each other as possible. The differences between them are:

    • Their address family: socket.AF_INET (IP) and socket.AF_UNIX (Unix sockets).
    • To bind a process using socket.AF_UNIX, the socket file should be removed and created again if it already exists.
    • When using socket.AF_INET, the socket.SO_REUSEADDR flag have to be set in order to avoid socket.error: [Errno 98] Address already in use errors that may occur even when the socket is properly closed. This option tells the kernel to reuse the same port if there are connections in the TIME_WAIT state.

    Both POCs were executed on a Core i3 laptop running Ubuntu 16.04 (Xenial) with stock kernel. There is no output at every loop iteration to avoid the huge performance penalty of writing to a screen. Let’s take a look at their performances.

    IP POC

    First terminal:

    $ python ip_server.py
    Server ready.
    

    Second terminal:

    $ python ip_client.py
    Receiving messages...
    Received 10159 messages in 1 second(s).
    

    Unix domain socket POC

    First terminal:

    $ python uds_server.py
    Server ready.
    

    Second terminal:

    $ python uds_client.py
    Receiving messages...
    Received 22067 messages in 1 second(s).
    

    The Unix socket implementation can send and receive more than twice the number of messages, over the course of a second, when compared to the IP one. During multiple runs, this proportion is consistent, varying around 10% for more or less on both of them. Now that we figured their performance differences, let’s find out why Unix sockets are so much faster.

    It’s important to notice that both IP and Unix socket implementations are using TCP (socket.SOCK_STREAM), so the answer isn’t related to how TCP performs in comparison to another transport protocol like UDP, for instance (see update 1). What happens is that when Unix sockets are used, the entire IP stack from the operating system will be bypassed. There will be no headers being added, checksums being calculated (see update 2), encapsulation and decapsulation of packets being done nor routing being performed. Although those tasks are performed really fast by the OS, there is still a visible difference when doing benchmarks like this one.

    There’s so much room for real-world comparisons besides this synthetic measurement demonstrated here. What will be the throughput differences when a reverse proxy like nginx is communicating to a Gunicorn backend server using IP or Unix sockets? Will it impact on latency as well? What about transfering big chunks of data, like huge binary files, instead of small messages? Can Unix sockets be used to avoid Docker network overhead when forwarding ports from the host to a container?

    References:

    Updates:

    1. John-Mark Gurney and Justin Cormack pointed out that SOCK_STREAM doesn’t mean TCP under Unix domain sockets. This makes sense, but I couldn’t find any reference affirming nor denying it.
    2. Justin Cormack also mentioned that there’s no checksumming on local interfaces by default. Looking at the source code of the Linux loopback driver, this seems to be present in kernel since version 2.6.12-r2.

Subscribe via RSS