15-Apr-2025

Distributed Locking

Distributed locking is a crucial aspect of building robust and reliable distributed systems. It ensures that shared resources, like databases, file systems, or in-memory data structures, are accessed by only one process or thread at a time, preventing data corruption and race conditions. In a non-distributed environment, this is typically achieved using mutexes or semaphores. However, these mechanisms don’t work across multiple machines or processes that don’t share memory.

Several approaches exist for achieving distributed locking. Let’s explore a few common strategies:

1. Redis-Based Locking:

Redis, an in-memory data structure store, offers a simple and effective way to implement distributed locks. The core idea revolves around using the SETNX (SET if Not eXists) command. If the key doesn’t exist (meaning the lock is free), SETNX sets it and returns 1, indicating success. If the key already exists (lock held by another process), it returns 0.

const redis = require('redis');

const client = redis.createClient();

async function acquireLock(key, expirySeconds) {
  return new Promise((resolve, reject) => {
    client.setnx(key, 'locked', 'EX', expirySeconds, (err, reply) => {
      if (err) {
        reject(err);
      } else {
        resolve(reply === 1); // Returns true if lock acquired, false otherwise
      }
    });
  });
}

async function releaseLock(key) {
  return new Promise((resolve, reject) => {
    client.del(key, (err, reply) => {
      if (err) {
        reject(err);
      } else {
        resolve(reply === 1); // Returns true if lock released, false otherwise.
      }
    });
  });
}

// Example usage:
async function doWork() {
    const lockAcquired = await acquireLock("my_resource", 10); // Lock for 10 seconds
    if (lockAcquired) {
      console.log("Lock acquired, performing work...");
      // ... Perform your critical section work ...
      await releaseLock("my_resource");
      console.log("Lock released.");
    } else {
      console.log("Failed to acquire lock.");
    }
  }

doWork();

The EX option sets an expiration time for the lock, preventing deadlocks if the holding process crashes before releasing it.

2. Database-Based Locking:

Most relational databases support row-level locking, which can be leveraged to implement distributed locks. You can create a dedicated table for locks, and acquiring a lock involves inserting a row with the lock identifier. Releasing the lock involves deleting the corresponding row.

-- Acquire lock
INSERT INTO locks (lock_name) VALUES ('my_resource');

-- Release lock
DELETE FROM locks WHERE lock_name = 'my_resource';

Transactions should be used to ensure atomicity of acquiring and releasing locks.

3. Distributed Coordination Services (ZooKeeper, etcd):

Specialized distributed coordination services like ZooKeeper and etcd provide more sophisticated locking mechanisms with features like ephemeral nodes (nodes that automatically disappear if the client disconnects) and change notifications. These tools offer higher reliability and flexibility but introduce additional complexity.

Choosing the Right Approach:

The best approach depends on the specific requirements of your application. Redis is often a good choice for simple scenarios due to its speed and simplicity. Databases can be suitable when you already have a database infrastructure. ZooKeeper or etcd are preferable for more complex systems where high availability and advanced features are crucial.