Supervisors are processes that monitor other processes (called child processes) and restart them when they crash. They’re the foundation of Elixir’s fault-tolerance and self-healing capabilities.
This creative video explains supervision strategies using the classic game Doom! It’s a fun way to understand how different supervisor strategies work in practice.
When a child process crashes, only that process is restarted.
defmodule MyApp.Supervisor do
use Supervisor
def start_link(init_arg) do
Supervisor.start_link(__MODULE__, init_arg, name: __MODULE__)
end
@impl true
def init(_init_arg) do
children = [
{Cache, []},
{TaskProcessor, []}
]
Supervisor.init(children, strategy: :one_for_one)
end
end
When a child crashes, all other children are terminated and restarted.
# Use when all children are interdependent
children = [
{DatabaseConnection, []},
{CacheLayer, []},
{WebServer, []}
]
Supervisor.init(children, strategy: :one_for_all)
When a child crashes, only that child and the children that started after it are restarted.
# Order matters! Later processes depend on earlier ones
children = [
{Config, []}, # If this crashes, all restart
{Database, []}, # If this crashes, WebServer restarts too
{WebServer, []} # If this crashes, only it restarts
]
Supervisor.init(children, strategy: :rest_for_one)
For a fixed number of children known at startup:
defmodule MyApp.Supervisor do
use Supervisor
def start_link(init_arg) do
Supervisor.start_link(__MODULE__, init_arg, name: __MODULE__)
end
def init(_init_arg) do
children = [
{Cache, []},
{WebServer, port: 4000},
{TaskQueue, []}
]
Supervisor.init(children, strategy: :one_for_one)
end
end
For children that are started and stopped dynamically:
defmodule MyApp.WorkerSupervisor do
use DynamicSupervisor
def start_link(init_arg) do
DynamicSupervisor.start_link(__MODULE__, init_arg, name: __MODULE__)
end
def start_worker(args) do
spec = {Worker, args}
DynamicSupervisor.start_child(__MODULE__, spec)
end
@impl true
def init(_init_arg) do
DynamicSupervisor.init(strategy: :one_for_one)
end
end
defmodule MyApp.Application do
use Application
def start(_type, _args) do
children = [
# Top-level supervisor
{MyApp.MainSupervisor, []},
# Dynamic supervisor for workers
{MyApp.WorkerSupervisor, []},
# Supervisor for web-related processes
{MyApp.Web.Supervisor, []}
]
opts = [strategy: :one_for_one, name: MyApp.Supervisor]
Supervisor.start_link(children, opts)
end
end
# Permanent child (default) - always restarted
child_spec = %{
id: Worker,
start: {Worker, :start_link, []},
restart: :permanent
}
# Temporary child - never restarted
temp_spec = %{
id: TempWorker,
start: {TempWorker, :start_link, []},
restart: :temporary
}
# Transient child - restarted only on abnormal termination
transient_spec = %{
id: TransientWorker,
start: {TransientWorker, :start_link, []},
restart: :transient
}
✅ Do:
❌ Don’t:
defmodule MyApp.Supervisor do
use Supervisor
def init(_init_arg) do
children = [
{Worker, []}
]
# Max 3 restarts in 5 seconds
Supervisor.init(children,
strategy: :one_for_one,
max_restarts: 3,
max_seconds: 5
)
end
end
children = [
%{
id: Worker,
start: {Worker, :start_link, []},
shutdown: 10_000 # Give 10 seconds for cleanup
}
]
✅ one_for_one when:
✅ one_for_all when:
✅ rest_for_one when:
Try this simple example in iex
:
# Define a simple worker
defmodule Worker do
use GenServer
def start_link(id) do
GenServer.start_link(__MODULE__, id)
end
def init(id) do
{:ok, id}
end
end
# Start a dynamic supervisor
{:ok, sup} = DynamicSupervisor.start_link(strategy: :one_for_one)
# Start some children
DynamicSupervisor.start_child(sup, {Worker, 1})
DynamicSupervisor.start_child(sup, {Worker, 2})
# Check supervisor status
DynamicSupervisor.count_children(sup)