Optimal control for gate compilation

David Wierichs

Demos/
Optimization/
Optimal control for gate compilation

Optimal control for gate compilation

David Wierichs

Published: August 07, 2023. Last updated: September 22, 2025.

Today, quantum computations are largely phrased in the language of quantum circuits, composed of digital quantum gates. However, most quantum hardware does not come with such digital gates as elementary native operations. Instead, the hardware allows us to play sequences of analog electromagnetic pulses, for example by shining laser pulses on trapped ions or Rydberg atoms, or by sending microwave pulses onto superconducting qubit cavities. These pulses need to be calibrated to produce the desired digital gates, and in this tutorial we will be concerned with exactly this task.

For this, we will parametrize a pulse sequence, which leads to a whole space of possible sequences. Then we optimize the pulse parameters in order to find a configuration in this space that behaves as closely to the target gate as possible. More concretely, we will optimize simple pulse programs on two and three qubits to obtain a CNOT and a Toffoli gate. This training of control parameters to achieve a specific time evolution is a standard task in the field of quantum optimal control.

Illustration of a metal hand crafting a CNOT gate, using qubit systems

For an introduction, see the demo on differentiable pulse programming in PennyLane. Instead of optimizing pulses to yield digital quantum gates, we may use them directly to solve optimization problems, as is also showcased in this introductory demo. If you are interested in specific hardware pulses, take a look at an introduction to neutral-atom quantum computing or the tutorial on the QuEra Aquila device, which treat pulse programming with Rydberg atoms.

Quantum optimal control

The overarching goal of quantum optimal control is to find the best way to steer a microscopical physical system such that its dynamics matches a desired behaviour. The meaning of “best” and “desired behaviour” will depend on the specific task, and it is important to specify the underlying assumptions and constraints on the system controls in order to make the problem statement well-defined. Once we specified all these details, optimal control theory is concerned with questions like “How close can the system get to modelling the desired behaviour?”, “How can we find the best (sequence of) control parameters to obtain the desired behaviour?”, or “What is the shortest time in which the system can reach a specific state, given some initial state?” (controlling at the so-called quantum speed limit) [1].

In this tutorial, we consider the control of few-qubit systems through pulse sequences, with the goal to produce a given target, namely a digital gate, to the highest possible precision. To do this, we will choose an ansatz for the pulse sequence that contains free parameters and define a profit function that quantifies the similarity between the qubit operation and the target gate. Then, we maximize this function by optimizing the pulse parameters until we find the desired gate to a sufficient precision–or can no longer improve on the approximation we found. For the training phase, we will make use of fully-differentiable classical simulations of the qubit dynamics, allowing us to make use of backpropagation – an efficient differentiation technique widely used in machine learning – and gradient-based optimization. At the same time we attempt to find pulse shapes and control parameters that are (to some degree) realistically feasible, including bounded pulse amplitudes and rates of change of the amplitudes.

Tutorials that use other techniques are available, for example, for the open-source quantum toolbox QuTiP.

Gate calibration via pulse programming

Here, we briefly discuss the general setup of pulse programs that we will use for our optimal control application. For more details, you may peruse the related tutorials focusing on pulse programming.

Consider a quantum system comprised of \(n\) two-level systems, or qubits, described by a Hamiltonian

\[H(\boldsymbol{p}, t) = H_d + \sum_{i=1}^K f_i(\boldsymbol{p_i}, t) H_i.\]

As we can see, \(H\) depends on the time \(t\) and on a set of control parameters \(\boldsymbol{p},\) which is composed of one parameter vector \(\boldsymbol{p_i}\) per term. Both \(t\) and \(\boldsymbol{p}\) feed into functions \(f_i\) that return scalar coefficients for the (constant) Hamiltonian terms \(H_i.\) In addition, there is a constant drift Hamiltonian \(H_d.\) We will assume that the Hamiltonian \(H\) fully describes the system of interest and, in particular, we do not consider sources of noise in the system, such as leakage, dephasing, or crosstalk, i.e. the accidental interaction with other parts of a larger, surrounding system.

The time evolution of the state of our quantum system will be described by the Schrödinger equation associated with \(H.\) However, for our purposes it will be more useful to consider the full unitary evolution that the Hamiltonian causes, independently of the initial state. This way, we can compare it to the digital target gate without iterating over different input and output states. The Schrödinger equation dictates the behaviour of the evolution operator \(U\) to be

\[\frac{d}{dt} U(\boldsymbol{p}, t) = -i H(\boldsymbol{p}, t) U(\boldsymbol{p}, t),\]

where we implicitly fixed the initial time of the evolution to \(t_0=0.\) It is possible to simulate the dynamics of sufficiently small quantum systems on a classical computer by solving the ordinary differential equation (ODE) above numerically. For a fixed pulse duration \(T\) and given control parameters \(\boldsymbol{p},\) a numerical ODE solver computes the matrix \(U(\boldsymbol{p}, T).\)

How can we tell whether the evolution of the qubit system is close to the digital gate we aim to produce? We will need a measure of similarity, or fidelity.

In this tutorial we will describe the similarity of two unitary matrices \(U\) and \(V\) on \(n\) qubits with a fidelity function:

\[f(U,V) = \frac{1}{2^n}\big|\operatorname{tr}(U^\dagger V)\big|.\]

It is similar to an overlap measure obtained from the Frobenius norm but it allows us to ignore differences in the global phase. Note that fidelity is often used to compare quantum states rather than gates, and that noise often plays a role in this context. Here we only consider unitary gates.

We can maximize the fidelity function above to train the pulse parameters. For this purpose we write

\[F(\boldsymbol{p}) \equiv f(U_\text{target}, U(\boldsymbol{p}, T)).\]

Here \(U_\text{target}\) is the unitary matrix of the gate that we want to compile. We consider the total duration \(T\) as a fixed constraint to the optimization problem and therefore we do not denote it as a free parameter of \(F.\)

Illustration of a mountain with a path drawn from the ground to the peak, with markers for a pulse unitary and a CNOT gate

We can then maximize the fidelity \(F,\) for example, using gradient-based optimization algorithms like Adam [2]. But how do we obtain the gradient of a function that requires us to run an ODE solver to obtain its value? We are in luck! The implementation of pulse programming in PennyLane is fully differentiable via backpropagation thanks to its backend based on the machine learning library JAX. This enables us to optimize the gate sequences using efficiently computed gradients (provided the target gate is not too large).

Before we climb mount fidelity for particular example gates, let’s briefly talk about the pulse shape that we will use.

Smooth rectangle pulses

Let’s look at a building block that we will use a lot: smoothened rectangular pulses. We start with a simple rectangular pulse

\[R_\infty(t, (\Omega, t_0, t_1)) = \Omega \Theta(t-t_0) \Theta(t_1-t)\]

where \(\Omega\) is the amplitude, \(t_0\) and \(t_1\) are the start and end times of the pulse, and \(\Theta(t)\) is the Heaviside step function which is one for \(t\geq 0\) and zero otherwise. The trainable parameters of this pulse are the amplitude and the start/end times.

There are two main issues with \(R_\infty\) for our purposes:

The Heaviside step function is not differentiable with respect to the times \(t_0\) and \(t_1\) in the conventional sense (but only if we were to consider distributions in addition to functions), and in particular we cannot differentiate the resulting \(U(\boldsymbol{p},T)\) within the automatic differentiation framework provided by JAX.
The instantaneous change in the amplitude will not be realizable in practice. In reality, the pulses describe some electromagnetic control field that can only be changed at a bounded rate and in a smooth manner. \(R_\infty\) is not only not smooth, it is not even continuous. So we should consider smooth pulses with a bounded rate of change instead.

We can solve both these issues by smoothening the rectangular pulse: We simply replace the step functions above by a smooth variant, namely by sigmoid functions:

\[\begin{split}R_k(t, (\Omega, t_0, t_1)) &= \Omega S(t-t_0, k) S(t_1-t, k)\\ S(t, k) &= (1+\exp(-k t))^{-1}.\end{split}\]

We introduced an additional parameter, \(k,\) that controls the steepness of the sigmoid functions and can be adapted to the constraints posed by hardware on the maximal rate of change. In contrast to \(R_\infty,\) its sister \(R_k\) is smooth in all three arguments \(\Omega\), \(t_0\) and \(t_1,\) and training these three parameters with automatic differentiation will not be a problem.

Sketch of converting a rectangular pulse shape into a smoothened rectangular pulse shape

Let’s implement the smooth rectangle function using JAX’s numpy. We directly implement the product of the two sigmoids in the function sigmoid_rectangle:

\[R_k(t, (\Omega, t_0, t_1), k)= \Omega [1+\exp(-k (t-t_0))+\exp(-k (t_1-t))+\exp(-k(t_1-t_0))]^{-1}.\]

import jax
from jax import numpy as jnp

jax.config.update("jax_enable_x64", True)  # Use float64 precision
jax.config.update("jax_platform_name", "cpu")  # Disables a warning regarding device choice


def sigmoid_rectangle(t, Omega, t_0, t_1, k=1.0):
    """Smooth-rectangle pulse between t_0 and t_1, with amplitude Omega."""
    return Omega / (
        1 + jnp.exp(-k * (t - t_0)) + jnp.exp(-k * (t_1 - t)) + jnp.exp(-k * (t_1 - t_0))
    )

Let’s look at a rectangular pulse and its smoothened sister, for a number of different smoothness parameters:

import matplotlib.pyplot as plt

t = jnp.linspace(0, 6, 1000)
t_0, t_1 = (1.3, 5.4)
amplitude = 2.3
ks = [5, 10, 50]
rect = amplitude * jnp.heaviside(t - t_0, 1.0) * jnp.heaviside(t_1 - t, 1.0)

for k in ks:
    smooth = sigmoid_rectangle(t, amplitude, t_0, t_1, k)
    plt.plot(t, smooth, label=f"Smooth rectangle $R_k$, $k={k}$")
plt.plot(t, rect, label="Rectangle $R_{\\infty}$, $k\\to\\infty$")
plt.legend(bbox_to_anchor=(0.6, 0.05), loc="lower center")
plt.xlabel("time $t$")
plt.ylabel("Pulse function")
plt.show()

We see that for very large \(k,\) the smooth rectangle becomes practically indistinguishable from the original rectangle function \(R_\infty.\) This means that we can consider the smooth \(R_k\) a generalization of the pulse shape, rather than a restriction.

In the examples below, we will use a pulse ansatz \(S_k\) that sums multiple smooth rectangles \(R_k\) with the same value for \(k\) but individual start/end times \(t_{0/1}\) and amplitudes \(\Omega.\) With this nicely trainable pulse shape in our hands, we now turn to the first gate calibration task.

Pulse ansatz for CNOT calibration

In this first example we will tune a two-qubit pulse to produce a standard CNOT gate.

We start by choosing a system Hamiltonian. It contains the drift term \(H_d = Z_0 + Z_1,\) i.e. a Pauli \(Z\) operator acting on each qubit, with a constant unit amplitude. The parametrized part uses five generating terms: Pauli \(Z\) acting on the first qubit (\(Z_0\)), all three Pauli operators acting on the second qubit (\(X_1, Y_1, Z_1\)) and a single interaction term \(Z_0X_1,\) resembling an abstract cross-resonance driving term. For all coefficient functions we choose the same function, \(f_i=S_k\ \forall i\) (see the section above), but with distinct parameters. That is, our Hamiltonian is

\[H(\boldsymbol{p}, t) = \underset{H_d}{\underbrace{Z_0 + Z_1}} + S_k(\boldsymbol{p_1}, t) Z_0 + S_k(\boldsymbol{p_2}, t) X_1 + S_k(\boldsymbol{p_3}, t) Y_1 + S_k(\boldsymbol{p_4}, t) Z_1 + \underset{\text{interaction}}{\underbrace{S_k(\boldsymbol{p_5}, t) Z_0X_1}}\]

Due to this choice, the \(Z_0\) term commutes with all other terms, including the drift term, and can be considered a correction of the drive term to obtain the correct action on the first qubit. Although the interaction term was chosen to resemble a typical interaction in a superconducting cross resonance drive, this Hamiltonian remains a toy model. Realistic hardware Hamiltonians may impose additional constraints or provide fewer controls, and we do not consider the unit systems of such real-world systems here.

The idea behind using the sum of smooth rectangles function for the parametrization is the following: Many methods in quantum optimal control work with discretized pulse shapes that keep the pulse envelope constant for short time bins. This approach leads to a large number of parameters that need to be trained, and it requires us to manually enforce that the values do not differ by too much between neighbouring time bins. The smooth rectangles introduced above have a limited rate of change by design, and the number of parameters is much smaller than in generic discretization approaches. Each coefficient function \(S_k\) sums \(P\) smooth rectangles \(R_k\) with individual amplitudes and start and end times. Overall, this leads to \(n=5\cdot 3\cdot P=15P\) parameters in \(H.\) In this and the next example, we chose \(P\) heuristically.

Before we define this Hamiltonian, we implement the sum over multiple sigmoid_rectangle functions, including two normalization steps. First, we normalize the start and end times of the rectangles to the interval \([\epsilon, T-\epsilon],\) which makes sure that the pulse amplitudes are close to zero at \(t=0\) and \(t=T.\) Without this step, we might be tuning the pulses to be turned on (off) instantaneously at the beginning (end) of the sequence, negating our effort on the pulse shape itself not to vary too quickly. Second, we normalize the final output value to the interval \((-\Omega_\text{max}, \Omega_\text{max}),\) which allows us to bound the maximal amplitudes of the pulses to a realizable range while maintaining differentiability.

For the normalization steps, we define a sigmoid and a normalize function. The first is a straightforward implementation of \(R_k\) whereas the second uses the sigmoid function to normalize real numbers to the interval \((-1, 1).\)

def sigmoid(t, k=1.0):
    """Sigmoid function with steepness parameter k."""
    return 1 / (1 + jnp.exp(-k * t))


def normalize(t, k=1.0):
    """Smoothly normalize a real input value to the interval (-1, 1) using 'sigmoid'
    with steepness parameter k."""
    return 2 * sigmoid(t, k) - 1.0


def smooth_rectangles(params, t, k=2.0, max_amp=1.0, eps=0.0, T=1.0):
    """Compute the sum of P smooth-rectangle pulses and normalize their
    starting and ending times, as well as the total output amplitude.

    Args:
        params (tensor_like): Amplitudes and start and end times for the rectangles,
            in the order '[amp_1, ... amp_P, t_{1, 0}, t_{1, 1}, ... t_{P, 0}, t_{P, 1}]'.
        t (float): Time at which to evaluate the pulse function.
        k (float): Steepness of the sigmoid functions that delimit the rectangles
        max_amp (float): Maximal amplitude of the rectangles. The output will be normalized
            to the interval '(-max_amp, max_amp)'.
        eps (float): Margin to the beginning and end of the pulse sequence within which the
            start and end times of the individual rectangles need to lie.
        T (float): Total duration of the pulse.

    Returns:
        float: Value of sum of smooth-rectangle pulses at 't' for the given parameters.
    """
    P = len(params) // 3
    # Split amplitudes from times
    amps, times = jnp.split(params, [P])
    # Normalize times to be sufficiently far away from 0 and T
    times = sigmoid(times - T / 2, k=1.0) * (T - 2 * eps) + eps
    # Extract the start and end times of single rectangles
    times = jnp.reshape(times, (-1, 2))
    # Sum products of sigmoids (unit rectangles), rescaled with the amplitudes
    rectangles = [sigmoid_rectangle(t, amp, *ts, k) for amp, ts in zip(amps, times)]
    value = jnp.sum(jnp.array([rectangles]))
    # Normalize the output value to be in [-max_amp, max_amp] with standard steepness
    return max_amp * normalize(value, k=1.0)

Let’s look at this function for some example parameters, with the same steepness parameter \(k=20\) for all rectangles in the sum:

from functools import partial

T = 2 * jnp.pi  # Total pulse sequence time
k = 20.0  # Steepness parameter
max_amp = 1.0  # Maximal amplitude \Omega_{max}
eps = 0.1 * T  # Margin for the start/end times of the rectangles
# Bind hyperparameters to the smooth_rectangles function
S_k = partial(smooth_rectangles, k=k, max_amp=max_amp, eps=eps, T=T)

# Set some arbitrary amplitudes and times
amps = jnp.array([0.4, -0.2, 1.9, -2.0])  # Four amplitudes
times = jnp.array([0.2, 0.6, 1.2, 1.8, 2.1, 3.7, 4.9, 5.9])  # Four pairs of start/end times
params = jnp.hstack([amps, times])  # Amplitudes and times form the trainable parameters

plot_times = jnp.linspace(0, T, 300)
plot_S_k = [S_k(params, t) for t in plot_times]

plt.plot(plot_times, plot_S_k)
ax = plt.gca()
ax.set(xlabel="Time t", ylabel=r"Pulse function $S_k(p, t)$")
plt.show()

Note that the rectangles are rather round for these generic parameters. The optimized parameters in the training workflows below will lead to more sharply defined pulses that resemble rectangles more closely. The amplitude normalization step in smooth_rectangles enables us to produce them in a differentiable manner, as was our goal with introducing \(R_k.\) Also note that the normalization of the final output value is not a simple clipping step, but again a smooth function. As a consequence, the amplitudes 1.9 and -2. in the example above, which are not in the interval [-1, 1], are not set to 1 and -1 but take smaller absolute values. Finally, also note that the start and end times of the smooth rectangles are being normalized as well, in order to not end up too close to the boundaries of the total time interval. While this makes the pulse times differ from the input times, our pulse training will automatically consider this normalization step so that it has no major consequences for us.

Using this function, we now may build the parametrized pulse Hamiltonian and the fidelity function discussed above. We make use of just-in-time (JIT) compilation, which will make the first execution of profit and grad slower, but speed up the subsequent executions a lot. For optimization workflows of small-scale functions, this almost always pays off.

import pennylane as qml

X, Y, Z = qml.PauliX, qml.PauliY, qml.PauliZ

num_wires = 2
# Hamiltonian terms of the drift and parametrized parts of H
ops_H_d = [Z(0), Z(1)]
ops_param = [Z(0), X(1), Y(1), Z(1), Z(0) @ X(1)]
# Coefficients: 1 for drift Hamiltonian and smooth rectangles for parametrized part
coeffs = [1.0, 1.0] + [S_k for op in ops_param]
# Build H
H = qml.dot(coeffs, ops_H_d + ops_param)
# Set tolerances for the ODE solver
atol = rtol = 1e-10

# Target unitary is CNOT. We get its matrix and note that we do not need the dagger
# because CNOT is Hermitian.
target = qml.CNOT([0, 1]).matrix()
target_name = "CNOT"
print(f"Our target unitary is a {target_name} gate, with matrix\n{target.astype('int')}")


def pulse_matrix(params):
    """Compute the unitary time evolution matrix of the pulse for given parameters."""
    return qml.evolve(H, atol=atol, rtol=rtol)(params, T).matrix()


@jax.jit
def profit(params):
    """Compute the fidelity function for given parameters."""
    # Compute the unitary time evolution of the pulse Hamiltonian
    op_mat = pulse_matrix(params)
    # Compute the fidelity between the target and the pulse evolution
    return jnp.abs(jnp.trace(target.conj().T @ op_mat)) / 2**num_wires


grad = jax.jit(jax.grad(profit))

Our target unitary is a CNOT gate, with matrix
[[1 0 0 0]
 [0 1 0 0]
 [0 0 0 1]
 [0 0 1 0]]

For the arbitrary parameters from above, of course we get a rather arbitrary unitary time evolution, which does not match the CNOT at all:

params = [params] * len(ops_param)
arb_mat = jnp.round(pulse_matrix(params), 4)
arb_profit = profit(params)
print(
    f"The arbitrarily chosen parameters yield the unitary\n{arb_mat}\n"
    f"which has a fidelity of {arb_profit:.6f}."
)

The arbitrarily chosen parameters yield the unitary
[[-0.0315+0.2074j  0.967 +0.1442j  0.    +0.j      0.    +0.j    ]
 [ 0.6004+0.7717j -0.1653+0.1292j  0.    +0.j      0.    +0.j    ]
 [ 0.    +0.j      0.    +0.j      0.8553-0.1974j  0.4013+0.2618j]
 [ 0.    +0.j      0.    +0.j     -0.0311-0.4781j -0.5918+0.6482j]]
which has a fidelity of 0.052782.

Before we can start the optimization, we require initial parameters. We set small alternating amplitudes and evenly distributed start and end times for \(P=3\) smoothened rectangles. This choice leads to a total of \(15P=45\) parameters in the pulse sequence.

P = 3  # Number of rectangles P
# Initial parameters for the start and end times of the rectangles
times = [jnp.linspace(eps, T - eps, P * 2) for op in ops_param]
# All initial parameters: small alternating amplitudes and times
params = [jnp.hstack([jnp.array([0.1 * (-1) ** i for i in range(P)]), time]) for time in times]

Now we are all set up to train the parameters of the pulse sequence to produce our target gate, the CNOT. We will use the Adam optimizer [2], implemented in the optax library to our convenience. We keep track of the optimization via a list that contains the parameters and fidelity values. Then we can plot the fidelity across the optimization. As we will run a second optimization later on, we code up the optimizer run as a function. This function will report on the optimization progress and duration, and it will plot the trajectory of the profit function during the optimization.

import time
import optax


def run_adam(profit_fn, grad_fn, params, learning_rate, num_steps):
    start_time = time.process_time()
    # Initialize the Adam optimizer
    optimizer = optax.adam(learning_rate, b1=0.97)
    opt_state = optimizer.init(params)
    # Initialize a memory buffer for the optimization
    hist = [(params.copy(), profit_fn(params))]
    for step in range(num_steps):
        g = grad_fn(params)
        updates, opt_state = optimizer.update(g, opt_state, params)

        params = optax.apply_updates(params, updates)
        hist.append([params, c := profit_fn(params)])
        if (step + 1) % (num_steps // 10) == 0:
            print(f"Step {step+1:4d}: {c:.6f}")
    _, profit_hist = list(zip(*hist))
    plt.plot(list(range(num_steps + 1)), profit_hist)
    ax = plt.gca()
    ax.set(xlabel="Iteration", ylabel=f"Fidelity $F(p)$")
    plt.show()
    end_time = time.process_time()
    print(f"The optimization took {end_time-start_time:.1f} (CPU) seconds.")
    return hist


learning_rate = -0.2  # negative learning rate leads to maximization
num_steps = 500
hist = run_adam(profit, grad, params, learning_rate, num_steps)

Step   50: 0.993980
Step  100: 0.999696
Step  150: 0.999628
Step  200: 0.999854
Step  250: 0.999977
Step  300: 0.999995
Step  350: 0.999998
Step  400: 1.000000
Step  450: 1.000000
Step  500: 1.000000
The optimization took 34.9 (CPU) seconds.

As we can see, Adam steadily increases the fidelity, bringing the pulse program closer and closer to the target unitary. On its way, the optimizer produces a mild oscillating behaviour. The precision to which the optimization can produce the target unitary depends on the expressivity of the pulses we use, but also on the precision with which we run the ODE solver and the hyperparameters of the optimizer.

Let’s pick those parameters with the largest fidelity we observed during the training and take a look at the pulses we found. We again prepare a function that plots the pulse sequence, which we can reuse later on. For the single-qubit terms, we encode their qubit in the color and the type of Pauli operator in the line style of the plotted line.

colors = {0: "#70CEFF", 1: "#C756B2", 2: "#FDC357"}
dashes = {"X": [10, 0], "Y": [2, 2, 10, 2], "Z": [6, 2]}

def plot_optimal_pulses(hist, pulse_fn, ops, T, target_name):
    _, profit_hist = list(zip(*hist))
    fig, axs = plt.subplots(2, 1, figsize=(10, 9), gridspec_kw={"hspace": 0.0}, sharex=True)

    # Pick optimal parameters from the buffer of all observed profit values
    max_params, max_profit = hist[jnp.argmax(jnp.array(profit_hist))]
    plot_times = jnp.linspace(0, T, 300)
    # Iterate over pulse parameters and parametrized operators
    for i, (p, op) in enumerate(zip(max_params, ops)):
        # Create label, and pick correct axis
        label = str(op)
        dash = dashes[label[0]]
        ax = axs[0] if len(op.wires) == 1 else axs[1]

        # Set color according to qubit the term acts on
        col = colors[op.wires[0]]
        # Plot the pulse
        values = [pulse_fn(p, t) for t in plot_times]
        ax.plot(plot_times, values, label=label, dashes=dash, color=col)
    ax.legend()
    # Set legends and axis descriptions
    axs[0].legend(title="Single-qubit terms", ncol=int(jnp.sqrt(len(ops))))
    axs[1].legend(title="Two-qubit terms")
    title = f"{target_name}, Fidelity={max_profit:.6f}"
    axs[0].set(ylabel=r"Pulse function $f(p, t)$", title=title)
    axs[1].set(xlabel="Time $t$", ylabel=r"Pulse function $S_k(p, t)$")
    plt.show()


plot_optimal_pulses(hist, S_k, ops_param, T, target_name)

We observe that a single rectangular pulse is sufficient for some of the generating terms in the Hamiltonian whereas others end up at rather intricate pulse shapes. We see that their shape is closer to actual rectangles now, in particular for those with a saturated amplitude.

The final fidelity tells us that we achieved our goal of finding a pulse sequence that implements a unitary close to a CNOT gate. It could be optimized further, for example by running the optimization for more training iterations, by tuning the optimizer further to avoid oscillations, or by increasing the precision with which we run the ODE solver. This likely would also allow to reduce the total duration of the pulse.

Pulse sequence for Toffoli

The second example we consider is the compilation of a Toffoli–or CCNOT–gate. We reuse most of the workflow from above and only change the pulse Hamiltonian as well as a few hyperparameters.

In particular, the Hamiltonian uses the drift term \(H_d=Z_0+Z_1+Z_2\) and the generators are all single-qubit Pauli operators on all three qubits, together with the interaction generators \(Z_0X_1, Z_1X_2, Z_2X_0.\) Again, all parametrized terms use the coefficient function smooth_rectangles. We allow for a longer pulse duration of \(3\pi\) and five smooth rectangles in each pulse shape.

In summary, we use nine single-qubit generators and three two-qubit generators, with five rectangles in each pulse shape and each rectangle being given by an amplitude and a start and end time. The pulse sequence thus has \((9+3)\cdot 5\cdot 3=180\) parameters.

num_wires = 3
# New pulse hyperparameters
T = 3 * jnp.pi  # Longer total duration
eps = 0.1 * T
P = 5  # More rectangles in sum: P=5
S_k = partial(smooth_rectangles, k=k, max_amp=max_amp, eps=eps, T=T)

# Hamiltonian terms of the drift and parametrized parts of H
ops_H_d = [Z(0), Z(1), Z(2)]
ops_param = [pauli_op(w) for pauli_op in [X, Y, Z] for w in range(num_wires)]
ops_param += [Z(0) @ X(1), Z(1) @ X(2), Z(2) @ X(0)]

# Coefficients: 1. for drift Hamiltonian and smooth rectangles for parametrized part
coeffs = [1.0, 1.0, 1.0] + [S_k for op in ops_param]
# Build H
H = qml.dot(coeffs, ops_H_d + ops_param)
# Set tolerances for the ODE solver
atol = rtol = 1e-10

# Target unitary is Toffoli. We get its matrix and note that we do not need the dagger
# because Toffoli is Hermitian and unitary.
target = qml.Toffoli([0, 1, 2]).matrix()
target_name = "Toffoli"
print(f"Our target unitary is a {target_name} gate, with matrix\n{target.astype('int')}")


def pulse_matrix(params):
    """Compute the unitary time evolution matrix of the pulse for given parameters."""
    return qml.evolve(H, atol=atol, rtol=rtol)(params, T).matrix()


@jax.jit
def profit(params):
    """Compute the fidelity function for given parameters."""
    # Compute the unitary time evolution of the pulse Hamiltonian
    op_mat = pulse_matrix(params)
    # Compute the fidelity between the target and the pulse evolution
    return jnp.abs(jnp.trace(target.conj().T @ op_mat)) / 2**num_wires


grad = jax.jit(jax.grad(profit))

Our target unitary is a Toffoli gate, with matrix
[[1 0 0 0 0 0 0 0]
 [0 1 0 0 0 0 0 0]
 [0 0 1 0 0 0 0 0]
 [0 0 0 1 0 0 0 0]
 [0 0 0 0 1 0 0 0]
 [0 0 0 0 0 1 0 0]
 [0 0 0 0 0 0 0 1]
 [0 0 0 0 0 0 1 0]]

We create initial parameters similar to the above but allow for a larger number of \(1200\) optimization steps and use a reduced learning rate (by absolute value) in the optimization with Adam. Our run_adam function from above comes in handy and also provides an overview of the optimization process in the produced plot.

times = [jnp.linspace(eps, T - eps, P * 2) for op in ops_param]
params = [jnp.hstack([jnp.array([0.2 * (-1) ** i for i in range(P)]), time]) for time in times]

num_steps = 1200
learning_rate = -2e-3
hist = run_adam(profit, grad, params, learning_rate, num_steps)

params_hist, profit_hist = list(zip(*hist))
max_params = params_hist[jnp.argmax(jnp.array(profit_hist))]

Step  120: 0.772116
Step  240: 0.827255
Step  360: 0.890429
Step  480: 0.930445
Step  600: 0.984312
Step  720: 0.995645
Step  840: 0.998453
Step  960: 0.999210
Step 1080: 0.999538
Step 1200: 0.999708
The optimization took 364.1 (CPU) seconds.

This looks promising: Adam maximized the fidelity successfully and we thus compiled a pulse sequence that implements a Toffoli gate! To inspect how close the compiled pulse sequence is to the Toffoli gate, we can apply it to an exemplary quantum state, say \(|110\rangle,\) and investigate the returned probabilities. A perfect Toffoli gate would flip the third qubit, returning a probability of one in the last entry and zeros elsewhere.

dev = qml.device("default.qubit", wires=3)


@qml.qnode(dev, interface="jax")
def node(params):
    # Prepare |110>
    qml.PauliX(0)
    qml.PauliX(1)
    # Apply pulse sequence
    qml.evolve(H, atol=atol, rtol=rtol)(params, T)
    # Return quantum state
    return qml.probs()


probs = node(max_params)
print(f"The state |110> is mapped to the probability vector\n{jnp.round(probs, 6)}.")

The state |110> is mapped to the probability vector
[1.34000e-04 2.90000e-04 2.12000e-04 7.80000e-05 1.18000e-04 5.30000e-05
 1.10000e-05 9.99104e-01].

We see that the returned probabilities are close to the expected vector. The last entry is close to one, the others are almost zero. However, there are more possible inputs to the gate, and we hardly want to stare at eight probability vectors to understand the quality of the compiled pulse sequence. Instead, let’s plot the transition amplitudes with which our compiled pulse sequence maps computational basis vectors to each other. We include the complex phase of the amplitudes in the color of the bars.

import matplotlib as mpl

fig = plt.figure(figsize=(10, 8))
ax = fig.add_subplot(111, projection="3d")

dim = 8
x = jnp.tile(jnp.arange(dim), dim) - 0.27  # Input state indices
y = jnp.repeat(jnp.arange(dim), dim) - 0.36  # Output state indices
mat = pulse_matrix(max_params).ravel()  # Pulse matrix, reshaped to be a sequence of values
phases = jnp.angle(mat)  # Complex phases
color_norm = mpl.colors.Normalize(-jnp.pi, jnp.pi)
bar_colors = mpl.cm.turbo(color_norm(phases))
# Barplot with x, y positions, bottom, width, depth and height values for bars
ax.bar3d(x, y, 0.0, 0.6, 0.6, jnp.abs(mat).ravel(), shade=True, color=bar_colors)
# Specify a few visual attributes of the axes object
ax.set(
    xticks=list(range(dim)),
    yticks=list(range(dim)),
    xticklabels=[f"|{bin(i)[2:].rjust(3, '0')}>" for i in range(dim)],
    yticklabels=[f"|{bin(i)[2:].rjust(3, '0')}>" for i in range(dim)],
    zticks=[0.2 * i for i in range(6)],
    xlabel="Input state",
    ylabel="Output state",
)
# Add axes for the colorbar
cax = plt.axes([0.85, 0.15, 0.02, 0.62])
sc = mpl.cm.ScalarMappable(cmap=mpl.cm.turbo, norm=color_norm)
sc.set_array([])
# Plot colorbar
plt.colorbar(sc, cax=cax, ax=ax)
plt.show()

The transition amplitudes are as expected, except for very small deviations. All computational basis states are mapped to themselves, but the last two are swapped. The color of the entries close to one does not correspond to a phase of zero. However, the fact that they have the same color tells us that this deviation is a global phase, so that the pulse sequence is equivalent to the Toffoli gate. Let’s also look at the pulse sequence itself:

plot_optimal_pulses(hist, S_k, ops_param, T, target_name)

As we can see, the optimized smooth rectangles do not fill out the time at maximal amplitudes. This means that we probably can find shorter pulse sequences with larger amplitudes that produce a Toffoli with the same fidelity. If you are interested, take a shot at it and try to optimize the sequence with respect to the number of generators and pulse duration!

Conclusion

In this tutorial we calibrated a two-qubit and a three-qubit pulse sequence to obtain a CNOT and a Toffoli gate, respectively. For this, we used smooth rectangular pulse shapes together with toy pulse Hamiltonians, and obtained very good approximations to the target gates. Thanks to JAX, just-in-time (JIT) compiling and the PennyLane pulse module, training the pulse sequences was simple to implement and fast to run.

There are many different techniques in quantum optimal control that can be used to calibrate pulse sequences, some of which include gradient-based training. A widely-used technique called GRAPE [3] makes use of discretized pulses, which leads to a large number of free parameters to be optimized with gradient ascent. The technique shown here reduces the parameter count significantly and provides smooth, bounded shapes by definition.

Yet another method that does not use gradient-based optimization is the chopped random-basis quantum optimization (CRAB) algorithm [4]. It uses a different parametrization altogether, exploiting randomized basis functions for the pulse envelopes.

While setting up the application examples, we accommodated for some requirements of realistic hardware, like smooth pulse shapes with bounded maximal amplitudes and bounded rates of change, and we tried to use only few interaction terms between qubits. However, it is important to note that the shown optimization remains a toy model for calibration of quantum hardware. We did not take into account the interaction terms or pulse shapes available on realistic devices and their control electronics. We also did not consider a unit system tied to real devices, and we ignored noise, which plays a very important role in today’s quantum devices and in quantum optimal control. We leave the extension to real-world pulse Hamiltonians and noisy systems to a future tutorial–or maybe your work?

References

About the author

David Wierichs

I like to think about differentiation and representations of quantum programs, and I enjoy coding up research ideas and useful features for anyone to use in PennyLane.

Total running time of the script: (3 minutes 45.003 seconds)

Share demo

Ask a question on the forum

About

Software & Documentation

Resources

Topics

Community & Support

Optimal control for gate compilation

Quantum optimal control

Gate calibration via pulse programming

Smooth rectangle pulses

Pulse ansatz for CNOT calibration

Pulse sequence for Toffoli

Conclusion

References

About the author

Related Demos