Learning dynamics incoherently: Variational learning using classical shadows

Diego Guala

Demos/
Quantum Machine Learning/
Learning dynamics incoherently: Variational learning using classical shadows

Learning dynamics incoherently: Variational learning using classical shadows

Diego Guala

Published: August 14, 2024. Last updated: November 04, 2025.

How can we recreate and simulate an unknown quantum process with a quantum circuit? One approach is to learn the dynamics of this process incoherently, as done by Jerbi et al. [1]. Here, we’ll reproduce the numerical simulations of [1] using the authors’ data, as provided in the Learning Dynamics Incoherently PennyLane Dataset.

This approach differs from learning the quantum process coherently [2] because it does not require the model circuit to be connected to the target quantum process. That is, the model circuit does not receive quantum information from the target process directly. Instead, we train the model circuit using classical information from the classical shadow measurements. This works well for low-entangling processes but can require an exponential number of classical shadow measurements, depending on the unknown quantum process [1]. This is useful because it’s not always possible to port the quantum output of a system directly to hardware without first measuring it.

../../_images/OGthumbnail_large_LearningDynamicsIncoherently.png

In simple terms, learning dynamics incoherently consists of two steps. First, we measure the output of the unknown process for many different inputs. In this tutorial, we do this by measuring classical shadows of the target process output.

Then, we adjust a variational quantum circuit until it produces the same input-output combinations as the unknown process. Here, we will simulate the model circuit output and use the classical shadow measurements to estimate the overlap between the model output states and the unknown process output states.

1. Creating an unknown target quantum process

For our unknown quantum process, we will use the time evolution of a Hamiltonian:

\[U(H, t) = e^{-i H t / \hbar} .\]

For the Hamiltonian, \(H,\) we choose a transverse-field Ising Hamiltonian (as in [1]):

\[H = \sum_{i=0}^{n-1} Z_iZ_{i+1} + \sum_{i=0}^{n}\alpha_iX_i,\]

where \(n\) is the number of qubits and \(\alpha\) are randomly generated weights.

More specifically, we will approximate \(U(H, t)\) via Trotterization. We first create the Hamiltonian and Trotterize later with TrotterProduct.

import pennylane as qml
from pennylane import numpy as pnp
import numpy as np
import matplotlib.pyplot as plt

# number of qubits for the Hamiltonian
n_qubits = 2

# set random seed for reproducibility
pnp.random.seed(7)
np.random.seed(7)

# generate random weights
alphas = np.random.normal(0, 0.5, size=n_qubits)

# create the Hamiltonian
hamiltonian = qml.sum(
    *[qml.PauliZ(wires=i) @ qml.PauliZ(wires=i + 1) for i in range(n_qubits - 1)]
) + qml.dot(alphas, [qml.PauliX(wires=i) for i in range(n_qubits)])

2. Creating random initial states

The next step is to prepare a set of initial states. We will then apply the unknown quantum process to each of these states and measure the output to create input-output pairs. Later, we will train a model circuit to generate the same input-output pairs, thus reproducing the unknown quantum process.

Ideally, our input states should be uniformly distributed over the state space. If they are all clustered together, our model circuit will not learn to approximate the unknown quantum process behavior for states that are very different from our training set.

For quantum systems, this means we want to sample Haar-random states, as done below.

from scipy.stats import unitary_group

n_random_states = 100

# Generate several random unitaries
random_unitaries = unitary_group.rvs(2**n_qubits, n_random_states)
# Take the first column of each unitary as a random state
random_states = [random_unitary[:, 0] for random_unitary in random_unitaries]

Note

On a personal computer, this method becomes slow (>1 second) around 10 qubits.

3. Time evolution and classical shadow measurements

Now we can evolve the initial states using a Trotterized version of the Hamiltonian above. This will approximate the time evolution of the transverse-field Ising system.

dev = qml.device("default.qubit")

@qml.qnode(dev)
def target_circuit(input_state):
    # prepare training state
    qml.StatePrep(input_state, wires=range(n_qubits))

    # evolve the Hamiltonian for time=2 in n=1 steps with the order 1 formula
    qml.TrotterProduct(hamiltonian, time=2, n=1, order=1)
    return qml.classical_shadow(wires=range(n_qubits))


qml.draw_mpl(target_circuit)(random_states[0])
plt.show()

Since target_circuit returns classical_shadow(), running the circuit with a shot value gives the desired number of classical shadow measurements. We use this to create a set of shadows for each initial state.

n_measurements = 10000

shadows = []
for random_state in random_states:
    bits, recipes = qml.set_shots(target_circuit, shots=n_measurements)(random_state)
    shadow = qml.ClassicalShadow(bits, recipes)
    shadows.append(shadow)

4. Creating a model circuit that will learn the target process

Now that we have the classical shadow measurements, we need to create a model circuit that learns to produce the same output as the target circuit.

As done in [1], we create a model circuit with the same gate structure as the target circuit. If the target quantum process were truly unknown, then we would choose a general variational quantum circuit like in the Variational classifier demo.

Note

We use local measurements to keep the computational complexity low and because classical shadows are well-suited to estimating local observables [1]. For this reason, the following circuit returns local density matrices for each qubit. In hardware, the density matrix is obtained via state tomography using Pauli measurements or classical shadows.

@qml.qnode(dev)
def model_circuit(params, random_state):
    qml.StatePrep(random_state, wires=range(n_qubits))
    # parameterized quantum circuit with the same gate structure as the target
    for i in range(n_qubits):
        qml.RX(params[i], wires=i)

    for i in reversed(range(n_qubits - 1)):
        qml.IsingZZ(params[n_qubits + i], wires=[i, i + 1])
    return [qml.density_matrix(i) for i in range(n_qubits)]


initial_params = pnp.random.random(size=n_qubits*2-1, requires_grad=True)

qml.draw_mpl(model_circuit)(initial_params, random_states[0])
plt.show()

5. Training using classical shadows in a cost function

We now have to find the optimal parameters for model_circuit to mirror the target_circuit. We can estimate the similarity between the circuits according to this cost function (see Appendix B of [1]):

\[C^l_N(\theta) = 1 - \frac{1}{nN}\sum^N_{j=1}\sum^n_{i=1}Tr[U|\psi^{(j)}\rangle\langle\psi^{(j)}|U^\dagger O^{(j)}_i(\theta)],\]

where \(n\) is the number of qubits, \(N\) is the number of initial states, \(\psi^{(j)}\) are random states, \(U\) is our target unitary operation, and \(O_i\) is the local density matrix for qubit \(i\) after applying the ``model_circuit`.` That is, the local states \(\rho_{i}^{(j)}\) are used as the observables:

\[O_{i}^{(j)}(\theta) := \rho_{i}^{(j)}.\]

We can calculate this cost for our system by using the shadow measurements to estimate the expectation value of \(O_i.\) Roughly, this cost function measures the fidelity between the model circuit and the target circuit, by proxy of the single-qubit reduced states \(\rho_{i}^{(j)}\) of the model over a variety of input-output pairs.

def cost(params):
    cost = 0.0
    for idx, random_state in enumerate(random_states):
        # obtain the density matrices for each qubit
        observable_mats = model_circuit(params, random_state)
        # convert to a PauliSentence
        observable_pauli = [
            qml.pauli_decompose(observable_mat, wire_order=[qubit])
            for qubit, observable_mat in enumerate(observable_mats)
        ]
        # estimate the overlap for each qubit
        cost = cost + qml.math.sum(shadows[idx].expval(observable_pauli))
    cost = 1 - cost / n_qubits / n_random_states
    return cost


params = initial_params

optimizer = qml.GradientDescentOptimizer(stepsize=5)
steps = 50

costs = [None]*(steps+1)
params_list = [None]*(steps+1)

params_list[0]=initial_params
for i in range(steps):
    params_list[i + 1], costs[i] = optimizer.step_and_cost(cost, params_list[i])

costs[-1] = cost(params_list[-1])

print("Initial cost:", costs[0])
print("Final cost:", costs[-1])

Initial cost: 0.5769866128154675
Final cost: 0.19673998333089882

We can plot the cost over the iterations and compare it to the ideal cost.

# find the ideal parameters from the original Trotterized Hamiltonian
ideal_parameters = [
    op.decomposition()[0].parameters[0]
    for op in qml.TrotterProduct(hamiltonian, 2, 1, 1).decomposition()
]
ideal_parameters = ideal_parameters[:n_qubits][::-1] + ideal_parameters[n_qubits:]

ideal_cost = cost(ideal_parameters)

plt.plot(costs, label="Training")
plt.plot([0, steps], [ideal_cost, ideal_cost], "r--", label="Ideal parameters")
plt.ylabel("Cost")
plt.xlabel("Training iterations")
plt.legend()
plt.show()

In this case, we see that the ideal cost is greater than 0. This is because for the ideal parameters, the model outputs and target outputs are equal:

\[\rho_{i}^{(j)} := O_i = U|\psi^{(j)}\rangle\langle\psi^{(j)}|U^\dagger.\]

Since the single-qubit reduced states used in the cost function are mixed states, the trace of their square is less than one:

\[Tr[(\rho_{i}^{(j)})^2] < 1.\]

The ideal cost \(C^l_N(\theta)\) is therefore greater than 0.

We can also look at the trace_distance between the unitary matrix of the target circuit and the model circuit. As the circuits become more similar with each training iteration, we should see the trace distance decrease and reach a low value.

import scipy

target_matrix = qml.matrix(
    qml.TrotterProduct(hamiltonian, 2, 1, 1),
    wire_order=range(n_qubits),
)

zero_state = [1] + [0]*(2**n_qubits-1)

# model matrix using the all-|0> state to negate state preparation effects
model_matrices = [qml.matrix(model_circuit, wire_order=range(n_qubits))(params, zero_state) for params in params_list]
trace_distances = [qml.math.trace_distance(target_matrix, model_matrix) for model_matrix in model_matrices]

plt.plot(trace_distances)
plt.ylabel("Trace distance")
plt.xlabel("Training iterations")
plt.show()

print("The final trace distance is: \n", trace_distances[-1])

The final trace distance is:
 0.10087468405477117

Using the Learning Dynamics Incoherently PennyLane Dataset

In Jerbi et al. [1], the authors perform this procedure to learn dynamics incoherently on a larger, 16-qubit transverse-field Ising Hamiltonian, and use classical shadow samples from quantum hardware to estimate the cost function. The corresponding Learning Dynamics Incoherently PennyLane Dataset can be downloaded via the qml.data module.

[ds] = qml.data.load("other", name="learning-dynamics-incoherently")

# print the available data
print(ds.list_attributes())

# print more information about the hamiltonian
print(ds.attr_info["hamiltonian"]["doc"])

['field_strengths', 'hamiltonian', 'shadow_bases', 'shadow_meas', 'training_circuits', 'training_states']
Transverse-field Ising Hamiltonian for the problem studied.

The unknown target Hamiltonian, Haar-random initial states, and resulting classical shadow measurements are all available in the dataset.

Note

We use few shadows to keep the computational time low and the dataset contains only two training states.

random_states = ds.training_states

n_measurements = 10000
shadows = [qml.ClassicalShadow(shadow_meas[:n_measurements], shadow_bases[:n_measurements]) for shadow_meas, shadow_bases in zip(ds.shadow_meas,ds.shadow_bases)]

We only need to create the model circuit, cost function, and train. For these we use the same model circuit as above, updated to reflect the increased number of qubits.

dev = qml.device("default.qubit")

@qml.qnode(dev)
def model_circuit(params, random_state):
    # this is a parameterized quantum circuit with the same gate structure as the target unitary
    qml.StatePrep(random_state, wires=range(16))
    for i in range(16):
        qml.RX(params[i], wires=i)

    for i in reversed(range(15)):
        qml.IsingZZ(params[16 + i], wires=[i, i + 1])
    return [qml.density_matrix(i) for i in range(16)]


initial_params = pnp.random.random(size=31)

qml.draw_mpl(model_circuit)(initial_params, random_states[0])
plt.show()

We can then minimize the cost to train the model to output the same states as the target circuit. For this, we can use the cost function from before, as long as we update the number of qubits and the number of random states.

n_qubits = 16
n_random_states = len(ds.training_states)

optimizer = qml.GradientDescentOptimizer(stepsize=5)
steps = 50

costs = [None]*(steps+1)
params_list = [None]*(steps+1)

params_list[0]=initial_params
for i in range(steps):
    params_list[i + 1], costs[i] = optimizer.step_and_cost(cost, params_list[i])

costs[-1] = cost(params_list[-1])

print("Initial cost:", cost(initial_params))
print("Final cost:", costs[-1])

Initial cost: 0.18513920753086577
Final cost: 0.05960174499912396

As a quick check, we can take a look at the density matrices to see whether the training was successful:

original_matrices = model_circuit(initial_params, random_states[0])
learned_matrices = model_circuit(params_list[-1], random_states[0])
target_matrices_shadow = np.mean(shadows[0].local_snapshots(), axis=0)

print("Untrained example output state\n", original_matrices[0])
print("Trained example output state\n", learned_matrices[0])
print("Target output state\n", target_matrices_shadow[0])

Untrained example output state
 [[ 0.21126203+0.j         -0.34162652+0.13423424j]
 [-0.34162652-0.13423424j  0.78873797+0.j        ]]
Trained example output state
 [[ 0.35039227+0.j         -0.39923078+0.26089092j]
 [-0.39923078-0.26089092j  0.64960773+0.j        ]]
Target output state
 [[ 0.3689+0.j     -0.3108+0.2724j]
 [-0.3108-0.2724j  0.6311+0.j    ]]

After training, the model outputs are closer to the target outputs, but not quite the same. This is due to the limitations of this learning method. Even for a simple circuit like the short-time evolution of a first order single Trotter step, it requires a large number of shadow measurements and training states to faithfully reproduce the underlying quantum process. The results can be improved by increasing the number of training states and classical shadow measurements.

References

About the author

Diego Guala

Diego is a quantum scientist at Xanadu. His work is focused on supporting the development of the datasets service and PennyLane features.

Total running time of the script: (2 minutes 5.814 seconds)

Share demo

Ask a question on the forum

About

Software & Documentation

Resources

Topics

Community & Support

Learning dynamics incoherently: Variational learning using classical shadows

1. Creating an unknown target quantum process

2. Creating random initial states

3. Time evolution and classical shadow measurements

4. Creating a model circuit that will learn the target process

5. Training using classical shadows in a cost function

Using the Learning Dynamics Incoherently PennyLane Dataset

References

About the author

Related Demos