Limitations of Amplitude Encoding on Quantum Classification
Paper Information
Title: Limitations of Amplitude Encoding on Quantum Classification
Authors: Xin Wang, Yabo Wang, Bo Qi, Rebing Wu
Published: ([2025])
DOI: https://arxiv.org/abs/2503.01545
Reproduction Status: ✅ Complete
Reproducer: Louis-Félix Vigneux (louis-felix.vigneux@quandela.com)
Abstract
This paper prooves mathematically the main limitations of the amplitude encoding paradigm is QML. Indeed, the authors introduce the concentraion phenomena of encoded states that tend towards a loss plateau. The authors then proove on simple synthtic datasets that the limitations are also showing experimentally. To show that those cases are not just outliers, the paper shows that for known datasets such as CIFAR-10, EURO-SAT and MedMNIST, the same limitations occur.
Significance
This paper is significant as the QML community mostly sees amplitude encoding as the most efficient and useful encoding strategy. Indeed, it is a strategy that can encode classical data in very few qubits. Although, there is not much discussion on the impact normalization factor in amplitude encoding that greatly limits its capabilities. It is important for the community to question the choice of embedding if a quantum model does not perform as well as they thought.
MerLin Implementation
MerLin is used to create small and easy to train model for the experimental part of the paper.
Key Contributions Reproduced
- Limitations of amplitude encoding
Three simple datasets are shown to have the same amplitude encoded average state
These datasets are then classified with a simple amplitude encoding model.
- Classify known datasets
Four known datasets ((MNIST, EuroSAT, CIFAR-10 and PathMNIST)) are shown to have the same amplitude encoded average state
These datasets are then classified with QCNN amplitude encoding bases-models to show that the limitations are also visible in practical cases.
Implementation Details
We use the CircuitBuilder to create small quantum models easily with amplitude encoding.
import merlin as ml
circuit = ml.CircuitBuilder(n_modes=n_modes)
for _ in range(num_layers):
if n_modes == 1:
circuit.add_rotations(trainable=True)
else:
circuit.add_entangling_layer()
qlayer = ml.QuantumLayer(
builder=circuit,
amplitude_encoding=True,
n_photons=n_photons,
measurement_strategy=measurement_strategy,
)
Experimental Results
To see the result plots, consult the ReadMe of the project.
Interactive Exploration
Jupyter Notebook: Limitations of Amplitude Encoding on Quantum Classification (AA_study)
The notebook provides shows that even for simple datasets, amplitude encoding is not always the best suited encoding strategy.
Extensions and Future Work
This paper sparked the interest to explore more encoding strategies and find their strengths and limitations so that MerLin users can choose the encoding that is the most appropriate for them.
Code Access and Documentation
GitHub Repository: merlin/reproductions/AA_study
The complete implementation includes:
The simple synthetic datasets proving the limitations of amplitude encoding.
For those datasets, classification tasks on a amplitude and angle encoding based models.
A gate based and photonic based QCNN used to classify known datasets (MNIST, EuroSAT, CIFAR-10 and PathMNIST). They both use amplitude encoding.
Citation
@misc{wang_limitations_2025,
title = {Limitations of {Amplitude} {Encoding} on {Quantum} {Classification}},
url = {http://arxiv.org/abs/2503.01545},
doi = {10.48550/arXiv.2503.01545},
abstract = {It remains unclear whether quantum machine learning (QML) has real advantages when dealing with practical and meaningful tasks. Encoding classical data into quantum states is one of the key steps in QML. Amplitude encoding has been widely used owing to its remarkable efficiency in encoding a number of \$2{\textasciicircum}\{n\}\$ classical data into \$n\$ qubits simultaneously. However, the theoretical impact of amplitude encoding on QML has not been thoroughly investigated. In this work we prove that under some broad and typical data assumptions, the average of encoded quantum states via amplitude encoding tends to concentrate towards a specific state. This concentration phenomenon severely constrains the capability of quantum classifiers as it leads to a loss barrier phenomenon, namely, the loss function has a lower bound that cannot be improved by any optimization algorithm. In addition, via numerical simulations, we reveal a counterintuitive phenomenon of amplitude encoding: as the amount of training data increases, the training error may increase rather than decrease, leading to reduced decrease in prediction accuracy on new data. Our results highlight the limitations of amplitude encoding in QML and indicate that more efforts should be devoted to finding more efficient encoding strategies to unlock the full potential of QML.},
publisher = {arXiv},
author = {Wang, Xin and Wang, Yabo and Qi, Bo and Wu, Rebing},
month = mar,
year = {2025},
note = {arXiv:2503.01545 [quant-ph]},
keywords = {Quantum Physics},
annote = {Comment: 18 pages, 11 figures},
}
