Experimental data re-uploading with provable enhanced learning capabilities
Paper Information
Title: Experimental data re-uploading with provable enhanced learning capabilities
Authors: Martin F. X. Mauser, Solène Four, Lena Marie Predl, Riccardo Albiero, Francesco Ceccarelli, Roberto Osellame, Philipp Petersen, Borivoje Dakić, Iris Agresti, and Philip Walther
Published: arXiv preprint, 14 (2025)
DOI: https://doi.org/10.48550/arXiv.2507.05120
Reproduction Status: ✅ Complete
Reproducer: Hugo Izadi (hugoizadi@gmail.com) and Philippe Schoeb (philippe.schoeb@quandela.com)
Abstract
The reference paper’s main contribution is to present a well performing and resource-efficient data re-uploading scheme on a photonic quantum processor. It showcases the model’s performance on four datasets of increasing complexity. It also provides an analytical proof that the proposed model in a universal classifier.
Significance
This research is significant since it proposes a resource-efficient model that could reduce ernegy consumption for classifying tasks. In addition, energy efficiency is becoming an increasingly important argument for using quantum circuits in machine learning.
MerLin Implementation
MerLin’s QuantumLayer is used in our reproduction to define the optimizable photonic circuit that comprises almost all the proposed model.
Key Contributions Reproduced
- Effect of increasing number of re-uploading layers on the circular dataset
Three re-uploading layers are necessary to fully capture the expressivity of the circular dataset
- Experimental accuracies or the photonic data re-uploading model on four different datasets based on its number of layers
Increasing number of layers increases the obtained train and test accuracies
Faster improvement of accuracy when increasing number of layers on the Tetromino dataset
Slower improvement of accuracy when increasing number of layers on the circular dataset
Implementation Details
The trainable photonic circuit used to encode data is defined with MerLin:
import merlin as ml
quantum_layer = ml.QuantumLayer(
input_size=self.dimension,
circuit=circuit_model.circuit,
trainable_parameters=["var"],
input_parameters=["x"],
input_state=circuit_model.input_state,
measurement_strategy=ml.MeasurementStrategy.probs(),
)
Extensions and Future Work
The MerLin implementation extends beyond the original paper:
- Circuit architecture benchmark
Three different circuit schemes were explored for data encoding and for training
Design C is clearly defective
- Hyperparameter grid search
Exploration for different values of tau (Fisher loss temperature) and alpha (phase scaling)
Trade off between too simple and too complex decision boundaries
- Hardware Considerations
Every experiment done for this reproduction has been designed for simulation on CPU
- Future work
Extend to more complex datasets to see the limit of this architecture (if there is any)
Compare with classical baseline models
Deploy on quantum hardware
Code Access and Documentation
GitHub Repository: merlin/reproductions/data_reuploading
The complete implementation includes:
Model implementation
Testing on the four dataset used
Two additional benchmarking
A notebook to explore with the proposed architecture in different settings
Citation
@misc{mauser2025experimentaldatareuploadingprovable,
title={Experimental data re-uploading with provable enhanced learning capabilities},
author={Martin F. X. Mauser and Solène Four and Lena Marie Predl and Riccardo Albiero and Francesco Ceccarelli and Roberto Osellame and Philipp Petersen and Borivoje Dakić and Iris Agresti and Philip Walther},
year={2025},
eprint={2507.05120},
archivePrefix={arXiv},
primaryClass={quant-ph},
url={https://arxiv.org/abs/2507.05120},
}