Applied Quantum Allocation.
Executed on Actual Qubits.
Qubitrade is an active R&D framework bridging reinforcement learning and hybrid quantum-classical algorithms. Currently in Week 5 of live-capital execution, we bypass simulated buzzwords to run mathematically dense QAOA arrays on real physical hardware.
The Theorem
The Combinatorial Wall
Classical Mean-Variance Optimization functions elegantly in continuous, unconstrained environments. It takes milliseconds on a standard CPU. However, real-world institutional capital is constrained by transaction costs, discrete trade lots, and strict cardinality limits (e.g., "select exactly K assets from a universe of N").
Introducing these discrete, binary constraints transforms portfolio allocation into an intractable Mixed-Integer Non-Linear Programming (MINLP) problem. As the asset universe ($N$) scales, the state space explodes exponentially ($2^N$). Classical heuristics routinely stall in sub-optimal local minima. Qubitrade hypothesizes that Quantum Approximate Optimization Algorithms (QAOA) provide a mathematically superior pathway to search these hyper-dimensional energy landscapes natively.
-
×
Cardinality Penalty: Forcing $\sum x_i = K$ requires exhaustive combinatorial traversal.
-
×
Local Minima Traps: Simulated Annealing and CPLEX relaxations fail to capture true global minima in non-convex risk surfaces.
Translation Layer
QUBO Formulation
To execute on gate-based QPUs, financial constraints must be translated into the native language of quantum mechanics. We map the allocation problem into a Quadratic Unconstrained Binary Optimization (QUBO) structure, natively equivalent to an Ising Hamiltonian.
Expected Returns ($\mu$)
The linear term. Predicted short-term returns generated by our QSVR ensemble. Mapped to local fields (Pauli-Z) on individual qubits.
Covariance Risk ($\Sigma$)
The quadratic term. Represents asset correlation, scaled by risk aversion ($\theta$). Mapped directly to ZZ-entanglement couplings between qubits.
Cardinality Penalty ($\lambda$)
A massive algebraic penalty ensuring the solver selects exactly $K$ assets, forcing invalid portfolios into high-energy (sub-optimal) states.
System Architecture
The Execution Pipeline
QSVR Covariance
Classical data streams are engineered into high-dimensional feature maps. We actively benchmark Quantum Support Vector Regressors (QSVR) against classical counterparts to predict covariance matrices based on 88,000+ ticks of rolling data.
RL Control Agent
A reinforcement learning (PPO) agent dictates state action. The RL policy triggers optimization only when mathematical drift exceeds thresholds computed in the current volatility regime.
QUBO / QAOA Solvers
Target allocations are isolated, constructed into QUBOs, and handed to async Celery workers. The workers submit the parametric circuits via IBM Qiskit Runtime, proving viability on real quantum hardware.
Active State
Current R&D Operations
U100 Real-Capital Execution
Currently executing Month 2 (Week 5) of autonomous live operations. The architecture is managing real capital across a 100-symbol universe, grounding theoretical math in actual market microstructure and slippage.
U500 Offline Overhaul
Simultaneously scaling offline data pipelines to target 500-asset indices. We are running rigorous constraint sensitivity analyses and testing the agent's stability against massive topological state spaces.
Batched Quantum Jobs
Preparing to introduce new users by aggregating disparate QUBO formulations into batched quantum sessions. This optimizes NISQ-era hardware queue latency and minimizes QPU idle-time overheads.
Model Validation
Institutional Friction & The Counter-Intuitive Alpha
In quantitative finance, an unconstrained backtest is largely fiction. To validate our new reinforcement agent ahead of a Q2 U500 deployment, we intentionally choked the model with severe institutional friction to observe where the logic breaks.
We locked in a strict 30 bps transaction drag across a 252-day out-of-sample holdout (500-symbol universe), running a constraint sensitivity analysis by progressively strangling the daily turnover allowance from 15% down to exactly 5%.
The telemetry revealed a counter-intuitive structural advantage. When heavily constrained, the agent did not degrade. It mathematically adapted, abandoning marginal high-frequency noise to hold higher-conviction, longer-duration states—fundamentally improving its risk-adjusted survival.
| Turnover Cap | Sharpe | Max DD | Return | Calmar |
|---|---|---|---|---|
| 15.0% Max | 1.44 | -12.80% | +32.67% | 2.56 |
| 10.0% Max | 1.59 | -11.22% | +34.38% | 3.08 |
| 07.0% Max | 1.73 | -10.49% | +35.99% | 3.45 |
| 05.0% Hard Cap | 2.19 | -09.13% | +44.41% | 4.89 |
Authentic Telemetry
Raw Production Logs
These are strictly un-indented stdout traces from our offline
training environments and physical QPU executions on March 4, 2026. For OPSEC, critical
hyperparameters, active feature vectors, and network paths have been strictly redacted [REDACTED] to highlight
only the raw computational scale: massive data ingestion, out-of-sample cross-validation, and
grueling topological hardware transpilation.
Research & Development Collaboration
Qubitrade is strictly an experimental PoC. We are not offering commercial SaaS or managing capital. We welcome dialogue with quantitative researchers, deep-tech infrastructure funds, and quantum hardware foundries interested in algorithm validation.