Multi-objective reward generalization: improving performance of Deep Reinforcement Learning for applications in single-asset trading

Federico Cornalba, Constantin Disselkamp, Davide Scassola, Christopher Helf

Research output: Contribution to journalArticlepeer-review

2 Citations (SciVal)

Abstract

We investigate the potential of Multi-Objective, Deep Reinforcement Learning for stock and cryptocurrency single-asset trading: in particular, we consider a Multi-Objective algorithm which generalizes the reward functions and discount factor (i.e., these components are not specified a priori, but incorporated in the learning process). Firstly, using several important assets (BTCUSD, ETHUSDT, XRPUSDT, AAPL, SPY, NIFTY50), we verify the reward generalization property of the proposed Multi-Objective algorithm, and provide preliminary statistical evidence showing increased predictive stability over the corresponding Single-Objective strategy. Secondly, we show that the Multi-Objective algorithm has a clear edge over the corresponding Single-Objective strategy when the reward mechanism is sparse (i.e., when non-null feedback is infrequent over time). Finally, we discuss the generalization properties with respect to the discount factor. The entirety of our code is provided in open-source format.

Original languageEnglish
Number of pages19
JournalNeural Computing and Applications
DOIs
Publication statusPublished - 5 Oct 2023

Bibliographical note

Funding Information:
Open access funding provided by Università degli Studi di Trieste within the CRUI-CARE Agreement. Funding was provided by Austrian Science Fund (Grant No. F65), Horizon 2020 (Grant No. 754411) and Österreichische Forschungsförderungsgesellschaft.

Funding

Open access funding provided by Università degli Studi di Trieste within the CRUI-CARE Agreement. Funding was provided by Austrian Science Fund (Grant No. F65), Horizon 2020 (Grant No. 754411) and Österreichische Forschungsförderungsgesellschaft.

Keywords

  • Cryptocurrency trading
  • Deep Reinforcement Learning
  • Discount factor generalization
  • Multi-objective generalization
  • Multi-task learning
  • Stock trading

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Multi-objective reward generalization: improving performance of Deep Reinforcement Learning for applications in single-asset trading'. Together they form a unique fingerprint.

Cite this