Decoding Market Signals: A New Foundation Model for Financial Flows

Author: Denis Avetisyan

Researchers have unveiled TradeFM, a large-scale generative model capable of learning and replicating the complex dynamics of financial markets from vast transaction data.

TradeFM operates within a closed-loop system, predicting trades which are then executed by a market simulator, with the resulting market state continuously fed back to refine subsequent predictions - a cycle designed to test and adapt to dynamic conditions. — TradeFM operates within a closed-loop system, predicting trades which are then executed by a market simulator, with the resulting market state continuously fed back to refine subsequent predictions – a cycle designed to test and adapt to dynamic conditions.

TradeFM is a generative foundation model for market microstructure that learns unified trade-flow dynamics from billions of transactions and serves as a robust environment for agent-based modeling.

Despite the success of foundation models across diverse domains, capturing the complex dynamics of financial markets has remained a significant challenge. This paper introduces ‘TradeFM: A Generative Foundation Model for Trade-flow and Market Microstructure’, a 524M-parameter Transformer trained on billions of trade events that learns scale-invariant representations of order flow. TradeFM successfully reproduces key stylized facts of financial returns and generalizes to new markets without asset-specific calibration, demonstrating the potential for transferable structure in market microstructure. Could this approach pave the way for robust synthetic data generation and the development of more effective learning-based trading agents?

Decoding the Noise: Why Markets Defy Simple Models

Conventional financial models, often reliant on assumptions of normality and efficient markets, frequently fall short when applied to actual trading data. These models struggle to account for persistent ‘Stylized Facts’ – observable characteristics of financial time series that defy simple explanations. Specifically, real-world markets exhibit phenomena like excess kurtosis – meaning extreme events occur far more often than predicted by a normal distribution – and asymmetric return distributions where negative returns tend to be larger and more frequent than positive ones. This disconnect arises because traditional approaches often oversimplify the complex interplay of investor behavior, information flow, and market mechanics. Consequently, predictions generated by these models can be inaccurate and risk assessments misleading, highlighting the need for more robust and realistic frameworks capable of capturing the inherent complexities of financial markets.

Financial markets routinely exhibit patterns that deviate significantly from the predictions of traditional statistical models, primarily through the phenomenon of ‘heavy-tailed returns’ and volatility clustering. Unlike a normal distribution where extreme events are rare, market returns often display a higher probability of large, unexpected gains or losses – a ‘heavy tail’ – suggesting inherent risks are underestimated by conventional methods. Simultaneously, periods of high volatility tend to cluster together, meaning large price swings are more likely to follow other large swings, and vice versa, rather than occurring randomly. These characteristics invalidate assumptions of constant volatility and normally distributed errors underpinning many economic models, necessitating the development of more sophisticated approaches – such as those incorporating stochastic volatility, extreme value theory, or agent-based modeling – to accurately capture market behavior and better manage financial risk.

A comprehensive understanding of market microstructure, specifically the dynamics within the limit order book, is increasingly recognized as essential for building robust financial models. The limit order book represents all outstanding buy and sell orders for an asset, and its intricate interplay reveals how price discovery truly occurs. Researchers are finding that traditional modeling approaches, which often rely on simplified assumptions about market behavior, frequently fail because they overlook the granular details of order placement, cancellation, and execution. By analyzing the patterns within this book – including order imbalances, order flow toxicity, and the impact of different order types – analysts can gain valuable insights into short-term price movements and overall market stability. This detailed examination allows for more accurate predictions of asset prices and risk assessment, ultimately improving the effectiveness of trading strategies and portfolio management.

Validation of the TradeFM model demonstrates realistic financial time series characteristics, including near-zero autocorrelation, slowly decaying autocorrelation of absolute returns indicative of volatility clustering, and heavy-tailed distributions exhibiting aggregational Gaussianity.

TradeFM: A Foundation for Modeling Market Genesis

TradeFM is a 524 million parameter foundation model engineered for the specific task of modeling financial time series data and the associated trade flow. This model class represents a departure from traditional methods by offering a scalable and adaptable architecture capable of learning directly from data. The model’s parameter count signifies its capacity to represent complex relationships within financial data, while its design focuses on capturing the dynamic and sequential nature of trade activity over time. By functioning as a ‘foundation model’, TradeFM is intended to serve as a base for further specialization and application to a variety of financial modeling tasks.

TradeFM utilizes the Transformer architecture, a deep learning model originally developed for natural language processing, to model financial time series data. This architecture employs self-attention mechanisms, allowing the model to weigh the importance of different points in a time series when predicting future values. Specifically, the Transformer’s capacity for sequence modeling enables TradeFM to capture complex temporal dependencies – relationships between data points across varying time lags – without requiring pre-defined window sizes or feature engineering. This is achieved through parallel processing of the entire input sequence, unlike recurrent neural networks which process data sequentially, improving computational efficiency and allowing the model to identify long-range correlations within the trade-flow data.

Traditional methods for modeling trade flow, such as Zero-Intelligence agents and Compound Hawkes Processes, rely on pre-defined rules or statistical assumptions. Zero-Intelligence agents generate trades based on probabilistic order placement without considering market impact, while Compound Hawkes Processes model trade arrival as a self-exciting point process. In contrast, TradeFM utilizes a data-driven approach, learning directly from historical time series data to identify and replicate complex patterns and dependencies. This allows TradeFM to generate more realistic simulations of market behavior, capturing nuanced interactions and dependencies that are not explicitly programmed into rule-based or statistically simplified models.

Training curves demonstrate that TradeFM-500M converges with increasing epochs, and out-of-sample perplexity decreases as model size and training progress, indicating improved performance.

Stress Testing Reality: Rigorous Evaluation via Simulation

The TradeFM Simulator is a deterministic computational environment directly integrated with the TradeFM model, serving as the primary tool for performance evaluation. This closed system allows for controlled generation of market scenarios, isolating specific variables and enabling repeatable testing of TradeFM’s predictive capabilities. Unlike stochastic or randomized simulations, the deterministic nature of the TradeFM Simulator ensures that identical inputs will always produce identical outputs, facilitating precise analysis and debugging of model behavior. The simulator’s integration with TradeFM allows for efficient iteration on model parameters and algorithms, as results are directly accessible within the model’s framework without the need for external data processing or interpretation.

The TradeFM Simulator generates market scenarios by simulating the submission and execution of individual orders – constituting ‘Order Flow’ – and subsequently deriving ‘Price Formation’ based on the interaction of these orders. This deterministic environment allows for controlled experimentation, enabling evaluation of TradeFM’s predictive capabilities regarding order book dynamics and its accuracy in modeling the price discovery process. By manipulating parameters within the simulator, we can generate a diverse range of market conditions, including varying volumes, volatility levels, and order types, to assess TradeFM’s performance under stress and in diverse scenarios. The model’s predictions are then compared against the simulated outcomes to quantify its ability to accurately forecast market behavior.

TradeFM’s robustness is assessed through the implementation of both Multi-Agent Simulation and Reinforcement Learning techniques. Multi-Agent Simulation involves populating the simulated environment with numerous independent agents, each adhering to predefined behavioral rules, to generate complex, emergent market dynamics. Reinforcement Learning is then applied by training agents within the simulation to optimize trading strategies, effectively stress-testing TradeFM’s predictive capabilities under varied and challenging conditions. This combined approach allows for the systematic exploration of TradeFM’s performance across a wide range of market scenarios, including those not observed in historical data, thereby identifying potential vulnerabilities and ensuring model stability.

The fidelity of the TradeFM Simulator is quantitatively assessed using the Wasserstein distance, a metric for comparing probability distributions. Simulations consistently achieve a Wasserstein distance of less than 0.1 when comparing the distributions of key market features – specifically, trade volume and lot counts – in the simulated environment against those observed in real market data. This low Wasserstein distance indicates a high degree of similarity between the simulated and real distributions, demonstrating the simulator’s ability to accurately replicate these critical characteristics of market behavior. The metric is calculated across a representative dataset of historical trades, ensuring the result is statistically meaningful and generalizable.

Feature stationarity within the TradeFM Simulator was assessed using the Kolmogorov-Smirnov (KS) statistic over a one-year simulation period. This statistical test evaluates whether the distribution of key features remains consistent over time, indicating the model’s resilience to temporal drift. Results consistently demonstrate a KS statistic of ≤0.05 for critical variables, signifying that any observed changes in feature distributions are within statistically insignificant bounds. This level of stationarity validates the robustness of the simulated market environment and the TradeFM model’s ability to maintain predictive accuracy even as market conditions evolve over extended periods.

The model accurately simulates market behavior under stress, responding realistically to a tenfold increase in anomalous order flow and proving its value for impact analysis.

Beyond Prediction: Implications and Future Directions

TradeFM introduces a powerful capability for financial modeling: the generation of realistic, synthetic market data. This innovation addresses a critical need within the industry, as access to historical data is often limited, costly, or insufficient for comprehensively evaluating trading strategies. By creating data that mirrors the statistical properties of real markets, TradeFM allows for rigorous backtesting under diverse conditions, identifying potential weaknesses and optimizing performance before live deployment. Furthermore, the synthetic datasets prove invaluable for stress-testing risk management systems, simulating extreme market scenarios – such as flash crashes or unexpected volatility spikes – to assess their resilience and ensure adequate safeguards are in place. This ability to proactively identify and mitigate risks, coupled with enhanced strategy validation, represents a significant advancement in quantitative finance and algorithmic trading.

TradeFM’s capacity to internalize and reproduce established trading benchmarks, such as Volume Weighted Average Price (VWAP), offers a novel lens through which to examine market dynamics. The model doesn’t simply mimic these benchmarks; its internal representation allows for the identification of subtle deviations from expected behavior, potentially revealing inefficiencies or fleeting arbitrage opportunities. By analyzing how TradeFM predicts and reacts to VWAP, researchers can gain insights into the factors driving price discovery and the limits of market efficiency. Furthermore, the model’s ability to accurately forecast benchmark-related price movements suggests its utility in developing advanced trading algorithms designed to exploit momentary discrepancies and optimize execution strategies. This understanding extends beyond mere profit maximization, offering a computational framework for assessing the true cost of trading and the impact of various market participants.

TradeFM exhibits a remarkable capacity for generalization, successfully maintaining performance when applied to previously unseen market data from China and Japan. Evaluation using perplexity – a measure of how well the model predicts a sample – revealed minimal degradation when tested on these held-out datasets. This indicates that TradeFM doesn’t simply memorize patterns from its training data, but rather learns underlying principles of market behavior applicable across diverse geographical contexts. The ability to perform well in these zero-shot scenarios – without any specific training on Chinese or Japanese markets – suggests a powerful and adaptable framework with potential for broader application in global financial modeling and analysis.

The performance of TradeFM exhibits a predictable relationship between model scale, data volume, and achieved accuracy, adhering to a power law distribution with an exponent of approximately 0.18-0.19 for out-of-sample test loss. This scaling law suggests that increasing either the model’s size or the dataset used for training will yield diminishing returns, but in a quantifiable and predictable manner $Loss \propto Size^{-\alpha}$ . Specifically, a modest increase in model or data size results in a substantial reduction in loss, while further expansion yields progressively smaller improvements. This characteristic is crucial for efficient resource allocation during model development, enabling researchers to estimate the performance gains achievable with larger models or datasets and avoid unnecessary computational expense. The observed exponent provides a valuable benchmark for future iterations, allowing for informed decisions regarding model architecture and training data requirements.

Continued development of TradeFM centers on broadening its analytical scope beyond current limitations, with planned expansions to include diverse asset classes – from fixed income and derivatives to commodities and real estate. Researchers aim to refine the model’s performance across varied market conditions, encompassing periods of high volatility, low liquidity, and significant macroeconomic shifts. This includes investigating its potential within sophisticated financial modeling applications, such as portfolio optimization, risk assessment under extreme scenarios, and the dynamic pricing of complex financial instruments, ultimately pushing the boundaries of AI-driven financial analysis and forecasting.

Employing an exponentially weighted volume-weighted average price (EW-VWAP) estimator yields a more stable and responsive mid-price estimate-closely mirroring executed fill prices across varying time scales and liquidity-compared to standard volume-weighted average price (VWAP) or exponentially weighted moving average (EWM) methods.

The development of TradeFM echoes a sentiment articulated by Grace Hopper: “It’s easier to ask forgiveness than it is to get permission.” This model doesn’t simply accept the established rules of market microstructure; it actively generates them from data, effectively reverse-engineering the underlying dynamics of billions of transactions. By learning directly from raw data, TradeFM bypasses the need for pre-defined assumptions, a process mirroring the iterative, exploratory spirit Hopper championed. The ability of TradeFM to generalize across new markets isn’t about adherence to existing structures, but about identifying and replicating the fundamental principles governing trade-flow, a testament to the power of questioning established norms and forging new paths in understanding complex systems.

What Breaks Down Next?

The construction of TradeFM, a generative model for market microstructure, isn’t a culmination, but a carefully engineered fracture point. The model successfully reproduces established patterns-a parlor trick, really-but the true test lies in forcing its failure. What happens when the latent space is pushed beyond the bounds of observed transactions? Can the model predict, or even generate, novel market anomalies-flash crashes seeded in simulation, liquidity black holes, or emergent herd behaviors? The current architecture excels at imitation; the next iteration must embrace purposeful disruption.

Generalization to new markets is presented as a strength, but this merely shifts the problem. A model that performs well across existing exchanges hasn’t truly learned market dynamics, only the shared biases of their data. The challenge isn’t adapting to new instruments, but to fundamentally different rules. What happens when the model encounters a market with asymmetric information, manipulative trading practices deliberately obscured, or a regulatory structure designed to actively prevent predictable behavior?

Finally, framing TradeFM as a reinforcement learning environment is…convenient. A robust environment isn’t one where agents consistently profit, but one that reliably punishes flawed strategies. The model’s true value won’t be realized until it can generate adversarial conditions – market states specifically designed to expose the weaknesses of any trading algorithm. Only then will it transcend the role of a sophisticated echo chamber and become a true instrument for understanding-and breaking-the logic of financial markets.

Original article: https://arxiv.org/pdf/2602.23784.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

Decoding the Noise: Why Markets Defy Simple Models

TradeFM: A Foundation for Modeling Market Genesis

Stress Testing Reality: Rigorous Evaluation via Simulation

Beyond Prediction: Implications and Future Directions

What Breaks Down Next?

See also: