At Wall Street Scholars, we believe that the cornerstone of sound financial decision-making lies in the robustness and reliability of the models used. It’s not enough to develop sophisticated models; they must also undergo rigorous validation to ensure accuracy and reliability.
One of the key questions every model validator must ask is:
How accurate and reliable is the model under real-world conditions?
Models drive decisions that lead to financial actions, impacting every facet of our economic and social systems.
Failure to identify and mitigate model risks has historically led to significant financial crises, underscoring the importance of thorough validation.
Clear examples of inadequate model validation and insufficient testing
1.Stranded London Whale (2012):
On May 10, 2012, JPMorgan Chase lost $2 billion in trading due to trader Bruno Iksil, nicknamed the "London Whale." Iksil accumulated large, risky positions in credit default swaps (CDS), leading to massive losses.
Inadequate validation of the risk models used to assess the size and impact of Bruno Iksil’s credit default swap (CDS) positions likely contributed to the losses. If JPMorgan Chase had stronger model validation processes, it might have identified the outsized risk exposure early on, preventing the accumulation of such large, unhedged positions.
2.Société Générale’s $7.2 Billion Loss (2008):
In January 2008, Société Générale suffered a loss of approximately €4.9 billion over three days of trading, beginning on January 21. The bank closed out massive equity positions during a period of steep market declines, exacerbating the damage.
Société Générale’s massive equity positions could have been better controlled with validated models that properly measured market exposure and anticipated the risk of sharp market drops. Lack of model validation likely contributed to the inability to detect the risks associated with large, concentrated positions, leading to rushed unwinding in a declining market.
3.Flash Crash of (2010):
On May 6, 2010, U.S. equity markets experienced a "flash crash," where prices plummeted and quickly recovered. Between 2:41 pm and 2:45:27 pm, E-Mini futures dropped over 5%, and the SPY ETF fell by more than 6%. At 2:45:28 pm, the Chicago Mercantile Exchange (CME) triggered a five-second trading pause to prevent further declines. Despite the pause, sell algorithms continued to execute until about 2:51 pm, during which time prices rapidly rebounded.
This event highlights how poorly validated trading algorithms, especially those using high-frequency trading (HFT), can malfunction under stress. More rigorous validation of algorithms and the market's microstructure could have prevented or mitigated the rapid sell-off. Proper stress testing of models could have helped anticipate these extreme outcomes.
4.Knight Capital’s $440 Million Loss (2012):
On August 1, 2012, Knight Capital Group, a major market maker, lost $440 million in just 30 minutes due to a trading algorithm glitch. The error caused millions of unintended trades, leading to small but frequent losses. High-frequency trading algorithms, operating in under 30 milliseconds, exposed the firm to massive risks. Automatic safeguards, such as stop-loss limits, could have prevented such a catastrophic event.
Knight Capital's loss was directly linked to a software glitch in its algorithm, a clear case of insufficient model validation and testing. If Knight had thoroughly tested its trading algorithms before deployment, it could have avoided the errors that resulted in millions of unintended trades.
5.U.S. Treasury Intra-Day Crash (2014):
On October 15, 2014, the U.S. Treasury market experienced a 12-minute, 37 basis point swing between 9:33 and 9:45 am ET, with no significant news triggering the volatility. A report from the U.S. Department of the Treasury noted an imbalance between buyer-initiated trades (pushing yields lower) and seller-initiated trades (driving yields back up) during the event window.
The 12-minute crash in the U.S. Treasury market can be attributed to models not adequately considering liquidity risks and market imbalances. More robust validation processes could have better accounted for market dynamics and anticipated the imbalance between buyers and sellers during periods of stress.
6.August 24 Volatility Test (2015):
On August 24, 2015, U.S. stock prices experienced a sharp decline following panic caused by the Chinese stock market crash. Trading of stocks and ETFs was paused over 1,278 times due to Limit Up/Limit Down (LULD) halts, with 773 Limit Up halts and 505 Limit Down halts. Each halt lasted five minutes, triggered when a stock moved by 5% or more. However, market-wide circuit breakers did not activate as the S&P 500 did not reach the 7% decline threshold.
This event underscores the importance of validating models related to market volatility and circuit breakers. If these mechanisms had been stress-tested and validated more thoroughly, the market might have been better equipped to handle the volatility induced by the Chinese market panic, potentially avoiding the excessive number of trading halts.
In all these cases, rigorous model validation—including stress testing, back testing, and real-time monitoring—could have mitigated or prevented the financial losses and market disruptions. Each incident highlights the critical importance of validating models to ensure they perform as expected, even in extreme market conditions.
Our commitment to model validation doesn’t end after the initial checks. At Wall Street Scholars, we provide ongoing post-validation support to banks and fintech, helping them continuously monitor their models' performance in live environments, ensuring they remain effective over time.