@Vu
July 2025
This report covers a deep learning model designed to predict the opening gap percentage of the SPY ETF. The opening gap, or the price change between the previous day's close and the current day's open, is a significant source of daily volatility. By leveraging a hybrid CNN-LSTM architecture, the model analyzes pre-market and overnight 15-minute candlestick data to forecast the direction and magnitude of the gap.
The model was trained on data from February 2024 to June 2025. On the unseen test set, it achieved a Directional Accuracy of 57.1%, indicating a statistically significant edge in predicting whether the market will gap up or down. While the model's R-squared value is modest (0.13), its ability to correctly forecast the gap's direction presents a valuable signal for pre-market trading strategies.
The model's predictive power is derived from a rich set of engineered features designed to capture market dynamics in the crucial pre-open hours.
Gap_Percentage, calculated as ((Current_Day_Open - Previous_Day_Close) / Previous_Day_Close) * 100.The task is formulated as a many-to-one sequence-to-value regression problem. The model takes a sequence of feature vectors (one for each 15-minute candle) as input and outputs a single continuous value representing the predicted gap percentage.