The significant wave height (SWH) is the actual wave height calculated in a certain way. SWH is an important parameter used to measure the marine climate and plays a crucial role in marine disaster prediction (
Ardhuin et al., 2019), marine engineering construction (
Vanem, 2016) and ship design (
Caires and Sterl, 2005). The accurate and quick prediction of the SWH has become a considerable problem in the development of modern ocean technology. Currently, the SWH forecasting can be divided into three categories: numerical wave, machine learning and deep learning models. Given appropriate SWH forecasting models, the accuracy of prediction depends on the forecast time span. The longer the prediction time is, the lower the accuracy of prediction is. With the rapid development of computer science and technology, more attention has been paid to the numerical wave models. The third-generation numerical wave prediction models (e.g., Wave Modeling, Wave Watch III Model, and Simulating Waves Nearshore Model) are the most widely used numerical wave models. The machine learning methods can fit complex nonlinear processes and solve complex nonlinear problems of the physical mechanism without prior knowledge of the system. Therefore, these methods are widely applied to SWH prediction. One is the single prediction model, such as artificial neural network (
Deo and Naidu, 1998), support vector machine (
Mahjoobi and Mosabbeb, 2009), M5′ model tree (
Etemad-Shahidi and Mahjoobi, 2009); and the other is the composite prediction model, such as hybrid empirical mode decomposition support vector regression model (
Duan et al., 2016), a fuzzy KNN-based model (
Nikoo et al., 2018). Here, traditional machine learning methods usually require manual feature engineering. If important features cannot be learned, then the prediction accuracy of the model will be significantly reduced. As a result, manually selecting feature is a laborious task. The development of computer technology in recent years realizes the possible application of complex models. Deep learning algorithms such as the simulating waves nearshore (SWAN)-long short-term memory (LSTM) (
Fan et al., 2020), convolutional neural network (
Yang et al., 2021), convolutional LSTM network (ConvLSTM) (
Zhou et al., 2021a) and convolutional neural network-bidirectional LSTM-attention mechanism model (CNN-BiLSTM-attention) (
Wang et al., 2022), can automatically learn data features and achieve successful results in the field of ocean prediction. However, the above methods do not fully mine the signal characteristics of the SWH. The original data sequence is decomposed into a set of intrinsic mode functions (IMFs) by signal decomposition technology to reduce the non-stationarity of wave height data, and then the model is constructed for prediction at each intrinsic mode function sub-sequence.
Oh and Suh (2018) proposed a hybrid model combining the empirical orthogonal function and wavelet analysis with the neural network algorithm (EOFWNN) and predicted the wave height values of eight wave observation stations along the coast of the Sea of Japan. Compared with the wavelet and neural network hybrid models, EOFWNN performed effectively despite the decomposition level of wavelet analysis.
Zhou et al. (2021b) proposed the forecasting algorithm using a joint empirical mode decomposition LSTM (EMD-LSTM) model, in which the EMD algorithm is applied to decompose the SWH, and the decomposed IMF sub-sequences are trained by the LSTM network. Effective results are achieved in the SWH from two buoys in the Atlantic Ocean, east of the Bahamas.
Raj and Brown (2021) jointly adopted a hybrid Boruta random forecast ensemble EMD (EEMD) bidirectional LSTM algorithm to predict SWH over 24 h along the coastal areas of Queensland, Australia. Considering the prediction of different forecast windows,
Sutskever et al. (2014) proposed a sequence-to-sequence deep learning model (Seq-to-Seq) based on encoder–decoder, in which one recurrent neural network (RNN) network encodes the input information and the other RNN network decodes the encoded information, which can demonstrate increased continuity for long-term prediction. The Seq-to-Seq has been applied to long-term prediction problems in different fields, such as ocean waves (
Pirhooshyaran and Snyder, 2020), wind power (
Zhang et al., 2020), power load (
Gong et al., 2019) and semantic trajectories (
Karatzoglou et al., 2018), and achieved high prediction accuracy.