LI Jingjing, ZHANG Qijin, ZHANG Yumei, et al., “Hidden Phase Space Reconstruction: A Novel Chaotic Time Series Prediction Method for Speech Signals,” Chinese Journal of Electronics, vol. 27, no. 6, pp. 1221-1228, 2018, doi: 10.1049/cje.2018.09.010
Citation: LI Jingjing, ZHANG Qijin, ZHANG Yumei, et al., “Hidden Phase Space Reconstruction: A Novel Chaotic Time Series Prediction Method for Speech Signals,” Chinese Journal of Electronics, vol. 27, no. 6, pp. 1221-1228, 2018, doi: 10.1049/cje.2018.09.010

Hidden Phase Space Reconstruction: A Novel Chaotic Time Series Prediction Method for Speech Signals

doi: 10.1049/cje.2018.09.010
Funds:  This work is supported by the National Key Research and Development Program of China (No.2017YFB1402100), the National Natural Science Foundation of China (No.11772178, No.11502133, No.61701291), the 111 Project (No.B16031), the Fundamental Research Fund for the Central Universities (No.2017CBY008), the China Postdoctoral Science Foundation funded project (No.2017M613053), and the Shaanxi Science & Technology Co-ordination & Innovation Project (No.2015KTZDGY06-05-01).
More Information
  • Corresponding author: WU Xiaojun (corresponding author) is a professor in Shaanxi Normal University, Xi'an, China. His research interests include pattern recognition, intelligent system and system complexity. (Email:xjwu@snnu.edu.cn)
  • Received Date: 2018-04-23
  • Rev Recd Date: 2018-05-17
  • Publish Date: 2018-11-10
  • Speech signals are nonlinear chaotic time series. This paper proposes a novel speech signal nonlinear prediction model with the hidden phase space reconstruction method. The parameters, embedding dimension m, time delay τ and model structure are solved simultaneously, breaking the restriction of phase space, which needs to be reconstructed before modeling for the existing prediction method. Subsequently, an explicit speech signal prediction model is generated. Meanwhile, the introduction of the frame length parameter k effectively extends the prediction length. Experimental results show that the values of m and τ solved by the proposed method are consistent with the values addressed by the Cao method and mutual information method, respectively. In addition, the optimal value of k is further discussed. The prediction results obtained using the proposed model are more accurate than those of linear prediction coding, the radial basis function neural network model and the long short-term memory network.
  • loading
  • J. Sun, N. Zheng and X. Wang, “Enhancement of Chinese speech based on nonlinear dynamics”, Signal Processing, Vol.87, No.10, pp.2431-2445, 2007.
    O.H. Kocal, E. Yuruklu and I. Avcibas, “Chaotic-type features for speech steganalysis”, IEEE Transactions on Information Forensics and Security, Vol.3, No.4, pp.651-661, 2008.
    D.G. Stavrakoudis and J.B. Theocharis, “Pipelined recurrent fuzzy neural networks for nonlinear adaptive speech prediction”, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol.37, No.5, pp.1305-1320, 2007.
    F.M. Bianchi, S. Scardapane, A. Uncini, A. Rizzi and A. Sadeghian, “Prediction of telephone calls load using echo state network with exogenous variables”, Neural Networks, Vol.71, pp.204-213, 2015.
    N. Chouikhi, B. Ammar, N. Rokbani, et al., “PSO-based analysis of echo state network parameters for time series forecasting”, Applied Soft Computing, Vol.55, pp.211-225, 2017.
    L. Deng, G. Hinton and B. Kingsbury, “New types of deep neural network learning for speech recognition and related applications: An overview”, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, Canada, pp.8599-8603, 2013.
    C. Weng, D. Yu, S. Watanabe, et al., “Recurrent deep neural networks for robust speech recognition”, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, pp.5532-5536, 2014.
    M.K. Luka, I.A. Frank and G. Onwodi, “Neural network based hausa language speech recognition”, International Journal of Advanced Research in Artificial Intelligence, Vol.1, No.2, pp.39-44, 2012.
    Y.M. Zhang, X.J. Hu, X.J. Wu, et al., “Volterra prediction model for speech signal series”, Acta Physica Sinica, Vol.64, No.20, pp.117-129, 2015. (in Chinese)
    Y.S. Lee and L.I. Tong, “Forecasting time series using a methodology based on autoregressive integrated moving average and genetic programming”, Knowledge-Based Systems, Vol.24, No.1, pp.66-72, 2011.
    M. Ogorzalek, “Signal coding and compression based on discrete-time chaos: Statistical approaches”, IEEE International Symposium on Circuits and Systems (ISCAS), Scottsdale, Arizona, USA, Vol.4, pp.IV-IV, 2002.
    X.J. Wu and Z.Z. Yang, “Nonlinear speech coding model based on genetic programming”, Applied Soft Computing, Vol.13, No.7, pp.3314-3323, 2013.
    L. Yang, J.X. Zhang, X.J. Wu, Y.M. Zhang and J.J. Li, “A chaotic time series prediction model for speech signal encoding based on genetic programming”, Applied Soft Computing, Vol.38, pp.754-761, 2016.
    K. James and E. Russell, “Particle swarm optimization”, IEEE International Conference on Neural Networks, Perth, Australia, Vol.4, pp.1942-1948, 1995.
    D. Pradeepkumar and V. Ravi, “Forecasting financial time series volatility using particle swarm optimization trained quantile regression neural network”, Applied Soft Computing, Vol.58, pp.35-52, 2017.
    H.G. Han, W. Lu and J.F. Qiao, “A multi objective particle swarm optimization algorithm based on the diversity information and covergence degree”, Acta Electronica Sinica, Vol.46, No.2, pp.315-324, 2018. (in Chinese)
    X.J. Wu, Z.Z. Yang and M. Zhao, “A uniform searching particle swarm optimization algorithm”, Acta Electronica Sinica, Vol.39, No.6, pp.1261-1266, 2011. (in Chinese)
    X.F. Xie, W.J. Zhang and Z.L. Yang, “A dissipative particle swarm optimization”, Eprint Arxiv Cs, Vol.abs/cs/0505065, No.8, pp.1456-1461, 2005.
    X.J. Wu, F. Li, Y. Ma, et al., “The convergence analysis of the uniform search particle swarm optimization”, Acta Electronica Sinica, Vol.40, No.6, pp.1115-1120, 2012. (in Chinese)
    J.J. Jiang, Y. Zhang and C. Mcgilligan, “Chaos in voice, from modeling to measurement”, Journal of Voice, Vol.20, No.1, pp.2-17, 2006.
    M.P. Fay and M.A. Proschan, “Wilcoxon-Mann-Whitney or ttest? On assumptions for hypothesis tests and multiple interpretations of decision rules”, Statistics Surveys, Vol.4, pp.1-39, 2010.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (481) PDF downloads(270) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return