HE Tingting, CHEN Jiyang, LEI Yuanwu, et al., “High-Performance FP Divider with Sharing Multipliers Based on Goldschmidt Algorithm,” Chinese Journal of Electronics, vol. 26, no. 2, pp. 292-298, 2017, doi: 10.1049/cje.2016.10.004
Citation: HE Tingting, CHEN Jiyang, LEI Yuanwu, et al., “High-Performance FP Divider with Sharing Multipliers Based on Goldschmidt Algorithm,” Chinese Journal of Electronics, vol. 26, no. 2, pp. 292-298, 2017, doi: 10.1049/cje.2016.10.004

High-Performance FP Divider with Sharing Multipliers Based on Goldschmidt Algorithm

doi: 10.1049/cje.2016.10.004
Funds:  This work is supported by the Aerospace Science Foundation of China (No.2013ZC88003), and the Natural Science Foundation of China (No.61402499).
More Information
  • Corresponding author: LEI Yuanwu (corresponding author) is an assistant professor of National University of Defense and Technology (NUDT), China. His research interests include high performance computer architecture and computing engineering. (Email:yuanwulei@nudt.edu.cn)
  • Received Date: 2015-05-28
  • Rev Recd Date: 2016-01-05
  • Publish Date: 2017-03-10
  • Focused on the issue that division is complex and needs a long latency to compute, a method to design the unit of high-performance Floating-point (FP) divider based on Goldschmidt algorithm was proposed. Bipartite reciprocal tables were adopted to obtain initial value of iteration with area-saving, and parallel multipliers were employed in the iteration unit to reduce latency. FP divider to support pipeline execution with the control of state machine is presented to increase the throughput. The design was implemented in Digital signal process (DSP) chip by sharing the existed multipliers.
  • loading
  • S.F. Oberman and M.J. Flynn, "Division algorithms and implementations", IEEE Transactions on Computers, Vol.46, No.8, pp.833-854, 1997.
    P. Kornerup, "Digit selection for SRT division and square root", IEEE Transactions on Computers, Vol.54, No.3, pp.727-739, 2005.
    D. Wang and M.D. Ercegovac, "A radix-16 combined complex division/square root unit with operand prescaling", IEEE Transactions on Computers, Vol.61, No.9, pp.1243-1255, 2012.
    R.E. Goldschmidt, "Applications of division by convergence", M.S. Thesis, Massachusetts Institute of Technology, 1964.
    P.W. Markstein, "Computation of elementary function on the IBM RISC system/6000 processor", IBM J. Research and Development, Vol.34, No.1, pp.111-119, 1990.
    S.F. Anderson, J.G. Earle, R.E. Goldschmidt, et al., "The IBM system/360 model 91:Floating-point execution unit", IBM J. Research and Development, Vol.11, No.1, pp.34-53, 1967.
    S.F. Oberman, "Floating-point division and square root algorithms and implementation in the AMD-K7 microprocessor", Proc. of 14th IEEE Symposium on Computer Arithmetic, Adelaide, SA, pp.106-115, 1999.
    M.D. Ercegovac, Laurent Imbert and Jean-Michel Muller, "Improving Goldschmidt division, square root", and square root reciprocal, IEEE Transactions on Computers, Vol.49, No.7, pp.759-763, 2000.
    I. Kong and E.E. Swartzlander, "A Goldschmidt division method with faster than quadratic convergence", IEEE Transactions on Very Large Scale Integration Systems, Vol.19, No.4, pp.759-763, 2011.
    E.V. Krishnamurthy, "On optimal iterative schemes for high speed division",IEEE Transactions on Computers, Vol.19, No.3, pp.227-231, 1970.
    Guy Evena, P.M. Seidelb and W.E. Fergusonc, "A parametric error analysis of Goldschmidt's division algorithm", Journal of Computer and System Sciences, Vol.70, No.1, pp.118-139, 2005.
    Daniel Piso and J.D. Bruguera, "ariable latency Goldschmidt algorithm based on a new rounding method and a remainder estimate", IEEE Transactions on Computers, Vol.60, No.11, pp.1535-1546, 2011.
    W. Yan, X.J. Qu, H. Chen, et al., "Improved Goldschmidt division method using mapping of divisors", Science China, Vol.56, No.6, pp.1535-1546, 2013.
    David Stevenson, "An American national standard IEEE standard for binary floating-point arithmetic", ACM SIGPLAN Notices, Vol.22, No.2, pp.53-57, 1987.
    D.D. Sarma and D. Matula, "Measuring the accuracy of ROM reciprocal tables", IEEE Transactions on Computers, Vol.43, No.8, pp.932-940, 1994.
    D.D. Sarma and D. Matula, "Faithful bipartite Rom reciprocal tables", Proc. of 12th IEEE Symposium on Computer Arithmetic, Bath, England, pp.17-28, 1995.
    D.D. Sarma and D. Matula, "Faithful interpolation in reciprocal tables", Proc. of 13th IEEE Symposium on Computer Arithmetic, California, USA, pp.82-91, 1997.
    E. Schwarz and M.J. Flyn, "Hardware starting approximation for the square root operation", Proc. of 11th IEEE Symposium on Computer Arithmetic, Windsor, Canada, pp.103-111, 1993.
    S.F. Oberman and M.J. Flynn, "Design issues in division and other floating-point operations", IEEE Transaction on Computers, Vol.46, No.2, pp.154-161, 1997.
    S.M. Chen, Y.H. Wang and J.H. Wan, "FT-Matrix:A coordination-aware-architecture for signal processing", IEEE MICRO., Vol.34, No.6, pp.64-73, 2014.
    L. Wei and Alberto Nannarelli, "Power efficient division and square root unit", IEEE Transaction on Computers, Vol.61, No.8, pp.1059-1070, 2012.
    Taek-Jun Kwon and Jeffrey Draper, "Floating-point division and square root using a Taylor-series expansion algorithm", IEEE Transaction on Computers, Vol.46, No.2, pp.1601-1605, 2009.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (460) PDF downloads(358) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return