Power Aware and Density Optimized High Fan-in Posit-HUB Using Modified CSLA

J.Prasanthi Kumari, B.Ratna Raju

Abstract


In recent years, the posit number system has gained attention as a promising alternative to the traditional IEEE floating-point representation, especially within the field of deep learning. Thanks to its non-uniform distribution of values, the posit format aligns more closely with the data characteristics observed in deep learning models. This advantage enables faster training and improved computational efficiency. Among various arithmetic operations, multiplication is one of the most frequently used in these applications. To address the hardware complexity associated with such operations, the HUB (Hybrid Unum Binary) approach, introduced in 2016, focused on reducing the cost of floating-point hardware units. Building upon this idea, a novel format known as the HUB posit has been proposed. This format aims to further decrease the hardware overhead typically found in posit arithmetic units. Specifically, this research introduces a power-efficient posit adder architecture. While the mantissa adder is still designed to accommodate the maximum possible bit-width, the architecture cleverly segments the adder into several smaller units. These smaller adders operate based on the regime bit-width, which effectively determines the required mantissa bit-width for a given operation. This design methodology is implemented and tested on both 6-bit and 11-bit posit configurations. Moreover, the architecture is enhanced by incorporating a modified carry select adder along with Binary Excess Converters. This enhancement significantly optimizes the performance by reducing both power consumption and processing delay, making it well-suited for efficient and scalable deep learning hardware solutions.


Keywords


Posit number system, posit addition, computer arithmetic, low-power arithmetic circuit, HUB posit, Carry select adder, Binary Excess Converter.

References


E. Grosswald, “Topics from the Theory of Numbers,” New York: McMillan, 1996.

W. K. Jenkins, “Finite Arithmetic Concepts in Handbook for DSP,” S. K. Mitra and J. F. Kaiser, eds., Wiley, 1993, pp. 611-675.

F. J. Taylor, “Residue arithmetic: A tutorial with examples,” Computer (IEEE), vol. 17, no. 5, pp. 50-63, May 1984.

A. Omondi and B. Premkumar, “Residue Number System: Theory and Implementation,” Imperial College Press 2007, ISBN 978-1-86094-866-4.

N. Szabo and R. Tanaka, “Residue Arithmetic and its Applications to Computer Technology,” New York: McGraw Hill, 1967.

M. A. Soderstrand, W. K. Jenkins, G. A. Jullien, and F. J. Taylor, “Residue Number System Arithmetic: Modern Applications in Digital Signal Processing,” New York: IEEE Press 1986.

Wei Wang, M. N. S. Swamy, and M. O. Ahmad, “An area-time-efficient residue-tobinary converter,” Proceedings of the 43rd IEEE Midwest Symposium on Circuits and Systems, Vol. 2, pp. 904-907, 2000.

F. Barsi and P. Maestrini, “Error correcting properties of redundant residue number systems,” IEEE Transactions on Computers, vol. 23, no.9, pp. 915-923.

R. Conway and J. Nelson, “Improved POSIT MUL FIR Filter Architectures,” IEEE Transactions On Circuits and Systems II, Vol. 51, No. 1, pp. 26-28, 2004.

M. K. Jaiswal and H. K. So, “PACoGen: A hardware posit arithmetic core generator,” IEEE Access, vol. 7, pp. 74586–74601, 2019.

L. Sommer, L. Weber, M. Kumm, and A. Koch, “Comparison of arithmetic number formats for inference in sum-product networks on FPGAs,” in Proc. IEEE 28th Annu. Int. Symp. Field-Program. Custom Comput. Mach. (FCCM), 2020, pp. 75–83.

R. Murillo, A. A. Del Barrio, and G. Botella, “Customized posit adders and multipliers using the FloPoCo core generator,” in Proc. IEEE Int. Symp. Circuits Syst. (ISCAS), Oct. 2020, pp. 1–5.

H. Zhang and S.-B. Ko, “Design of power efficient posit multiplier,” IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 67, no. 5, pp. 861–865, May 2020.

L. Crespo, P. Tomás, N. Roma, and N. Neves, “Unified posit/IEEE-754 vector MAC unit for transprecision computing,” IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 69, no. 5, pp. 2478–2482, May 2022.

J. Hormigo and J. Villalba, “Measuring improvement when using HUB formats to implement floating-point systems under round-to-nearest,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 24, no. 6, pp. 2369–2377, Jun. 2016.

J. Hormigo and J. Villalba, “HUB floating point for improving FPGA implementations of DSP applications,” IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 64, no. 3, pp. 319–323, Mar. 2017.

S. Bose, A. De, and I. Chakrabarti, “Framework for automated earthquake event detection based on denoising by adaptive filter,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 67, no. 9, pp. 3070–3083, Sep. 2020.


Full Text: PDF [Full Text]

Refbacks

  • There are currently no refbacks.


Copyright © 2013, All rights reserved.| ijseat.com

Creative Commons License
International Journal of Science Engineering and Advance Technology is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJSEat , Permissions beyond the scope of this license may be available at http://creativecommons.org/licenses/by/3.0/deed.en_GB.

Â