Low-Error and High-Throughput Discrete Cosine Transform (DCT) Design

Sadiq Ali Mohammad, Sumanth K


In this paper, by operating the shifting and addition in parallel, an error-compensated adder-tree (ECAT) is proposed to deal with the truncation errors and to achieve low-error and high-throughput discrete cosine transform (DCT) design.  Many DCT architectures were proposed on systolic design to reduce the number of multipliers in the systolic design as multipliers consumes high power and occupy less area . Instead of the 12 bits used in previous works, 9-bit distributed arithmetic-precision is chosen for this work so as to meet peak-signal-to-noise-ratio (PSNR) requirements. He proposed 2-D DCT core synthesized by usingXilinx ISE 9.1, and the Xilinx XC2VP30 FPGA can achieve 792 megapixels per second (M-pels/sec) throughput rate.


Distributed arithmetic (DA)-based, error-compensated adder-tree (ECAT), 2-D discrete cosine transform (DCT).


Y. Wang, J. Ostermann, and Y. Zhang, Video Processing and Communications, 1st ed. Englewood Cliffs, NJ: Prentice-Hall, 2002.

Y. Chang and C. Wang, “New systolic array implementation of the 2-D discrete cosine transform and its inverse,” IEEE Trans. Circuits Syst. Video Technol., vol. 5, no. 2, pp. 150–157, Apr. 1995.

C. T. Lin, Y. C. Yu, and L. D. Van, “Cost-effective triple-mode reconfigurable pipeline FFT/IFFT/2-D DCT processor,” IEEE Trans. Very Large Scale Integr. Syst., vol. 16, no. 8, pp. 1058–1071, Aug. 2008.

S. Uramoto, Y. Inoue, A. Takabatake, J. Takeda, Y. Yamashita, H. Yerane, and M. Yoshimoto, “A 100-MHz 2-D discrete cosine trans-form core processor,” IEEE J. Solid-State Circuits, vol. 27, no. 4, pp. 492–499, Apr. 1992.

A. M. Shams, A. Chidanandan, W. Pan, and M. A. Bayoumi, “NEDA: A low-power high-performance DCT architecture,” IEEE Trans. Signal Process., vol. 54, no. 3, pp. 955–964, Mar. 2006.

M. R. M. Rizk and M. Ammar, “Low power small area high performance 2D-DCT architecture,” in Proc. Int. Design Test Workshop, 2007, pp. 120–125.

Y. Chen, X. Cao, Q. Xie, and C. Peng, “An area efficient high performance DCT distributed architecture for video compression,” in Proc. Int. Conf. Adv. Comm. Technol., 2007, pp. 238–241.

C. Peng, X. Cao, D. Yu, and X. Zhang, “A 250 MHz optimized distributed architecture of 2D 8x8 DCT,” in Proc. Int. Conf. ASIC, 2007, pp. 189–192



  • There are currently no refbacks.

Copyright © 2013, All rights reserved.| ijseat.com

Creative Commons License
International Journal of Science Engineering and Advance Technology is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJSEat , Permissions beyond the scope of this license may be available at http://creativecommons.org/licenses/by/3.0/deed.en_GB.