IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 20, NO. 2, FEBRUARY 2010
An Efficient Architecture for 3-D Discrete Wavelet Transform
Anirban Das, Anindya Hazra, and Swapna Banerjee, Senior Member, IEEE
Abstract—This paper presents an architecture of the liftingbased running 3-D discrete wavelet transform (DWT), which is a powerful image and video compression algorithm. The proposed design is one of the first lifting based complete 3-DDWT architectures without group of pictures restriction. The new computing technique based on analysis of lifting signal flow graph minimizes the storage requirement. This architecture enjoys reduced memory referencing and related low power consumption, low latency, and high throughput compared to those of earlier reported works. The proposed architecture has been successfully implemented on Xilinx Virtex-IV series field-programmable gate array, offering a speed of 321 MHz, making it suitable for realtime compression even with large frame dimensions. Moreover, the architecture is fully scalable beyond the present coherent Daubechies filterbank (9, 7). Index Terms—Discrete wavelet transform, image compression, lifting, video, VLSI architecture.
I. Introduction
S
TILL IMAGE compression technique based on 2-D discrete wavelet transform (DWT) has already gained superiority over traditional JPEG based on discrete cosine transform and is standardized in forms like JPEG2000 [1]. Quite similarly, the application of its 3-D superset, i.e., 3-D-DWT on video, outperforms the current predictive coding standards, like H.261-3, MPEG1-2,4 by rendering the quality features like better peak signal-to-noise ratio (PSNR), absence of blocky artifacts in low bit rates. Furthermore, it has the added provisions of highly scalable compression, which is mostly coveted in modern communications over heterogeneous channels like the Internet [2]. Successful application of 3-D-DWT has been reported in the literature in
References: [1] A. Skodras, C. Christopoulos, and T. Ebrahimi, “The JPEG 2000 still image compression standard,” IEEE Signal Process. Mag., vol. 18, no. 5, pp. 36–58, Sep. 2001. [2] J.-R. Ohm, M. van der Schaar, and J. W. Woods, “Interframe wavelet coding: Motion picture representation for universal scalability,” J. Signal Process. Image Commun., vol. 19, no. 9, pp. 877–908, Oct. 2004. [3] G. Menegaz and J.-P. Thiran, “Lossy to lossless object-based coding of 3-D MRI data,” IEEE Trans. Image Process., vol. 11, no. 9, pp. 1053– 1061, Sep. 2002. [4] J. E. Fowler and J. T. Rucker, “3-D wavelet-based compression of hyperspectral imagery,” in Hyperspectral Data Exploitation: Theory and Applications, C.-I. Chang, Ed. Hoboken, NJ: Wiley, 2007, ch. 14, pp. 379–407. [5] L. R. C. Suzuki, J. R. Reid, T. J. Burns, G. B. Lamont, and S. K. Rogers, “Parallel computation of 3-D wavelets,” in Proc. Scalable HighPerformance Computing Conf., May 1994, pp. 454–461. [6] E. Moyano, P. Gonzalez, L. Orozco-Barbosa, F. J. Quiles, P. J. Garcia, and A. Garrido, “3-D wavelet compression by message passing on a Myrinet cluster,” in Proc. Can. Conf. Electr. Comput. Eng., vol. 2. 2001, pp. 1005–1010. [7] W. Badawy, G. Zhang, M. Talley, M. Weeks, and M. Bayoumi, “Low power architecture of running 3-D wavelet transform for medical imaging application,” in Proc. IEEE Workshop Signal Process. Syst., Taiwan, 1999, pp. 65–74. [8] G. Bernab´ , J. Gonz´ lez, J. M. Garc´ and J. Duato, “Memory conscious e a ia, 3-D wavelet transform,” in Proc. 28th Euromicro Conf. Multimedia Telecommun., Dortmund, Germany, Sep. 2002, pp. 108–113. [9] M. Weeks and M. A. Bayoumi, “Three-dimensional discrete wavelet transform architectures,” IEEE Trans. Signal Process., vol. 50, no. 8, pp. 2050–2063, Aug. 2002. [10] Q. Dai, X. Chen, and C. Lin, “Novel VLSI architecture for multidimensional discrete wavelet transform,” IEEE Trans. Circuits Syst. Video Technol., vol. 14, no. 8, pp. 1105–1110, Aug. 2004. V. Conclusion The applications of 3-D wavelet based coding are opening new vistas in video and other multidimensional signal compression and processing. The prominent needs in these diversified application areas are efficient 3-D-DWT engines with good computing power which draws the attention of the dedicated VLSI architectures as the best possible solution. Though the researches of 2-D-DWT architectures are progressing quite fast, fewer approaches are reported in the literatures designing their 3-D counterpart. 296 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 20, NO. 2, FEBRUARY 2010 [11] W. Badawy, M. Weeks, G. Zhang, M. Talley, and M. A. Bayoumi, “MRI data compression using a 3-D discrete wavelet transform,” IEEE Eng. Med. Biol. Mag., vol. 21, no. 4, pp. 95–103, Jul.–Aug. 2002. [12] J. Xu, Z. Xiong, S. Li, and Y.-Q. Zhang, “Memory-constrained 3-D wavelet transform for video coding without boundary effects,” IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 9, pp. 812–818, Sep. 2002. [13] B. Das and S. Banerjee, “Low power architecture of running 3-D wavelet transform for medical imaging application,” in Proc. Eng. Med. Biol. Soc./Biomed. Eng. Soc. Conf., vol. 2. 2002, pp. 1062–1063. [14] B. Das and S. Banerjee, “Data-folded architecture for running 3-D DWT using 4-tap Daubechies filters,” IEE Proc. Circuits Devices Syst., vol. 152, no. 1, pp. 17–24, Feb. 2005. [15] W. Sweldens, “The lifting scheme: A custom-design construction of biorthogonal wavelets,” Appl. Comput. Harmon. Anal., vol. 3, no. 15, pp. 186–200, 1996. [16] I. Daubechies and W. Sweldens, “Factoring wavelet transforms into lifting steps,” J. Fourier Anal. Appl., vol. 4, no. 3, pp. 247–269, 1998. [17] Z. Taghavi and S. Kasaei, “A memory efficient algorithm for multidimensional wavelet transform based on lifting,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), vol. 6. 2003, pp. 401–404. [18] C.-T. Huang, P.-C. Tsneg, and L.-G. Chen, “Flipping structure: An efficient VLSI architecture for lifting-based discrete wavelet transform,” IEEE Trans. Signal Process., vol. 52, no. 4, pp. 1080–1090, Apr. 2004. [19] C. Pilrisot, M. Antonini, and M. Barlaud, “3-D scan based wavelet transform and quality control for video coding,” Eur. Assoc. Signal Process. J. Appl. Signal Process., vol. 2003, pp. 56–65, Jan. 2003. [20] S. Barua, J. E. Carletta, K. A. Kotteri, and A. E. Bell, “An efficient architecture for lifting-based two-dimensional discrete wavelet transforms,” VLSI J. Integration, vol. 38, no. 3, pp. 341–352, Jan. 2005. [21] G. Kuzmanov, B. Zafarifar, P. Shrestha, and S. Vassiliadis, “Reconfigurable DWT unit based on lifting,” in Proc. Program Res. Integr. Syst. Circuits, Veldhoven, The Netherlands, Nov. 2002, pp. 325–333. [22] I. S. Uzun and A. Amira, “Design and FPGA implementation of nonseparable 2-D biorthogonal wavelet transforms for image/video coding,” in Proc. Int. Conf. Image Process. (ICIP), vol. 4. Belfast, U.K., Oct. 2004, pp. 2825–2828. [23] B. Girod and S. Han, “Optimum update for motion-compensated lifting,” IEEE Signal Process. Lett., vol. 12, no. 2, pp. 150–153, Feb. 2005. [24] A. Secker and D. Taubman, “Motion-compensated highly scalable video compression using an adaptive 3-D wavelet transform based on lifting,” in Proc. IEEE Int. Conf. Image Process., Thessaloniki, Greece, Oct. 2001, pp. 1029–1032. [25] B. Pesquet-Popescu and V. Bottreau, “Three-dimensional lifting schemes for motion compensated video compression,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Salt Lake City, UT, May 2001, pp. 1793–1796. [26] N. Bozinovi´ , J. Konrad, T. Andr´ , M. Antonini, and M. Barlaud, c e “Motion-compensated lifted wavelet video coding: Toward optimal motion/transform configuration,” in Proc. 12th Eur. Signal Process. Conf., Vienna, Austria, Sep. 2004, pp. 1975–1978. [27] L. Luo, S. Li, Z. Zhuang, and Y.-Q. Zhang, “Motion compensated lifting wavelet and its application in video coding,” in Proc. IEEE Int. Conf. Multimedia Expo, Tokyo, Japan, 2001, pp. 365–368. [28] Architecture and Features of a Fully Scalable Motion-Compensated 3-D Subband Codec, document M7977.doc, ISO/IEC/JTC1/SC29/ WG11, Mar. 2002. [29] Improved MC-EZBC with Quarter-Pixel Motion Vectors, document 813 MPEG2002/M8366, ISO/IEC JTC1/SC29/WG11, May 2002. Anirban Das received the B.E. degree in electronics and telecommunication engineering from Bengal Engineering and Science University, Shibpur, India, and the M.S. degree in electronics and electrical communication engineering from the Indian Institute of Technology Kharagpur, Kharagpur, India. His M.S. thesis title was “Lifting based architecture for realizing two and 3-D discrete wavelet transform.” His research work was related to the design of highly efficient reduced memory 2-D and 3-D discrete wavelet transform architectures to be used in lossless and lossy image and video compression systems. Since 2006, he has been with the Bangalore Design Center, Nvidia Corporation, Bangalore, India, where he holds the position of an ApplicationSpecific Integrated Circuit Design Engineer, working on Nvidia’s latest motherboard graphics processors. His research interests include multimedia and signal processing architectures, platform architectures, system-on-chip design, and controller design of industry standard bidirectional universal switches. Anindya Hazra received the B.E. degree in electronics and telecommunication engineering from Bengal Engineering and Science University, Shibpur, India, in 2002, and the M.S. degree in electronics and electrical communication engineering from the Indian Institute of Technology Kharagpur, Kharagpur, India, in 2005. From 2005 to 2006, he was with the Computer Maintenance Corporation Ltd., Hyderabad, India. He is currently a Complex Digital Designer at the STMicroelectronics Private Ltd., Greater Noida, Uttar Pradesh, India. His research interests include image and video processing architecture, and complex system-on-chip design. Swapna Banerjee (SM’99) received the B.E. and M.E. degrees from the Department of Electronics and Telecommunication Engineering, Jadavpur University, Jadavpur, India, in 1971 and 1974, respectively. She received the Ph.D. degree from the Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, Kharagpur, India, in 1981. She is currently a Professor at the Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur. She has published in several reputed journals and conferences. She has served as a mentor for a number of doctoral students over the last two decades. She is supervising a number of projects funded by national and international agencies, mainly related to VLSI-DSP-based noninvasive biomedical instrumentations. Her research interests include analog and digital VLSI design, and digital signal/image processing for biomedical instrumentations. Dr. Banerjee received the Postdoctoral Fellowship from Tokyo University, Tokyo, Japan. She received the prestigious Matsumae International Foundation Award in 1985.