A pipelined shuffle-exchange network is proposed as a generalized 2D orthogonal transforms with flexible transform lengths. The 16×16 points 2-D DCT processor is chosen as the target design for its applicability in video processing. It is constructed in a modulized radix-4 pipelined structure and is implemented with high data rate and low hardware cost. According to the circuit simulations with 1.2 μm standard cell technology, the processing throughput of this DCT circuit can be 40 MHz. Extensibility for various indices and transform lengths is also discussed. Besides, a number of orthogonal transform algorithms are also implemented in the shuffle-exchange network with the same throughput rate.