A Very Fast and Low Power Incrementer and Decrementer Circuit

Samiappa Sakthikumaran¹, S. Salivahanan, V. S. Kanchana Bhaaskaran², V. Kavinilavu
B. Brindha and C. Vinoth
SSN College of Engineering, Kalavakkam, Rajiv Gandhi Salai, (Off) Chennai, Tamil Nadu, India
Email: vskanchana@hotmail.com

Abstract—An Incrementer/Decrementer (INC/DEC) is an important building block in many digital systems such as the address generation unit of microcontrollers and microprocessors. In this paper, we propose novel architectures and designs for very fast and low power INC/DEC circuits that can be used in low power RISC processors. The designs operate on the 2N-2N2P energy recovery logic design approach. The 350nm technology library files from AMS350nm standard CMOS technology are used in the work.

Index Terms—Adiabatic Logic, low power, 2N-2N2P adiabatic logic, Incrementer, Decrementer

I. INTRODUCTION

The design of high-speed and low-power VLSI architectures needs efficient processing units, which are optimized for the speed and power consumption. Hence, the trend has been to search for techniques to reduce the power dissipation, low power operation, and designing for energy recovery and recycling. Energy recovery is proving to be a promising approach for the design of low power VLSI circuits. The primary advantage of these circuits results from its inherent characteristics of

- deriving a constant current from the power clock
- getting the FETs working with the minimum voltage between the source and drain terminals. The basic complementary function Block of 2N-2N2P is shown in Fig. 1.

The module INC/DEC counts up or counts down during every clock cycle. This building block is widely used in digital systems, such as the program counter [1] and frequency divider [2]. A pre-loadable up/down counter [3] [4] is used as an INC/DEC. However, [3] or [4] incur a speed limitation in the process of generating the carry bit. This is due to the fact that the up/down counter is designed using the concept of conventional addition operation, although [5] uses the half-adder to replace the full-adder. [6] Presents a decrementer architecture employing a data-out multiplexer array and a decision block but the latency remains high due to the generation of select bit.

In this paper, we propose a structure which can realize the incrementing and decrementing operations, during incrementing/decrementing operations in the arithmetic units, for incrementing the address information of the program counter and for incrementing and decrementing operations of the stack pointer in the processors. The INC/DEC circuit structure realizes reduced delay and power, and enhanced low power dissipation characteristics.

The rest of this paper is organized as follows. Section II presents a review of the conventional INC/DEC structures. Section III presents the proposed architecture. Section IV depicts the design flow of CMOS and adiabatic INC/DEC design counterparts. The front-end design flow is also discussed in this section. Section V presents the results and analysis. The comparative performance behavior between the proposed architecture and the conventional CMOS circuits is discussed. Section VI concludes.

II. CONVENTIONAL INCREMENTER CIRCUITS

Figure 2 shows the internal logic circuit schematic of an incrementer, based on the conventional 4-bit ripple carry adder. The half adders, denoted by HA, are used for constructing the incrementer structure. Fig. 3 depicts the internal logic diagram of a 4-bit ripple carry decrementer. In these designs, the critical delay for the incrementing/decrementing operation is determined by the delay incurred in the carry bit propagation time, through the chain of AND gates. As the size of the chain grows, this delay also increases, making the delay of the most significant bit (MSB), the largest one in the process. In the proposed design, the carry is calculated using parallel AND chain thereby reducing the delay incurred.
by MSB. The latency, power dissipation, the combinational path delay and the energy consumed are calculated and compared for these circuits.

III. PROPOSED ARCHITECTURE

The design employs the carry select adder method in place of the conventional ripple-carry adder structure. This speeds up the process of the action of incrementing and decrementing (INC/DEC). The basic schematic block of the proposed carry select incrementer is shown in Fig. 4. It considers the carry_in to be a logic High always. The addition function of the proposed circuit is accomplished by the 4-bit basic block. The incrementing action or the retaining of the current value is achieved by the use of the 2-to-1 multiplexer. This accomplishes fast incrementing action along with a reduced number of function stages, thus reducing the latency of the circuit. The resultant advantage of this basic block is the large silicon area reduction realized, when the INC/DEC is designed for a wider word length of data. To explain the operation of the logic block better, the Boolean expressions depicting the 4-bit basic block are given below:

\[
\begin{align*}
B_1 &= \neg A_0; \quad B_2 = A_1 \oplus A_0; \\
C_1 &= A_1 \cdot A_0; \quad C_2 = A_2 \cdot A_3; \\
C_0 &= C_1 \cdot C_2; \quad C_3 = C_1 \cdot A_2; \\
B_3 &= A_2 \oplus C_1; \quad B_4 = A_3 \oplus C_3;
\end{align*}
\]

It can be seen that the carry_out bit of the block is calculated in parallel along with B3 by using a parallel chain of AND gates. On the other hand, the RCA structure uses a series pattern of carry propagation. This reduces the delay when compared with the conventional methods of incrementer structures.

The block diagram of the INC/DEC is shown in Fig. 5 and Fig. 6 respectively. The incrementer structure divides the word-size equally into blocks of 4-bits each. Each ripple carry block is replaced by the proposed block, which reduces the latency and number of devices utilized when the function is designed with large number of bits. The decrementer structure inherits the fast incrementer structure with inverters in both sides to provide a fast decrementing action, along with gaining the advantages of the incrementer structure.

IV. DESIGN FLOW

A. FRONT-END DESIGN

The INC/DEC design has been developed using Verilog-HDL and synthesized using the Synopsys Front-end Design Compiler. The generic libraries of SAED 90nm technology has been used in the designs. The design area and the combinational path delay have been estimated for both the conventional and proposed INC/DEC designs and the results are compared in Tables I and II.
B. BACK-END DESIGN

The conventional INC/DEC is designed in Tanner EDA using AMS350nm technology from Austria Micro Systems. The design is implemented by using the 2N-2N2P quasi-adiabatic logic style. The design is also implemented using the conventional CMOS logic counterpart for justifiable comparison. Figs. 7 to 12 depict the circuit diagrams. The latency, power dissipation and energy are calculated for each of the circuit design methods.

The latency represents the number of logic stages incurred by the circuit to produce the output in response to the input. The power dissipation measured indicates the average power measured over the simulation time. The simulation is carried out over a wide range of input data values. The results so obtained are tabulated in Tables III and IV.

<table>
<thead>
<tr>
<th>Circuit Structure</th>
<th>Incrementer</th>
<th>Decrementer</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>Combinational path delay (ns)</td>
<td>Area (μm²)</td>
</tr>
<tr>
<td>Ripple-carry</td>
<td>16.201</td>
<td>374.01</td>
</tr>
<tr>
<td>Proposed</td>
<td>12.122</td>
<td>346.5</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Circuit Structure</th>
<th>Incrementer</th>
<th>Decrementer</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>N</td>
<td>L</td>
</tr>
<tr>
<td>Ripple-carry</td>
<td>1152</td>
<td>16</td>
</tr>
<tr>
<td>Proposed CMOS</td>
<td>804</td>
<td>6</td>
</tr>
<tr>
<td>Adiabatic</td>
<td>1040</td>
<td>5</td>
</tr>
</tbody>
</table>

TABLE I
PATH DELAY AND AREA

TABLE II
NO. OF DEVICES (N) AND LATENCY (L) COMPARISON
RESULTS AND DISCUSSIONS

To get the practical performance comparison made between the already circuits and the proposed incrementer decrementer circuits, the results obtained in the front end is tabulated in Table I. The proposed incrementer proves to be 25% faster than conventional one, while the decrementer is about 17 % faster. An area reduction of 8% of the proposed circuit from that of the conventional circuit has also been achieved in case of an incrementer and 6.2% in case of decrementer. Table III and IV present the power dissipation incurred by the three types of incrementer and decrementer circuits. The power presented is the average power component, while taking various combinations of the input signals at identified frequency values. This method helps us obtain a justifiable result.

TABLE III

<table>
<thead>
<tr>
<th>Circuit</th>
<th>Frequency</th>
<th>50</th>
<th>200</th>
<th>300</th>
<th>500</th>
</tr>
</thead>
<tbody>
<tr>
<td>Ripple-carry</td>
<td>0.655</td>
<td>2.6</td>
<td>5.19</td>
<td>7.76</td>
<td></td>
</tr>
<tr>
<td>Proposed CMOS</td>
<td>0.352</td>
<td>0.92</td>
<td>2.89</td>
<td>4.83</td>
<td></td>
</tr>
<tr>
<td>Adiabatic</td>
<td>0.038</td>
<td>0.36</td>
<td>0.927</td>
<td>1.68</td>
<td></td>
</tr>
</tbody>
</table>

TABLE IV

<table>
<thead>
<tr>
<th>Circuit</th>
<th>Frequency</th>
<th>50</th>
<th>200</th>
<th>300</th>
<th>500</th>
</tr>
</thead>
<tbody>
<tr>
<td>Ripple-carry</td>
<td>0.55</td>
<td>3.4</td>
<td>5.35</td>
<td>8.0</td>
<td></td>
</tr>
<tr>
<td>Proposed CMOS</td>
<td>0.32</td>
<td>2.4</td>
<td>4.25</td>
<td>6.0</td>
<td></td>
</tr>
<tr>
<td>Adiabatic</td>
<td>0.075</td>
<td>0.35</td>
<td>0.86</td>
<td>1.7</td>
<td></td>
</tr>
</tbody>
</table>

V. RESULTS AND DISCUSSIONS

The adiabatic incrementer realizes 17 times less power than the ripple-carry incrementer type at 50MHz. It realizes 4.62 times less power at 500MHz frequency of operation. The corresponding values of decrementer are 7.33 and 4.7 at 50MHz and 500MHz respectively.

The adiabatic power is the power incurred by the circuit, and it depicts the advantage of the adiabatic or energy recovery property of the circuit.

VI. CONCLUSION

A new circuit for the incrementer/decrementer is presented in this paper. The proposed circuit reduces the operation complexity and number of devices used. It paves way for reduced area, less power dissipation and lower energy added with increase in speed performance. The comparison results also show that the proposed adiabatic design reduces the circuit delay of a 16-bit incrementer by 25%, along with a power reduction from 0.55mW of the ripple-carry adder to 0.075mW of the adiabatic decrementer. The corresponding values of incrementer are reduction of adiabatic power from 0.655mW to 0.038mW at 50MHz. At 500MHz, power reduction from 7.76mW to 1.68mW of adiabatic circuit has been realized for the incrementer.

The decrementer realizes power reduction from 8.0mW to 1.7mW. The comparison pertaining the proposed CMOS incrementer circuit shows a power reduction of nearly 2 times at 50MHz to 1.6 times at 500MHz. The corresponding power reduction for decrementer is nearly 1.72 times at 50MHz to 1.33 times at 500MHz.

ACKNOWLEDGMENTS

The work is supported by the Department of Electronics and Communication Engineering, SSN College of Engineering, Kalavakkam, Tamil Nadu, India.

REFERENCES