0

https://doi.org/10.1038/s44335-024-00013-1

# Adiabatic leaky integrate and fire neurons with refractory period for ultra low energy neuromorphic computing

Check for updates

Marco Massarotto 🖂, Stefano Saggini, Mirko Loghi & David Esseni

In recent years, the in-memory-computing in charge domain has gained significant interest as a promising solution to further enhance the energy efficiency of neuromorphic hardware. In this work, we explore the synergy between the brain-inspired computation and the adiabatic paradigm by presenting an adiabatic Leaky Integrate-and-Fire neuron in 180 nm CMOS technology, that is able to emulate the most important primitives for a valuable neuromorphic computation, such as the accumulation of the incoming input spikes, an exponential leakage of the membrane potential and a tunable refractory period. Differently from previous contributions in the literature, our design can exploit both the charging and recovery phases of the adiabatic operation to ensure a seamless and continuous computation, all the while exchanging energy with the power supply with an efficiency higher than 90% over a wide range of resonance frequencies, and even surpassing 99% for the lowest frequencies. Our simulations unveil a minimum energy per synaptic operation of 470 fJ at a 500 kHz resonance frequency, which yields a 9x energy saving with respect to a non-adiabatic operation.

Computers designed according to the von Neumann separation between processing and memory, tightly synchronized by clock signals and based on a high-precision digital representation have not been conceived for the complex cognitive tasks and the extreme energy efficiency required by edge artificial intelligence (AI) applications. In-memory-computing has thus emerged as a compelling paradigm to improve energy efficiency in hardware solutions dedicated to AI. Moreover, neuromorphic computing is being actively explored as a bio-inspired approach to information processing that is based on spiking neural networks<sup>1</sup>, and is capable of sophisticated tasks including real-world sensory processing<sup>2</sup>. The essential components of neuromorphic hardware are artificial synapses and neurons (see Fig. 1), with artificial synapses being pivotal also for crossbar arrays employed in accelerators for neural network inference<sup>3-5</sup>.

Many implementations of artificial synapses have been proposed including CMOS sub-threshold circuits<sup>6-9</sup>, switched capacitors<sup>10</sup>, oxide-RAM<sup>11</sup>, and Li-ion transistors<sup>12</sup>. Moreover, in order to reduce the area and improve the energy efficiency, single-device synapses have been explored using diverse kinds of memristors<sup>13–15</sup>.

Capacitive synapses can enable a highly energy efficient, charge domain in-memory-computing<sup>16-18</sup>, and non-volatile tunable capacitors, or memcapacitors, could provide an ideal platform for such an implementation<sup>19-22</sup>. In fact, in contrast to memristors, the read-out of capacitive synapses is not inherently dissipative<sup>23,24</sup>. Nevertheless, the usual operation of capacitors, sketched in Fig. 1b, implies the dissipation in the

driving circuitry of an energy comparable to the energy stored in the capacitor itself.

Thus, in the quest for a further leap forward in the energy efficiency of neuromorphic hardware and in-memory-computing, the adiabatic operation of the capacitors seems very attractive<sup>22,25</sup>.

The adiabatic operation illustrated by Fig. 1c has already been explored for logic circuits, but without reaching a significant impact. This was due to the challenges in the design of logic gates and of multi-phase power-clock generators, to the penalty in dynamic performance<sup>26</sup>, as well as to the fact that the adiabatic operation can effectively suppress the dynamic energy but not as much the static leakage energy, whose impact has become comparatively more important with the scaling of CMOS technologies and at low operating frequencies.

The adiabatic operation can be much more rewarding in neuromorphic circuits employing capacitive synapses<sup>22,25</sup>, because these bioinspired circuits inherently target a relatively low-frequency operation and, moreover, the dynamic energy consumption tends to dominate the static dissipation due to the use of quite large capacitors.

In this paper, we expand on our previous work in ref. 27 by presenting, for the first time to our knowledge, an adiabatic Leaky Integrate-and-Fire neuron featuring an accumulation of the incoming input spikes, an exponential leakage of the membrane potential and a tunable refractory period. We report transistor-level simulations, carried out with the commercially available XP018-180 nm XFAB CMOS technology, demonstrating that our

DPIA-Polytechnic Department of Engineering and Architecture, University of Udine, Udine, Italy. 🖂 e-mail: massarotto.marco001@spes.uniud.it



Fig. 1 | Adiabatic neuromorphic computing. a Sketch of an artificial spiking neuron in a spiking Hopfield neural network with self-recursion. The input spikes are weighted by the synapses and then integrated into the membrane potential, which finally fires an output spike upon reaching an appropriate threshold. b Nonadiabatic driving of a capacitor by an abrupt input voltage V<sub>IN</sub> having practically null rise and fall times (T = 0). The dissipation during each charging or discharging is  $\frac{1}{2}CV_{DD}^2$  and does not depend on R. c Adiabatic driving of a capacitor by a  $V_{IN}$ 

waveform with rise and fall times (T) sufficiently longer than the time constant  $\tau = RC$  of the circuit. In this case, the conduction losses become proportional to  $\tau/T$ and so can be arbitrarily reduced by slowing down the driving signal. Since in the charging-discharging cycle only a fraction of the supplied energy is dissipated on *R*, during the discharging the power supply can recover a significant fraction of the supplied energy.



Fig. 2 | Schematic of the adiabatic neuromorphic architecture. a Global architecture featuring an inductive driver (yellow) that resonates with the word-line capacitance selected by its WL controller (pink). Each word-line comprises a row of differential synapses (orange) that weigh and forward the spikes to the soma of the neurons on the bit-lines (blue), where they are integrated into the differential membrane potential ( $\Delta V_m$ ), that is finally monitored by a comparator. **b** Timemultiplexing of a single driver among all the neurons of the Hopfield network.

design is able to recover the charge supplied to the capacitive synapses and soma with minimal losses, in fact the circuit can reach an energy efficiency higher than 99% for the slowest resonance frequencies. Our results reveal a minimum energy per synaptic operation of 470 fJ at a 500 kHz resonance frequency with a supply voltage  $V_{DD} = 1.8$  V. Such an energy per synaptic operation translates into a 9× energy saving with respect to a non-adiabatic operation.

## Results

The adiabatic architecture proposed in this work is shown in Fig. 2. The core of our circuit is a crossbar array of synapses which natively implements a spiking Hopfield network with self-recursion (see Fig. 2b). Nonetheless, any other network architecture can be obtained by pruning the undesired network connections, namely by setting their corresponding synaptic weight to zero. The spikes are distributed through the network based on the Address-Event Representation (AER) protocol<sup>28</sup>. In particular, both the word-lines (WL) and the neurons on the bit-lines (BL) are identified by matching Article

configuration of the synapse based on the leakage and refractory period of the corresponding neuron. e Binary-encoded capacitor bank that implements the differential capacitive synapse. f Encoding of the synaptic weight and corresponding number of Least Significant Bit (LSB) capacitors that form the positive and negative synaptic capacitors  $(C_{syn}^{\pm})$ .

identifiers, so that the spikes fired by the *i*-th neuron can be forwarded to all neurons through the synapses of the *i*-th word-line.

In our architecture, all the energy involved in the neuromorphic computation is supplied to the neurons and synapses in an adiabatic fashion, thus greatly reducing the conduction losses in the circuit. More details about the operation of the system will be presented in the "Discussion" section. In brief, upon each input spike, an inductive driver (yellow) resonates with the target capacitive word-line, which consists of differential capacitive synapses (orange) that form two capacitive dividers with the differential soma of the neurons (blue). Thanks to the resonance, the driver can adiabatically transfer or recover energy to and from the word-lines. The supplied current is weighted by the synapses and then integrated by the soma into the *dif*ferential membrane potential  $(\Delta V_m)$  as a function of their synaptic weight (SW). Finally, the membrane potential is monitored by the neuron comparator, which fires an output spike upon the crossing of a certain threshold.

One specific novelty of our design is its capability to make use of both the adiabatic charging and recovery phases to accumulate the incoming



**Fig. 3** | **Neuromorphic computation and ultra-low energy performance.** Example of the neuromorphic computation. **a** Synchronous clock spikes (CLK, gray) that provide a time base for both the neuron leakage and refractory period, and asynchronous spikes (SPK, black) weighted by the synaptic weights. **b** Output spikes fired by the neurons in (**c**) to (**f**) for different synaptic weights (SW) and duration of the refractory period (REFR). All neurons share the same time constant for the exponential leakage. **g** Time evolution of the energy supplied to the circuit (black) and stored in the capacitors (gray), whose difference provides the dissipated energy. Energy performance simulated from the example on the left and expressed in terms of Energy per Synaptic Operation. **h** Adiabatic conduction losses (green line) versus the resonance frequency  $f_{LC}$  and corresponding adiabatic efficiency (right *y*-axis),

spikes in the membrane potential of the neurons. Following the same working principle, a synchronous *spiking clock* (CLK) provides the neurons with an adiabatic time-base, which is used to induce discrete inhibitions of the membrane potential, so as to achieve an adiabatic implementation of the leakage and refractory period of the neurons.

Figure 3 shows an example of the overall computation. The train of spikes from the neurons (black SPK) and clock spikes (gray CLK) in Fig. 3a are forwarded to the neurons through four different synaptic weights (SW), resulting in different behaviors of their membrane potential, which are reported in Fig. 3c–f. In particular, each input SPK increases the membrane potential proportionally to the excitatory SW, and then the membrane potential exponentially decays towards the resting state due to the CLK-driven leakage of the neuron. Furthermore, whenever the membrane potential overcomes the spiking threshold set by the comparator (dashed lines), the neurons fire the output spikes of Fig. 3b and then enter a refractory period. The black triangles show how the circuit can adjust the duration of the refractory period of the neurons, which are also reported in the figure. As a reference, the pink neuron having the highest SW as well as the shortest refractory period exhibits the most lively spiking activity.

Below, Fig. 3g reports the energy supplied to the circuit over time (black), as well as the total energy stored in all its internal capacitors (gray). Since the energy remaining in the synapses and neurons plays an active role



defined as the saving with respect to the non-adiabatic energy dissipation (red line). **i** Dynamic energy per synaptic operation comprising the conduction losses (green) and the consumption of the logic (gray) plotted versus  $f_{LC}$ . **j** Total ESOP extracted at the Typical Mean at 27 °C (TM, green), Worst Power at 0 °C (WP, yellow) and Worst Speed at 100 °C (WS, cyan) corners including both the dynamic and static energy contributions (reported for TM corner only). The opposite  $f_{LC}$  dependence of the dynamic and static energies results in a Minimum Energy Point (MEP, colored stars) at  $f_{LC}$  = 500 kHz for all corners and corresponding to 470 fJ for the TM corner. **k** Comparison between the nominal (TM @ 27 °C) adiabatic MEP at  $f_{LC}$  = 500 kHz and the corresponding energy in nominal non-adiabatic mode.

in the computation, the actual dissipated energy is evaluated as the difference between the supplied and said stored energy.

The simulations on the left side of Fig. 3 were repeated for many different resonance frequencies  $f_{LC}$ . The resulting energy performance are analyzed as a function of  $f_{LC}$  in the right side of Fig. 3, where they are expressed in terms of *Energy per Synaptic Operation* (ESOP), namely normalized to the number of spikes (including clock spikes), and to the number of synapses served by each WL.

First, the left *y*-axis of Fig. 3h reports the total conduction losses, which correspond to the adiabatic driving efficiency on the right *y*-axis. Such efficiency is here defined as the difference between the conduction losses resulting from our adiabatic operation (green), and the non-adiabatic upper limit  $\frac{1}{2}C_{WL}V_{DD}^2$  obtained by operating our circuit in a completely non-adiabatic reduction of the conduction losses for decreasing  $f_{LC}^{29}$ . As can be seen, our design results in conduction losses roughly proportional to the square root of  $f_{LC}$  consistently with the frequency-dependent optimization of the transmission gates discussed in "Methods" and in Supplementary Note 4. Thanks to such a frequency scaling, our adiabatic efficiency remains higher than 90% over the whole explored frequency range and even exceeds a 99% efficiency at the lowest resonance frequencies.

In Fig. 3i it is shown the dynamic ESOP, which comprises the conduction losses of Fig. 3h (green), as well as the energy consumed by the

#### Table 1 | Comparison with state-of-the-art

| Chip Name<br>Reference | Loihi<br>Davies<br>IEEE Micro <sup>30</sup> | ODIN<br>Frenkel<br>IEEE TBCAS <sup>31</sup> | DYNAP-SE2<br>Richter<br>IOP Neuro <sup>8</sup> | –<br>Rubino<br>IEEE TCAS <sup>9</sup> | —<br>Maheshwari<br>IEEE TCAS <sup>25</sup> | —<br>Massarotto<br>This work |
|------------------------|---------------------------------------------|---------------------------------------------|------------------------------------------------|---------------------------------------|--------------------------------------------|------------------------------|
| Implementation         | Digital                                     | Digital                                     | Mixed: Subthreshold                            | Mixed: Subthreshold                   | Mixed: Adiabatic                           | Mixed: Adiabatic             |
| Technology             | 14 nm FinFET                                | 28 nm FDSOI                                 | 180 nm                                         | 22 nm FDSOI                           | 180 nm                                     | 180 nm                       |
| Supply Voltage         | 0.5–1.25 V                                  | 0.55–1 V                                    | 1.8 V                                          | 0.8 V                                 | 1.8 V                                      | 1.8 V                        |
| Neuron Model           | CUBA-LIF                                    | LIF & Izhikevich                            | LIF & exLIF                                    | AdExp-I&F                             | Perceptron                                 | LIF                          |
| Dynamics               | N/A                                         | Bio. to accel.                              | Biological                                     | Biological                            | -                                          | Biological                   |
| Synaptic Weight        | 9 bit                                       | 3+1 bit                                     | 4+2 bit                                        | N/A                                   | 8 bit                                      | 8 bit                        |
| ESOP - Energy per      | >23.6 pJ <sup>a</sup>                       | >12.7 pJ                                    | 150 pJ⁵                                        | <14 pJ°                               | 8.5 pJ <sup>d</sup>                        | >470 fJ                      |
| Synaptic Operation     | @ V <sub>DD</sub> = 0.75 V                  | @ V <sub>DD</sub> = 0.55 V                  | @ Fire rate = 80 Hz                            | @ Fire rate = 30 Hz                   | @ <i>f<sub>LC</sub></i> = 1 MHz            | $@f_{LC} = 500 \text{ kHz}$  |
|                        | (Experimental)                              | (Experimental)                              | (Experimental)                                 | (Simulation)                          | (Simulation)                               | (Simulation)                 |

Comparison of Energies per Synaptic Operation with previously published low-power neuromorphic circuits. The notes of the table report details about the definition and evaluation of the ESOP in the different references.

<sup>a</sup>Does not account for either synaptic or neuron update energy; extracted from pre-silicon SDF and SPICE simulations, in accordance with early post-silicon characterization.

<sup>b</sup>Energy per "Neuron Spike Operation", comprising one-to-many MAC-like operations depending on the integration period and on the assumed time-step fidelity.

<sup>c</sup>Energy reported versus the average neuron firing rate. At less biologically-plausible rates higher than 30 Hz, the ESOP has been shown to decrease (e.g., 850 fJ at 1 kHz).

<sup>d</sup>The ESOP for an 8-bit synaptic weight is reported only for a resonance frequency f<sub>LC</sub> = 1 MHz. Out of the total 8.5 pJ ESOP, 4 pJ are due to the adiabatic conduction losses and control logic of the switches, while the remaining 4.5 pJ are due to the neuron comparator implemented with RRAM memristors.

auxiliary logic (gray). The Dynamic ESOP mirrors the scaling of the conduction losses at relatively high frequencies, while it deviates at the lower frequencies as the consumption of the logic circuits becomes progressively dominant with respect to the adiabatic conduction losses.

In adiabatic systems operating at low frequencies, the static dissipation may become the dominant source of energy consumption. As such, the static power of each sub-circuit has been thoroughly evaluated for each corner, and it is reported in Supplementary Note 4 for the TM corner at 27 °C only.

Figure 3j reports the total ESOPs of all the design corners (colored solid lines), obtained by adding the dynamic energy of Fig. 3i to the static energy estimated for the full crossbar; the corners TM, WS, WP are defined and described in the "Simulation Framework" paragraph of the "Methods" section. The isolated dynamic (black dotted line) and static energy (black dashed line) are also shown in Fig. 3j, but only for the TM corner at 27 °C. The opposite frequency scaling of the static and dynamic energy leads to a Minimum Energy Point (MEP, colored stars) at a resonance frequency of about 500 kHz for all corners.

In particular, thanks to the high adiabatic efficiency reached by both the TM at 27 °C (green) and WP at 0 °C (yellow) corners, their MEPs result as low as 470 fJ and 490 fJ, respectively. On the other hand, for the WS corner at 100 °C (cyan) the MEP increases to 620 fJ, due to both higher conduction losses (owing to a higher resistance of the transmission gates) and higher static consumption (due to a degradation of the MOSFETs off-current at the high operating temperature).

As summarized in Fig. 3k, the nominal MEP yields a 9× energy saving compared to an identical circuit operated in a completely non-adiabatic mode. The detailed energy breakdown of the nominal adiabatic and non-adiabatic benchmarks can be found in Supplementary Note 5.

In Table 1, we compare the performance of our adiabatic LIF neurons with previously published neuromorphic circuits, including digital implementations<sup>30,31</sup>, mixed-signal subthreshold options<sup>8,9</sup>, and adiabatic circuits<sup>25</sup>. Mixed-signal systems tend to reach an energy efficiency higher than digital solutions, albeit at the cost of a lower configurability. Compared to the adiabatic perceptron in ref. 25, this work has both introduced novel functional aspects and achieved a lower overall ESOP. Indeed, our results suggest that the adiabatic operation is a promising option to further improve the energy-efficiency of neuromorphic circuits.

#### Discussion

#### Adiabatic resonance-driven charge recovery

The inductive driver (yellow in Fig. 2a) is tasked to transfer the energy back and forth between the power supply  $(V_{DD})$  and the target WL of the



**Fig. 4** | **Adiabatic driving and charge recovery. a** Asynchronous input spikes that trigger either a charging or a recovery phase. **b** adiabatic integration phase (INTG) followed by a HOLD phase that corrects the incomplete transition of the word-line before the comparator evaluates the membrane potential (EVAL). c Word-line voltage which is alternatively charged (red) and discharged (green) at any subsequent spike. The insets show the correction during the HOLD phase. **d** Sinusoidal current waveform that transfers a charge  $C_{WL}V_{DD}$  back and forth between the supply and the WL upon each spike. The zero-current switching points are marked with a black star.

crossbar. In particular, the driver consists of an inductor *L* and a large flying capacitor  $C_{FLY}$ , which is pre-charged at  $V_{DD}/2$  and should always preserve such a voltage. The operation of the driver is demonstrated in Fig. 4. The arrival of each input spike (Fig. 4a) starts the *adiabatic integration phase* (INTG, Fig. 4b), where the selected WL controller enables a transmission gate (TG-WL, see Fig. 2b) to connect the driver to the word-line. This triggers the resonance between the inductance *L* and the series of  $C_{FLY}$  and the WL capacitance  $C_{WL}$ . Such resonance is crucial to convert any abrupt voltage step across the inductor to an adiabatic-compatible voltage swing that drives the WL capacitance. In particular, if the WL voltage begins at GND/ $V_{DD}$ , the connection with the driver causes a positive/negative  $V_{DD}/2$  voltage drop across the inductor. In turn, this triggers the rising/falling

sinusoidal swing of the word-line voltage shown in Fig. 4c, which can be expressed as:

$$V_{WL} = \frac{V_{DD}}{2} \left[ 1 \mp \cos(2\pi f_{LC}(t - t_0)) \right]$$
(1)

with upper and lower sign applying respectively to falling and rising. The charging/discharging of the WL is driven by the positive/negative sinusoidal current reported in Fig. 4d, which can be written as:

$$I_L = \pm I_{L,pk} \sin\left[2\pi f_{LC}(t-t_0)\right] \tag{2}$$

where  $t_0$  is the spike arrival instant,  $f_{LC}$  the resonance frequency and  $I_{L,pk} = \pi C_{WL} V_{DD} f_{LC}$  is the peak amplitude of the current oscillation. Upper and lower signs in Eqs. (1) and (2) correspond to a  $V_{WL}$  waveform starting at GND or  $V_{DD}$ , respectively.

Without any external intervention, the resonance would continue to shuttle energy from the power supply to the WL during the upswing *(charging phase* in red), and recover that same energy back during the downswing *(recovery phase* in green). All the resistive elements on the path of the current would however damp such oscillation by dissipating energy via Joule heating.

In our architecture, however, the timer forces the WL controller to disable the TG-WL and so interrupt the INTG phase after a single halfperiod  $T_{LC}/2$ , namely when the word-line nominally completes the charge to  $V_{DD}$  (charging phase) or the discharge to *GND* (recovery phase). Such a timing ensures the zero-current switching of the TG-WL, which is a well-known requirement for achieving an adiabatic operation because it minimizes the losses when turning off the transistors<sup>32</sup>.

As it can be seen in Fig. 4, the input spikes trigger INTG phases that alternatively start with either a completely charged or discharged word-line. This behavior ensures the seamless alternation of charging and recovery phases despite the identical control signals. For clarity, we will denote the spikes that trigger a charging phase as *charging spikes*, and those triggering a recovery phase as *recovery spikes*.

After any INTG, the word-line is kept in HOLD (yellow of Fig. 4b) until the next INTG. While in HOLD, the tristate of the WL controller takes control of the word-line by enforcing and then maintaining either  $V_{DD}$  (after a charging phase) or GND (after a recovery phase). As shown in Fig. 2b, such tristate consists of a Toggle Flip-Flop (T-FF), which inverts its output upon each input spike thus mirroring the transitions of the word-line, and then a transmission gate (TG-TRS) that disconnects T-FF during the adiabatic integration. Before the evaluation by the comparator (EVAL, gray in Fig. 4b), the HOLD phase corrects any incomplete charging or discharging of the word-line, as shown in the insets of Fig. 4c. In doing so, both the T-FF and the TG-TRS produce non-adiabatic conduction losses which, as such, depend only on the amplitude of the correction required by the WL voltage but not on the sizing of the transistors.

#### Capacitive synapses and soma

Our architecture employs differential capacitive synapses realized with the binary-encoded capacitor bank sketched in Fig. 2e, which gets split by a digital encoding of the *Synaptic Weight* (SW) in two differential synaptic capacitors,  $C_{syn}^+$  and  $C_{syn}^-$ . In particular, depending on the *i*-th bit of the synaptic weight (SW<sub>i</sub>), a row of transmission gates (TG-SYN) connect the *i*-th capacitor ( $C_{syn,i}$ ) either to the left branch to make up the positive synaptic capacitor (SW<sub>i</sub> = 1), or to the right to be part of the negative one (SW<sub>i</sub> = 0).

Such differential structure serves two interlinked purposes. First, it represents the synaptic weight in the difference between the two synaptic capacitors (i.e.,  $SW \propto (C_{syn}^+ - C_{syn}^-))$ , which can be either positive for an excitatory synapse  $(C_{syn}^+ > C_{syn}^-)$ , negative for an inhibitory synapse  $(C_{syn}^+ < C_{syn}^-)$ , or null  $(C_{syn}^+ = C_{syn}^-)$ . Second, it results in a total synaptic capacitance  $C_{syn} = (C_{syn}^+ + C_{syn}^-)$  independent of the synaptic weight, thus ensuring a good tracking of the resonance irrespectively of the synaptic

weight and without resorting to a large equalizing capacitor<sup>25</sup>. In the table of Fig. 2f are reported the attainable values of the synaptic weight, together with their digital encoding and their corresponding  $C_{syn}^+$  and  $C_{syn}^-$  capacitances expressed in terms of number of Least Significant Bits (LSB). More details on the variability of the capacitor bank and on the linearity of the resulting transfer function can be found in Supplementary Note 1.

Below the synapses, there is another row of transmission gates (TG-SOMA, see Fig. 2a, e) that connect each synaptic capacitor to one of the two identical capacitors  $C_{soma}$  that form the differential soma that acts as the bitline shared by all WLs. The differential membrane potential  $\Delta V_m$  between the  $C_{soma}$  capacitors represents the neuron membrane potential that is finally monitored by the neuron comparator, which is described in Supplementary Note 2.

As it will be detailed in the following sections, each synapse is controlled by a synapse controller (SYN-CTRL, see Fig. 2d), which dynamically manipulates all the TG-SYNs and TG-SOMAs depending on the state of the neuron and word-line, respectively. In particular, the SYN-CTRL can dynamically set the SW to zero to ignore the incoming spikes (e.g., during the refractory period), or restore it to its user-defined value. As for the TG-SOMAs, they select which synapse  $C_{syn}^{\pm}$  is connected to each node  $V_m^{\pm}$ . Moreover, the TG-SOMAs can also be disabled to put the word-line in IDLE and enable the multiplexing of the external inductor among multiple WLs (Fig. 2b).

#### Accumulation of charging and recovery spikes

In previous literature, the resonant adiabatic driving was successfully employed to compute Multiply-and-Accumulate (MAC) operations<sup>25,33</sup>. As sketched in Fig. 5a, in those designs the charge is adiabatically provided during an initial charging phase as the input to each branch of the WL ( $x_i$ ). Such inputs are then weighted by capacitive synapses ( $w_i$ ) and accumulated into the terminal somas shared by multiple WLs, whose voltage is finally interpreted as the result of the MAC operation ( $\sum_i w_i x_i$ ). Immediately afterward, the charge is recovered by adiabatically discharging the synapsesoma capacitive dividers. As a consequence, each recovery phase cancels out the effect of the previous charging phase, thus making each MAC operation independent of the previous ones.

However, in order to implement the functionalities required by a spiking neural network, the results of many MAC operations need to be accumulated into the soma along successive computation cycles. To this purpose, in our architecture, the charging and the recovery phases have a cumulative effect on the membrane potential of the neurons, as sketched in Fig. 5b, which is crucial to compute a valuable MAC operation during both adiabatic phases. In particular, the recovery phase accomplishes both a computational task by modifying the membrane potential, and an electrical one by enabling the adiabatic recovery of the charge transferred and then stored in the WL by the previous charging phase.

The accumulation of the input spikes is achieved thanks to a proper reconfiguration of the differential structure of both the synapses and neurons. The  $C_{syn}^+$ ,  $C_{syn}^-$  can be connected either at  $V_m^+$  or  $V_m^-$  to form two capacitive dividers that govern the charge induced to the corresponding  $C_{soma}$ . In particular, the larger synaptic capacitor will induce a larger charge variation  $\delta Q_{soma}$  in the connected  $C_{soma}$  as dictated by:

$$\delta Q_{soma} = \Delta V_{WL} \left[ \frac{C_{soma} C_{syn}^{\pm}}{C_{soma} + C_{syn}^{\pm}} \right]$$
(3)

where the upper and lower signs correspond to the  $C_{soma}$  connected to respectively  $C_{syn}^+$  and  $C_{syn}^-$ , while  $\Delta V_{WL}$  is the voltage swing at the word-line induced by the input spike, namely  $\Delta V_{WL} = +V_{DD}$  for a charging spike and  $\Delta V_{WL} = -V_{DD}$  for a recovery spike (see Fig. 4c).

The resulting charges in the two  $C_{soma}$  capacitors give rise to two singleended potentials  $V_m^{\pm}$  and thus to a differential voltage  $\Delta V_m = (V_m^+ - V_m^-)$  that represents the neuron membrane potential.



**Fig. 5** | **Accumulation of charging and recovery spikes. a** Sketch of an adiabatic multiply-and-accumulation operation (MAC). The charge is first supplied adiabatically (red), then adiabatically recovered (green) after sampling the outcome of the MAC (blue), thus resetting the state of the integrating capacitor. b Sketch of an adiabatic neuromorphic operation, whereby an alternation of charge (red) and recovery (green) spikes are accumulated into the membrane potential of the soma. Synapse-soma capacitive dividers controlled by the signal *D* during the charging

In order to accumulate the effects produced by many  $\delta Q_{soma}$  on the differential membrane potential, the SYN-CTRL inverts the capacitive dividers by properly controlling the TG-SOMAs through the signal *D*. More specifically, the following configurations are possible: (1) *direct divider* used during charging spikes ( $D = 1, C_{syn}^+$  connected to  $V_m^+$  and  $C_{syn}^-$  connected to  $V_m^-$ ); (2) *inverted divider* used during recovery spikes ( $D = 0, C_{syn}^+$  connected to  $V_m^-$ ); (3) *IDLE mode*, where *D* is bypassed to disable all TG-SOMAs and thus disconnect the WL. This latter configuration makes it possible to multiplex the inductor among multiple WLs.

Figure 5c,d illustrate an example for a positive synaptic weight (obtained by setting  $C_{syn}^+ > C_{syn}^-$ ), where the direct divider makes each charging spike charge the positive soma more than the negative one, thus increasing their differential membrane potential. Conversely, the inverted divider makes each recovery spike discharge the negative soma more than the positive one, which increases again the differential membrane potential, thus accumulating its contribution to the charging spikes. Figure 5e, f illustrate such behavior also for a negative synaptic weight (namely for  $C_{syn}^+ < C_{syn}^-$ ).

By doing so, every spike yields a change  $\delta V_m$  of the differential membrane potential equal to:

$$\delta V_m = V_{DD} \left[ \frac{C_{syn}^+}{C_{syn}^+ + C_{soma}} - \frac{C_{syn}^-}{C_{syn}^- + C_{soma}} \right]$$
(4)

If we let  $sw = SW/2^{Nbit}$  be the signed synaptic weight SW normalized between -1 and +1, and recall the  $C_{syn}$  definition  $C_{syn} = (C_{syn}^+ + C_{syn}^-)$ , the synaptic capacitances can be expressed in terms of sw as  $C_{syn}^{\pm} = C_{syn}(1 \pm sw)/2$  and substituted in Eq. (4) to obtain:

$$\delta V_m = V_{DD} \left[ \frac{c(1+sw)}{1+c(1+sw)} - \frac{c(1-sw)}{1+c(1-sw)} \right]$$
(5)



(red) and recovery phase (green) for either a positive synaptic weight  $(C^+_{syn} > C^-_{syn})$ , **c**, **d** in blue) or a negative synaptic weight  $(C^+_{syn} < C^-_{syn})$ , **e**, **f** in orange). In (**c**) and (**e**), the larger synapse results in a greater charging of the  $C_{soma}$  connected respectively at  $V^+_m$  and  $V^-_m$ , thus increasing and reducing  $\Delta V_m$ , respectively. Conversely, in (**d**) and (**f**), the larger synapse results in a greater discharging of the  $C_{soma}$  connected respectively at  $V^-_m$  and  $V^+_m$ , again increasing and reducing  $\Delta V_m$ .

where  $c = C_{syn}/(2C_{soma})$ . We reiterate that, for a given synaptic weight,  $\delta V_m$  is the same for both charging and recovery spikes.

Finally, by neglecting the neuron leakage that will be discussed in the following sections, we can estimate as  $N_{SPK} = V_{th}/\delta V_m$  the number of successive excitatory spikes necessary for a neuron to cross the threshold  $V_{th}$  of the comparator starting from the resting state at  $\Delta V_m = 0$ .

#### Spiking clock as adiabatic time reference

In biology, it is observed that the membrane potential of the neurons decays exponentially towards its resting potential due to the leakage of ions through the neuron membrane<sup>34</sup>. In complex SNNs, the inclusion of this leakage mechanism into a *Leaky Integrate-and-Fire* model (LIF, see Fig. 6a) enables the neurons to process temporally structured stimuli that carry relevant information also through the arrival times of the spikes<sup>35</sup>. This is the case for many edge applications such as real-time keyword spotting, health monitoring and gesture recognition<sup>36–38</sup>. Indeed, for this kind of tasks, it has been extensively demonstrated that the LIF model performs remarkably better than the simpler *Integrate-and-Fire* (IF) model, particularly in terms of an increased noise-robustness, better generalization capabilities and higher network accuracy<sup>39,40</sup>.

In neuromorphic circuits, the leakage of the membrane potential is typically realized with a resistance *R* that discharges the soma *C* with an exponential time decay and a time constant  $\tau = RC$ , as illustrated in Fig. 6a. In bio-inspired architectures that target both low-energy and biologically plausible time-scales, the required high effective resistance is typically realized with FETs working in the sub-threshold region<sup>9,41</sup>. Such implementations, however, are inherently dissipative and not compatible with our adiabatic approach.

For this reason, in our architecture, we devised an adiabatic implementation of the leakage, which makes use of the inductor to gradually decrease the membrane potential via adiabatic steps.

As it is depicted in Fig. 6b, a time reference is obtained by propagating through the network periodic *clock spikes* (CLK) with a fixed spiking rate



Fig. 6 | Spiking clock as time-base for adiabatic neuron leakage and refractory period. a Leaky-Integrate-and-Fire (LIF) neuron model. b Distribution of the clock spikes (CLK) to the neurons. The Clock Forwarders (CLK-FWD) of a dedicated Clock Word-line (CLK-WL) are electrically analogous to inhibitory synapses. c Network-level interpretation of the spiking clock. d, e Time decay of the membrane potential for  $T_{CLK} = 100 \,\mu$ s, whose linearity is set by the decay linearity factor (DL) of the CLK-FWD: d linear decays obtained with increasingly negative DLs and exploited for the neuron refractory period. e exponential decay achieved with null DL and used to implement the neuron leakage; f, g Inversion of the capacitive

 $f_{CLK}$  in addition to the asynchronous spikes fired by the neurons (SPK). The time resolution is set by the spiking period  $T_{CLK} = f_{CLK}^{-1}$  of the CLKs. Indeed, the  $f_{CLK}$  should be adequately faster than the average  $f_{SPK}$  to approximate the desired time decay of the membrane potential by a large number of small steps given by Eq. (5).

The CLK is distributed to the neurons via an additional *clock word-line* (CLK-WL), which inhibits the membrane potential following a decay profile that depends on the negative *Decay Linearity* factor (DL) of the *clock forwarders* (CLK-FWD). Actually, the CLK-FWDs and their decay linearity factors are electrically and functionally analogous to inhibitory synapses with negative synaptic weights, as they both drive the membrane potential in discrete steps given by Eq. (5).

As such, our leakage implementation can be interpreted at the network level as illustrated by Fig. 6c, namely by a clocked neuron that fires inhibitory spikes with a fixed period  $T_{CLK}$  to all the other neurons of the network.

## Adiabatic neuron leakage

In order to implement a biologically plausible neuron leakage, it is necessary for the membrane potential to follow an exponential decay, whereby each discrete step gradually decreases in magnitude until the neuron returns to its resting state. Indeed, a negative decay linearity factor cannot achieve such an exponential decay, because it yields a linear dynamics with fixed voltage steps given by Eq. (5), namely a linear time dependence with a fixed rate  $d(\Delta V_m)/dt$  proportional to the DL, as depicted in Fig. 6d. Instead, the exponential decay in Fig. 6e can be achieved by exploiting a specific feature of the capacitive network in Fig. 6f, g occurring when DL is set to zero.

In fact, during each integration phase (INTG, see Fig. 4), the change induced to the membrane potential has two different origins:

dividers and corresponding charge sharing in the capacitors network (orange). **h** Resistance ( $R_{eq}$ , left *y*-axis) and exponential time constant ( $\tau_{eq}$ , right *y*-axis) of the equivalent RC circuit in the inset, as a function of the clock period. The simulations of the exponential decay at the Typical Mean at 27 °C (TM, orange circles) and Worst Speed at 100 °C (WS, yellow crosses) corners agree with the model of Eq. (6) (dashed), and demonstrate the insensitivity to the operating temperature. Biologically plausible time constants in the order of the ms can be achieved with  $T_{CLK}$  in the range of hundreds of µs.

- First, at the very start of the INTG, the inversion of the dividers from the configuration in Fig. 6f to that in Fig. 6g triggers a non-adiabatic and nearly instantaneous redistribution of charge between the C<sup>±</sup><sub>syn</sub> and C<sub>soma</sub> capacitors that slightly reduces the membrane potential towards zero proportionally to its starting value during Fig. 6f.
- Second, during the adiabatic integration, the membrane potential varies according to Eq. (5), i.e., proportionally to either the synaptic weight (for the neuron spikes) or the decay linearity factor (for the CLK spikes).

In Supplementary Note 3, we show that the tiny change of  $\Delta V_m$  induced by a single event (1) can be approximated as  $-\Delta V_m \cdot C_{syn}/C_{soma}$ , which is negligible compared to the  $\Delta V_m$  produced by a non-null SW or DL value during the phase (2). Still, the charge redistribution occurring in phase (1) can be exploited to achieve an exponential decay of the membrane potential by concatenating many clock spikes with DL = 0, as demonstrated in Fig. 6e. In particular, such a charge redistribution can be modeled as an equivalent resistance  $R_{eq}$  resulting in a time constant  $T_{CLK}$  given respectively by:

$$R_{eq} = 2 \cdot \frac{T_{CLK}}{C_{syn}} \Rightarrow \tau_{eq} = T_{CLK} \cdot \frac{C_{soma}}{C_{syn}}$$
(6)

as it is sketched in the inset of Fig. 6h.

This exponential decay can be easily scaled in time by the clock period  $T_{CLK}$  as confirmed by the simulations in Fig. 6h which are in excellent agreement with Eq. (6). Moreover, the time constant of the decay depends on the ratio between the synaptic and soma capacitors, which guarantees a stable matching independent of the working temperature, as opposed to other leakage implementations relying on discrete resistive elements<sup>9,41</sup>.

Nonetheless, if we relax the requirement for a biologically plausible exponential leakage, the linear decay can be more easily adjusted by tuning both the  $T_{CLK}$  and decay linearity. In principle, the DL can also be trained alongside the synaptic weights to optimize the leakage behavior of each neuron, which is seen to improve the performance of the network<sup>42,43</sup>.

The leakage of each neuron is controlled by the *LEAK* signal, which is activated by the comparator whenever  $\Delta V_m > 0$ , and resets back to zero when the neuron returns to its resting state. In this latter case, the controller of the clock forwarder re-configures its DL to zero so that the CLK cannot reduce the membrane potential any further, all the while conveniently correcting any undershoot below  $\Delta V_m = 0$  thanks to the aforementioned charge redistribution (Eq. (6)). The SYN-CTRL enforces a similar control to mask also any inhibitory SPK when the neuron is in its resting state. It's important to stress that the incoming spikes cannot be masked simply by turning off the TG-SOMAs, because that would change the overall capacitance of the WL and the actual resonance frequency, thus compromising the zero-current switching of the driver.

#### Adiabatic refractory period

Another fundamental behavior observed in biological neurons is the presence of a refractory period that follows the firing of an output spike. During this time, the neuron is unresponsive to any additional stimuli, and so is unable to fire new output spikes. At the network level, this ultimately limits the maximum spiking activity in the network<sup>44</sup>.

In our architecture, even the refractory period is implemented adiabatically by making use of the time-base provided by the clocking spikes to keep track of the elapsed time directly in the membrane potential of the neurons. The comparator of each neuron starts its own refractory phase by raising the REFR signal when its membrane potential crosses the spiking threshold, which marks the firing of an output spike. While REFR=1, the SYN-CTRLs reconfigure all the synapses belonging to the neuron to a null synaptic weight, so as to mask any incoming input spike until the refractory phase has ended. At the same time, the decay linearity may also be changed to a different negative value to make the duration of the refractory period uncorrelated from the neuron leakage. The periodic arrival of each CLK discharges the membrane potential towards zero in discrete steps that are proportional to the programmed DL. Finally, the refractory phase ends when the membrane potential returns to the zero-voltage resting state.

In our architecture, the refractory period is implemented by the linear decay of Fig. 6d, which allows us to achieve biologically plausible refractory periods in the sub-ms to ms range<sup>45–47</sup>, while still maintaining the time constant for the membrane potential leakage longer than the ms<sup>48</sup>. The duration of the refractory period can be estimated as  $T_{REFR} = T_{CLK}V_{th}/\delta V_{m}$ , and so it depends on both  $T_{CLK}$  and the refractory linearity factor through the  $\delta V_m$  given by Eq. (5). This flexibility could be exploited to tailor the refractory period of each neuron on the specific application, so as to enhance the encoding capabilities of the network<sup>49</sup>.

#### **Design scaling and perspectives**

In this work, we have reported how the seamless alternation between the charging and recovery phases of the resonance-powered adiabatic driving can be exploited to implement a Leaky Integrate-and-Fire artificial neuron. In particular, we have demonstrated the main primitives required for a valuable neuromorphic computation, namely the accumulation and weighting of the incoming input spikes, an exponential leakage of the membrane potential and a tunable refractory period. These functionalities have been implemented with an adiabatic efficiency higher than 90% over a wide range of resonance frequencies, and even surpassing 99% for the lowest frequencies. The frequency scaling of the dynamic and static energy consumption has unveiled a minimum energy per synaptic operation of about 470 fJ at a resonance frequency around 500 kHz, yielding a 9× energy saving with respect to the non-adiabatic operation.

As of now, the admittedly quite large size of the capacitors employed in our design hinders a large-scale integration of the synaptic crossbar. However, more advanced technologies can offer a higher capacitance density per unit area even in BEOL capacitors and also reduce the onresistance per unit width of the transistors, thus improving both the conduction and driving losses and, in turn, the overall energy efficiency. On the other hand, technology scaling may also bring along an increase of the static power, which is the main limitation to the reduction of the ESOP at low operating frequency in any adiabatic system.

Moreover, CMOS compatible memcapacitors, capable of a nonvolatile storage of the capacitive synaptic weights, represent emerging devices that are inherently synergetic with the adiabatic operation, thus opening novel perspectives for charge-based, adiabatic neuromorphic computing.

The best validation of the circuitry solutions proposed in this work is a tape-out in the commercial 180 nm technology employed throughout our design, which is in fact our top-of-the-list item for the future work.

### Methods Simulation framework

All the results presented in this work have been obtained from transistorlevel *Spectre* simulations carried out with CADENCE<sup>®</sup> Virtuoso<sup>®</sup> Studio version IC23.1, loaded with the commercially available XP018-180 nm XFAB CMOS technology with supply voltage  $V_{DD}$  = 1.8 V. The simulation outputs have been post-processed through the Virtuoso ADE Explorer and Virtuoso ADE Assembler environments.

The results presented in this work have been evaluated for various operating temperatures and for the design corners provided by the technology manufacturer. In particular, the explored corners are defined as follows: Typical Mean (TM) refers to the nominal characteristics of both transistors and capacitors; Worst Power (WP) corresponds to transistors more conductive than the TM case, coupled with a lower BEOL capacitance per unit area compared to the TM case; Worst Speed (WS) refers to lower conductive transistors, coupled with a higher BEOL capacitance per unit area.

#### **Circuit design**

In this work, we implemented the synapse with a capacitor bank in the BEOL in order to ensure a very good linearity of the synaptic weight. In particular, the synapses have been designed for an  $N_{bit} = 8$  resolution of the synaptic weight with an elementary, LSB capacitance of  $(10.0 \pm 0.4)$  fF. This results in a full-scale synaptic capacitance  $(C_{syn})$  equal to  $2^{Nbit} = 256$  times said LSB, namely about  $(2.56 \pm 0.10)$  pF.

On the other hand, each soma capacitor ( $C_{soma}$ ) has an effective capacitance of around (51.0 ± 1.6) pF and has been implemented in the active silicon area using grounded minimum-sized n-MOSFETs each contributing with (342 ± 11) aF, and so leverage their higher capacitance per unit area compared to the BEOL capacitors. The variability of these capacitors was investigated by means of Monte Carlo simulations and the resulting statistical distributions can be found in Supplementary Note 1.

Indeed, despite the differential structure of the synapses, the total wordline capacitance ( $C_{WL}$ ) inevitably depends slightly on the synaptic weights due to the synapse-soma dividers. In particular, during the benchmark of Fig. 3, we estimated the total  $C_{WL}$  as about 630 pF. Moreover, based on the technology specifications for the interconnections, the parasitics in the capacitor bank are expected to be tiny compared to the word-line capacitance and can thus be safely neglected.

The flying capacitance of the driver  $(C_{FLY})$  can be implemented offchip and, as such, it was designed much larger than  $C_{WL}$  and set to  $100 \,\mu$ F. Such a large  $C_{FLY}$  helps maintain stable its pre-charged  $V_{DD}/2$  voltage during the adiabatic resonance.

Different resonance frequencies  $f_{LC}$  have been obtained by changing the inductance of the driver as  $L = (2\pi f_{LC})^{-2}/C_{WL}$ , where the contribution of  $C_{FLY}$  to the resonance condition has been safely neglected due to the condition  $C_{FLY} \gg C_{WL}$ . At the MEP condition illustrated in Fig. 3j, the inductance turns out to be about 100 µH, which is compatible with an offchip implementation and in line with previous papers<sup>25</sup>. The volume occupied by the inductor is roughly proportional to its maximum stored energy<sup>50</sup>, so namely proportional to  $L \cdot I_{L,pk}^2$ , where  $I_{L,pk} = \pi C_{WL} V_{DD} f_{LC}$  is the peak amplitude of the current oscillation. Hence, due to the opposite  $f_{LC}$ dependence of L and  $I_{L,pk}^2$ , we expect that the inductor volume should be roughly independent of the resonance frequency.

## Optimization of the transmission gates

When a capacitor *C* is charged or discharged by an abrupt voltage step, the conduction losses are always equal to  $\frac{1}{2}CV_{DD}^2$  and so do not depend on the effective resistance *R* of the charging circuit. On the other hand, when such a capacitor is driven adiabatically, the conduction losses become proportional to  $\tau = RC$  (refer to Fig. 1c).

In our architecture, R is the resistance of the transmission gates (TGs) present on the path of the resonant current. Hence, R can be lowered by sizing the TGs with a minimum channel length and a large channel width W. However, by enlarging W we also increase the gate input capacitance of the TGs and, in turn, the energy required to drive them. As a consequence, the total ESOP dissipated by a transmission gate involves a trade-off between the adiabatic conduction dissipation and the non-adiabatic driving losses.

We minimized the overall energy with respect to the *W* of each n-MOS and p-MOS of the TGs, resulting in a minimum dissipation per transistor given by:

$$ESOP_{tot}^{opt} = V_{DD}I_{pk}\sqrt{\frac{R_{DS}C_G}{f_{LC}}}$$
(7)

where  $R_{DS}$  and  $C_G$  are respectively the triode resistance and gate capacitance per unit width of a minimum length MOSFET, while  $I_{pk}$  is the peak amplitude of the current flowing through said transistor during the resonance.

This minimum energy is achieved for an optimum width given by:

$$W_{opt} = \frac{1}{2} \frac{I_{pk}}{V_{DD}} \sqrt{\frac{R_{DS}}{C_G} \frac{1}{f_{LC}}}$$
(8)

This optimization has been computed for every TG and at each explored resonance frequency, resulting in adiabatic conduction losses scaling with the square root of  $f_{LC}$ , as also confirmed by our simulations (see Fig. 3h).

More details and numerical results about the optimization of the TGs can be found in Supplementary Note 4.

#### Evaluation of the energy performance

This work aimed to explore the synergy between the brain-inspired neuromorphic computation and the adiabatic operation to minimize the energy per operation of the system. As such, a thorough evaluation of the energy performance is of utmost importance to assess quantitatively the effectiveness of our adiabatic design.

The energy figures of Fig. 3 have been extracted from a transient simulation involving 100 spikes forwarded to 256 neurons either through a single WL of synapses, for the asynchronous neuron spikes, or through a clock WL, for the clocking spikes.

The 256 neurons have been grouped into four different classes based on the synaptic weight of their input synapse, where the neurons belonging to the same class share the same value of the synaptic weight (SW) and so exhibit the same profile of their membrane potential. In particular, such four weights have been chosen as +32 (+0.125), +64 (+0.25), +128 (+0.5), and +256(+1.0) so as to emulate the different spiking behaviors shown in Fig. 3c–f.

The evaluation of the energy performance is based on three different energy components:

• energy supplied by a tree of ideal voltage sources where each branch (B) provides  $V_{DD}$  to a single sub-circuit, calculated as  $E_{supply}(t) = \sum_{j=1}^{N_B} \int_t V_{DD} I_j(t) dt$ , where  $I_j(t)$  is the current supplied to the *j*-th branch and  $N_B$  is the number of branches;

- conduction losses on all the transmission gates (TG) computed by  $E_{cond}(t) = \sum_{k=1}^{N_{TG}} \int_t \Delta V_k(t) I_k(t) dt$ , where  $I_k(t)$  and  $\Delta V_k(t)$  are respectively the current and the voltage drop across the *k*-th TG and  $N_{TG}$  is the number of TGs;
- *energy stored* in all the capacitors estimated as  $E_{cap} = \sum_{l=1}^{N_c} \frac{1}{2} C_l \Delta V_l(t)^2$ , where  $\Delta V_l(t)$  is the voltage drop across the *l*-th capacitor  $C_l$  and  $N_c$  is the number of capacitors.

From these three contributions, we can compute the instantaneous *dissipated energy* as:

$$E_{diss}(t) = E_{supply} - E_{cap} = E_{cond} + E_{logic}$$
(9)

where we recall that  $E_{cap}$  can be to a large extent recovered thanks to the adiabatic operation. We note that Eq. (9) can also be used to determine the energy consumption due to the logic ( $E_{logic}$ ).

All the aforementioned energy contributions are simulated over time and, from their final value at  $t = t_{end}$ , the *Energy per Synaptic Operation* for either the overall dissipation  $E_{diss}$ , conduction losses  $E_{cond}$  and logic consumption  $E_{logic}$  can be obtained as:

$$ESOP = \frac{E_{diss}, E_{cond} \text{ or } E_{logic} @ t_{end}}{N_{neur} \cdot (N_{SPK} + N_{CLK})}$$
(10)

where  $(N_{SPK} + N_{CLK})$  is the total number of neuron (SPK) and clock spikes (CLK), while  $N_{neur}$  denotes the number of neurons, also corresponding to the number of synapses served by each WL.

The ESOP of the logic includes both the dynamic and the static components. The static energy absorbed by each sub-circuit was also isolated in the overall transient  $E_{diss}(t)$ . Hence, by subtracting such a static component from the ESOP, we could in turn identify the dynamic component reported in Fig. 3i. The breakdown of the total ESOP into the different dynamic and static contributions is discussed in Supplementary Note 5.

The extraction of the static power of each sub-circuit also allowed us to accurately estimate the total static power absorbed by the complete crossbar (see Fig. 3j), which comprises 256 synaptic WLs and one clock WL, all connected to 256 neurons.

Finally, the benchmark on the left side of Fig. 3 has been repeated in a wide range of resonance frequencies, by re-resizing the transmission gates at each frequency according to the optimized channel width in Eq. (8).

#### Adiabatic and non-adiabatic operating modes

Our architecture can be operated either in an adiabatic or in a non-adiabatic mode.

When operated in adiabatic mode, the energy is provided adiabatically during the integration phase (INTG, see Fig. 4) by the resonance between the capacitive WL and inductive driver. Then, we recall that every INTG phase is followed by a HOLD phase (yellow in Fig. 4b), that nonadiabatically supplies energy to the word-line to correct any incomplete charging or discharging of its voltage.

Hence, the non-adiabatic operating mode can be achieved by skipping the adiabatic INTG altogether or, equivalently, by using only the HOLD phase to supply the charge non-adiabatically to all synapses and neurons.

Therefore, we evaluated the efficiency of the adiabatic driving shown in Fig. 3h and the energy saving reported in Fig. 3k by simulating the benchmark on the left side of Fig. 3 in both adiabatic and non-adiabatic operation mode.

Our analysis confirmed that, when the system is operated in nonadiabatic mode, the energy consumption is dominated by the conduction losses in the transmission gates, which amount to about 4 pJ per synaptic operation (see  $ESOP_{cond}$  in Fig. 3h), independently of the resonance frequency and of the sizing of the TGs. The detailed breakdown of the energy consumption in non-adiabatic mode is shown in Supplementary Note 5. Finally, we also exploited the non-adiabatic losses to accurately estimate the

## Data availability

The authors declare that the data supporting this work are available within the main paper and its supplementary information. The data are also available from the corresponding author upon reasonable request.

Received: 30 April 2024; Accepted: 26 September 2024; Published online: 04 December 2024

## References

- Marković, D., Mizrahi, A., Querlioz, D. & Grollier, J. Physics for neuromorphic computing. *Nat. Rev. Phys.* 2, 499–510 (2020).
- Moro, F. et al. Neuromorphic object localization using resistive memories and ultrasonic transducers. *Nat. Commun.* 13, 1–13 (2022).
- Qiao, N. et al. A reconfigurable on-line learning spiking neuromorphic processor comprising 256 neurons and 128K synapses. *Front. Neurosci.* 9, 141 (2015).
- Rolls, E. T. The connections of neocortical pyramidal cells can implement the learning of new categories, attractor memory, and topdown recall and attention. *Brain Struct. Funct.* 226, 2523–2536 (2021).
- Purves, D. et al. Neuroscience 3rd edition. https://www.ncbi.nlm.nih. gov/books/NBK10865/ (2004).
- Chicca, E., Stefanini, F., Bartolozzi, C. & Indiveri, G. Neuromorphic electronic circuits for building autonomous cognitive systems. *Proceedings of the IEEE* **102**, 1367–1388 (2014).
- Yu, T., Joshi, S., Rangan, V. & Cauwenberghs, G. Subthreshold MOS dynamic translinear neural and synaptic conductance. In: *Proc. 5th International IEEE/EMBS Conference on Neural Engineering NER* 2011 68–71 (2011).
- Richter, O. et al. Dynap-se2: a scalable multi-core dynamic neuromorphic asynchronous spiking neural network processor. *Neuromorphic Comput. Eng.* 4, 014003 (2024).
- Rubino, A., Livanelioglu, C., Qiao, N., Payvand, M. & Indiveri, G. Ultralow-power fdsoi neural circuits for extreme-edge neuromorphic intelligence. *IEEE Trans. Circuits Syst. I: Regul. Pap.* 68, 45–56 (2021).
- Noack, M., Krause, M., Mayr, C., Partzsch, J. & Schuffny, R. VLSI implementation of a conductance-based multi-synapse using switched-capacitor circuits. In: *Proc. IEEE International Symposium* on *Circuits Systems* 850–853 (2014).
- 11. Vianello, E. et al. Multiple binary OxRAMs as synapses for convolutional neural networks. *Cogn. Syst. Monogr.* **31**, 109–127 (2017).
- 12. Fuller, E. J. et al. Li-lon synaptic transistor for low power analog computing. *Adv. Mater.* **29**, 1604310 (2017).
- Covi, E. et al. Analog memristive synapse in spiking networks implementing unsupervised learning. *Front. Neurosci.* 10, 482 (2016).
- Papandroulidakis, G., Serb, A., Khiat, A., Merrett, G. V. & Prodromakis, T. Practical implementation of memristor-based threshold logic gates. *IEEE Trans. Circuits Syst. I Regul. Pap.* 66, 3041–3051 (2019).
- Saxena, V. High LRS-resistance CMOS memristive synapses for energy-efficient neuromorphic SoCs. *Midwest Symp. Circuits Syst.* August, 1143–1146 (2019).
- 16. Hur, J. et al. Nonvolatile capacitive crossbar array for in-memory computing. *Adv. Intell. Syst.* **4**, 2100258 (2022).
- Pershin, Y. V. et al. Device-algorithm co-optimization for an on-chip trainable capacitor-based synaptic device with IGZO TFT and Retention-Centric Tiki-Taka Algorithm. *Adv. Sci.* **10**, 2303018 (2023).
- Yu, S., Luo, Y.-C., Kim, T.-H. & Phadke, O. Nonvolatile capacitive synapse: device candidates for charge domain compute-in-memory. *IEEE Electron. Devices Mag.* 1, 23–32 (2023).
- Ventra, M. D., Pershin, Y. V. & Chua, L. O. Circuit ElementsWithMemory: memristors, memcapacitors, and meminductors. *Proc. IEEE* 97, 1717 (2009).

- 20. Pershin, Y. V. & Di Ventra, M. Memcapacitive neural networks. *Electron. Lett.* **50**, 141–143 (2014).
- Kwon, D. & -Y. Chun, I. Capacitive neural network using charge-stored memory cells for pattern recognition applications. *IEEE Electron. Device Lett.* 41, 493–496 (2020).
- Demasius, K. U., Kirschen, A. & Parkin, S. Energy-efficient memcapacitor devices for neuromorphic computing. *Nat. Electron.* 4, 748–756 (2021).
- 23. Wang, Z. et al. Capacitive neural network with neuro-transistors. *Nat. Commun.* **9**, 1–10 (2018).
- Ge, N., Strachan, J. P., Yang, J. & Hu, M. Memcapacitive cross-bar array for determining a dot product. https://patents.google.com/ patent/US10249356B2/en (2019).
- Maheshwari, S., Serb, A., Papavassiliou, C. & Prodromakis, T. An adiabatic capacitive artificial neuron with RRAM-based threshold detection for energy-efficient neuromorphic computing. *IEEE Trans. Circuits Syst. I Regul. Pap.* 69, 3512–3525 (2022).
- Frank, M. P., W, B. R., Tierney, B. D. & A, M. N. Reversible computing with fast, fully static, fully adiabatic CMOS. In: *Proc. International Conference on Rebooting Computing (ICRC)* (2020).
- Massarotto, M., Saggini, S., Loghi, M. & Esseni, D. Adiabatic spiking neurons and synapses for ultra-low energy neuromorphic computing. In (ed.) Proc. 30th IEEE International Conference on Electronics, Circuits and Systems (ICECS), 1–4 (2023).
- Payvand, M., Nair, M. V., Müller, L. K. & Indiveri, G. A neuromorphic systems approach to in-memory computing with non-ideal memristive devices: from mitigation to exploitation. *Faraday Discuss.* 213, 487–510 (2019).
- 29. Frank, P. Reversible computing and truly adiabatic circuits: the next great challenge for digital engineering. In (ed.) Proc. *Fifth IEEE Dallas Circuits and Systems Workshop on Design, Applications, Integration and Software (DCAS-06)*, 31–38 (2006).
- Davies, M. et al. Loihi: a neuromorphic manycore processor with onchip learning. *IEEE Micro* 38, 82–99 (2018).
- Frenkel, C., Lefebvre, M., Legat, J.-D. & Bol, D. A 0.086-mm<sup>2</sup> 12.7-pj/ sop 64k-synapse 256-neuron online-learning digital spiking neuromorphic processor in 28-nm cmos. *IEEE Trans. Biomed. Circuits Syst.* 13, 145–158 (2019).
- Frank, M. P. Common mistakes in adiabatic logic design and how to avoid them. In (ed.) Proceedings of the International Conference on Embedded Systems and Applications. https://api.semanticscholar. org/CorpusID:10087823 (2003).
- Karakiewicz, R., Genov, R. & Cauwenberghs, G. 1.1 TMACS/mW finegrained stochastic resonant charge-recycling array processor. *IEEE* Sens. J. 12, 785–792 (2012).
- Huang, S., Hong, S. & De Schutter, E. Non-linear leak currents affect mammalian neuron physiology. *Front. Cell. Neurosci.* 9, 432 (2015).
- Abbott, L. F. Lapicque's introduction of the integrate-and-fire model neuron (1907). *Brain Res. Bull.* 50, 303–304 (1999).
- Dalgaty, T. et al. Mosaic: in-memory computing and routing for smallworld spike-based neuromorphic systems. *Nature Commun.* 15, 142 (2024).
- Vitale, A., Donati, E., Germann, R. & Magno, M. Neuromorphic edge computing for biomedical applications: gesture classification using emg signals. *IEEE Sens. J.* 22, 19490–19499 (2022).
- Covi, E. et al. Adaptive extreme edge computing for wearable devices. Front. Neurosci. 15, 611300 (2021).
- Bouanane, M. S., Cherifi, D., Chicca, E. & Khacef, L. Impact of spiking neurons leakages and network recurrences on event-based spatio-temporal pattern recognition. *Front. Neurosci.* 17, 1244675 (2023).
- 40. Chowdhury, S. S., Lee, C. & Roy, K. Towards understanding the effect of leak in spiking neural networks. *Neurocomputing* **464**, 83–94 (2021).
- 41. Indiveri, G. et al. Neuromorphic silicon neuron circuits. *Front. Neurosci.* **5**, 73 (2011).

- Fang, W. et al. Incorporating learnable membrane time constant to enhance learning of spiking neural networks. In (ed.) 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2641–2651 (2021).
- 44. Gerstner, W. & Kistler, W. M. Spiking Neuron Models: Single Neurons, Populations, Plasticity (Cambridge University Press, 2002).
- Yeomans, J. S. The absolute refractory periods of self-stimulation neurons. *Physiol. Behav.* 22, 911–919 (1979).
- Bielajew, C., Lapointe, M., Kiss, I. & Shizgal, P. Absolute and relative refractory periods of the substrates for lateral hypothalamic and ventral midbrain self-stimulation. *Physiol. Behav.* 28, 125–132 (1982).
- Burke, D., Kiernan, M. C. & Bostock, H. Excitability of human axons. *Clin. Neurophysiol.* **112**, 1575–1585 (2001).
- Fontaine, B., Peña, J. L. & Brette, R. Spike-threshold adaptation predicted by membrane potential dynamics in vivo. *PLoS Comput. Biol.* **10**, e1003560 (2014).
- Carrillo-Medina, J. L. & Latorre, R. Influence of the refractory period on neural networks based on the recognition of neural signatures. In (ed.) *Proc. International Joint Conference on Neural Networks (IJCNN)*, 1–9 (2015).
- Hurley, W. G. & Wölfle, W. H. Transformers and Inductors for Power Electronics: Theory, Design and Applications (John Wiley and Sons, Ltd, 2013). https://doi.org/10.1002/9781118544648.

# Acknowledgements

This work was supported by the Departmental Strategic Plan (PSD) of the University of Udine – Interdepartmental project of Artificial Intelligence AI and by the Italian MUR through the "FeNeL" project, PNRR M4.C2.1.1 – PRIN 2022 – No. 2022JE5SK2 – CUP G53D23000380006.

## Author contributions

D.E. proposed the concept; S.S. devised the adiabatic driving, the synapse architecture to accumulate the spikes and modeled it as an equivalent resistance; M.M. devised the adiabatic implementation of the neuron leakage and refractory period, designed and optimized the overall circuit,

carried out the simulations, analyzed and post-processed the data and produced the figures; M.L. provided insights into the algorithmic- and network-level interpretations and requirements. All authors contributed to the writing of the manuscript. D.E. supervised the project.

# **Competing interests**

The authors declare no competing interests.

# Additional information

Supplementary information The online version contains supplementary material available at https://doi.org/10.1038/s44335-024-00013-1.

**Correspondence** and requests for materials should be addressed to Marco Massarotto.

Reprints and permissions information is available at http://www.nature.com/reprints

**Publisher's note** Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

**Open Access** This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/bync-nd/4.0/.

© The Author(s) 2024