# A subthreshold MOS circuit for the Lotka-Volterra neural network producing the winners-share-all solution 

T. Asai ${ }^{\mathrm{a}, *}$, T. Fukai ${ }^{\text {b,c }}$, S. Tanaka ${ }^{\text {c }}$<br>${ }^{\text {a }}$ Department of Electrical and Electronic Engineering, Toyohashi University of Technology, Hibarigaoka 1-1, Tempaku-cho, Toyohashi, Aichi, 441-8580, Japan<br>${ }^{\mathrm{b}}$ Department of Electronics, Tokai University, Kitakaname 1117, Hiratsuka, Kanagawa, 259-1207, Japan<br>${ }^{\mathrm{c}}$ Laboratory for Neural Modeling, Brain Science Institute, The Institute of Physical and Chemical Research (RIKEN), 2-1, Hirosawa, Wako, Saitama 351-0198, Japan

Received 22 July 1998; received in revised form 9 September 1998; accepted 9 September 1998


#### Abstract

An analog MOS circuit is proposed for implementing a Lotka-Volterra (LV) competitive neural network which produces winners-shareall solutions. The solutions give multiple winners receiving large inputs and are particularly useful for selecting a set of inputs through "decision by majority". We show that the LV network can easily be implemented using subthreshold MOS transistors. Results of extensive circuit simulations prove that the proposed circuit does exhibit a reliable selection compared with winner-take-all circuits, in the possible presence of device mismatches. These results pave a way to future implementation on a real device. © 1999 Elsevier Science Ltd. All rights reserved.


## 1. Introduction

The winner-take-all (WTA) competition provides a way to select the input of upmost importance (Cohen \& Grossberg, 1983; Majani et al., 1989, 1989; Yuille, 1989; Wolfe et al., 1991; Kaski \& Kohonen, 1994; Taylor \& Alavi, 1995). The result of competition is typically represented by the activation of a single neuron which receives the largest input. The WTA competition, however, does not suit for an analog circuit implementation since the selection of a single winner may easily suffer from noise in inputs, partial destructions of the circuits or deviations in physical parameters of a group of equally designed devices (Lakshmikumar et al., 1986; Pelgrom et al., 1989).

A Lotka-Volterra (LV) neural network, which was recently derived from the membrane dynamics of competing neurons, possesses three types of steady-state solutions (Fukai \& Tanaka, 1997). Those solutions are classified as the WTA, winners-share-all (WSA) and variant winner-take-all (VWTA). The WSA solution gives multiple winners in the order of magnitudes of external inputs. The number of winners can easily be adjusted by a single parameter in the model. The selection of winners for WTA and

[^0]WSA cases does not depend on initial values of neuron variables, whereas that for VWTA case does.

Among those solutions, the WSA solution can be used to reduce the influence of the noise and device mismatches of the analog circuits if an external signal is selected by multiple winners. In this letter, we propose a MOS circuit producing the WSA solution of the LV networks for exploring a possible solution to the difficulties in the matching analog devices. Since the MOS transistors in the proposed circuit operate in a subthreshold region, the electric power dissipation from the circuit is very small. The low power dissipation is critical for realizing a large scale circuit. In order to examine the influences of device mismatches on the circuit's performance, we conduct extensive simulations of the proposed LV circuit with Simulation Program of Integrated Circuit Emphasis (SPICE). The implementation on a real device is reported elsewhere (Asai et al., 1997).

## 2. The Lotka-Volterra competitive neural network

The LV equation which describes competitive behavior among $N$ identical neurons is given as (Fukai \& Tanaka,


Fig. 1. The circuit for the LV neural network with $N$ external inputs. The circuit consists of a single inhibitory neuron unit (H-Cell) and $N$ excitatory neuron units (E-cells).
1997)

$$
\begin{equation*}
\tau \frac{\mathrm{d} z_{i}}{\mathrm{~d} t}=z_{i}\left(\gamma+W_{i}-z_{i}-\lambda \sum_{j \neq i} z_{j}\right)+\varepsilon, \quad i=1, \ldots, N \tag{1}
\end{equation*}
$$

where $z_{i}$ is the activity of the $i$ th neuron, $\gamma$ represents an input which is nonspecific to each neuron, $W_{i}$ represents neuron-dependent inputs and $\epsilon$ is a small positive constant. The $\epsilon$ term prevents any $z_{i}$ from becoming 0 , so that losers and winners can interchange if the magnitudes of $\left\{W_{i}\right\}$ are changed occasionally. Each neuron has a self-inhibitory connection of the strength normalized to unity and $\lambda$ is the relative strength of all-to-all lateral inhibitory connections.

Let the external inputs obey
$W_{1} \geq W_{2} \geq W_{3} \geq \cdots \geq W_{N-1} \geq W_{N} \geq 0$,
and $\epsilon=0$ for the time being. The WSA solution to the LV equation appears when
$\lambda<\lambda_{-} \equiv \frac{\gamma+W_{1}}{\gamma+2 W_{1}-W_{2}}$.
The winners are neurons which receive $d$ largest external inputs among all. The steady-state solution with $d$ winners is given by
$z_{i}=\frac{1}{\alpha}+\frac{W_{i}}{1-\lambda}+\frac{\lambda d}{(\lambda-1) \alpha}\langle W\rangle_{D}, \quad i=1, \ldots, d$
$z_{i}=O(\varepsilon), \quad i=d+1, \ldots, N$
independently of initial values of $z_{i}$, where $\alpha \equiv d \lambda+1-\lambda,\langle W\rangle_{D} \equiv d^{-1} \sum_{j=1}^{d} W_{j}$. The actual number of winners can be determined implicitly by
$\gamma+W_{d}>\frac{\lambda d}{1-\lambda}\left(\langle W\rangle_{D}-W_{d}\right)$.
The r.h.s. of (5) is an increasing function of $d$ while the l.h.s. is a decreasing function of $d$. Thus there exists an
upper bound for $d$ above which condition (5) is not satisfied. This upper bound gives the number of winners.

Condition(5) indicates that the number of winners decreases, and finally becomes one, as the relative strength $\lambda$ of the lateral inhibition approaches unity. On the other hand, all the neurons remain active for $\lambda$ less than $\lambda_{L} \equiv\left(\gamma+W_{N}\right) /\left(\gamma+W_{N}+\sum_{j=1}^{N} W_{j}-N W_{N}\right)$, which means that no neural selection occurs for $0<\lambda<\lambda_{L}$. Note that $0<\lambda_{L}<\lambda_{-}<1$.

The WTA $\left(\lambda_{-}<\lambda<\lambda_{+} \equiv\left(\gamma+W_{1}\right) /\left(\gamma+W_{2}\right)\right)$ and VWTA $(\lambda>\lambda)$ solutions to the LV equation were also obtained by Fukai \& Tanaka (1997), but they are not discussed here since we consider those solutions less important in the circuit than the WSA solutions.

## 3. An analog circuit for the Lotka-Volterra neural network

The LV equation can be transformed into a form more suitable for analog circuit implementation.

Introducing new variables $y_{i}=\ln z_{i}$, we can rewrite (1) as
$\tau \dot{y}_{i}=\gamma^{\prime}+W_{i}-\exp \left(y_{i}\right)-\lambda \sum_{j \neq i}^{N} \exp \left(y_{i}\right)$.
where $\gamma^{\prime}$ represents $\gamma+\varepsilon \exp \left(-y_{i}\right)$. The circuit implementation of the system described by (6) is much easier than that of (1), because (i) the products of the system variables disappear, and (ii) each processing unit has an exponential response function, a fundamental characteristics of many types of semiconductor device. Although the present LV neural network has all-to-all connections, the complexity of the connections can be easily reduced to $O(\mathrm{~N})$ owing the uniformity in the strength of lateral inhibition. Namely, we rewrite (6) as

$$
\begin{equation*}
\tau \dot{y}_{i}=\gamma+W_{i}-(1-\lambda) \exp \left(y_{i}\right)-\lambda \sum_{j=1}^{N} \exp \left(y_{i}\right) \tag{7}
\end{equation*}
$$



Fig. 2. The E-cell circuit composed of four MOS transistors. Each external input is given to the circuit as the gate voltage $V_{i}^{(e)}$. The E-cell circuits are connected with the H-cell circuit through the terminals (b) and (c).
and introduce $N$ excitatory cells (E cells) and one inhibitory cell ( H cell) as shown in Fig. 1. Note that the $\epsilon$-dependent term in $\gamma^{\prime}$ is omitted since the term is not necessary in the circuit implementation of (6) because of a property of MOS transistors, as will be seen later. The $H$ cell receives an excitatory connection of unit strength from each E cell, while an E cell receives an afferent input $\gamma+W_{i}$, a selfinhibitory connection of strength $(1-\lambda)$ and an inhibitory connection of strength $\lambda$ from the $H$ cell. The response functions of the E cell and H cell are defined as $\exp (x)$ and x , respectively. We use this system for the circuit implementation of our LV neural network. In this way, a large-scale neural network could be implemented on a small chip area owing to the simple circuit structure of neuron units and $O(\mathrm{~N})$ complexity of connections among the units.

Figs. 2 and 3 show schematic diagrams of the E- and Hcell circuits realized by a small number of MOS transistors,


Fig. 3. The H-cell circuit composed of five MOS transistors. The parameter $\lambda$ of the LV neural network is controlled by the current source magnitude ratio $\left(I_{H}^{(1)} / I_{H}^{(2)}\right)$.
respectively. Applying Kirchhoff's current law (KCL) at node (a) in Fig. 2, we can obtain the equation

$$
\begin{equation*}
C \dot{V}_{i}=I_{i}^{M_{4}}-I_{i}^{M_{1}}-I_{i}^{M_{2}} \tag{8}
\end{equation*}
$$

where $C$ and $I_{i}^{M a}$ stand for a MOS capacitance and the current of transistor $M_{i}^{a}$ of the $i$ th E-cell circuit. In the subthreshold region of operation, $I_{i}^{M_{2}}$ is ideally given by
$I_{i}^{M_{2}}=I_{0} \exp \left(k V_{i} / V_{T}\right)$,
where $I_{0}$ is the zero-bias current for the given device, $V_{T}=$ $k T / q$ ( $k$ is the Boltzmann constant, $T$ is temperature and $q$ is the charge of an electron), and $k$ measures the effectiveness of the gate potential. Similarly, $I_{i}^{M_{1}}$ is given as
$I_{i}^{M_{1}}=I_{0} \exp \left(k V_{\text {com }} / V_{T}\right)$
in terms of the gate-source voltage of transistor $M_{i}^{1}$, as long as it operates in the saturation region $\left(V_{i} \geq 4 V_{T}\right)$. The current mirror structure of $M_{i}^{2}$ and $M_{i}^{3}$ implies that $I_{i}^{M_{3}}$ or the output current of the $i$ th circuit is equal to $I_{i}^{M_{2}}$. Applying KCL at common node (b), we obtain the current $I_{H}^{M_{1}}$ of $M_{H}^{1}$ in the H -cell circuit as
$I_{H}^{M_{1}}=I_{0} \sum_{j=1}^{N} \exp \left(k V_{j} / V_{T}\right)$,
which is equal to $I_{H}^{M_{2}}$ of the current mirror structure. The current $I_{i}^{M_{2}}$ which represents the global inhibition on E cells needs to be mirrored to $I_{i}^{M_{1}}$ with an externally modifiable ratio. To this end, the transistors $M_{H}^{3}, M_{H}^{4}, M_{H}^{5}$ in the H-cell circuit and $M_{i}^{1}(i=1,2 \ldots N)$ in the E-cell circuits are employed as translinear multiplier/divider (Andreou et al., 1991). Then summing the voltages around the loop GND-(e)-(d)-(c)-GND in Fig. 3 imposes the following relation on the voltages in the H-cell circuit:
$V_{a}+V_{b}+V_{c}+V_{c o m}=0$,
where $V_{a}, V_{b}$ and $V_{c}$ stand for the gate-source voltages of transistors $M_{H}^{3}, M_{H}^{4}$ and $M_{H}^{5}$, respectively. Representing the gate-source voltages with their respective drain-source subthreshold currents, and assuming that all devices have identical values for $k$ and $\mathrm{I}_{0}$, we obtain

$$
\begin{align*}
& \frac{V_{T}}{k} \ln \left(\frac{I_{H}^{M_{1}}}{I_{0}}\right)+\frac{V_{T}}{k} \ln \left(\frac{I_{H}^{(1)}}{I_{0}}\right)-\frac{V_{T}}{k} \ln \left(\frac{I_{H}^{(2)}}{I_{0}}\right) \\
& \quad-\frac{V_{T}}{k} \ln \left(\frac{I_{i}^{M_{1}}}{I_{0}}\right) \\
& \quad=0 \tag{13}
\end{align*}
$$

From (11) and (13), we can easily derive

$$
\begin{equation*}
I_{i}^{M_{1}}=\frac{I_{H}^{(1)} I_{H}^{M_{1}}}{I_{H}^{(2)}}=\beta I_{0} \sum_{j=1}^{N} \exp \left(k V_{j} / V_{T}\right) \tag{14}
\end{equation*}
$$

where $\beta$ represents the ratio of $I_{H}^{(1)}$ to $I_{H}^{(2)}$. By replacing


Fig. 4. The dynamic behavior of the circuit model showing the WSA solution with three winners. The time courses of the output currents $I_{i}^{M_{2}}$ and the membrane voltages $V_{I}$ at the node of the E cells are presented in (a) and (b), respectively.
the current sources for $I_{H}^{(1)}$ and $I_{H}^{(2)}$ with MOS transistors operating in the saturation region, $\beta$ is rendered externally modifiable through the gate voltages of those transistors.

The afferent input to each E cell is given by an input current to transistor $M_{i}^{4}$. Therefore, the strength of the afferent input can be externally controlled by changing the gate voltage $V_{i}^{(e)}$ of $M_{i}^{4}$ through
$I_{i}^{M_{4}}=I_{0} \exp \left(-k V_{i}^{(e)} / V_{T}\right)$.
Here $V_{i}^{(e)}(<0)$ must be in the range which ensures the operation of $M_{i}^{4}$ in the subthreshold region.

Now substituting (9), (14) and (15) into (8), we obtain

$$
\begin{align*}
C \dot{V}_{i}= & I_{0} \exp \left(-k V_{i}^{(e)} / V_{T}\right)-I_{0} \exp \left(k V_{i} / V_{T}\right) \\
& -\beta I_{0} \sum_{j=1}^{N} \exp \left(k V_{j} / V_{T}\right) \tag{16}
\end{align*}
$$

which becomes equivalent to (6) with the following substitution:
$\frac{C V_{T}}{I_{0} k(1+\beta)} \equiv \tau, \quad \frac{k}{V_{T}} V_{i} \equiv y_{i}$,
$\frac{\exp \left(-k V_{i}^{(e)} / V_{T}\right)}{1+\beta} \equiv \gamma^{\prime}+W_{i}, \quad \frac{\beta}{1+\beta} \equiv \lambda$.
Note that $\lambda$ is in the interval $(0,1)$ since $\beta>0$. Therefore, the proposed circuit produces WSA solutions as required.

In practice we can omit the $\epsilon$-term in the circuit implementation and simply replace $\gamma^{\prime}$ with $\gamma$ in (17). The reason why this term can be omitted is as follows. Remember that the term was necessary in (1) to prevent $z_{i}$ from becoming zero. Suppose that the LV neural network gives the WTA solution with cell 1 being a winner: $z_{1}>0$ and $z_{i}=0$ for $i \neq 1$. In the transformed system (7), this means that $y_{1}=$ $\ln \left(\left(\gamma^{\prime}+W_{1}\right) / k\right)$ and all other $y_{i}{ }^{\prime}$ s are $-\infty$ because of the logarithmic relation $y_{i}=\ln z_{i}$. Large negative $y_{i}$ for the losers implies that the inhibitory effects of the losers on the winner vanish and the effects of the winner on the losers dominate to give the WTA solution.

This divergence to negative infinity, however, never occurs in the circuit. When $V_{i} \leq 4 V_{T}$ and $M_{i}^{1}$ starts to leave the saturation region, the relation (10) ceases to be valid. Consequently, the lateral inhibition term represented by $\beta I_{0} \sum_{j=1}^{N} \exp \left(k V_{j} / V_{T}\right)$ in (16) decreases rapidly to zero, rather than to $I_{0}$, as $V_{I}$ approaches zero in the equation for the losers. This indicates that $V_{i}$ does not go to negative infinity for the losers because the driving term itself vanishes. Thus the losers acquire a small nonvanishing $V_{i}$, giving $z_{i}>0$.

## 4. Simulation results

In the following SPICE simulations, the capacitance of the E cells and the dimensions of the MOS transistors are assumed as $\mathrm{C}=10 \mathrm{pF}$ and $\mathrm{W} / \mathrm{L}=6 \mu \mathrm{~m} / 4 \mu \mathrm{~m}$. For other parameters of the transistors, we employ the typical values provided by MOSIS (CMOS process, feature size: $2 \mu \mathrm{~m}$ ).

A transient response of the WSA circuit with $N=10$ is shown in Fig. 4. In this simulation, all transistors have identical values for the device parameters. Figs. 4(a) and (b) show the time courses of output currents $\left\{I_{i}^{M_{2}}\right\}$ and membrane voltages $\left\{V_{i}\right\}$ for $\lambda=0.7$, which correspond to $z_{i}$ and $y_{i}$ respectively. The value of the afferent input current for each neural circuit is given by
$I_{i}^{M_{4}}=300-20 \times i(\mathrm{nA}), \quad i=1, \ldots, 10$.
The inequality (5) predicts that winners should be the


Fig. 5. Correlation between equilibrium voltages of the LV circuits with and without device mismatches as functions of a standard deviation of the mismatches.
neurouns receiving the first three largest inputs. The result shown in Fig. 4(a) is consistent with this prediction. From Fig. 4(b), it is also found that $V_{i}$ for losers do not diverge to negative infinity, as expected.

Device mismatches of on-chip transistors are considered to be a drawback when we use the subthreshold regions. In the following simulations, the mismatches are embedded in the values of zero-bias current $I_{0}$ since they may vary for different transistors to a much greater extent than the values of $k$ (Pavasovic et al., 1994). The following Gaussian distribution is assumed for $I_{0}$ :
$P\left(I_{0}\right)=\frac{1}{\sqrt{2 \pi} \sigma} \exp \left(\frac{\left(I_{0}-I_{\mathrm{av}}\right)^{2}}{2 \sigma^{2}}\right)$,
where $I_{\mathrm{av}}$ and $\sigma$ represent the average value of the zero-bias currents and the standard deviation of $I_{0}$ respectively. $I_{\mathrm{av}}$ is fixed at 16 fA . The mismatches of $I_{0}$ will give rise to the deviations of the strength of lateral inhibition, and the inhomogeneous strength of lateral inhibition can result in erroneous selection of winners. We are interested in studying to what extent the WSA solution produced by the LV circuit is influenced by the magnitude of $\sigma$. The network performance is measured by the correlation function:
$\operatorname{corr}\left(\boldsymbol{V}_{c}, \boldsymbol{V}_{n}\right)=\frac{\left\langle\boldsymbol{V}_{c} \cdot \boldsymbol{V}_{n}\right\rangle-\left\langle\boldsymbol{V}_{c}\right\rangle\left\langle\boldsymbol{V}_{n}\right\rangle}{\sqrt{\left\langle\boldsymbol{V}_{c}^{2}\right\rangle-\left\langle\boldsymbol{V}_{c}\right\rangle^{2}} \sqrt{\left\langle\boldsymbol{V}_{n}^{2}\right\rangle-\left\langle\boldsymbol{V}_{n}\right\rangle^{2}}}$
where $\boldsymbol{V}_{c}$ represents the vector of equilibrium voltages $V_{i}$ 's in the LV circuit in the absence of the mismatches, while $V_{n}$ represents the vector of the voltages in the presence of the mismatches. If the value of the correlation is close to unity, the influence of the device mismatches is small. The value of the afferent input current for each neural circuit is given by
$I_{i}^{M_{4}}=i \quad(n A), \quad i=1, \ldots, 100$.

Fig. 5 shows the results of simulations for LV circuit with $N=100$. The figure indicates that when the LV circuit produces a WTA solution $(\lambda \rightarrow 1)$, even a small $\sigma$ makes the selection of a solution unpredictable since an inappropriate neuron accidentally becomes a winner because of the mismatches. On the other hand, the selection becomes reliable as the number of winners is increased in the WSA solution $(\lambda<0.9)$. In the cases, correct winners are activated with high probability according to 'decision by majority'.

In actual analog VLSIs, the $I_{0}$ values of two nominally identical transistors may maximally differ by a factor of two (Mead, 1989;Andreou et al., 1991). If the distribution is Gaussian as assumed here, these values lie within a range $I_{\mathrm{av}} / 9 \approx 1.8 \mathrm{fA}$. Fig. 5 indicates that the WSA circuits with $\lambda \leq 0.8$ will perform nearly perfect selection even in the presence of the device mismatches.

## 5. Concluding remark

We have proposed a MOS circuit representing a LotkaVolterra competitive neural network and showed its characteristics. The influences of the device mismatches were investigated by circuit simulations, varying the magnitudes of the deviation in the zero-bias current. With the proposed circuits, a large number of unit cells can be integrated on a chip since (i) the complexity of connections is reduced to $O(N)$ by introducing a single inhibitory cell; (ii) the quadratic interaction terms from the original LV neural network were removed by adopting an exponential transfer characteristics for excitatory neuron units, which makes the circuit organization extremely simple; (iii) the electric power dissipation from the circuit is expected to be very small owing to the subthreshold operation of the MOS transistors.

Encouraged by the results of the present simulations, the LV integrated circuit was recently fabricated in Toyohashi University of Technology. The experimental results showed that the circuit does exhibit reliability in WSA solutions (Asai et al., 1997).

## References

Andreou, G. A., Boahen, A. K., Pouliquen, O. P., Pavasovic, A., Jenkins, R. E., \& Strohbehn, K. (1991). Current-mode subthreshold MOS circuits for analog VLSI neural systems. IEEE Trans. Neural Networks, 2, 205-213.
Asai, T., Ohtani, M., \& Yonezu, H. (1997). Analog integrated circuits for the Lotka-Volterra competitive neural networks with a winners-shareall solution. In: Proc. Of the 6th. International Conference on Microelectronics for Neural Networks, Evolutionary and Fuzzy Systems (pp. 91-98).
Cohen, A. M., \& Grossberg, S. (1983). Absolute stability of global pattern formation and parallel memory storage by competitive neural networks. IEEE Trans. Systems Man and Cybern., SMC-13, 815-826.
Fukai, T., \& Tanaka, S. (1997). A simple neural network exhibiting selective activation of neuronal ensembles: from winner-take-all to winners-share-all. Neural Compuration, 9, 77-97.

Kaski, S., \& Kohonen, T. (1994). Winner-take-all networks for physiological models of competitive learning. Neural Networks, 7, 973984.

Lakshmikumar, R. K., Hadaway, A. R., \& Copeland, A. M. (1986). Characterization and modeling of mismatch in MOS transistors for precision analog design. IEEE J. Solid-State Circuits, SC-21, 1057-1066.
Majani, E., Erlanson, R., \& Abu-Mostafa, Y. (1989). On the K-winners-take-all network. Advances in Neural Information Processing Systems, I. p. 634-641.

Mead, C. (1989). Analog VLSI and Neural Systems. New York: Addison Wesley.
Pavasovic, A., Anderou, G. A., \& Westgate, R. C. (1994). Characterization
of subthreshold MOS mismatch in transistors for VLSI systems. J. Analog Integrated Circuits and Signal Processing, 6, 75-85.
Pelgrom, M. J. M., Duinmaijer, J. C. A., \& Welbers, G. P. A. (1989). Matching properties of MOS transistors. IEEE J. Solid-State Circuits, 24, 1433-1440.
Taylor, G. J., \& Alavi, N. F. (1995). A global competitive neural network. Biol. Cybern., 72, 233-248.
Wolfe, J. W., Mathis, D., Anderson, C., Rothman, J., Gottler, M., Brady, G., Walker, R., Duane, G., \& Alaghband, G. (1991). K-winner networks. IEEE Trans. Neural Networks, 2, 310-315.
Yuille, L. A., \& Grzywacz, M. N. (1989). A winner-take-all mechanism based on presynaptic inhibition feedback. Neural Computation, 1, 334-347.


[^0]:    * Corresponding author; e-mail: asai@ postone.com

