# Hierarchical Verification and Hierarchical Synthesis of the Power-Down Mode of Analog Circuits 

Maximilian Ulrich Neuner

Vollständiger Abdruck der von der TUM School of Computation, Information and Technology zur Erlangung des akademischen Grades eines

Doktors der Ingenieurwissenschaften (Dr.-Ing.)
genehmigten Dissertation.

## Vorsitz:

Prof. Dr.-Ing. Thomas Eibert

Prüfer der Dissertation: 1. apl. Prof. Dr.-Ing. habil. Helmut Gräb
2. Prof. Dr.-Ing. Steffen Paul
3. Prof. Sachin Sapatnekar, Ph.D.

Die Dissertation wurde am 24.04.2023 bei der Technischen Universität München eingereicht und durch die TUM School of Computation, Information and Technology am 15.10.2023 angenommen.


#### Abstract

To save power, dedicated transistors can switch off the currents of analog circuits in powerdown mode. Hierarchical circuits consist of several blocks. Even if the power-down mode of the individual subblocks was correctly designed, their connectivity can cause unexpected errors in the power-down mode of the overall system, e.g. currents. This work describes a new method to identify such errors and one to construct the power-down mode for hierarchical analog circuits fault-free.


## Zusammenfassung

Um Energie zu sparen, stellen dedizierte Transistoren analoge Schaltungen im Power-DownModus stromlos. Hierarchische Schaltungen bestehen aus mehreren Blöcken. Deren Verdrahtung kann unerwartete Fehler, wie z.B. Ströme, im Power-Down-Modus des Gesamtsystems verursachen, obwohl dieser für dessen Einzelblöcke korrekt entworfen wurde. Diese Arbeit stellt eine neue Methode zur Erkennung solcher Fehler und eine zur fehlerfreien Konstruktion des Power-Down-Modus hierarchischer analoger Schaltungen vor.

## Preface

During my early bachelor studies at TUM, I realized that I enjoyed the systematic, step-bystep approach software development offers to tackle and solve complex technical problems. This interest led me to my first internship at the Chair of Electronic Design Automation where I learned how to program and design graphical user interfaces for CAD tools specialized on integrated circuit design. Several more internships as well as my bachelor's thesis followed and I also completed my master studies at the same chair under the supervision of Michael Zwerger - working on a topic which should later turn out to be the foundation of my dissertation.

Many people accompanied me throughout my time at the institute; without their help and encouragement this work would most likely not exist as it is in its current form. I want to thank Prof. Schlichtmann, head of the institute, for giving me the opportunity to conduct my own research and to work in academia - I especially enjoyed teaching and passing on knowledge which I learned myself not too long ago in the same lecture rooms. I want to express my deep gratitude towards my supervisor Prof. Graeb. He always encouraged me to follow new ideas and his ability to discover new aspects and value in my work (which I did not even think of) were crucial for the successful completion of this work. Kudos as well for his patience for revising my drafts and helping me to improve my presentation skills. A special thanks also to Prof. Steffen Paul from the University of Bremen and Prof. Sachin Sapatnekar from the University of Minnesota for taking interest in my work and the time to review it.

I am still feeling honored and glad to be given the chance to conduct the lab part of the lecture "Mathematical Methods of Circuit Design" at Nanyang Technological University in Singapore which afterwards allowed me to travel the world - and in the end to meet my future wife in Malaysia.

A big thanks to my colleagues at the institute for the good working atmosphere which enabled fruitful discussions, the get-togethers during lunch time, table soccer matches during coffee breaks and other leisure time activities outside of work.

I also want to thank all the students who were actively helping me to improve the implemented software, especially Inga Abel who was always willing to explore unusual ideas which lead to unexpected or surprising results.

Finally, I want to thank my friends and family who were always supporting me in my activities, took off my mind from work and helped me in that way to refresh and collect new ideas.

## Contents

1. Introduction ..... 3
1.1. Analog Power-Down Mode Design ..... 5
1.2. State of the Art ..... 8
1.3. Problem Formulation and Contributions ..... 10
1.4. Thesis Structure and Prior Publications ..... 12
2. Hierarchical Circuit Representation ..... 15
3. Hierarchical Power-Down Verification ..... 17
3.1. Overview ..... 17
3.2. Voltage Propagation ..... 18
3.3. Short Circuit Path Computation ..... 21
3.4. Matching and Symmetry Assertion ..... 24
3.4.1. Power-up Transformation ..... 24
3.4.2. Assertion ..... 26
3.5. Experimental Results ..... 29
3.5.1. Dual Gain Amplifier ..... 29
3.5.2. High Input Impedance Differential Amplifier ..... 36
4. Hierarchical Power-Down Synthesis ..... 43
4.1. Overview ..... 43
4.2. Rip-Up Heuristic ..... 45
4.3. Gate Shut-Off ..... 45
4.3.1. Constraint Optimization Problem ..... 49
4.3.2. Selection of the Optimal Solution ..... 59
4.3.3. Computational Complexity ..... 62
4.4. Experimental Results ..... 64
4.4.1. Current Mirror Operational Amplifier ..... 65
4.4.2. Low Resistance Operational Amplifier ..... 66
4.4.3. Fully Differential Operational Amplifier ..... 67
4.4.4. Dual Gain Amplifier ..... 69
4.4.5. High Input Impedance Differential Amplifier ..... 73
5. Library-Free Structure Recognition ..... 77
5.1. State of the Art ..... 78
5.2. Overview ..... 78
5.3. Creation of New Library Elements ..... 80
5.4. Library Extension ..... 83
5.5. Experimental Results ..... 84
5.5.1. Folded Cascode Amplifier Variants ..... 84
5.5.2. Level Shifter Topologies ..... 87
6. Conclusion ..... 91
A. Hierarchical Structure Recognition ..... 93
A.1. Introduction ..... 93
A.2. State of the Art and Contributions ..... 93
A.3. Overview ..... 97
A.4. Partial Circuit Flattening ..... 98
A.5. Structure Recognition ..... 100
A.6. Experimental Result ..... 105
B. Hierarchical Symmetry Computation ..... 109
B.1. Introduction ..... 109
B.2. State of the Art and Contributions ..... 109
B.3. Overview ..... 110
B.4. Signal Flow Graph Generation ..... 111
B.5. Signal Path Analysis ..... 116
B.6. Experimental Result ..... 124
B.7. Discussion and Limitations ..... 124
C. Manual Power-Down Mode Revision ..... 129
Bibliography ..... 141

## Contents

## 1. Introduction

Analog circuits are widely considered as the interface between the real and the digital world. Most often they are the first point of interaction with time continuous electrical signals within an integrated circuit (IC).

Fig. 1.1 shows the simplified block diagram of a wireless speaker. Its antenna receives a time continuous wireless signal which is filtered and amplified by dedicated analog circuitry. The resulting signal is then binarized by an analog-to-digital converter (ADC) and processed by a digital signal processor (DSP). Finally, the signal is translated back into the analog domain by a digital-to-analog converter (DAC) and output by a speaker. The whole system is powered by a battery whose power is distributed to the system's components by a power management unit (PMU).

The runtime of the speaker is limited by the electrical energy stored in its battery. Hence, each component of the speaker should be designed as power efficient as possible in order to increase its runtime.

Fig. 1.2 shows a digital inverter and a simple analog amplifier circuit, i.e., a source circuit.
The output signal $z$ of an inverter takes the opposite voltage level of its input signal $x$ which corresponds to either the ground, i.e., logical zero, or the supply voltage, i.e., logical one. The inverter is embedded in sequential logic, i.e., in-between the two registers reg1 and reg2. Registers store incoming signal values from a previous logic stage and pass it onto the next one whenever the clock signal $c l k$ transitions from the ground to the supply voltage.

The analog source circuit amplifies a small signal input voltage at net in and converts it into a small signal output current at node out. The amplification factor, the so-called "gain", is thereby defined as the ratio of the output and input voltage in decibel.


Figure 1.1.: Simplified block diagram of a wireless speaker.

## 1. Introduction


(a) inverter circuit

(b) source circuit

Figure 1.2.: Schematics of a simple digital (a) and a simple analog circuit (b).

The power consumption of a digital circuit has two contributors [1]:

$$
\begin{equation*}
P_{d i g}=P_{s t a t}+P_{d y n} . \tag{1.1}
\end{equation*}
$$

The static power dissipation

$$
\begin{align*}
P_{\text {stat }} & =I_{\text {leak }} \cdot V_{d d} \quad \text { with }  \tag{1.2}\\
I_{\text {leak }} & \sim \frac{W}{L} I_{s} e^{\frac{V_{g s}-V_{t h}}{a V_{T}}} \tag{1.3}
\end{align*}
$$

is caused by sub-threshold leakage currents of the implemented CMOS transistors.
The dynamic power dissipation

$$
\begin{equation*}
P_{d y n}=\alpha C V_{d d}^{2} f \tag{1.4}
\end{equation*}
$$

is the power required to charge or discharge small capacitors at the affected nodes of the circuit with $\alpha$ describing the switching frequency in percent.

Analog circuits require a constant bias current $I_{B}$ to set the operating point of the circuit and hence:

$$
\begin{equation*}
P_{\text {analog }}=I_{B} \cdot V_{d d} . \tag{1.5}
\end{equation*}
$$

The static power dissipation of digital circuits can be reduced by threshold voltage scaling [2]. The threshold voltage of the transistors can be increased by applying a voltage to the substrate of the chip. This reduces the leakage current exponentially according to eq. (1.3). This technique however is not applicable for analog circuits as the transistors typically operate in saturation and not in sub-threshold region.

Another technique to reduce the static power dissipation is shown by Fig. 1.3a. Powergating, also known as multi-threshold voltage circuits [3], uses two additional transistors to disconnect the circuit block from the ground and the supply voltage when not required for system operation. These so-called header ( $P 1$ ) and footer ( $N 1$ ) transistors have a higher threshold voltage than the transistors of the digital logic block which hence reduces the leakage current of the whole block according to eq. (1.3).

This technique can principally also be applied to analog circuits but has the following two disadvantages:


Figure 1.3.: Two digital power-saving techniques.

- the header and footer transistors must be sized large in order to carry the bias current of the circuit,
- the additional voltage drops between the drain and source pins of the power-gating transistors limits the available voltage swing of the circuit.

The first disadvantage is unacceptable for area-critical devices and the reduced voltage swing might cause a circuit to fail its specifications. Hence, this technique is often not suitable in practice.

Fig. 1.3b shows a technique to reduce the dynamic power dissipation of a digital circuit. Clock gating disables the clk signal at the registers of sequential logic circuits when not required for system operation. Hence, the dynamic power drops to zero according to eq. (1.4) as the block's frequency is zero.

This technique -as well as frequency scaling - cannot be applied to analog circuits as they are not synchronized by a clock signal.

In summary, common digital power saving techniques cannot be transferred to the analog domain. Instead, a so-called power-down mode is implemented.

### 1.1. Analog Power-Down Mode Design

Fig. 1.4a illustrates the basic principle of the analog power-down mode. The power-down mode is implemented by additional transistors, highlighted in blue, which pull the internal nets of the analog block either to the ground or to the supply voltage. This forces the devices of the circuit either into the conducting or non-conducting state with the overall goal to shut off all bias currents in the circuit.

The power-down circuitry is controlled by the digital signal $p w d$. It is inactive when $p w d$ is assigned to the ground voltage level gnd, i.e., the analog block is in normal operation.

The power-down circuitry becomes active when $p w d$ is set to the supply voltage level $v d d$. The analog block does not consume any power in this mode.
Fig. 1.4b shows the source circuit from Fig. 1.2 with its power-down circuitry highlighted in blue. The transistor $N P 1$ pulls the gate of $N 1$ to the ground voltage in power-down mode

## 1. Introduction



Figure 1.4.: Basic principle of the analog power-down mode (a) and the source circuit from Fig. 1.2 b with its power-down circuitry (b) highlighted in blue.


Figure 1.5.: Power-down mode implementation variants considering device matching.
to force $N 1$ into the non-conducting state. This switches the bias current $I_{B}$ of the circuit off and reduces its power-consumption to approximately zero. NP1 is inactive in normal operation and the bias current $I_{B}$ can set the operating point of the circuit as usual.

Analog circuits require precise matching, i.e., functionally identical devices must be sized identically and should be exposed to similar operating conditions in order to work as intended [4]. This design principle also affects the analog power-down mode design.
Fig. 1.5 shows a differential input stage. It converts a differential input voltage at its input pins in and ip into a differential output current at the nodes $n 2$ and $n 3$. The two circuit halves processing the differential input signal, i.e., $(P 1, N 2)$ and $(P 2, N 3)$, must be built identically to show symmetrical electrical behaviour. Hence, the devices of the differential pair $d p(N 2, N 3)$ and of the simple current mirror load $\operatorname{scm}(P 1, P 2)$ must be matched.
The publications $[5 ; 6 ; 7]$ showed that matching deteriorates over time when the affected devices are exposed to asymmetric operating conditions, especially due to mismatched pin voltages. Thorough care must be taken to avoid voltage mismatch in power-down mode
as the affected devices could be stressed by the maximum available voltage difference in the circuit as its internal nets are either pulled to $v d d$ or $g n d$. In the worst case, the circuit might fail after powering it up again due to time-dependent voltage mismatch in power-down mode. Matched devices must hence be exposed to matched voltages in power-down mode to prevent these problems.

The power-down mode of the differential input stage from Fig. 1.5a has been implemented by only one power-down transistor, namely $N P 1$. The nets $i n, i p, n 1$ and $n 3$ are highimpedant, i.e., floating, in this implementation as no connection to the supply or ground net can be established by conducting devices. The state of these nodes is hard to predict as it depends on the sub-threshold operation of $N 1, N 2$ and $N 3$ which is often inaccurately captured by their corresponding device models [8;9]. The diode-connected transistor P1 furthermore pulls $n 2$ to the supply voltage level which can lead to asymmetric stress at the differential pair $d p(N 2, N 3)$ and the simple current mirror $\operatorname{scm}(P 1, P 2)$ of the circuit.

The power-down circuitry from Fig. 1.5b overcomes these problems by four more powerdown transistors which pull $i n$, $i p$ and $n 1$ to the ground and $n 3$ to the supply voltage level. Now, all voltages levels in the circuit are defined, i.e., not floating, and the differential pair $d p(N 2, N 3)$ as well as the simple current mirror $\operatorname{scm}(P 1, P 3)$ are exposed to symmetrical voltages in power-down mode.

This solution however is not optimal with respect to the die area occupied by the additional power-down circuitry. The area requirements of the power-down circuitry is mainly determined by the number of inserted power-down transistors. It should be minimized whilst fulfilling the circuit's matching conditions as good as possible. The optimal implementation variant for the differential input stage is shown in Fig. 1.5c. It requires three power-down transistors in total. The transistors $P P 1$ and $P P 2$ turn on the differential pair $d p(N 2, N 3)$ in power-down mode which allows the supply voltage to propagate to $n 1$ and $n 3$. All matching conditions are still fulfilled whilst minimizing the area requirements of the power-down circuitry. In this example, it has been assumed that the voltages at the input and output pins of the circuit, i.e., in, ip and out, can be arbitrarily pulled to the supply or ground voltage level. In reality however, these voltages depend on external circuitry connected to these pins and must be considered during power-down mode design.

In summary, the design of the analog power-down mode has the following targets [10]:
(A) All current paths must be shut off in power-down mode.
(B) Floating nodes should be avoided in power-down mode.
(C) Maximize matching to prevent time-dependent mismatch.
(D) Minimize the area requirements of the power-down circuitry.

Target (A) must be fulfilled, requirements (B), (C) and (D) should be fulfilled as good as possible.

Additionally, the normal operation of the circuit should not be affected by the inserted transistors. This must be verified by numerical simulation after the implementation of the power-down mode.

Fig. 1.6 shows the three shut-off patterns presented by [11] to implement the power-down mode of a circuit whilst following design goals (A) - (D).

The gate shut-off pattern inserts power-down transistors in parallel into the circuit to pull its internal nets either to the ground or to the supply voltage in order to turn off all current


Figure 1.6.: Shut-off patterns used for power-down mode implementation [11].
flow in the circuit. It is the main tool to implement the power-down mode of a given circuit and corresponds to the technique which has been used to switch off the source circuit and the differential stage from Figs. 1.4b and 1.5, respectively.

Some current paths in the circuit however cannot be switched off using the gate shut-off pattern. Fig. 1.6b shows a current path running over the resistor $R 2$ and diode-connected transistor $N 2$. The power-down transistor $N P 2$ tries to switch off $N 2$ by pulling its gate to the ground voltage which creates a new current path over the just inserted power-down transistor. The diode rip-up pattern inserts $N P 3$ into the circuit which disables the diode configuration of $N 2$ in power-down mode. Now, N2 can be safely switched off by NP2.

A serial rip-up, as shown in Fig. 1.6c, must be performed if there is no diode-connected transistor on the problematic current path. The transistor is inserted in series in-between the supply node and the first device of that current path and must be sized large enough to carry the bias current of that branch in normal operation. The serial rip-up furthermore introduces an additional voltage drop in that path which could affect the normal operation of the circuit. Hence, the serial rip-up is the last resort and a diode rip-up should be preferred whenever possible [12].

### 1.2. State of the Art

The manual design of the analog power-down mode is error prone, repetitive and tedious. Unintended current flows, floating nodes and mismatched voltages can be easily overseen in more complex circuits. This is especially true for subtle design errors as leakage currents through the bulk substrates of transistors [9]. Therefore, automatic tools to detect such errors are required.

The power-down verification method presented by [8] detects current paths and floating nodes in power-down mode using voltage propagation. The algorithm initializes the supply and ground nets with their corresponding voltage level; the internal nets are set to floating. Starting from the ground and supply nets, the algorithm scans the circuit for conducting devices and propagates the supply and ground voltage into the circuit. This process is repeated until no more propagation occurred. Conducting devices which connect the supply with the ground voltage are reported back to indicate that a current flow in the circuit has been found. The method supports resistors, diodes and MOS transistors and guarantees to find all potential current flow in the circuit.

The verification method presented in $[10]^{1}$ models the static DC behavior of a circuit in power-down mode by a graph and uses voltage propagation to estimate its node voltages. Current flows are detected by a depth-first search from the supply to the ground nets of the circuit. The method additionally uses structure recognition ${ }^{2}$ to identify basic analog building blocks, e.g., differential pairs and current mirrors, in the circuit which must be matched. The identified structures are then compared against the voltage estimates to detect mismatch in power-down mode.

The method presented by [18] shows how constraint programming can be used to detect floating nodes efficiently in power-down mode.

These tools are of great help for circuit designers as they either automatically verify the correct power-down mode functionality of a circuit or indicate error spots in the current power-down circuitry.

The authors of [10], however, even showed that the manual power-down mode design process can be automated which leaves designers with more time for more creative tasks. The presented method uses a set of rules to construct faultless power-down circuitry which has been derived from the verification rules presented in the same work. These rules are used to formulate two constraint programs [19] whose solutions guarantee the correct power-down mode behavior of the circuit. The constraint programs are modelled and solved using [20]. The first constraint program computes all possible net voltage combinations in power-down mode such that no current flow and no floating node can occur. Furthermore, matching constraints based on structure recognition results are formulated which must be fulfilled as good as possible. The second constraint program determines the number of required powerdown transistors to implement such a net voltage combination and inserts them at the correct locations, i.e., at the nets which must be pulled to the supply or ground voltage, into the circuit. The solutions which cannot be implemented with a minimum number of power-down transistors are thereby discarded. Finally, a solution is manually selected from the remaining ones for implementation. Before formulating and solving the constraint programs, the circuit is scanned for problematic current paths which require a rip-up. The presented algorithm recursively checks for each net of the circuit if there is a conducting path from that net to the supply voltage which cannot be switched off by the gate shut-off pattern. If there is additionally another conducting path from that net to the ground voltage which cannot be switched off by the gate shut-off pattern, a problematic current path has been detected. A diode or serial rip-up is then applied to each identified problematic current path which guarantees at least one solution of the two constraint programs.

Power-down transistor sizing was investigated in [14] with the conclusion that the minimum transistor sizes are adequate for most applications. Larger transistors are only necessary for designs which require fast transition times from the normal operation to the power-down mode. The wake-up time, i.e., the transition time from the power-down to the normal operation mode, is only determined by the original circuitry and cannot be influenced by power-down transistor sizing.

The paper by [15] describes how the power-down synthesis results from [10] can be visualized by displaying the power-down transistors in the schematic editor of Cadence Virtuoso [21]

[^0]

Figure 1.7.: Potential current flows in a source circuit (a), faulty power-down mode implementation for a source circuit series (b).
which indicates that the synthesis method can be seamlessly integrated into an analog design flow.

### 1.3. Problem Formulation and Contributions

The power-down verification and synthesis methods described by [8;10] operate on transistor level. Complex chips however undergo a hierarchical design process: the desired functionality is first translated into a technical specification which describes the overall system behavior. That system is then successively divided into smaller and smaller parts, until single blocks can be implemented on device level [22].
A chip is thereby divided into analog and digital parts, the analog parts are sub-divided into, e.g., filters, phase locked loops (PLLs), analog to digital converters (ADCS) or digital to analog converters (DACs). These circuits are, e.g., composed of RC networks and operational amplifiers (OpAmps) which can be implemented on device level. Afterwards, the single components are assembled to form the overall system.
The power-down verification and synthesis methods by [8;10] do not consider the interaction with and the connectivity to external circuitry which occurs in hierarchical analog designs. This can lead to design errors in the power-down mode as will be illustrated with the following example.
Fig. 1.7a shows the already known source circuit (SC) from Fig. 1.4b. The dashed red lines indicate a potential current flow in the power-down mode of the circuit which depends on the external circuitry connected to the input and output pins of the amplifier. Its input pin in is connected to gnd by the conducting power-down transistor NP1. A new and unintended current path would arise in the source circuit if that pin is connected to $v d d$ by conducting external circuitry. The output pin out of the source circuit is furthermore connected to vdd by the resistor $R 1$. Hence, an external, conducting connection to $g n d$ with that pin would create another current path in power-down mode.


Figure 1.8.: Data flows of the state-of-the-art (a) and the proposed power-down verification and power-down synthesis method (b) for hierarchical analog circuits. The diagram on the right (c) shows the data flow for the new structure recognition and symmetry computation methods.

This situation is illustrated by Fig. 1.7b. Two source circuits $S C 1$ and $S C 2$ are connected with each other to form a source circuit series (SCS), i.e., an amplifier with two amplification stages. Now, the output pin of $S C 1$ is connected to the input pin of $S C 2$ which results in a current flow over the resistor $S C 1 / R 1$ and the transistor $S C 2 / N P 1$ in power-down mode. Such new short circuit paths, created by hierarchical circuit composition, must be detected and prevented.

The methods presented by [10] must flatten a given hierarchical design in order to be able to support it as shown in Fig. 1.8a. Circuit flattening replaces instantiated subcircuit blocks by their netlist implementation until the device level is reached. This approach does not preserve the circuit hierarchy and is hence not suitable for industrial and IP based designs. Furthermore, intermediate verification and synthesis results on the block level cannot be reused which increases the computational complexity of the verification and synthesis tasks.

This work presents a new power-down verification method and a new power-down synthesis method for hierarchical circuits which overcome these issues as shown in Fig. 1.8b. The new methods are based on [10] with the following extensions:

- all algorithms have been developed in order to preserve the circuit hierarchy. The implemented verification and synthesis methods traverse the circuit hierarchy either in a top-down or bottom-up approach or in both directions.
- intermediate results for all encountered subcircuit blocks are stored in libraries to enable their reuse and to avoid costly recomputations.

The methods presented by [10] do not consider symmetries alongside the signals paths of a given circuit. Such symmetries pose additional matching conditions on the power-down

## 1. Introduction

circuitry which are now incorporated into the new power-down verification and synthesis methods.

Furthermore, the following enhancements of the analog power-down synthesis have been created:

- the rip-up procedure by [10] has been simplified by using methods from the verification process to identify short circuit paths.
- the two constraint programs by [10] have been unified which allows to compute tradeoffs between the two design goals "maximize matching" and "minimize area".

The new hierarchical methods require automatic procedures to extract analog basic building blocks and signal symmetries from the circuit hierarchy in order to be able to verify and synthesize power-down circuitry considering voltage matching. Hence, a new structure recognition method has been developed based on [10] that supports hierarchical circuits following a similar approach as presented in [23]. Fig. 1.8c shows that the new approach flattens the circuit hierarchy partially, i.e., structural information connected to the pins of a subcircuit block is propagated between the hierarchy levels when required for the computations. Partial circuit flattening decouples the hierarchy levels from each other, which enables and allows the parallelization of traditional structure recognition approaches for hierarchical circuits. Furthermore, a symmetry computation method similar to [24] has been implemented and adjusted to support hierarchical circuits. The algorithm starts at the device level and propagates the collected signal flow information between the input and output pins of subcircuit blocks to the next higher hierarchy levels recursively until the top level is reached.

Last but not least, a new method to automatically generate the building block description of a given circuit has been developed. This method overcomes the problems of traditional structure recognition procedures: the scope and the complexity of the provided building block library. The library of the sizing rules method contains building blocks formed by only up to six devices [10]. New and more complicated structures, e.g., operational amplifiers, and their recognition rules must be added manually to the library which is prone to errors and can lead to wrong recognition results. The new procedure automatically partitions a given circuit into its building blocks without requiring a predefined library. Instead, three characteristics, i.e., connectivity, substrate type and tier difference of neighboring devices, are analyzed and used to determine the subblocks of the circuit. The algorithm then generates the recognition rules for the detected elements and furthermore creates new building block libraries from scratch or complements existing ones by new elements.

### 1.4. Thesis Structure and Prior Publications

This work is partitioned into two parts.
The first part focuses on the new power-down verification and synthesis method of hierarchical analog circuits (Chapters 3 and 4). Both methods rely on the hierarchical circuit representation described in Chapter 2.

The hierarchical structure recognition and symmetry computation method complementing the verification and synthesis procedures are described in Appendices A and B. Appendix C provides guidelines on how to manually fix detected errors in the analog power-down mode.

The second part of this work describes the new library-free structure recognition method (Chapter 5).

Each chapter provides experimental results for the respective methods.
Chapter 6 concludes this thesis.
The presented methods, algorithms and ideas have been partially published at conferences or journals. The hierarchical power-down verification method from Chapter 3 was originally published in [25], an extended version can be found in [26]. The unified constraint program and the hierarchical power-down synthesis method from Chapter 4 have been presented in [27] and [28], respectively. The library-free structure recognition method from Chapter 5 has been published at [29].

Additionally, the presented methods, other ideas and previous implementations have been partially realized in the student theses [30; 31; 32; 33; 34; 35; 36].

1. Introduction

## 2. Hierarchical Circuit Representation

Hierarchical designs are formally constructed by a set of circuits

$$
\begin{equation*}
\mathcal{C}=\left\{C_{1}\left(P_{1}, N_{1}, C_{\text {sub }, 1}, t_{1}\right), C_{2}\left(P_{2}, N_{2}, C_{\text {sub }, 2}, t_{2}\right), \ldots, C_{|\mathcal{C}|}\left(P_{|\mathcal{C}|}, N_{|\mathcal{C}|}, C_{\text {sub },|\mathcal{C}|}, t_{|\mathcal{C}|}\right)\right\} . \tag{2.1}
\end{equation*}
$$

A circuit $C_{i}\left(P_{i}, N_{i}, C_{s u b, i}, t_{i}\right)$ consists of

- a set of pins $P_{i}=\left\{p_{i, 1}, p_{i, 2}, \ldots, p_{i,\left|P_{i}\right|}\right\}$,
- a set of nets $N_{i}=\left\{n_{i, 1}, n_{i, 2}, \ldots, n_{i,\left|N_{i}\right|}\right\}$,
- its subcircuits $C_{s u b, i}=\left\{c_{s u b, i, 1}, c_{s u b, i, 2}, \ldots, c_{s u b, i,\left|C_{s u b, i}\right|}\right\}$,
- and is of type $t_{i}$, e.g., $S C S$ or $S C$.

A subcircuit $c_{s u b, i, j} \in C_{s u b, i}$ instantiates another circuit from set $\mathcal{C}$. A net $n_{i, k} \in N_{i}$ connects one or more subcircuits of $C_{\text {sub }, i}$ with each other via their pins. The set of pins $P_{i} \subseteq N_{i}$ is a subset of the circuit's nets that can be connected to external circuitry. Each subcircuit $c_{\text {sub }, i, j}$ has its own set of subcircuits, hence forming a hierarchical circuit recursively.
A device is abstracted as a circuit which does not contain any other subcircuits and its set of pins is identical to its set of nets:

$$
\begin{equation*}
\text { " } C_{i} \in \mathcal{C} \text { is a device" } \Leftrightarrow C_{s u b, i}=\emptyset \wedge P_{i}=N_{i} . \tag{2.2}
\end{equation*}
$$

The following device types $t_{i} \in T_{d}$ are thereby supported:

$$
\begin{equation*}
T_{d}=\{n m o s, p m o s, n p n, p n p, \text { res }, \text { cap }, \text { diode }, i n d\} \tag{2.3}
\end{equation*}
$$

The methods presented in Chapters 3-5 are however not limited to these device types and can be easily extended by additional device types.

Fig. 2.1 shows four different devices types of $T_{d}$ and their corresponding pins. The drain, gate and source pins of the transistors are denoted as $d$, $g$, and $s$, respectively. The pins of a resistor or capacitor are plus $(+)$ and minus ( - ).
The top-level circuit $C_{\text {top }}$ of a hierarchical design is not part of any other circuit of set $\mathcal{C}$. The bottom level of a circuit hierarchy is reached with the device level.


Figure 2.1.: Different device types and their corresponding pins.
2. Hierarchical Circuit Representation

| circuit <br> type $t_{i}$ | circuit <br> pins $P_{i}$ | circuit <br> nets $N_{i}$ | subcircuits <br> $C_{\text {sub }, i}$ | subcircuit <br> type $t_{\text {sub }, i, j}$ |
| :---: | :---: | :---: | :---: | :---: |
| source circuit | in, out, | in, out, | SC1, SC2 | SC |
| series $(S C S)$ | $p w d$ | $p w d, n 1$ |  |  |
| source circuit $(S C)$ | in, out, | in, out, | $N 1, N P 1$ | nmos |
|  | $p w d$ | $p w d$ | $R 1$ | res |
| nmos | $d, g, s$ | $d, g, s$ | $\emptyset$ | - |
| res | ,+- | ,+- | $\emptyset$ | - |

Ground and supply nets not shown.
Table 2.1.: Set of circuits $\mathcal{C}$ used to construct the source circuit series from Fig. 1.7.

Table 2.1 shows the set $\mathcal{C}$ for the source circuit series from Fig. 1.7b. The circuit consists of three pins, four nets and the two source circuits $S C 1$ and $S C 2$. Each of these subcircuits has three pins, three nets and three devices, i.e., two NMOS transistors and one resistor.

## 3. Hierarchical Power-Down Verification

Power-down mode verification checks whether the design goals (A) - (C) from Sec. 1.1 are met for a given circuit. Hence, power-down verification must provide methods

- to detect all (potential) current flows,
- to identify all floating nodes and
- to recognize all (potential) matching violations
of a circuit in power-down mode. A circuit fulfils (A) - (C) if the verification method does not detect any of the errors listed above. In the following, a new power-down verification method for hierarchical designs is presented.


### 3.1. Overview

Fig. 3.1 gives an overview of the new hierarchical power-down verification methodology. It is based on the methods presented in [10]. In comparison to the state-of-the-art methods, the new method does not flatten the circuit hierarchy and additionally verifies that symmetry pairs alongside symmetrical signal paths are not stressed in power-down mode. The newly introduced voltage propagation library $L_{V}$ furthermore enables the reuse of intermediate results.

The method takes a top-level circuit $C_{\text {top }}$, the supply nets $N_{\text {supply }}=N_{v d d} \cup N_{\text {gnd }}$ and the vector $V_{P, \text { top }}$ as inputs. The vector $V_{P, \text { top }}$ defines the initial voltage levels at the pins of $C_{\text {top }}$. The verification report contains all detected current paths $P_{s c}$, floating nets $N_{\text {float }}$ and the matching and symmetry violations $E_{M}, E_{S}$ of the circuit in power-down mode.

In the first step, voltage propagation estimates the net voltages of the circuit in power-down mode (Sec. 3.2). This method starts at the top-level and propagates the voltages on the current level as far as possible before descending into subcircuit blocks. The algorithm traverses the circuit hierarchy in both directions as a change in the voltage level at a subcircuit pin can affect the states of the devices in the underneath or above lying hierarchy levels.

In the second step, a depth-first search algorithm detects all short circuit paths in the circuit based on the previously computed net voltage estimates (Sec. 3.3). This algorithm also traverses the circuit hierarchy in both directions as a current path can descend and ascend to lower or higher hierarchy levels through subcircuit pins.

The voltage levels at the internal nets of a short circuit path cannot be accurately predicted anymore as they would take a value in-between gnd and $v d d$. Hence, the previously computed voltage propagation result is not valid anymore and must be recomputed. In this case, the detected short circuit nodes $N_{s c}$ are initialized with the voltage level $s c$ which indicates that a short circuit path is running other them. The subsequent voltage propagation run might eventually reveal further potential current flows in the circuit based on this new information.

## 3. Hierarchical Power-Down Verification



Figure 3.1.: Overview of the hierarchical power-down verification method.

Hence, voltage propagation and short circuit path computation are repeated until no more new currents paths are detected.

The last step of the verification method identifies matching and symmetry violations at analog basic building blocks and symmetry pairs (Sec. 3.4). The checks traverse the circuit hierarchy recursively from the top to its bottom using the final voltage propagation results to evaluate predefined rule sets for basic building blocks and symmetry pairs.

### 3.2. Voltage Propagation

The behavior of a circuit in power-down mode is determined by the static DC behavior of its devices [9]. Its internal nets are either charged or discharged to $v d d$ or $g n d$ by conducting devices or are high-impedant, i.e., floating float. Hence, the net voltages of a circuit in power-down mode can be estimated by voltage propagation.

The supply or ground voltage can propagate to another net of the circuit via conducting devices. Fig. 3.2 shows the static DC behavior of resistors, capacitors and NMOS transistors. A resistor propagates the ground and supply voltage between its plus and minus pin in both directions. A capacitor is treated as open loop in DC operation and does not propagate any voltage in power-down mode. An NMOS transistor propagates $v d d$ or gnd between its drain and source pins when it is conducting, i.e., when its gate-source voltage is greater than its threshold voltage. In power-down mode, all nets will ideally either take the $v d d$ or $g n d$ voltage level. Hence, NMOS transistors are conducting if their gate pin is exposed to vdd. Similarly, PMOS transistors only propagate voltages between their drain and source pins if


Figure 3.2.: Static DC behavior for different device types.
their gate is on gnd. A diode-connected transistor propagates the supply voltage $v d d$ from its drain to its source pin and the ground voltage gnd in the opposite direction. A complete overview over all supported device types, including bipolar transistors and bulk junction diodes of transistors, can be found in [9].

Algorithm 1 uses the described static DC behavior to estimate the node voltages of a hierarchical circuit in power-down mode. The algorithm takes a circuit $C_{i}\left(P_{i}, N_{i}, C_{s u b, i}, t_{i}\right)$, the voltages at its pins $V_{P_{i}}$, the supply nets $N_{\text {supply }}=N_{v d d} \cup N_{\text {gnd }}$, the short circuit nets $N_{s c}$ and the voltage propagation results library $L_{V}$ as inputs.

The library $L_{V}$ allows to store and fetch intermediate voltage propagation results $V_{N_{i}}$ for a specific circuit type $t_{i}$ with pin voltages $V_{P_{i}}$. The vector

$$
\begin{equation*}
V_{N_{i}}=\left(v_{n_{i, 1}}, v_{n_{i, 2}}, \ldots, v_{n_{i,|N|}}\right) \tag{3.1}
\end{equation*}
$$

with

$$
\begin{equation*}
v_{n_{i, j}} \in\{g n d, v d d, \text { float }, s c\} \tag{3.2}
\end{equation*}
$$

contains the voltage estimates of each net $n_{j} \in N_{i}$ of a circuit of type $t_{i}$ in power-down mode.

The set of short circuit nodes $N_{s c}$ is empty in the initial voltage propagation run.
The algorithm iteratively propagates $g n d$ and $v d d$ to the internal nets of $C_{i}$ via conducting devices according to Fig. 3.2. The voltages are thereby propagated as far as possible on the current hierarchy level before descending into subcircuit blocks, i.e., devices are always prioritized to subcircuit blocks during voltage propagation. This behavior is ensured by the propagation queue $Q_{\text {prop }}=\left(c_{s u b, 1}, c_{s u b, 2}, \ldots, c_{s u b, k}\right)$ which orders the inserted subcircuits such that devices are processed first. Subcircuits are queued up for propagation whenever a voltage level at one of its pins changed from float to $v d d$ or gnd. For devices, an additional device type specific propagation condition must be fulfilled before it is inserted into $Q_{\text {prop }}$ : the device must be conducting and $v d d$ or $g n d$ at one of its pins must be connected to a floating net at one of its other pins via its conducting channel as shown in Fig. 3.2.

```
Algorithm 1 Voltage propagation for hierarchical circuits based on [9]
    procedure VoltagePropagation \(\left(C_{i}\left(P_{i}, N_{i}, C_{\text {sub }, i}, t_{i}\right), V_{P_{i}}, N_{\text {supply }}, N_{s c}, L_{V}\right)\)
        \(Q_{\text {prop }}, V_{N_{i}}=\operatorname{initialize}\left(V_{P, i}, C_{i}, N_{\text {supply }}, N_{\text {sc }}\right)\)
        for all \(c_{\text {sub }} \in Q_{\text {prop }}\) do
            \(V_{P_{s u b}}=\operatorname{pinVoltages}\left(c_{s u b}, V_{N_{i}}\right)\)
            if not hasResult \(\left(c_{s u b}, V_{P_{\text {sub }}}, L_{V}\right)\) then
                if isDevice \(\left(c_{\text {sub }}\right)\) then
                    propagateDevice \(\left(c_{s u b}, V_{P_{\text {sub }}}, L_{V}\right) \quad / /\) According to Fig. 3.2
                else
                    voltagePropagation \(\left(c_{\text {sub }}, V_{P_{s u b}}, N_{\text {supply }}, N_{s c}, L_{V}\right) \quad / /\) Descend hierarchy
                end if
            end if
            \(V_{N_{\text {sub }}}=\) findResult \(\left(c_{\text {sub }}, V_{P_{\text {sub }}}, L_{V}\right)\)
            update \(\left(Q_{\text {prop }}, V_{N_{i}}, V_{N_{\text {sub }}}, C_{i}\right)\)
        end for
        storeResult \(\left(C_{i}, V_{P_{i}}, V_{N_{i}}, L_{V}\right)\)
    end procedure
```

Algorithm 1 initializes the propagation queue $Q_{p r o p}$ and the net voltages $V_{N_{i}}$ in line 2 by invoking the function initialize $\left(V_{P, i}, C_{i}, N_{\text {supply }}, N_{s c}\right)$. This function sets the pin, ground, supply and short circuit nodes from $V_{P, i}, N_{g n d}, N_{v d d}$ and $N_{s c}$ to their corresponding voltage levels. Subcircuits and devices which are connected to these nets and which are eligible for voltage propagation are inserted into $Q_{\text {prop }}$ during that initialization step. The remaining nets of the circuit are assigned to voltage level float.

In the next step, the algorithm iterates over all elements $c_{\text {sub }} \in Q_{\text {prop }}$ (line 3). In each iteration, the pin voltages $V_{P_{s u b}}$ of $c_{\text {sub }}$ are determined in line 4 based on the current state of $V_{N_{i}}$ by the function pinVoltages $\left(c_{s u b}, V_{N_{i}}\right)$. If another subcircuit of the same type and pin voltages as $c_{\text {sub }}$ has already been encountered during voltage propagation, the corresponding intermediate result is reused (lines 5 and 12). Else, the voltages at $c_{s u b}$ must be propagated (lines 6-10). The following two cases are thereby distinguished:

- $c_{s u b}$ is a device: voltages are propagated according to Fig. 3.2.
- $c_{\text {sub }}$ is a subcircuit block: the algorithm descends into the circuit hierarchy by calling itself recursively.

The corresponding propagation result $V_{N_{s u b}}$ is stored in $L_{V}$ in both cases. In line 12, that result is fetched from the library and used to update $Q_{p r o p}$ and $V_{N_{i}}$ of the current hierarchy level. The nets connected to the pins of $c_{\text {sub }}$ are set to the voltage levels according to $V_{N_{\text {sub }}}$ and any subcircuits connected to these nets are inserted into $Q_{\text {prop }}$. The above steps are repeated until $Q_{\text {prop }}$ is empty.

Finally, in line 15, the voltage propagation result $V_{N_{i}}$ for the current hierarchy level is stored in $L_{V}$ under the entry $\left(C_{i}, V_{P_{i}}\right)$.

Tables 3.1 and 3.2 show the voltage propagation results for the source circuit series (SCS) of Fig. 1.7b and its two source circuits $S C 1, S C 2$ from Fig. 1.7a, respectively. The pin voltages of the source circuit series are given as $V_{P_{S C S}}=\left(v_{\text {in }}, v_{o u t}, v_{p w d}\right)=($ float, float, vdd $)$ and its supply voltages as $N_{\text {supply }}=N_{\text {gnd }} \cup N_{v d d}=\{$ ground, supply\}. Algorithm 1 initializes the nets of the top-level circuit accordingly (see Table 3.1, circuit $S C S$, iteration one, initial

| circuit | iteration | state | $v_{\text {in }}$ | $v_{\text {out }}$ | $v_{n 1}$ | $v_{\text {ground }}$ | $v_{\text {supply }}$ | $v_{p w d}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $S C S$ | 1 | initial | float | float | float | gnd | $v d d$ | $v d d$ |
|  |  | end | gnd | $v d d$ | $v d d$ | gnd | $v d d$ | $v d d$ |
|  | $(2)$ | initial | float | $s c$ | $s c$ | gnd | $v d d$ | $v d d$ |
|  |  | end | gnd | $s c$ | $s c$ | gnd | $v d d$ | $v d d$ |

Table 3.1.: Voltage propagation results for the source circuit series (SCS) of Fig. 1.7b.

| subcircuit | iteration | state | $v_{\text {in }}$ | $v_{\text {out }}$ | $v_{\text {ground }}$ | $v_{\text {supply }}$ | pwd |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $S C 1$ | $(1)$ | initial | float | float | gnd | $v d d$ | $v d d$ |
|  |  | end | gnd | vdd | gnd | $v d d$ | $v d d$ |
|  | $(2)$ | initial | float | $s c$ | gnd | $v d d$ | $v d d$ |
|  |  | end | gnd | $s c$ | gnd | $v d d$ | $v d d$ |
| $S C 2$ | $(1)$ | initial | $v d d$ | float | gnd | $v d d$ | $v d d$ |
|  |  | end | $v d d$ | $v d d$ | gnd | $v d d$ | $v d d$ |
|  | $(2)$ | initial | $s c$ | $s c$ | gnd | $v d d$ | $v d d$ |
|  |  | end | $s c$ | $s c$ | gnd | $v d d$ | $v d d$ |

Table 3.2.: Voltage propagation results for the source circuits $S C 1$ and $S C 2$ from Fig. 1.7a.
state) and queues up the source circuits SC1 and SC2 for voltage propagation as both subcircuits are connected to the power-down signal pwd, i.e., $Q_{p r o p, S C S}=(S C 1, S C 2)$. In the next step, the algorithm descends into $S C 1$ by recursively calling itself with pin voltages $V_{P, S C 1}=\left(v_{\text {in }}, v_{\text {out }}, v_{p w d}\right)=($ float, float, $v d d)$, which are determined based on the current net voltage estimates of the source circuit series, and the supply nets $N_{\text {supply }}$. The algorithm initializes the net voltages of source circuit SC1 accordingly and sets the ground and supply node to their corresponding values (see Table 3.2, subcircuit SC1, iteration one, initial state). The resistor $R 1$ and transistor $N P 1$ of $S C 1$ are hence conducting in power-down mode and propagate $v d d$ to the output pin out and gnd to the input pin in of $S C 1$, respectively. Afterwards, the algorithm ascends back to the top-level, i.e., to the source circuit series, and updates its net voltages $v_{i n}$ and $v_{n 1}$ to $g n d$ and $v d d$. It then descends into source circuit $S C 2$, initializes its net voltages according to $V_{P, S C 2}=\left(v_{\text {in }}, v_{\text {out }}, v_{p w d}\right)=(v d d$, float, $v d d)$ and $N_{\text {supply }}$. The devices $R 1$ and $N 1$ of $S C 2$ are inserted into the propagation queue $Q_{p r o p, S C 2}$. Both devices are connected to the output pin of $S C 2$ and are eligible for propagation. In this case, resistor $R 1$ has been randomly selected first for propagation and the output pin voltage of $S C 2$, i.e., $v_{o u t}$, is set to $v d d$. Finally, the algorithm ascends back to the schematic of the source circuit series and updates the voltage at its output pin $v_{\text {out }}$ to $v d d$.

### 3.3. Short Circuit Path Computation

Algorithm 2 detects current flow of a circuit in power-down mode based on the net voltages computed by voltage propagation (Sec. 3.2). It performs a depth-first search from the supply to the ground nets of the circuit alongside conducting devices (Algorithm 3). Both algorithms are based on [9] and have been adapted to hierarchical designs.

```
Algorithm 2 Short circuit path computation based on [9]
    procedure ShortCircuitPathComputation \(\left(C_{\text {top }}, N_{v d d}, N_{\text {gnd }}, V_{P_{\text {top }}}, L_{V}\right)\)
        for all \(n \in N_{v d d}\) do
            DepthFirstSearch \(\left(n, C_{\text {top }}, N_{\text {gnd }}, V_{P_{\text {top }}}, L_{V}\right) \quad / /\) Algorithm 3
        end for
    end procedure
```

```
Algorithm 3 Depth-first search based on [9]
    procedure DepthFirstSearch \(\left(n, C_{i}, N_{g n d}, V_{P_{i}}, L_{V}\right)\)
        if \(n \in N_{\text {gnd }}\) then
        return // short circuit path detected.
        end if
        \(V_{N_{i}}=\operatorname{findResult}\left(C_{i}, V_{P_{i}}, L_{V}\right)\)
        if connectedToHigherLevel( \(n\) ) then
            DepthFirstSearch \(\left(n_{\text {super }}, C_{\text {super }}, N_{\text {gnd }}, V_{P_{\text {super }}}, L_{V}\right) \quad / /\) Ascend hierarchy
        end if
        for all \(C_{\text {sub }} \in\) connectedSubCircuitBlocks \((n)\) do
            \(V_{P_{s u b}}=\operatorname{pinVoltages}\left(C_{\text {sub }}, V_{N_{i}}\right)\)
            DepthFirstSearch \(\left(n_{\text {sub }}, C_{\text {sub }}, N_{\text {gnd }}, V_{P_{\text {sub }}}, L_{V}\right) \quad / /\) Descend hierarchy
            end for
        for all \(n_{\text {con }} \in \operatorname{connectedNets(n)}\) do
            DepthFirstSearch \(\left(n_{\text {con }}, C_{i}, V_{P_{i}}, L_{V}\right) \quad / /\) Continue on this level
            end for
    end procedure
```

The depth-first search algorithm first checks whether its termination criterion is fulfilled, i.e., whether the given net $n$ is a ground net (lines 2-4). In this case, a short circuit path has been found and the algorithm returns.

The search continues in all other cases.
In line 5 , the algorithm loads the net voltages $V_{N, i}$ for the currently investigated circuit $C_{i}$ with input voltages $V_{P, i}$ from the voltage propagation results library $L_{V}$ by calling function findResult $\left(C_{i}, V_{P_{i}}, L_{V}\right)$. These net voltages are used to determine the conducting or nonconducting state of the devices on the current hierarchy level and the pin voltages of lowerlevel circuit blocks before descending the circuit hierarchy.

The algorithm thereby checks with function connectedToHigherLevel $(n)$ whether the given net $n$ is connected to external circuitry, i.e., whether $n$ is a pin of the current circuit, and ascends the circuit hierarchy when required by calling itself recursively (lines 6-8).

The depth-first search furthermore has to determine whether it has to descend into the circuit hierarchy (lines 9-12). Thereby, function connectedSubCircuitBlocks( $n$ ) returns all subcircuits of $C_{i}$ connected to net $n$. The depth-first search algorithm then calls itself recursively for every subcircuit block connected to $n$ with $n_{s u b}, C_{s u b}$ and $V_{P_{s u b}}$ as input arguments. The net $n_{\text {sub }}$ denotes the net, i.e., pin, of $C_{s u b}$ which is connected to net $n$ of the current hierarchy level. The function pinVoltages $\left(C_{s u b}, V_{N_{i}}\right)$ determines the subcircuit's pin voltages $V_{P_{\text {sub }}}$ based on the current net voltages $V_{N_{i}}$ of the current hierarchy level (line 10).

Lastly, the search continues on the current hierarchy level in lines 13-15 by calling itself recursively for all nets $n_{\text {con }}$ connected to $n$ by conducting devices. This set of nets is thereby computed by function connectedNets $(n)$ which determines the conducting or non-conducting state of each device connected to $n$ based on $V_{N_{i}}$.

During the computations, the algorithm must track the hierarchy levels it has traversed in order to be able to ascend it again, as a circuit $C_{i}$ does not know where in the hierarchy it has been instantiated by another circuit. This part of the algorithm is not shown, but the hierarchy path is tracked by a vector which stores tuples of the visited subcircuits and their corresponding pin voltages $\left(C_{\text {super }}, V_{P_{\text {super }}}\right)$. Whenever the depth-first search algorithm descends into a subcircuit, the corresponding tuple is appended to that vector and whenever the algorithm ascends, the last visited subcircuit is restored from that vector thus allowing the algorithm to traverse the hierarchy in both directions.

A detected short-circuit path can be classified into one of the following three categories [10]:

- definite: all devices on the detected path are conducting. The path cannot be switched off by the gate shut-off pattern and a rip-up required as shown in Fig. 1.6.
- potential: at least one device on the detected path is only potentially conducting, e.g., the gate of a transistor on that path is floating. Such a path can be switched off by the gate shut-off pattern by inserting a power-down transistor which forces the corresponding device into the non-conducting state.
- induced: the detected path is potentially conducting due to another definite, potential or induced short-circuit path in power-down mode, i.e., the state of at least one of its devices is controlled by a short-circuit node. Such a path can be typically switched off by switching off the short-circuit path causing it.


## 3. Hierarchical Power-Down Verification

The source circuit series from Fig. 1.7b contains one supply net supply which is connected to the source circuits $S C 1$ and $S C 2$. Short circuit path computation loads the voltage propagation result from the first iteration for $S C 1$ from Table 3.2 and descends into it. The supply net is connected to the output pin out of $S C 1$ via the resistor $R 1$. The output pin of SC1 is connected to net $n 1$ of the source circuit series which again is connected to the input pin of $S C 2$. Hence, the algorithm consecutively first ascends back to the top-level and then descends down into $S C 2$. The input pin in of $S C 2$ is connected to the ground node via power-down transistor NP1. This transistor is conducting in power-down mode according to Table 3.2 and hence a short circuit path, i.e., $P 1=(S C 1 / R 1, S C 2 / R 2)$, has been found. The net $n 1$ of the source circuit series and the nets $S C 1 / o u t$ and $S C 2 /$ in lie on that short circuit path and are inserted into the set of short circuit nodes $N_{s c}$.
Short circuit path computation also descends into $S C 2$ via the supply net of the source circuit series. This net is connected to the ground node by the devices $R 1$ and $N 1$ inside of $S C 2$ which are both conducting in power-down mode according to the voltage propagation results from Table 3.2, iteration one. Hence, a second current flow $P 2=(S C 2 / R 1, S C 2 / N 1)$ has been found and the output pins of $S C 2$ and the source circuit series are inserted into $N_{s c}$ as these nets lie on $P 2$.

Short circuit path computation terminates afterwards and voltage propagation is repeated with $N_{s c}=\{n 1$, out, $S C 1 /$ out, $S C 2 /$ in, $S C 2 /$ out $\}$. The corresponding voltage propagation result is shown by Tables 3.1 and 3.2 , iteration two. The current path $P 2$ is now classified as induced current flow as the state of transistor $N 1$ inside of $S C 2$ cannot be accurately predicted anymore as the input pin of $S C 2$ is a short circuit node. No new current flow is detected in the second iteration of the short circuit path computation and voltage propagation must hence not be repeated anymore.

### 3.4. Matching and Symmetry Assertion

Fig. 3.3 gives an overview of the matching and symmetry assertion method for hierarchical circuits in power-down mode. The method is based on [13] and has been extended to support hierarchical circuits.

It takes the top-level circuit $C_{\text {top }}$, the pin voltages $V_{P_{\text {top }}}$, the power-down pins $P_{p w d}$, the voltage propagation library $L_{V}$ and the supply nets $N_{\text {supply }}$ of the circuit as inputs.
Structure recognition (App. A) and symmetry computation (App. B) are performed to identify the basic building blocks and symmetry pairs of the given circuit. The power-down circuitry has to be removed from the circuit in a preprocessing step as it might interfere with the above-mentioned methods. Finally, the identified building blocks and symmetry pairs are asserted by evaluating specific block type and symmetry pair matching rules against voltage mismatch. The detected matching and symmetry violations, $E_{M}$ and $E_{S}$, are reported back to the user.

### 3.4.1. Power-up Transformation

Power-up transformation removes the power-down circuitry from a given circuit. It thereby tracks the power-down signals pwd and $\overline{p w d}$ through electrostatic discharge (ESD) protection


Figure 3.3.: Overview of the symmetry assertion procedure [13].
circuitry, level shifters and inverters of the integrated circuit until it reaches the gates of the power-down transistors inside the analog subcircuit blocks of a hierarchical design. The identified power-down transistors are then removed from the circuit as follows: transistors which are conducting in power-down mode pull an internal net of the circuit to $v d d$ or $g n d$ and have been inserted in parallel into the circuit. Hence, these transistors can simply be removed. Power-down transistors which are off in power-down mode have been inserted in series into the circuit to disrupt a current path in power-down mode. Hence, the drain and source nets connected to these transistors must be shorted after removing them. The state of the power-down transistors is determined based on the net voltage estimates stored in the voltage propagation results library $L_{V}$. Finally, the subcircuit blocks of the powerdown signal path, e.g. inverters, are removed from the hierarchical design. The power-up transformation above has first been described in [13] and has been extended to support hierarchical designs.

Fig. 3.4a shows a differential stage with its power-down circuitry highlighted in blue. The voltage propagation result for this circuit is annotated with italic letters next to the net names. No short circuit path runs through the circuit.

Fig. 3.4b shows the differential stage after power-up transformation. The gate shut-off transistors $N P 1, P P 2$ and $P P 3$ have been removed from the circuit. The nets $n 1$ and nrip 1 have been shorted after removal of diode rip-up transistor $P P 1$. The removal of $P P 1$ allows the identification of the simple current mirror $\operatorname{scm}(P 1, P 2)$ by structure recognition which furthermore enables symmetry computation to detect the symmetry pairs $(N 1, N 2)$ and $(P 1, P 2)$ alongside the signal paths from the differential input $(i n, i p)$ to the single ended output out of the circuit.


Figure 3.4.: Differential stage before (a) and after (b) power-down up transformation.

### 3.4.2. Assertion

Fig. 3.5a shows the matching rules for two analog basic building blocks, i.e., a differential pair and a simple current mirror. The voltages at the gate and drain pins of the differential pair should be pairwise matched in order to avoid asymmetric stress in power-down mode. Similarly, the voltages at the drain pins of a simple current mirror should be matched. Matching rules for other analog basic building blocks can be found in [10]. The work by [10] additionally introduced several levels of strictness for each rule set, e.g., a strict rule set for differential pairs would enforce the same voltage level at all of its pins to prevent any electrical field at its devices. In the following, the presented assertion methods are only described for matched pin pairs as stricter rule sets can be similarly evaluated.

The matching rules for symmetry pairs in the signal paths of a circuit are shown in Fig. 3.5b. Same pin types should be exposed to the same voltage level to avoid asymmetric stress in power-down mode. Similarly to analog basic building blocks, strict rule sets for symmetry pairs could enforce the same voltage level at all of its pins.

Furthermore, when symmetrical signals paths inside a subcircuit block have been detected, their corresponding differential input and output pins should be matched as shown in Fig. 3.5c.

Building block and symmetry information at (sub)circuit pins are propagated in-between circuit hierarchy levels by structure recognition and symmetry computation as this information is important to reduce the solution space of the power-down synthesis constraint optimization problem (Sec. 4.3).

The evaluation of a matching rule, i.e., the comparison of the voltage levels at a pin pair, has three different outcomes:

- fulfilled $(\checkmark)$ : both pins are exposed to the same defined voltage level (vdd or gnd). Any additional requirement, e.g., a specific voltage level at those pins, is fulfilled as well.
- violated $(\boldsymbol{X})$ : both pins are exposed to different voltage levels and hence stressed. The voltage level $s c$ is also treated as an error as there is a short circuit path in the circuit.


Figure 3.5.: Matching rules for: (a) analog basic building blocks [10], (b) symmetry pairs in signal paths and (c) symmetrical pin pairs of subcircuit blocks due to underlying signal path symmetries.

- warning ( $\widehat{\wedge}$ ): at least one of the pins is floating, i.e., it cannot be guaranteed that the matching condition is fulfilled.

Algorithm 4 shows the outline of the matching and symmetry assertion method. It takes a circuit $C_{i}\left(P_{i}, N_{i}, C_{s u b, i}, t_{i}\right)$, its pin voltages $V_{P_{i}}$ and the voltage propagation results library $L_{V}$ as inputs. The libraries $L_{\text {struct }}$ and $L_{\text {sym }}$ contain all analog basic building blocks and symmetry pairs detected by structure recognition and symmetry computation (App. A and App. B). These correspond to the subcircuits and devices whose matching conditions have to be evaluated. The library $L_{\text {error }}$ stores the detected rule violations and is filled in during the execution of the algorithm.

In lines 2-7, the algorithm first checks the matching rules for the identified building blocks and symmetry pairs on the current hierarchy level. It fetches the corresponding voltage propagation, structure recognition and symmetry computation results from the respective libraries by using different versions of the implemented findResult() function (lines 2 4). The evaluation results, i.e., the sets of matching and symmetry violations $E_{M}$ and $E_{S}$ computed by checkBuildingBlocks $\left(C_{B}, V_{N_{i}}\right)$ and checkSymmetryPairs $\left(C_{S P}, V_{N_{i}}\right)$, are then stored under the entry $\left(C_{i}, V_{P_{i}}\right)$ in $L_{\text {error }}$ in lines 5-7.

In lines 8-15, the algorithm descends into the circuit hierarchy recursively and performs the symmetry checks for all subcircuit blocks $c_{s u b}$ of $C_{i}$ if necessary. It first determines the corresponding pin voltages $V_{P_{s u b}}$ of $c_{s u b}$ based on the net voltage estimates $V_{N_{i}}$ (line 10). Function hasResult $\left(c_{\text {sub }}, V_{P_{\text {sub }}}, L_{\text {error }}\right)$ checks, whether intermediate results for subcircuit $c_{\text {sub }}$ with pin voltages $V_{P_{s u b}}$ already exist in library $L_{\text {error }}$ (line 11). If yes, that intermediate results is reused; if not, the matching and symmetry assertion algorithm calls itself recursively with the just determined pin voltage information (line 12).
Table 3.3 shows the matching and symmetry assertion results for the differential stage from Fig. 3.4a. A warning is issued for simple current mirror $\operatorname{scm}(P 1, P 2)$ as its output pin, i.e., net out, is floating. Another warning is issued for the differential pair $d p(N 1, N 2)$ as one of its output pins, i.e., net out is floating. Furthermore, the voltages at the input pins of

```
Algorithm 4 Matching and symmetry assertion for hierarchical analog circuits
    procedure matchingAsSertion \(\left(C_{i}\left(P_{i}, N_{i}, C_{s u b, i}, t_{i}\right), V_{P_{i}}\right)\)
        // Additional inputs:
            - \(L_{V}\) : voltage propagation results library
            - \(L_{\text {struct }}\) : structure recognition results library
            - \(L_{\text {sym }}\) : symmetry computation results library
            - \(L_{\text {error }}\) : matching and symmetry violations results library
        // Perform checks for current hierarchy level
        \(V_{N_{i}}=\) findResult \(\left(C_{i}, V_{P_{i}}, L_{V}\right)\)
        \(C_{B}=\operatorname{findResult}\left(C_{i}, L_{\text {struct }}\right)\)
        \(C_{S P}=\operatorname{findResult}\left(C_{i}, L_{\text {sym }}\right)\)
        \(E_{M}=\) checkBuildingBlocks \(\left(C_{B}, V_{N_{i}}\right) \quad / /\) See Fig. 3.5a
        \(E_{S}=\operatorname{checkSymmetryPairs}\left(C_{S P}, V_{N_{i}}\right) \quad / /\) See Fig. 3.5b and Fig. 3.5c
        \(\operatorname{store}\left(C_{i}, V_{P_{i}}, E_{M}, E_{S}, L_{\text {error }}\right)\)
        // Descend into the circuit hierarchy when necessary
        for all \(c_{s u b} \in C_{\text {sub }, i}\) do
            if \(i s S u b C i r c u i t B l o c k\left(c_{s u b}\right)\) then
                    \(V_{P_{s u b}}=\operatorname{pinVoltages}\left(c_{s u b}, V_{N_{i}}\right)\)
                    if not hasResult \(\left(c_{\text {sub }}, V_{P_{s u b}}, L_{\text {error }}\right)\) then
                        matchingAssertion \(\left(c_{s u b}, V_{P_{s u b}}\right)\)
            end if
            end if
        end for
    end procedure
```

| building-blocks// | Evaluated |  | Result |  |
| :---: | :---: | :---: | :---: | :---: |
| symmetry pairs | pins | nets |  |  |
| $\operatorname{scm}(P 1, P 2)$ | (in,out) | ( $n 2$, out) | (gnd, float) | ¢ |
| $d p(N 1, N 2)$ | (in1, in2) | (in, ip) | ( gnd, vdd) | $x$ |
| $d p(N 1, N 2)$ | (out1, out2) | ( $n 2$, out) | (gnd, float) | ¢ |
| $(P 1, P 2)$ | $(d, d)$ | ( $n 2$, out) | (gnd, float) | $\triangle$ |
|  | $(g, g)$ | (nrip $1, n r i p 1)$ | (vdd, vdd) | $\checkmark$ |
|  | $(s, s)$ | (supply, supply) | (vdd, vdd) | $\checkmark$ |
| (N2, N3) | $(d, d)$ | ( $n 2$, out) | (gnd, float) | $\triangle$ |
|  | $(g, g)$ | (in, ip) | (gnd, vdd) | $x$ |
|  | $(s, s)$ | $(n 1, n 1)$ | (gnd, gnd) | $\checkmark$ |

evaluation result: fail $(\boldsymbol{X})$, pass $(\checkmark)$, warning $(\widehat{\wedge})$
Table 3.3.: Matching and symmetry assertion results for the differential stage from Fig. 3.4a.


Figure 3.6.: Bias circuitry (a) and core circuit (b) of a dual gain amplifier (DGA) with its power-down circuitry highlighted in blue. The subcircuits FOCA1 and FOCA2 are each implemented as the folded cascode amplifier from Fig. 3.7.

| iteration | bias circuitry |  |  |  | core circuit |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|  | $v_{n X 1}$ | $v_{n X 2}$ | $v_{n 6}$ | $v_{n 8}$ | $v_{i n}$ | $v_{o u t}$ | $v_{n 1}$ | $v_{n 2}$ |
| (1) | $v d d$ | gnd | gnd | gnd | $v d d$ | $g n d$ | $g n d$ | $g n d$ |
| $(2)$ | $v d d$ | $g n d$ | $g n d$ | $g n d$ | $v d d$ | $s c$ | $s c$ | $s c$ |

nets $\overline{p w d}$, ground always on gnd; nets pwd, supply always on $v d d$;
$X=3,4,5,7$ in $v_{n X 1}$ and $v_{n X 2}$
Table 3.4.: Voltage propagation results for the dual gain amplifier from Fig. 3.6.
$d p(N 1, N 2)$ differ from each other and hence a matching violation is issued. Two warnings, three fulfilled and one violated matching constraint are reported for the symmetry pairs $(P 1, P 2)$ and $(N 2, N 3)$. In summary, only $33 \%$ of all matching and symmetry constraints are fulfilled for the differential stage in power-down mode.

### 3.5. Experimental Results

### 3.5.1. Dual Gain Amplifier

Fig. 3.6 shows a dual gain amplifier (DGA). Its bias circuit consists of four current mirrors which provide the bias voltages to the core part of the circuit. The core circuit consists of two parallelly connected folded cascode amplifiers, i.e., FOCA1 and FOCA2, which doubles the output drive of the circuit [37]. The schematic of the corresponding folded cascode amplifier is shown in Fig. 3.7. The power-down circuitry of both amplifiers is highlighted in blue in both figures.


Figure 3.7.: Schematic of a folded cascode amplifier (FOCA) which implements subcircuits $F O C A 1$ and $F O C A 2$ of the dual gain amplifier from Fig. 3.6. Its power-down circuitry is highlighted in blue.

| iteration | $v_{i n}$ | $v_{i p}$ | $v_{o u t}$ | $v_{b 1}$ | $v_{b 2}$ | $v_{n 1}$ | $v_{n 2}$ | $v_{n 3}$ | $v_{n 4}$ | $v_{n 5}$ | $v_{6}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $(1)$ | $v d d$ | $v d d$ | gnd | gnd | gnd | $v d d$ | $v d d$ | $v d d$ | $v d d$ | gnd | gnd |
| $(2)$ | $s c$ | $v d d$ | $s c$ | gnd | gnd | float | $v d d$ | float | $v d d$ | gnd | gnd | $\overline{p w d}$, ground always on $g n d ; p w d$, supply always on $v d d$

Table 3.5.: Voltage propagation results for the folded cascode amplifiers from Fig. 3.7. The results are valid for $F O C A 1$ and $F O C A 2$ of the dual gain amplifier from Fig. 3.6 as these subcircuit are exposed to the same pin voltages.

The power-down verification method from the previous sections has been applied to the dual gain amplifier to detect any errors in its power-down circuitry. Table 3.4 shows the voltage propagation results for the bias and the core circuit of the dual gain amplifier. The pin voltages were given as

$$
\begin{equation*}
V_{P_{D G A}}=\left(v_{i n}, v_{o u t}, v_{p w d}, v_{\overline{p w d}}\right)=(\text { float }, \text { float }, v d d, \text { gnd }) \tag{3.3}
\end{equation*}
$$

and the supply nets as $N_{v d d}=\{$ supply $\}$ and $N_{\text {gnd }}=\{$ ground $\}$. Voltage propagation had to be executed twice, as current paths were detected in the circuit based on the results of the first iteration. The voltage levels of the bias circuitry do not change in the second iteration as nets $n 31, n 41, n 51, n 71$ are tied to $v d d$ by the resistors $R 3-R 6$, nets $n 6, n 8$ to gnd by the diode-connected transistors $N 4, N 6$ and nets $n 32, n 42, n 52, n 72$ to $g n d$ by the power-down transistors NP1, NP2 inside the subcircuits FOCA1 and FOCA2.

In the core circuit, the voltages levels of nets out, $n 1$ and $n 2$ change to $s c$ in the second voltage propagation iteration as short-circuit paths run over them.

Table 3.5 shows the voltage propagation result for the folded cascode amplifiers FOCA1 and $F O C A 2$. The results for both subcircuits are identical as they exhibit the same pin voltages. Hence, the voltage propagation results computed for one of the two folded cascode amplifiers has been reused for the other one which reduces the computational effort of voltage propagation.

Short circuit path computation detects, based on the voltage propagation results of the first iteration, four current paths in the dual gain amplifier in power-down mode:

- $P 1=(F O C A 1 / P P 1, F O C A 1 / N P 3)$
- $P 2=(F O C A 2 / P P 1, F O C A 2 / N P 3)$
- $P 3=(F O C A 1 / P P 1, R 1, R 2, F O C A 2 / N P 3)$
- $P 4=(F O C A 2 / P P 1, R 2, R 1, F O C A 1 / N P 3)$

All of them follow a similar pattern: they origin at power-down transistor PP1 inside FOCA1 or FOCA2, switch to the top-level via their input pin in to net $n 1$ or $n 2$ of the dual gain amplifier, descend again into one of the folded cascode amplifiers via their output pins (eventually first running over $R 1$ and $R 2$ of the dual gain amplifier) and terminate at the ground node after passing NP3.

The internal paths of these short circuit paths are:

$$
\begin{equation*}
N_{s c}=\{n 1, n 4, \text { out }, F O C A<1,2>/ \text { in }, F O C A<1,2>/ \text { out }\} . \tag{3.4}
\end{equation*}
$$

Voltage propagation has been repeated with this additional information; the corresponding results are shown in Tables 3.4 and 3.5, iteration two. No more new current paths have been detected based on these results and hence power-down verification continues with the next step, i.e., matching and symmetry assertion.

Fig. 3.8 shows the dual gain amplifier after power-up transformation and its matching and symmetry conditions identified by structure recognition (App. A) and symmetry computation (App. B). The power-down transistors NP1-NP4 in the bias part have been removed and the nets $n X 1, n X 2$, with $X=3,4,5,7$, have been shorted to $n X^{*}$ to restore the diodeconnection of transistors $N 1, N 2, N 3$ and $N 5$.

nets $\mathrm{nX} 1, \mathrm{nX} 2$ shortened to nX * after removal of NP1-NP4 ( $\mathrm{X}=1,3,5,7$ )
(a) bias circuitry
(b) core circuit

Figure 3.8.: Symmetry and matching constraints of the dual gain amplifier from Fig. 3.6.


Figure 3.9.: Symmetry and matching constraints for $F O C A 1$ and $F O C A 2$ from Fig. 3.7.

The dual gain amplifier itself does not contain any analog basic buildings blocks. However, the transistors $N 1, N 2, N 3$ and $N 5$ of the bias circuit are connected to the transistors $N 1, N 6$, and $N 7$ of $F O C A 1$ and $F O C A 2$ such that they form two simple current mirrors, i.e., $\operatorname{scm}(N 1, F O C A 1 / N 1), \operatorname{scm}(N 2, F O C A 2 / N 1)$, and four level shifters, i.e., $l s(N 3, F O C A 1 / N<6,7>), l s(N 5, F O C A 2 / N<6,7>)$, in total. Furthermore, the voltages at the pins of FOCA1, FOCA2 and the resistors $R 1, R 2$ must be matched in power-down mode.

Fig. 3.9 shows the folded cascode amplifier after power-up transformation and its identified building blocks and symmetry pairs. All power-down transistors can simply be removed from the circuit as none of them disables a diode-configuration or has been inserted in series into a current path of the circuit. The folded cascode amplifier consists of one differential pair $d p(N 2, N 3)$, one cascode current mirror $c c m(P 1-P 4)$ and one wide-swing cascode current mirror $w s c c m(N 4-N 7)$. These basic building blocks also define the symmetry constraints of the folded cascode amplifier.

Furthermore, the information about the simple current mirrors and levels shifters at the bias pins $b 1$ and $b 2$ of $F O C A 1, F O C A 2$ due to external connectivity with the dual gain amplifier has been shared between these circuit hierarchy levels to keep their structural and symmetry information synchronized.

Table 3.6 summarizes the findings of the symmetry and matching assertion for the dual gain amplifier including its folded cascode amplifiers. Overall, $75 \%$ of the matching rules at the analog basic building blocks and about $40 \%$ of rules for the symmetry pairs of the circuit are violated.

Table 3.7 shows the matching assertion results for the dual gain amplifier in detail. Each row corresponds to one matching constraint which has to be evaluated. The first column denotes the building block which generated the matching constraint; the second column the pins of the building block whose voltage levels have to be compared; the third column the nets connected these pins and the fourth column their corresponding voltage levels computed by voltage propagation. The last column denotes the result of the assertion.

| circuit | fulfilled |  | fulfilled |  |
| :--- | :---: | :---: | :---: | :---: |
|  | matching constraints |  | symmetry constraints |  |
|  | total | $\%$ | total | $\%$ |
| dual gain amplifier (DGA) | $6 / 10$ | $60 \%$ | $16 / 22$ | $72.3 \%$ |
| folded cascode amplifier (FOCA) | $4 / 11$ | $36.4 \%$ | $8 / 15$ | $53.3 \%$ |
| overall (DGA, FOCA1, FOCA2) | $8 / 32$ | $25.0 \%$ | $32 / 52$ | $61.5 \%$ |

Table 3.6.: Summary of the matching and symmetry assertion results for the dual gain amplifier (DGA) and its folded cascode amplifier subcircuits FOCA1 and FOCA2 from Fig. 3.6 and 3.7.

| building-blocks | Evaluated |  | Result |  |
| :---: | :---: | :---: | :---: | :---: |
|  | pins | nets |  |  |
| $\operatorname{scm}(N 1, F O C A 1 / N 1)$ | (in,out) | ( $n 31, F O C A 1 / n 1$ ) | (vdd, float) | $\triangle$ |
| scm(N2,FOCA2/N1) | (in,out) | ( $n 41$, FOCA2/n1) | (vdd, float) | $\triangle$ |
| $l s(N 3, F O C A 1 / N 6)$ | (in,out) | ( $n 51, F O C A 1 / n 4$ ) | (vdd, vdd) | $\checkmark$ |
|  | $(s 1, s 2)$ | $(n 6, F O C A 1 / n 5)$ | (gnd, gnd) | $\checkmark$ |
| $l s(N 3, F O C A 1 / N 7)$ | (in, out) | ( $n 51$, FOCA1/out) | (vdd, sc) | $x$ |
|  | $(s 1, s 2)$ | ( $n 6, F O C A 1 / n 6$ ) | (gnd, gnd) | $\checkmark$ |
| $l s(N 5, F O C A 2 / N 6)$ | (in,out) | ( $n 71, F O C A 2 / n 4$ ) | (vdd, vdd) | $\checkmark$ |
|  | $(s 1, s 2)$ | $(n 8, F O C A 2 / n 5)$ | ( $g n d$, gnd) | $\checkmark$ |
| $l s(N 5, F O C A 2 / N 7)$ | (in,out) | ( $n 71, F O C A 1 / o u t)$ | (vdd, sc) | $x$ |
|  | $(s 1, s 2)$ | ( $n 8$, FOCA1/n6) | (gnd, gnd) | $\checkmark$ |

evaluation result: fail $(\boldsymbol{X})$, pass $(\checkmark)$, warning $(\bigwedge)$
Table 3.7.: Matching assertion results for the dual gain amplifier from Fig. 3.6.

| subcircuits | Evaluated |  | Result |  |
| :---: | :---: | :---: | :---: | :---: |
|  | pins | nets |  |  |
| R1, R2 | + | $(n 1, n 2)$ | ( $s c, s c$ ) | $x$ |
|  | - | (out, out) | ( $s c, s c$ ) | $x$ |
| R3, R4 | + | (supply, supply) | (vdd, vdd) | $\checkmark$ |
|  | - | $(n 31, n 41)$ | (vdd, vdd) | $\checkmark$ |
| R5, R6 | + | (supply, supply) | ( $v d d, v d d$ ) | $\checkmark$ |
|  | - | $(n 51, n 71)$ | (vdd, vdd) | $\checkmark$ |
| N1, N2 | $d$ | $(n 31, n 41)$ | (vdd, vdd) | $\checkmark$ |
|  | $g$ | ( $n 32, n 42$ ) | (gnd, gnd) | $\checkmark$ |
|  | $s$ | (ground, ground) | ( $g n d$, gnd) | $\checkmark$ |
| N3, N5 | $d$ | $(n 51, n 71)$ | (vdd, vdd) | $\checkmark$ |
|  | $g$ | ( $n 52, n 72$ ) | (gnd, gnd) | $\checkmark$ |
|  | $s$ | $(n 6, n 8)$ | (gnd, gnd) | $\checkmark$ |
| N4, N6 | $d$ | $(n 6, n 8)$ | (gnd, gnd) | $\checkmark$ |
|  | $g$ | $(n 6, n 8)$ | (gnd, gnd) | $\checkmark$ |
|  | $s$ | (ground, ground) | ( $g n d, g n d$ ) | $\checkmark$ |
| FOCA1, FOCA2 | in | $(n 1, n 2)$ | ( $s c, s c$ ) | $x$ |
|  | $i p$ | (in, in) | (vdd, vdd) | $\checkmark$ |
|  | out | $(n 1, n 2)$ | ( $s c, s c$ ) | $x$ |
|  | $b 1$ | ( $n 32, n 42$ ) | ( $g n d, g n d$ ) | $\checkmark$ |
|  | $b 2$ | $(n 52, n 72)$ | (gnd, gnd) | $\checkmark$ |
| FOCA1 | (in, ip) | ( $n 1, i n$ ) | ( sc, vdd) | $x$ |
| FOCA2 | (in, ip) | ( $n 2, i n$ ) | ( $s c, v d d$ ) | $x$ |

evaluation result: fail $(\boldsymbol{X})$, pass $(\checkmark)$, warning ( $\widehat{\wedge})$
Table 3.8.: Symmetry assertion results for the dual gain amplifier from Fig. 3.6.

Two warnings are issued for the dual gain amplifier as the output pins of the simple current mirrors $\operatorname{scm}(N 1, F O C A 1 / N 1)$ and $\operatorname{scm}(N 2, F O C A 2 / N 1)$ are floating. Furthermore, two errors are reported as the output pins of the level shifters $l s(N 5, F O C A 2 / N 6)$ and $l s(N 5, F O C A 2 / N 7)$ are on a short circuit node. Please note that the devices of these current mirrors and level shifters lie on different hierarchy levels. Matching and symmetry violations at building blocks split over several hierarchy levels are hard to spot manually as such errors are concealed in-between the different levels. The new verification methodology is able to detect such errors automatically.

Table 3.8 shows the detailed evaluation results of the symmetry rules for the dual gain amplifier. The first column lists the checked symmetry pairs, the second column their evaluated subcircuit pins, the third column the nets connected to the respective pins, the fourth column their corresponding voltage levels and the last column the evaluation result of the symmetry assertion. Six symmetry constraints of the dual gain amplifier are violated in power-down mode. Each violation is caused by one of the detected short circuit paths. The rule violations

| building-blocks | Evaluated |  | Result |  |
| :---: | :---: | :---: | :---: | :---: |
|  | pins | nets |  |  |
| $\operatorname{scm}($ Top $/ N<1,2>, N 1)$ | (in, out) | (Top/n<31, 41>, $n 1$ ) | (vdd, float) | $\triangle$ |
| $l s(T o p / N<1,2>, N 6)$ | (in, out) | (Top/n<51, 71>, $n 4$ ) | (vdd, vdd) | $\checkmark$ |
|  | $(s 1, s 2)$ | (Top/n<6, 8>, $n 5$ ) | (gnd, gnd) | $\checkmark$ |
| $l s(T o p / N<1,2>, N 7)$ | (in, out) | (Top/n<51, 71>, out) | (vdd, sc) | $x$ |
|  | $(s 1, s 2)$ | (Top/n<6, $8>, n 6$ ) | $(g n d, g n d)$ | $\checkmark$ |
| $d p(N 2, N 3)$ | (in1, in2) | (in, ip) | ( $s c, v d d$ ) | $x$ |
|  | (out1, out2) | $(n 2, n 3)$ | (vdd, float) | $\triangle$ |
| $w \operatorname{sccm}(N 4-N 7)$ | (in, out) | ( $n 4$, out) | (vdd, sc) | $x$ |
|  | (inner 1, inner 2 ) | $(n 5, n 6)$ | (gnd, gnd) | $\checkmark$ |
| $\operatorname{ccm}(P 1-P 4)$ | (in,out) | ( $n 4$, out) | (vdd, sc) | $x$ |
|  | (inner 1, inner 2 ) | $(n 2, n 3)$ | (vdd, float) | 4 |

Table 3.9.: Matching assertion results for FOCA1 and FOCA2 from Fig. 3.7.
at the differential inputs (in, ip) of FOCA1 and FOCA2 at the bottom of the table indicate that there are further matching and symmetry problems in the underlying folded cascode amplifiers.

These problems are revealed by Tables 3.9 and 3.10 . Table 3.9 shows that only one out of six matching constraints generated by its basic analog building blocks is fulfilled. Three warnings due to floating nodes indicating possible asymmetrical stress and four errors due to short-circuit nodes in the circuit have been issued.

Table 3.10 reports four potentially violated, four violated and eight fulfilled symmetry constraints for each of the folded cascode amplifier in power-down mode.

### 3.5.2. High Input Impedance Differential Amplifier

Fig. 3.10a shows a high input impedance differential amplifier (HIIDA) [38]. Its subcircuits MILLER1, MILLER2 and MILLER3 are implemented as Miller operational transconductance amplifiers (OTAs) whose topology is shown in Fig. 3.10b. The power-down circuitry of these circuits is highlighted in blue.
Power-down verification has been executed for this hierarchical circuit with input voltages $V_{P, H I I D A}=($ in, ip,out, $p w d)=($ float, float, float, $v d d)$. The supply nets are given as $N_{\text {vdd }}=\{$ supply $\}$ and $N_{\text {gnd }}=\{$ ground $\}$.
Voltage propagation and short circuit path computation identify four current paths in the circuit:

- $P_{1}=(M I L L E R 1 / P P 2, R 2, R 3, R 5, R 7)$
- $P_{2}=(M I L L E R 2 / P P 2, R 3, R 5, R 7)$
- $P_{3}=(M I L L E R 3 / P P 2, R 4, R 1, R 2, R 3, R 5, R 7)$

evaluation result: fail $(\boldsymbol{X})$, pass $(\checkmark)$, warning ( $\bigwedge$ )
Table 3.10.: Symmetry assertion results for $F O C A 1$ and $F O C A 2$ from Fig. 3.7.


Figure 3.10.: Schematic of a high input impedance differential amplifier (HIIDA) [38]. The Miller operational transconductance amplifier (OTA) implements its subcircuits MILLER1, MILLER2 and MILLER3 (b). Their power-down circuitry is highlighted in blue.

| top-level (HIIDA) |  | MILLER<1,2> |  | MILLER3 |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
| Net | Voltage | Net | Voltage | Net | Voltage |
| $i n$ | $v d d$ | $i n$ | $s c$ | $i n$ | $s c$ |
| $i p$ | $v d d$ | $i p$ | $v d d$ | $i p$ | $s c$ |
| $n 1$ | $s c$ | $n 1$ | gnd | $n 1$ | gnd |
| $n 2$ | $s c$ | $n 2$ | float | $n 2$ | float |
| $n 3$ | $s c$ | $n 3$ | gnd | $n 3$ | gnd |
| $n 4$ | $s c$ | $n 4$ | gnd | $n 4$ | gnd |
| $n 5$ | $s c$ | $n p 1$ | $v d d$ | $n p 1$ | $v d d$ |
| $n 6$ | $s c$ | $o u t$ | $s c$ | $o u t$ | $s c$ |
| out | $s c$ |  |  |  |  |

For all circuits: $\quad$ ground $=$ gnd, supply $=v d d$ and $p w d=v d d$.
For MILLER1 - MILLER3: $\overline{p w d}=g n d$.
Table 3.11.: Final voltage propagation result for the high input impedance differential amplifier (HIIDA) and the Miller operational transconductance amplifiers (OTAs) from Fig. 3.10.

- $P_{4}=(M I L L E R 3 / P P 4, R 7)$

Hence, voltage propagation has to be repeated with the set of short circuit nets:

$$
\begin{equation*}
N_{s c}=\{M I L L E R 1 / i n, M I L L E R 2 / i n, M I L L E R 3 / i n, M I L L E R 3 / i p, n 1-n 6\} . \tag{3.5}
\end{equation*}
$$

The final voltage propagation result is displayed in Table 3.11. During voltage propagation, intermediate results for MILLER1 and MILLER2 have been reused as both Miller operational transconductance amplifiers are exposed to the same input voltage levels in power-down mode. The results reveal that there are three floating nets in the circuit: net $n 2$ in each of the three Miller operational transconductance amplifiers.

The voltage propagation results are then used to check the matching and symmetry conditions of the high input impedance differential amplifier in power-down mode. Its power-down circuitry first has to be removed in order to be able to identify its analog basic building blocks and symmetry pairs. The result of the power-up transformation for the high input impedance differential amplifier is shown in Fig. 3.11a, the one for the Miller operational transconductance amplifier in Fig. 3.11b.

Based on Fig. 3.11b, structure recognition identifies the differential pair $d p(P 4, P 5)$, the simple current mirror $\operatorname{scm}(N 1, N 2)$ and the simple current mirror bank $s c m(P 1, P 2, P 3)$ in each of the three Miller operational transconductance amplifiers. The high input impedance differential amplifier does not contain any analog basic building blocks on the top-level.

Symmetry computation for the Miller operational transconductance amplifier detects the symmetry pairs $(P 4, P 5)$ and $(N 1, N 2)$. The high input impedance differential amplifier contains the symmetry pairs (MILLER1, MILLER2), $(R 1, R 3),(R 4, R 5)$ and ( $R 6, R 7$ ).


Figure 3.11.: High input impedance differential amplifier (HIIDA) [38] and its Miller operational transconductance amplifier (OTA) subcircuit blocks after power-up transformation.

| Building Blocks | Evaluated |  | Voltages MILLER<1, 2> |  | Voltages MILLER3 |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|  | Pins | Nets |  |  |  |  |
| $\operatorname{scm}(P 1, P 2)$ | $d$ | $(n 1, n 2)$ | (gnd, float) | $x$ | (gnd, float) | $x$ |
| $\operatorname{scm}(P 1, P 3)$ |  | ( $n 1$, out) | (gnd, sc) | $x$ | (gnd, sc) | $x$ |
| $\operatorname{scm}(N 1, N 2)$ | $d$ | $(n 3, n 4)$ | ( $g n d, g n d$ ) | $\checkmark$ | (gnd, gnd) | $\checkmark$ |
|  | $d$ | $(n 3, n 4)$ | (gnd, gnd) | $\checkmark$ | (gnd, gnd) | $\checkmark$ |
| $d p(P 4, P 5)$ |  | (in,ip) | ( $s c, v d d$ ) | $x$ | ( $s c, s c$ ) | $x$ |

evaluation result: fail $(\boldsymbol{X})$, pass $(\checkmark)$, warning ( $\widehat{\wedge}$ )
Table 3.12.: Matching assertion results for the Miller operational transconductance amplifiers MILLER1 - MILLER3 from Fig. 3.10b.

Additionally, the voltages at the differential input (in, $i p$ ) of each of the three Miller operational transconductance amplifiers must be matched at the top-level. These results are shown in Fig. 3.11 and correspond to ones presented in [23].

Matching and symmetry assertion evaluates the identified structures against asymmetric stress in power-down mode. Table 3.12 shows the corresponding evaluation results for the basic building blocks of the three Miller operational transconductance amplifiers. The drain pins of the PMOS current mirrors, as well as the gate pins of the differential pair are exposed to different voltage levels in each of the three Miller operational transconductance amplifiers in power-down mode, hence failing the assertion. Only the drain pins of the NMOS simple current mirror and the differential pair are successfully evaluated. Overall, $\frac{9}{15}=60 \%$ of the building blocks matching conditions are violated.

Tables 3.13 and 3.14 show the symmetry assertion results for the high input impedance differential amplifier and its subcircuit blocks MILLER1 - MILLER3. All symmetry checks for the resistors of the high input impedance differential amplifier fail as the detected

| subcircuits | Evaluated |  | Voltages |  |
| :---: | :---: | :---: | :---: | :---: |
|  | Pins | Nets |  |  |
| R1, R3 | plus | $(n 1, n 3)$ | ( $s c, s c$ ) | $x$ |
|  | minus | $(n 2, n 4)$ | ( $s c, s c$ ) | $x$ |
| R4, R5 | plus | $(n 1, n 4)$ | ( $s c, s c$ ) | $x$ |
|  | minus | $(n 5, n 6)$ | $(s c, s c)$ | $x$ |
| R6, R7 | plus | $(n 5, n 6)$ | ( $s c, s c$ ) | $x$ |
|  | minus | (out, ground) | ( $s c, g n d$ ) | $x$ |
| MILLER1, MILLER2 | in | $(n 2, n 3)$ | ( $s c, s c$ ) | $x$ |
|  | $i p$ | (in, ip) | (vdd, vdd) | $\checkmark$ |
|  | out | $(n 1, n 4)$ | $(s c, s c)$ | $x$ |
| MILLER3 | (in, ip) | $(n 5, n 6)$ | $(s c, s c)$ | $x$ |

evaluation result: fail $(\boldsymbol{X})$, pass $(\checkmark)$, warning ( $\bigwedge$ )
Table 3.13.: Symmetry assertion result for the high input impedance differential amplifier from Fig. 3.10a.

| subcircuits | Evaluated |  | Voltages MILLER<1, $2>$ |  | Voltages MILLER3 |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|  | Pins | Nets |  |  |  |  |
| P4, P5 |  | $(n 4, n 3)$ | (gnd, gnd) | $\checkmark$ | (gnd, gnd) | $\checkmark$ |
|  |  | (in, ip) | ( $s c, v d d$ ) | $x$ | ( $s c, s c$ ) | $x$ |
|  |  | $n 2$ | float | ¢ | float | $\triangle$ |
| N1, N2 | $d$ | $(n 3, n 4)$ | (gnd, gnd) | $\checkmark$ | (gnd, gnd) | $\checkmark$ |
|  |  | n3 | gnd | $\checkmark$ | gnd | $\checkmark$ |
|  |  |  | gnd | $\checkmark$ | gnd | $\checkmark$ |
|  |  | ation res | t: fail ( $\boldsymbol{X}$ ), | $\checkmark$ ), warn | $g($ |  |
| ) |  |  |  |  |  |  |

Table 3.14.: Symmetry assertion results for the Miller operational transconductance amplifiers from Fig. 3.10b.
short circuit paths are running over them. Furthermore, at least one pin of each Miller operational transconductance amplifier is exposed to voltage level sc. Overall, only one out of ten symmetry conditions is fulfilled in the top-level schematic.

For the Miller operational transconductance amplifiers, three warnings and three errors are reported: the gates of $P 4$ and $P 5$ are exposed to different voltage levels and their source pins are connected to a floating net.

App. C shows how this verification report can be used to revise and debug the power-down circuitry of the high input impedance differential amplifier.
3. Hierarchical Power-Down Verification

## 4. Hierarchical Power-Down Synthesis

The previous chapter presented a method to verify the power-down mode of hierarchical analog circuits. In the following, a method to synthesize faultless power-down circuitry for such circuits is presented.

### 4.1. Overview

Fig. 4.1 gives an overview of the proposed hierarchical power-down synthesis method. It uses the same steps as the method presented in [12]. It, however, makes extensive reuse of intermediate results stored in several different intermediate results libraries.

The main data flow of the method is indicated by thick vertical arrows in the figure. It takes a top-level circuit $C_{\text {top }}$, the supply nets $N_{g n d}, N_{v d d}$, the power-down signals $P_{p w d}$ and the pin voltages $V_{P_{\text {top }}}$ as input. The output is denoted as $C_{\text {top }}^{\prime}\left(P_{\text {top }}^{\prime}, N_{\text {top }}^{\prime}, C_{\text {sub,top }}^{\prime}, t_{\text {top }}^{\prime}\right)$ which corresponds to the top-level circuit augmented by the synthesized power-down circuitry.
The core of the method is the gate shut-off procedure (Sec. 4.3). It uses the gate shutoff pattern from Fig. 1.6a and constraint programming to insert power-down transistors in parallel into $C_{\text {top }}$ to switch off its bias currents whilst achieving a trade-off between the two design goals "maximize matching" (C) and "minimize area" (D). Structure recognition and symmetry analysis are performed to generate the required matching and symmetry constraints for the constraint program (App. A and App. B).

The power-down synthesis method can be used to augment partially designed, yet incomplete, power-down circuitry, with its missing parts. The existing power-down circuitry, especially power-down transistors which have been inserted by the diode or serial rip-up pattern, can interfere with the structure recognition and symmetry analysis methods. E.g., a diode rip-up transistor can prevent the identification of current mirrors by structure recognition which in turn prevents symmetry analysis to accurately model the signal flow through these analog building blocks. Hence, already existing power-down circuitry is removed by the power-up transformation method from Sec. 3.4.1 before the structure recognition and symmetry computation are executed (Sec. 3.4).

The gate shut-off constraint program from Sec. 4.3 can determine a valid power-down mode implementation variant if all current paths of the circuit can be switched off by the gate shut-off pattern. Hence, $C_{t o p}$ is scanned beforehand for problematic currents paths, i.e., definite short circuit paths, by voltage propagation and short-circuit path computation from Secs. 3.2 and 3.3 which are then disrupted by applying the diode or serial rip-up pattern from Fig. 1.6. The gate shut-off constraint program then guarantees the existence of at least one valid power-down mode implementation variant for the ripped-up circuit.

## 4. Hierarchical Power-Down Synthesis



Figure 4.1.: Overview of the power-down synthesis method.

```
Algorithm 5 Rip-up heuristic for hierarchical analog circuits based on [12].
    procedure RipUpHeuristic \(\left(C_{\text {top }}\left(P_{\text {top }}, N_{\text {top }}, C_{\text {sub }, \text { top }}, t_{\text {top }}\right), \mathcal{P}_{\text {sc }}\right)\)
        \(L_{\text {rip }}=\emptyset\)
        for all \(P_{j} \in \mathcal{P}_{s c}\) do
            \(d_{\text {rip }}, c_{\text {rip }}=\) computeRipupPoint \(\left(P_{j}\right)\)
            if not hasResult \(\left(L_{\text {rip }}, d_{\text {rip }}, c_{\text {rip }}\right)\) then
                if isDiodeConnected \(\left(d_{\text {rip }}\right)\) then
                \(c_{\text {rip }}^{\prime}=\) applyDiodeRipup \(\left(d_{\text {rip }}, c_{\text {rip }}\right)\)
                storeResult \(\left(L_{\text {rip }}, d_{\text {rip }}, c_{\text {rip }}, c_{r i p}^{\prime}\right)\)
            else
                \(c_{\text {rip }}^{\prime}=\operatorname{applySerialRipup}\left(d_{\text {rip }}, c_{\text {rip }}\right)\)
                storeResult \(\left(L_{\text {rip }}, d_{\text {rip }}, c_{r i p}, c_{r i p}^{\prime}\right)\)
            end if
            end if
            \(c_{\text {rip }}^{\prime}=\operatorname{findResult}\left(L_{\text {rip }}, d_{\text {sub }}, c_{\text {rip }}\right)\)
            replace \(\left(C_{\text {top }}, c_{\text {rip }}, c_{\text {rip }}^{\prime}\right)\)
        end for
    end procedure
```


### 4.2. Rip-Up Heuristic

Algorithm 5 outlines the rip-up heuristic for hierarchical circuits based on [12]. It takes the top-level circuit $C_{\text {top }}$ and the set of definite short circuit paths $\mathcal{P}_{s c}$ as input. The problematic current paths are thereby detected by voltage propagation and short circuit path computation from Sec. 3.2 and Sec. 3.3, respectively. The algorithm uses the library $L_{\text {rip }}$ to store ripped-up subcircuit blocks in line 2. The method iterates over all current paths $P_{j} \in \mathcal{P}_{s c}$ (line 3) and determines a suitable rip-up position, i.e., a device $d_{r i p}$ on $P_{j}$ inside a subcircuit block $c_{\text {rip }}$, by invoking function computeRipupPoint $\left(P_{j}\right)$ in line 4 . In line 5, function $\operatorname{hasResult}\left(L_{\text {rip }}, d_{\text {rip }}, c_{\text {rip }}\right)$ checks whether intermediate results for the determined rip-up point already exist in library $L_{\text {rip }}$. If that is the case, the algorithm continues by reusing that result in lines 14 and 15 . Otherwise, the current path $P_{j}$ is ripped-up as follows: If $d_{r i p}$ is diode-connected, a diode-rip-up according to Fig. 1.6b is performed (lines 7 and 8). If a path $P_{j}$ contains several diode-connected devices, the device connected to the most transistor gates is chosen for rip-up as more current paths can be switched off with a single gate shut-off transistor afterwards [12].
If $d_{\text {rip }}$ is not diode-connected, the current path has to be switched off by a serial rip-up as shown in Fig. 1.6c (lines 10 and 11). For serial rip-up, the device which is connected to the supply net is chosen as rip-up position and a power-down transistor between the pin connected to the supply net and the supply net itself is inserted. However, other rip-up positions, e.g., at the ground node or even at an internal net of that path, could be chosen as well.

In both cases, the ripped-up circuit $c_{\text {rip }}^{\prime}$ is stored in the library $L_{\text {rip }}$ (line 8 or 11).
Finally, in lines 14 and 15, the rip-up heuristic fetches the ripped-up circuit from $L_{r i p}$ and uses it to replace $c_{\text {rip }}$ at the corresponding position in the top-level circuit.

Fig. 4.2 shows a differential stage series (SDFS). Its subcircuits $D F S 1$ and $D F S 2$ each implement one differential stage. Voltage propagation and short circuit path computation detected the definite short circuit path

$$
\begin{equation*}
P 1=(D F S 2 / P 1, D F S 2 / N 2, D F S 2 / N 1) \tag{4.1}
\end{equation*}
$$

in the circuit which must be ripped up in order to implement the power-down mode of the circuit. The current path $P 1$ is highlighted in red in the circuit schematic.
Algorithm 5 computes transistor $P 1$ of $D F S 2$ as rip-up point. The transistor is diodeconnected and part of the current mirror $\operatorname{scm}(P 1, P 2)$. Hence, disabling the diode configuration of $P 1$ will also switch off the current flow through $P 2$ in power-down mode. The ripped-up version of the differential input stage is stored in $\mathcal{C}$. It is denoted as $D F S 2^{\prime}$ in order to be able distinguish it better from the regular, non ripped-up version. The ripped-up schematic of the differential stage series is shown in Fig. 4.3.

### 4.3. Gate Shut-Off

Algorithm 6 outlines the gate shut-off procedure for hierarchical analog circuits. It takes a circuit netlist $C_{i}$, the pin voltages $V_{P_{i}}$ and the power-down synthesis results library $L_{\text {synth }}$ as

## 4. Hierarchical Power-Down Synthesis



Figure 4.2.: Schematic of a differential stage series (SDFS). Subcircuits DFS1 and DFS2 each implement one differential stage. The definite short circuit path $P 1$, detected by voltage propagation and short-circuit path computation, is highlighted in red.


Figure 4.3.: Ripped-up schematic of the differential series from Fig. 4.2.

```
Algorithm 6 Gate shut-off for hierarchical circuits based on [10]
    procedure GateShutoff \(\left(C_{i}\left(P_{i}, N_{i}, C_{\text {subbi, }}, t_{i}\right), V_{P_{i}}, L_{\text {synth }}\right)\)
        // Additional inputs \(L_{\text {struct }}, L_{\text {sym }}, N_{\text {supply }}\) not shown.
        // Output \(L_{\text {synth }}\) passed by reference and recursively filled in.
        \(V_{N_{i}}=\) voltagePropagation \(\left(C_{i}, V_{P_{i}}\right) \quad / /\) See Algorithm 1.
        // Check if gate shut-off COP from eq. (4.15) has at least one solution.
        if currentThroughDevice \(\left(C_{i}, V_{N_{i}}\right)\) then
            return // Current detected, return.
        end if
        // Only formulate gate shut-off COP if there is a floating node in the circuit.
        if noFloating \(\operatorname{Node}\left(C_{i}, V_{N_{i}}\right)\) then
            \(\mathbf{V}_{N_{i}}^{*}=\left\{V_{N_{i}}\right\}\)
        else
            \(\mathbf{V}_{N_{i}}^{*}=\) solveGateShutof \(\operatorname{COP}\left(C_{i}, V_{N_{i}}\right) \quad / /\) See eq. (4.15).
        end if
        storeResults \(\left(C_{i}, V_{P_{i}}, \mathbf{V}_{N_{i}}^{*}, L_{\text {synth }}\right)\)
        // Descend hierarchy recursively for each implementation variant.
        for all \(\mathbf{v}_{j}^{*} \in \mathbf{V}_{N}^{*}\) do
            for all \(c_{s u b} \in C_{\text {sub }, i}\) do
            if isSubCircuitBlock \(\left(c_{s u b}\right)\) then
                                \(V_{P_{\text {sub }}}=\operatorname{pinVoltages}\left(c_{s u b}, \mathbf{v}_{j}^{*}\right)\)
                                if not hasResult \(\left(c_{\text {sub }}, V_{P_{\text {sub }}}, L_{\text {synth }}\right)\) then
                        gateShutof \(f\left(c_{s u b}, V_{P_{\text {sub }}}, L_{\text {synth }}\right) \quad / /\) Recursive call.
                        end if
            end if
            end for
        end for
    end procedure
```


## 4. Hierarchical Power-Down Synthesis

inputs. It furthermore takes the structure recognition results $L_{\text {struct }}$, the symmetry computation results $L_{\text {sym }}$ and the supply nets $N_{\text {supply }}$ as additional inputs. The results library $L_{\text {synth }}$ is passed by reference and is filled in with all computed power-down mode implementation variants for the given circuit with pin voltages $V_{P_{i}}$ during gate shut-off. The additional inputs are not shown in the pseudo-code for simplicity but are always accessible during the computations.
The library $L_{\text {synth }}$ is organized as follows: An entry $\left(C_{i}, V_{P_{i}}, \mathbf{V}_{N_{i}}^{*}\right)$ contains all valid powerdown mode implementation variants

$$
\begin{align*}
& \mathbf{V}_{N_{i}}^{*}=\left\{\mathbf{v}_{1}^{*}, \mathbf{v}_{2}^{*}, \ldots, \mathbf{v}_{\mid \mathbf{V}_{N_{i}}}^{*}\right\}  \tag{4.2}\\
& \text { with } \mathbf{v}_{j}^{*}=\left(v_{n_{1}}^{*}, v_{n_{2}}^{*}, \ldots, v_{n_{\left|N_{i}\right|}}^{*}\right)  \tag{4.3}\\
& \text { and } v_{n_{k}}^{*} \in\{g n d, \text { pullGnd,vdd, pullVdd }\} \tag{4.4}
\end{align*}
$$

for the given circuit $C_{i}$ with input voltages $V_{P_{i}}$. An implementation variant $\mathbf{v}_{j}^{*} \in \mathbf{V}_{N_{i}}^{*}$ assigns each net $n_{k} \in N_{i}$ to one of the following voltage levels:

- pullGnd, pullVdd: net $n_{k} \in N_{i}$ has to be pulled to gnd or to $v d d$ by a power-down transistor.
- gnd, $v d d$ : the supply or ground voltage inherently propagates to net $n_{k} \in N_{i}$ in powerdown mode. Hence, no power-down transistor has to be inserted at such a net.

Algorithm 6 traverses the circuit hierarchy recursively in a top-down manner. As a first step, voltage propagation is performed to determine all internal nodes of the circuit to which the supply and ground voltage can inherently propagate (line 2 ). The algorithm then checks if there is at least one device which carries a current in power-down mode (lines 3-5) according to the voltage propagation result $V_{N_{i}}$ by equation

$$
\begin{align*}
& \text { currentThroughDevice }\left(C_{i}, V_{N_{i}}\right) \Leftrightarrow \\
& \underset{c_{\text {sub }} \in C_{\text {sub }, i}}{\exists}\left(t_{\text {sub }} \in T_{\text {device }}\right) \wedge \operatorname{on}\left(c_{\text {sub }}, V_{N_{i}}\right) \wedge \operatorname{voltageDrop}\left(v_{+}\left(c_{\text {sub }}\right), v_{-}\left(c_{\text {sub }}\right)\right) . \tag{4.5}
\end{align*}
$$

It expresses that there is a current flow in power-down mode if there is at least one device in the circuit which is conducting (on) and which additionally has a voltage potential difference between its conducting channel in power-down mode. A device is conducting in power-down mode if one of the following four conditions holds:

$$
\operatorname{on}\left(c_{s u b}, V_{N_{i}}\right) \Leftrightarrow \begin{cases}v_{g}=v d d & , \text { if } t_{s u b}=n \operatorname{mos}  \tag{4.6}\\ v_{g}=g n d & , \text { if } t_{\text {sub }}=\text { pmos } \\ v_{+}=v d d \vee v_{-}=g n d & , \text { if } t_{\text {sub }} \in\left\{\text { diode }, \text { nmos }_{d i o}, \text { pmos }_{\text {dio }}\right\} \\ \text { true } & , \text { if } t_{\text {sub }} \in\{\text { resistor }, \text { inductor }\} \\ \text { false } & , \text { else, e.g., } t_{\text {sub }}=\text { cap } .\end{cases}
$$

An nmos or pmos transistor is on if its gate voltage is tied to $v d d$ or $g n d$, respectively. Diodes or diode-connected transistors are conducting if their anode $\left(v_{+}\right)$is on $v d d$ or if their cathode ( $v_{-}$) is on $g n d$. Inductors and resistors are always conducting, capacitors are always off.

There is a voltage potential difference at the conducting device channel if:
voltageDrop $\left(v_{+}, v_{-}\right) \Leftrightarrow\left\{\begin{array}{lc}\left(v_{+}=v d d \wedge v_{-}=g n d\right) \vee & , \text { if } t_{\text {sub }} \in\{\text { resistor, inductor, } \\ \left(v_{+}=g n d \wedge v_{-}=v d d\right) & \text { nmos, pmos }\} \\ v_{+}=v d d \wedge v_{-}=g n d \quad, & \text { if } t_{\text {sub }} \in\left\{\text { diode }, \text { nmos }_{\text {dio }}, \text { pmos }_{\text {dio }}\right\} .\end{array}\right.$
Regular MOSFET transistors, resistors and inductors thereby conduct a current in both directions, diodes and diode-connected devices in only one direction, i.e., from their anode to their cathode pin.

The circuit contains a definite short-circuit path if eq. (4.5) is fulfilled and the later formulated gate shut-off constraint optimization problem (COP) will not have a solution. Hence, Algorithm 6 returns (line 4).

In the next step, the algorithm checks if there are any floating nodes in the circuit (lines 610):

$$
\begin{equation*}
\text { noFloatingNode }\left(C_{i}, V_{N_{i}}\right) \Leftrightarrow \neg \underset{n_{i, j} \in N_{i}}{\exists} v_{n_{i, j}}=\text { float } \tag{4.8}
\end{equation*}
$$

The power-down mode of the circuit is already functional if there is no floating node in the circuit, i.e., all nodes are already tied to either $v d d$ or $g n d$. There is no need to insert additional power-down transistors into the circuit by the gate shut-off COP. In fact, the voltage levels of the circuit cannot be changed by the gate shut-off pattern anymore without creating a new current flow in the circuit. The voltage propagation result $V_{N_{i}}$ would in this case also correspond to the only solution of the gate shut-off COP from eq. (4.15) and hence formulating and solving the COP is skipped by assigning the voltage propagation result to $\mathbf{V}_{\mathbf{N}_{\mathbf{i}}}{ }^{*}$ (line 7).

The gate shut-off COP however must be formulated and solved for the current hierarchy level $C_{i}$ with the given pin voltages $V_{P_{i}}$ if there are any floating nodes left in the circuit (line 9). The thereby obtained solutions $\mathbf{v}_{j}^{*} \in \mathbf{V}_{N_{i}}^{*}$ represent different valid power-down mode implementation variants for the given circuit whilst maintaining a trade-off between design goals matching (C) and area (D). The gate shut-off COP is formulated based on the devices of the current hierarchy level, its subcircuit blocks are ignored for the moment as they will be treated individually later on by recursively descending the circuit hierarchy. The obtained solutions are stored in library $L_{\text {synth }}$.

The algorithm iterates over all solutions $\mathbf{v}_{j}^{*} \in \mathbf{V}_{N_{i}}^{*}$ and all subcircuit blocks $c_{\text {sub }} \in C_{\text {sub }, i}$ of the circuit in the next step (lines 12-21), determines their pin voltages $V_{P_{\text {sub }}}$ based on $\mathbf{v}_{j}^{*}$ and descends into the hierarchy by calling itself recursively (lines 15 and 17). However, the algorithm first checks whether a previously computed intermediate result can be reused by determining whether another subcircuit block of the same type and pin voltages has been encountered during a previous iteration (line 16). Above steps are repeated until the device level of the circuit hierarchy has been reached.

### 4.3.1. Constraint Optimization Problem

The design goals of the power-down mode (A) - (D) are formalized as a constraint program in the following. The presented constraint program unifies the gate shut-off and net dependency

## 4. Hierarchical Power-Down Synthesis

constraint programs from [10] which allows trade-offs between the design goals "maximize matching" and "minimize area". Thereby, Gecode [20] has been used to formulate and solve the unified gate shut-off constraint program which is the same constraint programming framework used by [12].

Constraint programming [19;39] models a mathematical problem with a vector of variables $\mathbf{z}$, a domain $D_{i}$ for each variable $z_{i} \in \mathbf{z}$ and a set of constraints $C$ :

$$
\begin{align*}
& \mathbf{z}=\left(z_{1} z_{2} \ldots z_{n_{z}}\right), \quad z_{i} \in D_{i}, i=1,2, \ldots, n  \tag{4.9}\\
& C=E \cup I=\left\{c_{1}, c_{2}, \ldots, c_{m}\right\} \tag{4.10}
\end{align*}
$$

The domains $D_{i}$ contain either boolean, integer or float values and are either finite or infinite. A constraint $c_{j} \in E$ denotes an equality constraint, i.e., $c_{j}(\mathbf{z})=0$, a constraint $c_{k} \in I$ an inequality constraint, i.e. $c_{j}(\mathbf{z}) \bigcirc 0$, with $\bigcirc \in\{>,<, \geq, \leq,!=, \ldots\}$.
The constraints $c \in C$ form a constraint satisfaction problem (CSP),

$$
\begin{equation*}
\operatorname{CSP}(\mathbf{z}) \Leftrightarrow \underset{j \in E}{\forall} c_{j}(\mathbf{z})=0 \wedge \underset{k \in I}{\forall} c_{k}(\mathbf{z}) \bigcirc 0 \tag{4.11}
\end{equation*}
$$

which must be fulfilled. A solution of eq. (4.11) is a value assignment $\mathbf{z}^{*}$ of $\mathbf{z}$ which fulfils the CSP.

A constraint optimization problem (COP) extends a CSP by an additional vector of target functions $\mathbf{t}(\mathbf{z})$ to

$$
\begin{equation*}
\max \mathbf{t}(\mathbf{z}) \text { s.t. } \operatorname{CSP}(\mathbf{z}) \Leftrightarrow 1 . \tag{4.12}
\end{equation*}
$$

Solving eq. (4.12) yields the best solutions of $\mathbf{t}(\mathbf{z})$ which satisfy the $\operatorname{CSP}(\mathbf{z})$.
The gate shut-off COP is formulated based on the voltage propagation results $V_{N_{i}}$ computed by Algorithm 1 for the given circuit $C_{i}$ with pin voltage $V_{P_{i}}$. The following sets are thereby required as input:

$$
\begin{align*}
& N_{\text {float }}=\left\{n_{i, j} \mid v_{n_{i, j}}=\text { float } \wedge v_{n_{i, j}} \in V_{N_{i}}\right\} \\
& N_{\text {vdd }}^{\prime}=\left\{n_{i, j} \mid v_{n_{i, j}}=v d d \wedge v_{n_{i, j}} \in V_{N_{i}}\right\}  \tag{4.13}\\
& N_{\text {gnd }}^{\prime}=\left\{n_{i, j} \mid v_{n_{i, j}}=g n d \wedge v_{n_{i, j}} \in V_{N_{i}}\right\} \\
& C_{\text {float }}=\left\{c_{\text {sub }} \in C_{\text {sub }, i} \mid t_{\text {sub }} \in T_{\text {device }} \wedge \text { connectedToFloatingNet }\left(c_{\text {sub }}, V_{N_{i}}\right)\right\}
\end{align*}
$$

The set $N_{\text {float }}$ contains all floating nets, the sets $N_{v d d^{\prime}}$ and $N_{\text {gnd }}$ all nets of the circuit to which the ground or supply voltage can inherently propagate, including the supply and ground nets and respective input voltages. The set $C_{\text {float }}$ contains all devices of the circuit which are connected to a floating net in power-down mode by at least one of its corresponding pins:

$$
\begin{equation*}
\text { connectedToFloatingNet }\left(c_{\text {sub }}, V_{N_{i}}\right) \Leftrightarrow \underset{p_{s u b, j} \in P_{s u b}}{\exists} v_{p_{s u b, j}}=\text { float } \wedge v_{p_{s u b}, j} \in V_{N_{i}} \tag{4.14}
\end{equation*}
$$

The supply and ground voltages inherently propagate to some extent to the internal nets of the circuit through conducting devices in power-down mode. The remaining floating nodes of the circuit must be pulled to either $v d d$ or $g n d$ by additional gate shut-off power-down
transistors such that all potential current flows are switched off in the circuit while taking matching and area considerations into account. Hence, the vector of net voltages and their corresponding domains from eqs. (4.3) and (4.4) have been chosen to model the power-down synthesis problem as following constraint optimization problem:

$$
\begin{align*}
\max & w_{1} \cdot \frac{\operatorname{matching}(\mathbf{v})}{n_{\text {match }}}-w_{2} \cdot \frac{\operatorname{area}(\mathbf{v})}{n_{\text {float }}} \text { s.t. } \\
& n o \operatorname{CurrentFlow}(\mathbf{v}) \Leftrightarrow 1 \wedge  \tag{4.15}\\
& n o F l o a t i n g N e t \\
& \text { boundaryConditions }(\mathbf{v}) \Leftrightarrow 1 \wedge \\
&
\end{align*}
$$

The target function aims to find a trade-off between the design goals "maximize matching" (C) and "minimize area" (D). The design goals (C) and (D) are normalized by the number of generated matching constraints $n_{\text {match }}$ and by the number of floating notes $n_{\text {float }}$ in the circuit detected by voltage propagation. The weights $w_{1}, w_{2}$ of the target function can be chosen to either prioritize one of the two goals or to compute all Pareto-optimal solutions. The CSP part ensures that there are no current flows or floating nets in the circuit in powerdown mode and sets the boundary conditions of the constraint program by initializing the supply nets and pins of the given circuit to their corresponding level.

The function

$$
\begin{equation*}
\operatorname{matching}(\mathbf{v}) \Leftrightarrow \text { buildingBlockMatching }(\mathbf{v})+\text { symmetryMatching }(\mathbf{v}) \tag{4.16}
\end{equation*}
$$

consists of two parts: The first term

$$
\begin{equation*}
\text { buildingBlockMatching }(\mathbf{v}) \Leftrightarrow \sum_{b_{j} \in C_{B_{i, \text { analog }}}} \sum_{k \in R_{\text {type }\left(b_{j}\right)}} \operatorname{sameLevel}\left(v_{n_{1}\left(b_{j}, p_{1}\left(r_{k}\right)\right)}, v_{n_{2}\left(b_{j}, p_{2}\left(r_{k}\right)\right)}\right) \tag{4.17}
\end{equation*}
$$

generates block-type specific matching constraints for each identified analog basic building block $b_{j}$ in the circuit [12]. The building block circuit $C_{B_{i, a n a l o g}}$ denotes the structure recognition result from Appendix A. It contains all analog building blocks of circuit $C_{i}$, e.g., current mirrors and differential pairs. A building block type specific rule set

$$
\begin{equation*}
R_{\text {type }\left(b_{j}\right)}=\left\{r_{1}, r_{2}, \ldots, r_{\left|R_{t y p e\left(b_{j}\right)}\right|}\right\} \text { with } r_{k}=\left(p_{k, 1}, p_{k, 2}\right) \text { and } p_{k, 1}, p_{k, 2} \in P_{b_{j}} \tag{4.18}
\end{equation*}
$$

contains tuples of pins which should be matched in power-down mode. E.g., the set

$$
\begin{equation*}
R_{d p}=\left\{r_{1}=\left(\text { in }_{1}, \text { in }_{2}\right), r_{2}=\left(\text { out }_{1}, \text { out }_{2}\right)\right\} \tag{4.19}
\end{equation*}
$$

contains two rules which specify that the input and output pins of a differential pair $d p$ should be exposed to the same voltage levels in power-down mode (see Fig. 3.5a).
The constraint

$$
\begin{align*}
\operatorname{sameLevel}\left(v_{n_{1}}, v_{n_{2}}\right) \Leftrightarrow & {\left[g n d \operatorname{Level}\left(v_{n_{1}}\right) \wedge \operatorname{gnd\operatorname {Level}(v_{n_{2}})]\vee }\right.}  \tag{4.20}\\
& {\left[\operatorname{vdd} \operatorname{Level}\left(v_{n_{1}}\right) \wedge \operatorname{vdd\operatorname {Level}(v_{n_{2}})].}\right.}
\end{align*}
$$

## 4. Hierarchical Power-Down Synthesis

denotes that the voltages at the nets $n_{1}$ and $n_{2}$ should be on the same level. The symbols $g n d$, pullGnd and $v d d$, pullV $d d$ quantitatively represent the same voltage levels in power-down mode,i.e.:

$$
\begin{align*}
& \operatorname{gndLevel}\left(v_{n_{i}}\right) \Leftrightarrow\left(v_{n_{i}}=g n d\right) \vee\left(v_{n_{i}}=\operatorname{pullGnd}\right)  \tag{4.21}\\
& \operatorname{vddLevel}\left(v_{n_{i}}\right) \Leftrightarrow\left(v_{n_{i}}=v d d\right) \vee\left(v_{n_{i}}=\operatorname{pullVdd}\right) . \tag{4.22}
\end{align*}
$$

The functions $n_{1}\left(b_{j}, p_{1}\left(r_{k}\right)\right)$ and $n_{2}\left(b_{j}, p_{2}\left(r_{k}\right)\right)$ used in eq. (4.17) determine the nets of circuit $C_{i}$ which are connected to the pins $p_{1}$ and $p_{2}$ of building block $b_{j}$ which are specified by rule $r_{k}$.

The output pins of differential pair $d p(N 2, N 3)$ of the differential stage $D F S 1$ from Fig. 4.3, e.g., should be matched. This corresponds to matching the voltages at nets DFS1/outn and DFS1/outp:

$$
\begin{equation*}
\operatorname{match}_{d p(D F S 1 / N 2, D F S 1 / N 3), \text { out }}(\mathbf{v}) \Leftrightarrow \operatorname{sameLevel}\left(v_{D F S 1 / n 2}, v_{D F S 1 / n 3}\right) . \tag{4.23}
\end{equation*}
$$

The second term

$$
\begin{equation*}
\operatorname{symmetryMatching}(\mathbf{v}) \Leftrightarrow \sum_{\left(c_{i}, c_{j}\right) \in C_{s y m}} \sum_{\left(p_{i, k}, p_{j, k}\right) \in P_{s y m}} \operatorname{sameLevel}\left(v_{p_{i, k}}, v_{p_{j, k}}\right) \tag{4.24}
\end{equation*}
$$

generates matching constraints for each symmetrical pin pair of each identified symmetry pair in the circuit including pin symmetries due to subcircuit block internal symmetrical signal paths. The input pins in, ip of $D F S 2^{\prime}$ from Fig. 4.3, e.g., should be matched on the top-level schematic. This corresponds to matching the voltages at nets $n 1$ and $n 2$ of the differential stage series:

$$
\begin{equation*}
\operatorname{sameLevel}\left(v_{p_{D F S 2^{\prime}, i n}}, v_{p_{D F S 2^{\prime}, i p}}\right) \Leftrightarrow \operatorname{sameLevel}\left(v_{n 1}, v_{n 2}\right) \tag{4.25}
\end{equation*}
$$

The value of eq. (4.16) corresponds to the number of fulfilled matching and symmetry constraints in power-down mode.

In addition, the optional constraint

$$
\begin{equation*}
\text { symmetricTransistors }\left(v_{1}, v_{2}\right) \Leftrightarrow\left(v_{1}=v_{2}\right) \tag{4.26}
\end{equation*}
$$

can be formulated for each matched net or pin pair in order to enforce that power-down transistors are inserted symmetrically into the circuit.

The area of the power-down circuitry is mainly determined by the number of inserted powerdown transistors. Hence, the constraint

$$
\begin{align*}
& \operatorname{area}(\mathbf{v}) \Leftrightarrow \sum_{n_{i} \in N_{\text {float }}} \operatorname{addTransistor}\left(v_{n_{i}}\right), \text { with }  \tag{4.27}\\
& \text { addTransistor }\left(v_{n_{i}}\right) \Leftrightarrow\left(v_{n_{i}}=\text { pullGnd }\right) \vee\left(v_{n_{i}}=\text { pullVdd }\right) \tag{4.28}
\end{align*}
$$

sums up the number of power-down transistors which have been inserted at the floating nodes of the circuit to fulfil the CSP part, i.e., eqs. (4.29), (4.31), (4.34), of the overall gate shut-off COP, eq. (4.15).

The constraint satisfaction problem

$$
\begin{equation*}
\text { noCurrentFlow }(\mathbf{v}) \Leftrightarrow \underset{c_{\text {sub }} \in C_{\text {float }}}{\forall} \text { on }\left(c_{\text {sub }}, \mathbf{v}\right) \rightarrow \operatorname{sameLevel}\left(v_{+}\left(c_{\text {sub }}\right), v_{-}\left(c_{\text {sub }}\right)\right) \tag{4.29}
\end{equation*}
$$

ensures that there is no current flow in the circuit in power-down mode based on the set $C_{\text {float }}$. This set contains all devices of the circuit through which a current flow would still be possible in power-down mode as these devices are connected to at least one floating node according to the previously voltage propagation result. All other devices have already been checked by eq. (4.5) and do not have to be considered by the COP anymore. The CSP from eq. (4.29) formulates a constraint for each floating device of the circuit which prohibits a current flow through it by either forcing it into the non-conducting state or by ensuring that there is no voltage drop between its conducting channel in power-down mode.

An NMOS transistor, e.g., is off in power-down mode iff its gate pin is exposed to the ground voltage, i.e., $v_{g}=g n d$. It is on iff $v_{g}=v d d$. In the latter case, its drain and source pin must have the same voltage level in power-down mode to ensure that there is no current flow in the circuit.

A device $c_{\text {sub }}$ is conducting if one of the following conditions holds:

NMOS- and PMOS transistors are conducting when their gate pin is on $v d d$ or $g n d$, respectively. Diodes and diode-connected transistors are only conducting in one direction, i.e., they are on if their anode voltage $v_{+}$is on $v d d$ or their cathode pin $v_{-}$is on gnd. Resistors and inductors are treated as always on, other device types are considered to be off. Fig. 3.2 shows the pin names and the propagation behavior of different device types.
The constraint satisfaction problem

$$
\begin{align*}
& \text { noFloatingNet }(\mathbf{v}) \Leftrightarrow \\
& \underset{n_{i} \in N_{\text {float }}}{\forall} \text { addTransistor }\left(v_{n_{i}}\right) \vee  \tag{C1}\\
& \underset{P_{n_{j} \rightarrow n_{i}}, n_{j} \in N_{v d d^{\prime}}}{\exists} \text { on }\left(P_{n_{j} \rightarrow n_{i}}, \mathbf{v}\right) \vee  \tag{C2}\\
& \underset{P_{n_{i} \rightarrow n_{j}}, n_{j} \in N_{\text {gnd }}}{\exists} \text { on }\left(P_{n_{i} \rightarrow n_{j}}, \mathbf{v}\right) \vee  \tag{C3}\\
& \underset{P_{n_{j} \rightarrow n_{i}}, n_{j} \in N_{\text {float }}}{\exists}\left[o n\left(P_{n_{j} \rightarrow n_{i}}, \mathbf{v}\right) \wedge\left(v_{n_{j}}=\text { pullVdd }\right)\right] \vee  \tag{C4}\\
& \underset{P_{n_{i} \rightarrow n_{j}}, n_{j} \in N_{\text {float }}}{\exists}\left[\text { on }\left(P_{n_{i} \rightarrow n_{j}}, \mathbf{v}\right) \wedge\left(v_{n_{j}}=\text { pullGnd }\right)\right] \tag{C5}
\end{align*}
$$

## 4. Hierarchical Power-Down Synthesis

formulates a constraint for each floating net $n_{i}$ of the circuit that ensures that either a power-down transistor is inserted to pull it to $v d d$ or gnd (C1) or that the supply or ground voltage inherently propagates to $n_{i}$ via a conducting device path $P_{n_{j} \rightarrow n_{i}}$ or $P_{n_{i} \rightarrow n_{j}}(\mathrm{C} 2-\mathrm{C} 4)$, respectively.

A path

$$
\begin{align*}
& P_{n_{j} \rightarrow n_{i}}=\left(c_{1}, c_{2}, \ldots, c_{k}\right) \text { with } \\
& n_{+}\left(c_{1}\right)=n_{j}, n_{-}\left(c_{k}\right)=n_{i} \text { and }  \tag{4.32}\\
& \quad \forall \quad n_{-}\left(c_{l}\right)=n_{+}\left(c_{l+1}\right)
\end{align*}
$$

is a series of devices connecting net $n_{j}$ with net $n_{i}$ of the circuit. A path $P_{n_{j} \rightarrow n_{i}}$ allows $v d d$ to propagate from $n_{j}$ to $n_{i}$ if it is conducting in power-down mode, i.e., iff:

$$
\begin{equation*}
\text { on }\left(P_{n_{j} \rightarrow n_{i}}, \mathbf{v}\right) \Leftrightarrow \underset{c_{k} \in P_{n_{j} \rightarrow n_{i}}}{\forall} \text { on }\left(c_{k}, \mathbf{v}\right) \Leftrightarrow 1 . \tag{4.33}
\end{equation*}
$$

Analogously, a conducting path $P_{n_{i} \rightarrow n_{j}}$ allows the ground voltage to propagate from $n_{j}$ to $n_{i}$.

Condition (C2) checks if there is a conducting path from a net $n_{j} \in N_{v d d^{\prime}}$ to floating node $n_{i}$ such that the supply voltage can propagate to $n_{i}$, i.e., in forward direction. Similarly, condition (C3) checks if there is a conducting path from floating node $n_{i}$ to a net $n_{j} \in N_{\text {gnd }}$ such that the ground voltage can propagate to $n_{i}$, i.e., in backwards direction. Condition (C4) checks if there exists a conducting path between floating node $n_{j}$ to $n_{i}$ with $n_{j}$ additionally being pulled to $v d d$ by a power-down transistor such that $v d d$ can propagate to $n_{i}$. Finally, condition (C5) checks if there exists a conducting path between floating node $n_{i}$ to $n_{j}$ with $n_{j}$ additionally being pulled to $g n d$ by a power-down transistor such that $g n d$ can propagate to $n_{i}$. If there is at least one path in the circuit fulfilling one of (C2-C5) then there is no need to add a power-down transistor to $n_{i}$ as $v d d$ or gnd inherently propagates to it.

The boundary conditions model that the ground and supply voltage $v d d$ and $g n d$ can propagate from the nets specified by $N_{g n d}^{\prime}$ and $N_{\text {supply }}^{\prime}$ to the remaining floating nets of the circuit by initially adding a power-down transistor to these nets:

$$
\begin{equation*}
\text { boundaryConditions }(\mathbf{v}) \Leftrightarrow \underset{n_{i} \in N_{v d d}^{\prime}}{\forall}\left(v_{n_{i}}=\text { pullV } d d\right) \wedge \underset{n_{j} \in N_{\text {gnd }}^{\prime}}{\forall}\left(v_{n_{j}}=\text { pullGnd }\right) \tag{4.34}
\end{equation*}
$$

After solving the COP, those transistors are removed again from the circuit.
Table 4.1 shows the voltage propagation results for the ripped-up differential stage series (SDFS) from Fig. 4.3 from which

$$
\begin{equation*}
N_{\text {float }}=\{\text { in, ip, outp, } n 2\}, N_{v d d}^{\prime}=\{p w d, n 1\} \text { and } N_{g n d}^{\prime}=\{\text { outn }\} \tag{4.35}
\end{equation*}
$$

can be determined. There is no device which carries a current in power-down mode. However, there are still floating nodes in the circuit and hence the gate shut-off COP must be formulated and solved for it.

| $C_{i}$ | top-level (SDFS) |  |  |  |  |  | DFS 1 |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\begin{aligned} & n_{i} \\ & v_{n_{i}} \end{aligned}$ | in <br> float | ip <br> float | $\begin{gathered} \hline \text { outn } \\ \text { gnd } \end{gathered}$ | outp <br> float | $\begin{gathered} n 1 \\ v d d \end{gathered}$ | $\begin{gathered} n 2 \\ \text { float } \end{gathered}$ | in <br> float | ip <br> float | outn <br> vdd | outp <br> float | $n 1$ <br> gnd |
| $C_{i}$ | DFS2 ${ }^{\prime}$ |  |  |  |  |  | global nets |  |  |  |  |
| $n_{i}$ $v_{n_{i}}$ | in <br> float | $\begin{gathered} i p \\ v d d \end{gathered}$ | outn <br> gnd | outp <br> float | $\begin{gathered} n 1 \\ g n d \end{gathered}$ | $n_{\text {rip } 1}$ <br> float | $\begin{gathered} \text { supply } \\ v d d \end{gathered}$ | ground <br> gnd | $p w d$ $v d d$ |  |  |

Table 4.1.: Voltage propagation result for the ripped-up differential stage series from Fig. 4.3.

| $C O P$ | constraint |  |  | comment |
| :---: | :---: | :---: | :---: | :---: |
|  | buildingBlockMatching(v) | $\Leftrightarrow$ | 0 | no basic building block |
|  | symmetryMatching(v) |  | $\begin{aligned} & \operatorname{sameLevel}\left(v_{i n}, v_{i p}\right)+\operatorname{sameLevel}\left(v_{n 1}, v_{n 2}\right)+ \\ & \operatorname{sameLevel}\left(v_{n 1}, v_{n 2}\right)+\operatorname{sameLevel}\left(v_{o u t n}, v_{\text {outp }}\right) \end{aligned}$ | self-symmetry DFS1 <br> self-symmetry DFS2 ${ }^{\prime}$ |
|  | $\operatorname{area}(\mathbf{v})$ |  | $\begin{aligned} & \text { addTransistor }\left(v_{\text {in }}\right)+\text { addTransistor }\left(v_{i p}\right)+ \\ & \text { addTransistor }\left(v_{\text {outp }}\right)+\text { addTransistor }\left(v_{n 2}\right) \end{aligned}$ | $N_{\text {float }}$ from eq. (4.35) |
| $C S P$ | noCurrentFlow(v) | $\Leftrightarrow$ | 1 | no devices |
|  | noFloatingNet(v) |  | addTransistor $\left(v_{i n}\right) \wedge$ addTransistor $\left(v_{i p}\right) \wedge$ addTransistor $\left(v_{\text {outp }}\right) \wedge$ addTransistor $\left(v_{n 2}\right)$ | $N_{\text {float }}$ from eq. (4.35) |
|  | boundaryConditions(v) |  | $\begin{aligned} & \left(v_{o u t n}=\text { pullGnd }\right) \wedge\left(v_{n 1}=\text { pullVdd }\right) \wedge \\ & \left(v_{p w d}=\text { pullVdd }\right) \end{aligned}$ | $N_{v d d}^{\prime}, N_{v d d}^{\prime}$ from eq. (4.35) |

The constraints matching $(\mathbf{v}) \Leftrightarrow$ buildingBlockMatching $(\mathbf{v})+\operatorname{symmetryMatching}(\mathbf{v})$ and $\operatorname{area}(\mathbf{v})$ are equally weighted, i.e., $w_{1}=w_{2}=0.5$.

Table 4.2.: Constraint optimization problem from eq. (4.15) for the differential stage series from Fig. 4.3.

| solution | $v_{i n}^{*}$ | $v_{i p}^{*}$ | $v_{n 1}^{*}$ | $v_{n 2}^{*}$ | $v_{\text {outn }}^{*}$ | $v_{o u t p}^{*}$ | $v_{p w d}^{*}$ | valid |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{v}_{1}^{*}$ | pullVdd | pullVdd | $v d d$ | pullVdd | gnd | pullGnd | $v d d$ | $\checkmark$ |
| $\mathbf{v}_{2}^{*}$ | pullGnd | pullGnd | vdd | pullVdd | gnd | pullGnd | $v d d$ | $\mathbf{x}$ |

Table 4.3.: Solutions of the constraint optimization problem from Table 4.2. The power-down transistors inserted by the boundaryConditions( $\mathbf{v}$ ) constraint at nets outn, $n 1$ and $p w d$ have already been removed from this table and reverted to $g n d$, $v d d$, $v d d$, respectively.

| $c_{\text {sub }, i}$ | DFS 1 |  |  |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $n_{i}$ | in | ip | outn | outp | $n 1$ | supply | ground |
| $v_{n_{i}}$ | vdd | vdd | vdd | vdd | gnd | vdd | gnd |

Pin voltages: $V_{P, D F S 1}=\left(v_{\text {in }}, v_{\text {ip }}, v_{\text {outn }}, v_{\text {outp }}\right)=(v d d, v d d, v d d, v d d)$
Devices carrying a current by eq. (4.5): N2, N3
Table 4.4.: Voltage propagation results of differential stage $D F S 1$ for solution $\mathbf{v}_{2}^{*}$ of the differential stage series.

Table 4.2 shows the constraint optimization problem (COP) and the constraint satisfaction problem (CSP) part for the top-level schematic of the ripped-up differential stage series. The circuit does not contain any basic analog building blocks on the top-level, hence buildingBlockMatching(v) always evaluates to zero. Its subcircuits DFS1 and DFS2' are self-symmetric, i.e, their differential input and output pins (in, ip and outn, outp, respectively) should be exposed to the same voltage levels in power-down mode. This is captured by constraint symmetryMatching(v) which can have a score between zero (no matching constraints fulfilled) and four (all matching constraints fulfilled). The circuit initially has four floating nodes according to the voltage propagation results from eq. (4.35). Hence, constraint $\operatorname{area}(\mathbf{v})$ denotes that up to four power-down transistors have to be inserted into the top-level schematic to implement its power-down mode. The constraint noCurrentFlow(v) is always fulfilled on the top-level as it does not contain any basic devices. The four nets in, $i p, n 2$ and outp are floating and do not have any connections over conducting devices to one of the nets from sets $N_{v d d}^{\prime}$ or $N_{g n d}^{\prime}$ in power-down mode. Hence, one power-down transistor must be connected to each of these nets to fulfil the constraint noFloatingNet(v). Finally, the boundaryConditions(v) are set according to $N_{v d d}^{\prime}$ and $N_{g n d}^{\prime}$.
Solving this COP yields the two solutions which are shown in Table 4.3. Both solutions fulfil all symmetry constraints and are implemented by four power-down transistors.

The second solution however is not valid as can be seen on the voltage propagation results for $D F S 1$ from Table 4.4: the transistors $N 2$ and $N 3$ carry a current as their gates (in, ip) are on $v d d$ and there is a voltage drop between their drain and source pins, i.e., between $\left(v_{\text {outn }}, v_{n 1}\right)$ and $\left(v_{\text {outp }}, v_{n 1}\right)$, respectively. Hence solution $\mathbf{v}_{2}^{*}$ is discarded by Algorithm 6.

The first solution on the other hand switches $D F S 1 / N 2, D F S 1 / N 3$ off by pulling the inputs $i n$, ip on the top-level to $g n d$. The corresponding voltage propagation results are shown in Table 4.5. The circuit does not contain any device fulfilling eq. (4.5) and no floating nodes have been detected. The power-down mode of $D F S 1$ is already functional and formulating and solving the gate shut-off COP is skipped as the voltage propagation

| $c_{\text {sub, },}$ | DFS 1 |  |  |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $n_{i}$ | in | ip | outn | outp | $n 1$ | supply | ground |
| $v_{n_{i}}$ | gnd | gnd | vdd | vdd | gnd | vdd | gnd |

Pin voltages: $V_{P, D F S 1}=\left(v_{i n}, v_{i p}, v_{\text {outn }}, v_{\text {outp }}\right)=(g n d, g n d, v d d, v d d)$
Devices carrying a current by eq. (4.5): none
Table 4.5.: Voltage propagation results for differential stage DFS1 based on solution $\mathbf{v}_{1}^{*}$ of the differential stage series.

| $c_{\text {sub }, i}$ | $D F S 2$ |  |  |  |  |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $n_{i}$ | in | ip | outn | outp | $n 1$ | nrip1 | supply | ground | pwd |
| $v_{n_{i}}$ | vdd | vdd | gnd | gnd | gnd | float | vdd | gnd | vdd |

Pin voltages: $V_{P, D F S 2}=\left(v_{\text {in }}, v_{\text {ip }}, v_{\text {outn }}, v_{\text {outp }}\right)=(v d d, v d d, g n d, g n d)$
Devices carrying a current by eq. (4.5): none
Table 4.6.: Voltage propagation results for differential stage DFS2 based on solution $\mathbf{v}_{1}^{*}$ of the differential stage series.
result would correspond to the gate shut-off COP result.
Table 4.6 shows the voltage propagation results for $D F S 2^{\prime}$ based on solution $\mathbf{v}_{1}^{*}$ of the differential stage series. Solution $\mathbf{v}_{1}^{*}$ is still valid as there are no current carrying devices in $D F S 2^{\prime}$ for the provided input voltages. The sets

$$
\begin{equation*}
N_{\text {float }}=\{n \text { rip } 1\}, N_{v d d}^{\prime}=\{\text { in, ip, pwd, supply }\}, N_{g n d}^{\prime}=\{\text { outn, outp }, n 1\} \tag{4.36}
\end{equation*}
$$

and

$$
\begin{equation*}
C_{\text {float }}=\{P 1, P 2, P P 1\} \tag{4.37}
\end{equation*}
$$

are determined based on the voltage propagation results. As there is still one floating node in the circuit, the gate shut-off COP must be formulated and solved for $D F S 2^{\prime}$.

Table 4.7 shows the gate shut-off COP for $D F S 2^{\prime}$ and $\mathbf{v}_{1}^{*}$. The building block and symmetry matching constraints are determined by differential pair $d p(N 2, N 3)$ and simple current mirror $\operatorname{scm}(P 1, P 2)$. Their values are solely determined by the voltage propagation results as none of the generated matching constraints depend on floating node nrip1. The value of the area constraint however is determined by nrip 1 . It can be minimized if it could be avoided to add a power-down transistor to it. The three transistors $D F S 2^{\prime} / P 1, D F S 2^{\prime} / P 2$ and $D F S 2^{\prime} / P P 1$ are all connected to nrip1. Hence, a constraint is formulated for each of these transistors to forbid current flow through them. A power-down transistor must be added to nrip 1 or transistor $D F S 2^{\prime} / P P 1$ must be switched on in power-down mode to allow gnd to propagate to nrip1 to ensure that there is no floating node in the circuit anymore. This transistor however is always off in power-down mode as the boundary conditions set the power-down signal $p w d$ to $v d d$. Hence, a power-down transistor to pull nrip 1 to $v d d$ must be inserted into the circuit. The remaining boundary conditions are set according to eq. (4.36).

## 4. Hierarchical Power-Down Synthesis

|  |  |  | constraint | comment |
| :---: | :---: | :---: | :---: | :---: |
| COP | buildingBlockMatching(v) |  | ```sameLevel (voutn},\mp@subsup{v}{outp}{})+\operatorname{sameLevel}(\mp@subsup{v}{in}{},\mp@subsup{v}{ip}{}) sameLevel( (voutn},\mp@subsup{v}{outp}{}``` | $\begin{aligned} & \operatorname{dp}(N 2, N 3) \\ & \operatorname{scm}(P 1, P 2) \end{aligned}$ |
|  | symmetryMatching(v) |  | $\begin{aligned} & \operatorname{sameLevel}\left(v_{\text {outn }}, v_{\text {outp }}\right)+\operatorname{sameLevel}\left(v_{i n}, v_{\text {ip }}\right)+ \\ & \operatorname{sameLevel}\left(v_{\text {outn }}, v_{\text {outp }}\right) \end{aligned}$ | $\begin{aligned} & \operatorname{dp}(N 2, N 3) \\ & \operatorname{scm}(P 1, P 2) \end{aligned}$ |
|  | $\operatorname{area}(\mathrm{v})$ | $\Leftrightarrow$ | addTransistor ( $v_{\text {nrip } 1}$ ) | $N_{\text {float }}$ from eq. (4.36) |
| CSP | noCurrentFlow(v) | $\Leftrightarrow$ | $\left[\right.$ gndLevel $\left.\left(v_{\text {nrip1 } 1}\right) \rightarrow \operatorname{sameLevel}\left(v_{\text {outn }}, v_{\text {supply }}\right)\right] \wedge$ $\left[\operatorname{gnd} \operatorname{Level}\left(v_{\text {nrip1 }}\right) \rightarrow \operatorname{sameLevel}\left(v_{\text {outp }}, v_{\text {supply }}\right)\right] \wedge$ $\left[\operatorname{gndLevel}\left(v_{p w d}\right) \rightarrow \operatorname{sameLevel}\left(v_{\text {outn }}, v_{\text {nrip1 } 1}\right)\right] \Leftrightarrow 1$ | transistor $D F S 2^{\prime} / P 1$ transistor $D F S 2^{\prime} / P 2$ transistor $D F S 2^{\prime} / P P 1$ |
|  | noFloating Net(v) | $\Leftrightarrow$ | $\operatorname{addTransistor}\left(v_{\text {nrip1 }}\right) \vee \operatorname{gndLevel}\left(v_{p w d}\right) \Leftrightarrow 1$ | $N_{\text {float }}$ from eq. (4.36) |
|  | boundaryConditions(v) |  | $\begin{aligned} & \left(v_{\text {outn }}=\text { pullGnd }\right) \wedge\left(v_{\text {outp }}=\text { pullGnd }\right) \wedge \\ & \left(v_{n 1}=\text { pullGnd }\right) \wedge\left(v_{\text {ground }}=\text { pullGnd }\right) \wedge \\ & \left(v_{\text {in }}=\text { pullVdd }\right) \wedge\left(v_{\text {ip }}=\text { pullVdd }\right) \wedge \\ & \left(v_{\text {pwd }}=\text { pullVdd }\right) \wedge\left(v_{\text {supply }}=\text { pullVdd }\right) \Leftrightarrow 1 \end{aligned}$ | $N_{g n d}^{\prime}$ from eq. (4.36) <br> $N_{v d d}^{\prime}$ from eq. (4.36) |

The constraints matching $(\mathbf{v}) \Leftrightarrow$ buildingBlockMatching $(\mathbf{v})+$ symmetryMatching $(\mathbf{v})$ and $\operatorname{area}(\mathbf{v})$ are equally weighted, i.e., $w_{1}=w_{2}=0.5$.

Table 4.7.: Constraint optimization problem from eq. (4.15) for differential stage $D F S 2^{\prime}$ and solution $\mathbf{v}_{1}^{*}$ of the differential stage series from Fig. 4.3.

| solution | $v_{i n}^{*}$ | $v_{i p}^{*}$ | $v_{\text {outn }}^{*}$ | $v_{\text {outp }}^{*}$ | $v_{n 1}^{*}$ | $v_{\text {nrip2 }}^{*}$ | valid | matching | area |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{v}_{1}^{*}$ | $v d d$ | $v d d$ | gnd | gnd | gnd | pullVdd | $\checkmark$ | 6 | 1 |

Nets supply, pwd on $v d d$; net ground on gnd
Target function value: $0.5 \cdot \frac{6}{6}-0.5 \cdot \frac{1}{1}=0.0$
Table 4.8.: Solutions of the constraint optimization problem from Table 4.7. The powerdown transistors inserted by the boundary conditions have already been reverted to $v d d$ or $g n d$, respectively.


Figure 4.4.: Final synthesized power-down circuitry for the differential stage series from Fig. 4.3.

Solving the COP from Table 4.7 yields one solution which is shown in Table 4.8. One powerdown transistor to pull nrip1 to $v d d$ is inserted to switch off $D F S 2^{\prime} / P 1$ and $D F S 2^{\prime} / P 2$ in power-down mode. The solution fulfils all six matching constraints. Hence, the overall value of the target function is $0.5 \cdot \frac{6}{6}-0.5 \cdot \frac{1}{1}=0.0$.
The power-down mode of the differential stage series has been successfully synthesized as its subcircuits $D S F 1$ and $D F S 2^{\prime}$ each have one valid power-down mode implementation variant for the input voltages provided by top-level solution $\mathbf{v}_{1}^{*}$. The final synthesized power-down circuitry of the differential stage series is shown in Fig. 4.4. It requires five power-down transistors in total to implement it. The power-down transistor $N P 3$ is however redundant as gnd would inherently propagate to the output pin outp of the differential stage series via transistors $D F S 2^{\prime} / N 1$ and $D F S 2^{\prime} / N 3$. This redundant power-down transistor has been inserted into the circuit by the gate shut-off COP because the underlying circuit topologies of subcircuits $D F S 1$ and $D F S 2^{\prime}$ have not been considered while formulating the gate shut-off COP for the differential stage series. More specifically, it has not been considered that gnd can propagate to outp if $D F S 2^{\prime} / N 3$ is switched on in power-down mode.

Such redundant power-down transistors can be automatically detected by a depth-first search. It must be searched for conducting cycles starting from the ground or the supply node. If such a cycle exists and a power-down transistor lies on it, that power-down transistor is redundant and can safely be removed from the circuit.

### 4.3.2. Selection of the Optimal Solution

The individually computed power-down implementation variants for each subcircuit of a hierarchical design have to be combined to an overall solution. This can lead to a high number of implementation variants as all possible solutions have to be enumerated. E.g., consider a hierarchical circuit with two levels and six different subcircuit blocks. The gate

| $\mathbf{v}_{i}^{*}$ | $v_{i n}$ | $v_{i p}$ | $v_{n 1}$ | $v_{n 2}$ | $v_{\text {outn }}$ | $v_{\text {outp }}$ | sub-circuits |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{v}_{1}^{*}$ | pullGnd | pullGnd | vdd | pullGnd | gnd | pullGnd | $D F S 1 \mathbf{v}_{1}^{*}, D F S 2^{\prime} \mathbf{v}_{1}^{*}$ |
| $\mathbf{v}_{2}^{*}$ | pullGnd | pullGnd | vdd | pullGnd | gnd | pullVdd | $D F S 1 \mathbf{v}_{1}^{*}, D F S 2^{\prime} \mathbf{v}_{2}^{*}$ |
| $\mathbf{v}_{3}^{*}$ | pullGnd | pullGnd | vdd | pullVdd | gnd | gnd | $D F S 1 \mathbf{v}_{2}^{*}, D F S 2^{\prime} \mathbf{v}_{3}^{*}$ |
| $\mathbf{v}_{4}^{*}$ | pullVdd | pullGnd | vdd | gnd | gnd | pullGnd | $D F S 1 \mathbf{v}_{3}^{*}, D F S 2^{\prime} \mathbf{v}_{1}^{*}$ |
| $\mathbf{v}_{5}^{*}$ | pullVdd | pullGnd | vdd | gnd | gnd | pullVdd | $D F S 1 \mathbf{v}_{3}^{*}, D F S 2^{\prime} \mathbf{v}_{2}^{*}$ |

Table 4.9.: Valid solutions of the gate shut-off COP for the differential stage series from Fig. 4.3 if the symmetry and matching information is not propagated between the bottom and the top-level of the circuit hierarchy. The solutions for differential stages $D F S 1$ and $D F S 2^{\prime}$ are shown in Tables 4.10 and 4.11, respectively.
shut-off COP computes four solutions for the top-level and five solutions for each of its subcircuit blocks. In the worst-case, $4 \cdot 5^{6}=62500$ solutions for the hierarchical design need to be examined. For designs with even more subcircuit blocks and hierarchy levels, the number of solutions quickly becomes unmanageably large.

Typically, only a small subset of hierarchical solutions is optimal for the given design. In the following, an efficient strategy to find the optimal solutions for hierarchical circuits is presented. Therefore, two scores based on the optimization targets "maximize matching" and "minimize area" are assigned to each solution $\mathbf{v}_{j}^{*}$ of each circuit $C_{i}$. The flat score

$$
\begin{equation*}
\operatorname{flatScore}\left(C_{i}, \mathbf{v}_{j}^{*}\right)=w_{1} \cdot \frac{\operatorname{matching}\left(\mathbf{v}_{j}^{*}\right)}{n_{\text {match }}}-w_{2} \cdot \frac{\operatorname{area}\left(\mathbf{v}_{j}^{*}\right)}{n_{\text {area }}} \tag{4.38}
\end{equation*}
$$

corresponds to the value of the target function of the gate shut-off COP from eq. (4.15) for solution $\mathbf{v}_{j}^{*}$ and considers only the current hierarchy level $C_{i}$. The target function of the gate shut-off COP is maximized by constraint programming. Hence, solutions with a high flat score should be preferred over solutions with a low flat score as more matching constraints are fulfilled in power-down mode while requiring less chip area. This is especially true for transistor level circuits as they do not contain any subcircuit blocks, i.e., their best solutions have the highest flat score.

The hierarchical score of a solution includes the subcircuit blocks of $C_{i}$ as follows:

$$
\begin{equation*}
\operatorname{hierScore}\left(C_{i}, \mathbf{v}_{j}^{*}\right)=\operatorname{flatScore}\left(C_{i}, \mathbf{v}_{j}^{*}\right)+\sum_{c_{s u b} \in C_{s u b, i}} \max _{v_{s u b, i, j}^{*} \in \mathcal{V}_{s u b, i}^{*}} \operatorname{hierScore}\left(c_{s u b}, \mathbf{v}_{c_{s u b, i, j}}^{*}\right) . \tag{4.39}
\end{equation*}
$$

It is recursively computed: only the best solutions for each subcircuit block of $C_{i}$, i.e., only sub-solutions with the highest hierarchical score for that subcircuit, are selected. These scores are then added to the flat score of the current hierarchy level. In this way, only optimal sub-solutions are propagated from the bottom to the top-level of the circuit hierarchy.

Table 4.9 shows the solutions of the gate shut-off COP for the differential stage series from Fig. 4.3. In this experiment, the information that the nets (in, ip), $(n 1, n 2)$ and (outn,outp) of the differential stage series should be matched in power-down mode has not been propagated from its subcircuits $D F S 1$ and $D F S 2^{\prime}$ to the top-level schematic. The gate shut-off COP has five valid solutions in this case, the other eleven computed solutions have been discarded as they would have caused a short-circuit path in the circuit.

| $\mathbf{v}_{i}^{*}$ | $v_{\text {in }}$ | $v_{i p}$ | $v_{n 1}$ | $v_{\text {outn }}$ | $v_{\text {outp }}$ | $v_{\text {supply }}$ | $v_{\text {ground }}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{v}_{1}^{*}$ | gnd | gnd | gnd | $v d d$ | gnd | $v d d$ | gnd |
| $\mathbf{v}_{2}^{*}$ | gnd | gnd | gnd | vdd | $v d d$ | $v d d$ | gnd |
| $\mathbf{v}_{3}^{*}$ | $v d d$ | gnd | gnd | $v d d$ | gnd | $v d d$ | gnd |

Table 4.10.: Power-down mode implementation variants for differential stage DFS1 required to implement the solutions of the differential stage series shown in Table 4.9.

| $\mathbf{v}_{i}^{*}$ | $v_{\text {in }}$ | $v_{i p}$ | $v_{n 1}$ | $v_{\text {nrip } 1}$ | $v_{\text {outn }}$ | $v_{\text {outp }}$ | $v_{\text {pwd }}$ | $v_{\text {supply }}$ | $v_{\text {ground }}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{v}_{1}^{*}$ | gnd | vdd | gnd | pullVdd | gnd | gnd | $v d d$ | $v d d$ | $g n d$ |
| $\mathbf{v}_{2}^{*}$ | gnd | vdd | gnd | pullVdd | gnd | $v d d$ | $v d d$ | $v d d$ | gnd |
| $\mathbf{v}_{3}^{*}$ | vdd | vdd | gnd | pullVdd | gnd | gnd | $v d d$ | $v d d$ | gnd |

Table 4.11.: Power-down mode implementation variants for differential stage $D F S 2^{\prime}$ required to implement the solutions of the differential stage series shown in Table 4.9.

Tables 4.10 and 4.11 show the solutions required to implement the different power-down mode implementation variants of the differential stage series from Table 4.9. The solutions for $D F S 1$ have thereby been computed by voltage propagation as there is no floating node in the circuit after setting the pin voltages of $D F S 1$ according to $\mathbf{v}_{1}^{*}-\mathbf{v}_{5}^{*}$ of the differential stage series. For $D F S 2^{\prime}$, the gate shut-off COP must be solved as nrip 1 is always floating. Solutions $\mathbf{v}_{1}^{*}$ and $\mathbf{v}_{3}^{*}$ of $D F S 1$ and solutions $\mathbf{v}_{1}^{*}$ and $\mathbf{v}_{2}^{*}$ of $D F S 2^{\prime}$ have thereby been reused for different solutions of the differential stage series which saves computational effort.

Table 4.12 shows the flat and hierarchical scores of the COP solutions of the differential stage series from Table 4.9 including the scores for its subcircuits $D F S 1$ and $D F S 2^{\prime}$. Area and matching are equally weighted, i.e., $w_{1}=w_{2}=1$. The flat scores on the top-level circuit are all negative as there is no matching or symmetry information available and three or four power-down transistors are required to implement the respective solution. The underlying sub-circuits $D F S 1$ and $D F S 2^{\prime}$ have mostly positive scores, with the exceptions of $D F S 1 \mathbf{v}_{3}^{*}$ and $D F S 2^{\prime} \mathbf{v}_{2}^{*}$. These solutions do not fulfil any matching constraint and require up to one power-down transistor for their implementation. The best solution for $D F S 1$ is $D F S 1 \mathbf{v}_{2}^{*}$ as it fulfils all six matching constraints in power-down mode and no power-down transistor is inserted into the circuit. Its flat score is

$$
\begin{equation*}
\operatorname{flatScore}\left(D F S 1, \mathbf{v}_{2}^{*}\right)=\operatorname{hierScore}\left(D F S 1, \mathbf{v}_{2}^{*}\right)=6.0 / 6.0=1.0 \tag{4.40}
\end{equation*}
$$

which also corresponds to its hierarchical score as it does not contain any subcircuit blocks. The best solution for $D F S 2^{\prime}$ is $D F S 2^{\prime} \mathbf{v}_{3}^{*}$ with a flat and hierarchical score of 0.0 as it also fulfils all matching constraints and is implemented by one power-down transistor. The solution $S D F S \mathbf{v}_{3}^{*}$ of the differential stage series instantiates the two optimal subcircuit solutions $D F S 1 \mathbf{v}_{2}^{*}$ and $D F S 2^{\prime} \mathbf{v}_{3}^{*}$ which yields the highest hierarchical score of the five top-level solutions:

$$
\begin{equation*}
\operatorname{hierScore}\left(S D F S, \mathbf{v}_{3}^{*}\right)=-0.75+1.0+0.0=0.25 \tag{4.41}
\end{equation*}
$$

This solution is the optimal power-down mode implementation variant for the whole hierarchical circuit. It corresponds to the solution shown in Fig. 4.4 which has been directly

## 4. Hierarchical Power-Down Synthesis

| solution | matching | area | subcircuits | scores |  |
| :---: | :---: | :---: | :--- | :---: | :---: |
|  | $n_{o k} / n_{\text {match }}$ | $n_{\text {pwd }} / n_{\text {float }}$ |  | flat | hier |
| $S D F S \mathbf{v}_{1}^{*}$ | - | $4 / 4$ | $D F S 1 \mathbf{v}_{1}^{*}, D F S 2^{\prime} \mathbf{v}_{1}^{*}$ | -1.0 | -1.0 |
| $S D F S \mathbf{v}_{2}^{*}$ | - | $4 / 4$ | $D F S 1 \mathbf{v}_{1}^{*}, D F S 2^{\prime} \mathbf{v}_{2}^{*}$ | -1.0 | -1.66 |
| $S D F S \mathbf{v}_{3}^{*}$ | - | $3 / 4$ | $D F S 1 \mathbf{v}_{2}^{*}, D F S 2^{\prime} \mathbf{v}_{3}^{*}$ | -0.75 | 0.25 |
| $S D F S \mathbf{v}_{4}^{*}$ | - | $3 / 4$ | $D F S 1 \mathbf{v}_{3}^{*}, D F S 2^{\prime} \mathbf{v}_{1}^{*}$ | -0.75 | -1.08 |
| $S D F S \mathbf{v}_{5}^{*}$ | - | $3 / 4$ | $D F S 1 \mathbf{v}_{3}^{*}, D F S 2^{\prime} \mathbf{v}_{2}^{*}$ | -0.75 | -1.75 |
| $D F S 1 \mathbf{v}_{1}^{*}$ | $2 / 6$ | - | - | 0.33 | 0.33 |
| $D F S 1 \mathbf{v}_{2}^{*}$ | $6 / 6$ | - | - | 1.0 | 1.0 |
| $D F S 1 \mathbf{v}_{3}^{*}$ | $0 / 6$ | - | - | 0.0 | 0.0 |
| $D F S 2^{\prime} \mathbf{v}_{1}^{*}$ | $4 / 6$ | $1 / 1$ | - | 0.66 | -0.33 |
| $D F S 2^{\prime} \mathbf{v}_{2}^{*}$ | $0 / 6$ | $1 / 1$ | - | -1.0 | -1.0 |
| $D F S 2^{\prime} \mathbf{v}_{3}^{*}$ | $6 / 6$ | $1 / 1$ | - | 0.0 | 0.0 |

Area and matching equally weighted, i.e., $w_{1}=w_{2}=1$.
Table 4.12.: Flat and hierarchical score for the power-down mode implementation variants of the differential stage series from Table 4.9.
computed by the COP as the matching and symmetry information of the underlying hierarchy levels has been propagated to the top-level schematic. This example emphasizes the importance of sharing matching conditions between hierarchy levels as this additional information will reduce the solution space of the top-level COP significantly which avoids the exploration of suboptimal and invalid implementation variants and hence reduces the computational effort of the power-down synthesis method.

### 4.3.3. Computational Complexity

The hierarchical synthesis approach splits the power-down synthesis problem for a hierarchical design into smaller sub-problems. This reduces the overall complexity of the power-down synthesis problem compared to the approach using circuit flattening as will be shown in the following.
In general, the cardinality $|\mathcal{L}|$ of the COP's solution space is bound by $4^{|N|}$ as the domain of each net of the circuit contains four different voltage levels. The size of the solution space reduces to $4^{|N|-\left|N_{\text {supply }}\right|-\left|P_{\text {top, known }}\right|}$ as the voltage levels of the supply nets $N_{\text {supply }}$ are known and the user may specify voltages at a subset of the top-level pins $P_{\text {top }, \text { known }} \subseteq P_{\text {top }}$ of the circuit.

The supply nets and the pins of the subcircuits are shorted with the supply nets and internal
nets of the above hierarchy level during circuit flattening, which yields:

$$
\begin{align*}
\left|N_{\text {flat }}\right| & =\underbrace{\left|N_{\text {top }} \cup N_{\text {supply }}\right|}_{\text {top-level and supply nets }}+\underbrace{\sum_{t_{i} \in T} y_{i} \cdot\left(\left|N_{i}\right|-\left|N_{\text {supply }, i}\right|-\left|P_{i}\right|\right)}_{\text {subcircuits, supply nets and pins shorted }} \\
\left|\mathcal{L}_{\text {flat }}\right| & =4^{\left|N_{\text {flat }}\right|-\left|N_{\text {supply }}\right|-\left|P_{\text {top }, \text { known }}\right|}  \tag{4.42}\\
& =4^{\left|N_{\text {top }} \cup N_{\text {suppply }}\right|+\sum_{t_{i} \in T} y_{i} \cdot\left(\left|N_{i}\right|-\mid N_{\text {supply, }, i-\left|-\left|P_{i}\right|\right)-\left|N_{\text {supply }}\right|-\left|P_{\text {top,known }}\right|}\right.} \\
& =4^{\left|N_{\text {top }} \backslash N_{\text {supply }}\right|-\left|P_{\text {top, known }}\right|+\sum_{t_{i} \in T} y_{i} \cdot\left(\left|N_{i}\right|-\left|N_{\text {supply }, i}\right|-\left|P_{i}\right|\right)}
\end{align*}
$$

with $y_{i}$ as the total number of occurrences of a specific subcircuit type $t_{i} \in T$ in the hierarchical circuit. The supply nets are first made globally available on the top-level and are then shorted with their corresponding occurrences in the subcircuit blocks. The solution space of the COP of the flat power-down synthesis approach can be reduced to

$$
\begin{equation*}
\left|\mathcal{L}_{\text {flat }}^{\prime}\right|=4^{\left|N_{\text {floot }, \text { flat }}\right|} \tag{4.43}
\end{equation*}
$$

if it is formulated based on the voltage propagation result for the flattened circuit.
The hierarchical approach presented by this work preserves the circuit's hierarchy and reuses intermediate results whenever possible. Starting with the top-level, it formulates and solves the COP from eq. (4.15) for each hierarchy level and subcircuit individually, which partitions the overall synthesis problem into smaller sub-tasks:

$$
\begin{equation*}
\left|\mathcal{L}_{\text {hier }}\right|=\underbrace{4^{\left|N_{\text {top }}\right|-\left|N_{\text {supply,top }}\right|-\left|P_{\text {top }, \text { known }}\right|}}_{\text {top-level COP }}+\underbrace{\sum_{t_{i} \in T} x_{i} \cdot 4^{\left(\left|N_{i}\right|-\left|N_{\text {suppl }}, i\right|-\left|P_{i}\right|\right)}}_{\text {individual subcircuit COPs }} \tag{4.44}
\end{equation*}
$$

with $x_{i}$ as the number of different pin voltage combinations for which the COP has to be solved for a subcircuit of type $t_{i} \in T$. The supply nets are in this case individually treated for each hierarchy level and subcircuit.

As the COP is formulated based on the floating nodes determined by voltage propagation, eq. (4.44) can be simplified to:

$$
\begin{equation*}
\left|\mathcal{L}_{\text {hier }}^{\prime}\right|=\underbrace{4^{\left|N_{\text {float,top }}\right|}}_{\text {top-level COP }}+\underbrace{\sum_{t_{i} \in T} x_{i} \cdot 4^{\left(\left|N_{\text {float }, i}\right|\right)}}_{\text {individual subcircuit COPs }} \tag{4.45}
\end{equation*}
$$

with

$$
\begin{align*}
& 0 \leq\left|N_{\text {float,top }}\right| \leq\left|N_{\text {top }}\right|-\left|N_{\text {supply }, \text { top }}\right|-\left|P_{\text {top }, \text { known }}\right|  \tag{4.46}\\
& 0 \leq\left|N_{\text {float }, i}\right| \leq\left|N_{i}\right|-\left|N_{\text {supply }, i}\right|-\left|P_{i}\right| .
\end{align*}
$$

The comparison of eqs. (4.42) and (4.44) shows that $\left|\mathcal{L}_{\text {hier }}\right|$ is significantly smaller than $\left|\mathcal{L}_{\text {flat }}\right|$ when the gate shutoff COPs of the flat and hierarchical approach are formulated without considering voltage propagation results. The cardinality of the solution space of the new hierarchical approach is typically still smaller compared to the one of the flat approach when


Figure 4.5.: Analysis and synthesis results for a current mirror operational amplifier [40].
considering voltage propagation results. The flat approach sees all floating nodes at once while the hierarchical approach sees them distributed amongst several hierarchy levels for which it formulates individual COPs. However, it is important to propagate matching and symmetry conditions between the hierarchy levels, as the solution space on the higher levels is less constrained than on device level which could lead to the exploration of an unnecessary large number of sub-optimal solutions.

The differential stage series from Fig. 4.3 has 12 nets in total after circuit flattening. The net voltages of the ground and supply net and the voltage of the power-down signal $p w d$ are thereby already known. The size of the solution space of the flat COP is hence bound by $\left|\mathcal{L}_{\text {flat }}\right|=4^{9}=262144$. Voltage propagation would detect five floating nodes in the flat circuit, i.e., in, ip, n2, DFS2'/nrip1 and outp. Hence the solution space of the flat COP could be reduced to $\left|\mathcal{L}_{\text {flat }}^{\prime}\right|=4^{5}=1024$.
The top-level of the differential stage series has four floating nets according to the voltage propagation results from Table 4.1. The gate shutoff COP solution space of the differential stage series is hence bound by $\left|\mathcal{L}_{S D F S}^{\prime}\right|=4^{4}=256$. The corresponding COP has only one valid solution, hence the gate shut-off COP has to be formulated and solved for the differential stages $D F S 1$ and $D F S 2^{\prime}$ only once. $D F S 1$ has no floating nets according to Table 4.5 and hence $\left|\mathcal{L}_{D F S 1}^{\prime}\right|=4^{0}=1$. $D F S 2^{\prime}$ has one floating net according to Table 4.6 and hence $\left|\mathcal{L}_{D F S 2^{\prime}}^{\prime}\right|=4^{1}=4$. The overall size of the solution space for the hierarchical circuit is then $\left|\mathcal{L}_{\text {hier }}^{\prime}\right|=\left|\mathcal{L}_{S D F S}^{\prime}\right|+\left|\mathcal{L}_{D F S 1}^{\prime}\right|+\left|\mathcal{L}_{D F S 2^{\prime}}^{\prime}\right|=4^{4}+1+4=261$, which is significantly smaller compared to the flat approach.

### 4.4. Experimental Results

In the following, experimental results for five different circuits are presented. The first three results demonstrate the advantages of a unified constraint programming approach compared to the two-stage approach presented by [10]. Afterwards, synthesis results for two hierarchical circuits are presented.
\(\left.\begin{array}{c|ccccccccc}solution \& v_{in} \& v_{i p} \& v_{out} \& v_{v b 1} \& v_{v b 2} \& v_{v b 3} \& v_{n 1} \& v_{n 2} <br>
\hline new \& pullVdd \& pullVdd \& gnd \& pullVdd \& pullVdd \& pullVdd \& gnd \& gnd <br>

{[10]} \& pullVdd \& pullVdd \& pullGnd \& pullVdd \& pullVdd \& pullVdd \& gnd \& gnd\end{array}\right]\)|  |  |  |  |  |  |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| solution | $v_{n 3}$ | $v_{n 4}$ | $v_{n 5}$ | $v_{n 6}$ | $v_{n 7}$ | $v_{n 8}$ | $v_{n 9}$ | $v_{n 10}$ |  |
| new | gnd | vdd | $v d d$ | $v d d$ | $v d d$ | gnd | pullGnd | pullGnd |  |
| by $[10]$ | gnd | $v d d$ | $v d d$ | $v d d$ | $v d d$ | gnd | pullGnd | gnd |  |

Table 4.13.: Power-down synthesis results of the new method from Chap. 4 and the approach by [10] for the current mirror operational amplifier.

| building block | matched Nets | eq. (4.20) |  | eq. (4.26) |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  |  | $[10]$ | new | $[10]$ | new |
| $d p(N 2, N 3)$ | $($ in, ip $)$ | $\checkmark$ | $\checkmark$ | $\checkmark$ | $\checkmark$ |
|  | $(n 2, n 3)$ | $\checkmark$ | $\checkmark$ | $\checkmark$ | $\checkmark$ |
| $w \operatorname{sccm}(P 1, P 4, P 5, P 8)$ | $(n 4, n 7)$ | $\checkmark$ | $\checkmark$ | $\checkmark$ | $\checkmark$ |
|  | $(n 2$, out $)$ | $\checkmark$ | $\checkmark$ | $\times$ | $\checkmark$ |
| $w \operatorname{sccm}(P 2, P 3, P 6, P 7)$ | $(n 5, n 6)$ | $\checkmark$ | $\checkmark$ | $\checkmark$ | $\checkmark$ |
|  | $(n 3, n 8)$ | $\checkmark$ | $\checkmark$ | $\checkmark$ | $\checkmark$ |
| $w \operatorname{sccm}(N 4, N 5, N 6, N 7)$ | $(n 8$, out $)$ | $\checkmark$ | $\checkmark$ | $\times$ | $\checkmark$ |
|  | $(n 9, n 10)$ | $\checkmark$ | $\checkmark$ | $\times$ | $\checkmark$ |

Table 4.14.: Comparison of the building block matching results for the current mirror operational amplifier for the new synthesis method from Chap. 4 and the approach by [10].

### 4.4.1. Current Mirror Operational Amplifier

Fig. 4.5 shows the schematic of a current mirror operational amplifier [40]. The left side of the figure shows the building blocks and symmetry pairs of the circuit, the right side shows it augmented by additional power-down circuitry computed by the COP from Sec. 4.3 and [10].

Structure recognition identifies one differential pair, $d p(N 2, N 3)$, and three wide-swing cascode current mirrors, $w \operatorname{sccm}(P 1, P 4, P 5, P 8), w s c c m(P 2, P 3, P 6, P 7)$, $w s c c m(N 4-N 7)$ in the circuit. Furthermore, symmetry analysis computes seven symmetrical device pairs in the circuit. The devices are labelled such that $(N i, N i+1)$ with $i=2,4,6$ and $(P j, P j+1)$ with $j=1,3,5,7$ form a symmetry pair.

The new power-down synthesis method and the approach by [10] each compute one implementation variant for the given circuit as shown in Table 4.13. The differences between the two solutions are highlighted in red. The two solutions fulfil all generated basic building block matching constraints according to eq. (4.20) as shown in Table 4.14, i.e., matched pin pairs are exposed to the same voltage level in power-down mode. However, the new approach additionally ensures that the power-down transistors are inserted symmetrically at wide-swing cascode current mirrors $w s c c m(P 1, P 4, P 5, P 8)$ and $\operatorname{wsccm}(N 4, N 5, N 6, N 7)$ by

## 4. Hierarchical Power-Down Synthesis



Figure 4.6.: Basic building blocks and symmetry pairs of a low resistance operational amplifier [41].
fulfilling eq. (4.26). The old approach by [10] does not consider symmetrical placement constraints of power-down transistors at matched structures and hence computes four different implementation variants for the shown voltage values in its second constraint optimization problem. From those four solutions, one is arbitrarily chosen, leading to the shown asymmetric transistor placement. Hence, the constraints generated by eq. (4.26) help to reduce the number of power-down mode implementation variants further.

This effect can also be observed in the evaluation of the symmetry pair matching constraints. In general, the approach by [10] does not consider symmetry pair constraints. The computed solution by [10] still fulfils all generated constraints according to eq. (4.24) which usually cannot be guaranteed. However, the power-down transistors are not placed symmetrically at $(N 6, N 7)$ and $(P 7, P 8)$ compared to the solution of the new approach. Both solutions are implemented by seven power-down transistors.

### 4.4.2. Low Resistance Operational Amplifier

Fig. 4.6 shows the schematic of a low resistance operational amplifier [41]. Structure recognition (App. A) identifies one simple current mirror, one differential pair and three wide-swing cascode current mirrors in the circuit. Furthermore, seven symmetry pairs are detected by symmetry analysis (App. B).

Fig. 4.7 shows two power-down mode implementation variants computed by the new COP. For the first solution, the weights $w_{1}$ and $w_{2}$ have been chosen such that matching is prioritized over area. It fulfils about $80 \%$ of the generated matching and $100 \%$ of the generated symmetry constraints and can be implemented by 15 power-down transistors.

The second solution minimizes the area requirements of the power-down circuitry. It is implemented by 13 transistors, two transistors less compared to the first solution at the cost


Figure 4.7.: Power-down synthesis results for the low resistance operational amplifier from Fig. 4.6.
of sacrificing about $10 \%$ of matching, i.e., the constraint at the drain pins of the simple current mirror $\operatorname{scm}(N 1, N 2)$ is not fulfilled any more.

The solution computed by [10] corresponds to the one shown in Fig. 4.7 as it always maximizes matching before determining the placement of the power-down transistors. Hence, the approach by [10] cannot compute the second solution which might be more relevant for area critical designs.

### 4.4.3. Fully Differential Operational Amplifier

Fig. 4.8 shows several analysis and synthesis results for a fully differential operational amplifier [42]. It contains one differential pair, two bias shifters and two bias shifter banks.

The devices of the circuit are labelled such that $(N i, N i+1)$ with $i=1,3, \ldots, 11,(P j, P j+1)$ with $j=1,3, \ldots, 19$ and $(C 1, C 2)$ form a symmetry pair.

Power-down synthesis has been repeated three times for this circuit with different weights. The first solution (Fig. 4.8a) maximizes matching and fulfils all generated matching conditions. It requires twelve power-down transistors for its implementation which have been placed symmetrically into the operational amplifier.

The second solution (Fig. 4.8b) minimizes the area of the power-down circuitry and requires nine transistors. These transistors are also inserted symmetrically into the circuit, however, only about $45 \%$ percent of the matching conditions are fulfilled.

The last solution (Fig. 4.8c) represents a trade-off between the two design goals "maximize matching" and "minimize area". With two more additional power-down transistors, i.e., 11 in total, about $82 \%$ of the generated matching conditions can be fulfilled, an increase by $37 \%$ compared to the minimum area solution.

## 4. Hierarchical Power-Down Synthesis


(a) Building blocks and symmetry pairs.

(b) Maximize matching.
(c) Minimize area.


(d) Trade-off.

Figure 4.8.: Analysis and synthesis results for a fully differential operational amplifier [42].


Figure 4.9.: Symmetry and matching constraints of the dual gain amplifier from Fig. 3.6.

The second and third solution can only be computed by the unified constraint optimization problem formulated by eq. (4.15), hence, providing a designer more freedom in the selection of suitable power-down circuitry for the target application compared to [10].

### 4.4.4. Dual Gain Amplifier

In the following, the power-down circuitry for the dual gain amplifier from Fig. 4.9 with its two folded cascode amplifiers FOCA1 and FOCA2 from Fig. 4.10 is synthesized. The dual gain amplifier has four problematic current paths in its bias circuitry:

- $P 1=(R 3, N 1)$
- $P 2=(R 4, N 2)$
- $P 3=(R 5, N 3, N 4)$
- $P 4=(R 6, N 5, N 6)$

These have been detected by voltage propagation and short circuit path computation from Secs. 3.2 and 3.3. All paths run over at least one diode-connected transistor. Hence, a diode rip-up is applied to each of them. The transistors $N P 1, N P 2, N P 3$ and $N P 4$ are inserted by the rip-up heuristic into the dual gain amplifier to disable the diode-configurations of $N 1$, $N 2, N 3$ and $N 5$ in power-down mode, respectively. The heuristic thereby chooses $N 3$ and $N 5$ over $N 4$ and $N 6$ as rip-up points as each of these transistors is part of a level shifter. Hence, disabling the diode-configurations of $N 3$ and $N 5$ in power-down mode switches off


Figure 4.10.: Symmetry and matching constraints for $F O C A 1$ and $F O C A 2$ from Fig. 4.9.


Figure 4.11.: Bias circuitry of the dual gain amplifier from Fig. 4.9a after rip-up and gate shut-off.

|  | core circuit |  |  |  |  | ripped-up bias circuit |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{v}_{i}^{*}$ | $v_{\text {in }}$ | $v_{\text {out }}$ | $v_{n 1}$ | $v_{n 2}$ | $v_{n X 1}$ | $v_{n X 2}$ | $v_{n 6}$ | $v_{n 8}$ |  |
| ${\mathbf{\mathbf { v } _ { \mathbf { 1 } }}}^{*}$ | pullGnd | pullGnd | gnd | gnd | $v d d$ | pullGnd | gnd | gnd |  |
| $\mathbf{v}_{\mathbf{2}}{ }^{*}$ | pullVdd | pullVdd | vdd | vdd | $v d d$ | pullGnd | gnd | gnd |  |

Table 4.15.: Solutions of the gate shut-off COP for the dual gain amplifier for its core circuit and its ripped-up bias circuits from Figs. 3.8b and 4.11.

| $\mathbf{v}_{i}^{*}$ | $v_{i n}$ | $v_{i p}$ | $v_{o u t}$ | $v_{v b 1}$ | $v_{v b 2}$ | $v_{n 1}$ | $v_{n 2}$ | $v_{n 3}$ | $v_{n 4}$ | $v_{n 5}$ | $v_{n 6}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{v}_{\mathbf{1}}{ }^{*}$ | gnd | gnd | gnd | gnd | gnd | pullVdd | vdd | pullVdd | vdd | gnd | gnd |
| $\mathbf{v}_{\mathbf{2}}{ }^{*}$ | vdd | vdd | vdd | gnd | gnd | vdd | vdd | vdd | vdd | gnd | gnd |

Table 4.16.: Synthesized implementation variants for the folded cascode amplifiers FOCA1 and FOCA2 from Fig. 3.9.
more current paths in the dual gain amplifier. Gate shut-off afterwards inserts power-down transistors NP5-NP8 in each solution of the gate shut-off COP of the dual gain amplifier into the circuit such that $N 1, N 2, N 3$ and $N 5$ are forced into the non-conducting state in power-down mode. The resulting power-down circuitry for the bias circuit of the dual gain amplifier is shown in Fig. 4.11.

The two folded cascode amplifiers $F O C A 1$ and $F O C A 2$ from Fig. 4.10 remain unchanged after rip-up as no problematic current path runs through them.

Tables 4.15 and 4.16 show the power-down mode implementation variants computed by the gate shut-off COP for the ripped-up dual gain amplifier and its folded cascode amplifiers $F O C A 1$ and FOCA2. The symmetry and matching constraints have thereby been generated according to Figs. 4.9 and 4.10. The shown solutions fulfil all generated matching constraints and can be implemented with a minimum number of power-down transistors on their respective hierarchy level.

Fig. 4.12 shows the schematics corresponding to the solutions $\mathbf{v}_{1}^{*}$ and $\mathbf{v}_{2}^{*}$ of the dual gain amplifier (DGA) and the folded cascode amplifier (FOCA), respectively. Their corresponding flat and hierarchical scores are annotated in the table at the bottom of the figure.

Solutions $\mathbf{v}_{1}^{*}$ and $\mathbf{v}_{2}^{*}$ of the dual gain amplifier fulfil all matching constraints and can be implemented with four gate shut-off transistors, yielding a flat score of $\frac{32}{32}-\frac{6}{8}=0.25$ under the assumption that the design goals "maximize matching" and "minimize area" are equally weighted, i.e., $w_{1}=w_{2}=1$.

Both computed solutions for the folded cascode amplifier also fulfil all matching constraints. However, two power-down transistors are required to implement $\mathbf{v}_{1}^{*}$ and none for $\mathbf{v}_{2}^{*}$. Hence, the flat score of $\mathbf{v}_{1}^{*}$ is significantly lower than the one for $\mathbf{v}_{2}^{*}$, i.e., 0.0 compared to 1.0.

The dual gain amplifier solution $\mathbf{v}_{1}^{*}$ instantiates the folded cascode amplifier solution $\mathbf{v}_{1}^{*}$ two times, resulting in an overall hierarchical score of $0.25+2 \cdot 0.0=0.25$. The second solution of the dual gain amplifier implements the power-down circuitry of FOCA1 and FOCA2 with their second solution $\mathbf{v}_{2}^{*}$, yielding a hierarchical score of 2.25 .

Hence, $\mathbf{v}_{2}^{*}$ is the optimal solution for the overall circuit as it can be implemented with fewer power-down transistors compared to solution $\mathbf{v}_{1}^{*}$.

## 4. Hierarchical Power-Down Synthesis



DGA solution $\mathbf{v}_{1}^{*}$


FOCA solution $\mathbf{v}_{1}^{*}$


DGA solution $\mathbf{v}_{2}^{*}$


FOCA solution $\mathbf{v}_{2}^{*}$

| solution | matching | area | subcircuits | scores |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  | $n_{\text {ok }} / n_{\text {match }}$ | $n_{\text {pwd }} / n_{\text {float }}$ |  | flat | hier |
| DGA $\mathbf{v}_{1}^{*}$ | $32 / 32$ | $6 / 8$ | 2 x FOCA $\mathbf{v}_{1}^{*}$ | 0.25 | 0.25 |
| DGA $_{2}^{*}$ | $32 / 32$ | $6 / 8$ | 2 x FOCA $\mathbf{v}_{2}^{*}$ | 0.25 | 2.25 |
| FOCA $\mathbf{v}_{1}^{*}$ | $26 / 26$ | $2 / 2$ | - | 0.0 | 0.0 |
| FOCA $\mathbf{v}_{2}^{*}$ | $26 / 26$ | - | - | 1.00 | 1.00 |

Figure 4.12.: Valid power-down mode implementation variants for the dual gain amplifier (DGA) and folded cacscode amplifier (FOCA) from Tables 4.15 and 4.16, and their corresponding flat and hierarchical scores.


Figure 4.13.: Basic building blocks and symmetry pairs of a high input impedance differential amplifier (HIIDA) [38] and its Miller operational transconductance amplifier (OTA) subcircuit blocks.

### 4.4.5. High Input Impedance Differential Amplifier

In Sec. 3.5, the power-down mode of the high input impedance differential amplifier (HIIDA) from Fig. 3.10 has been verified. In the following, fault-free power-down circuitry is synthesized for the original circuit from Fig. 4.13. The supply nets are given as $N_{\text {supply }}=$ $N_{\text {vdd }} \cup N_{\text {gnd }}=\{$ supply, ground $\}$ and the input and output pins are left floating.
In the first step, three definite short circuit paths,

$$
\begin{equation*}
P_{<1,2,3>}=(M I L L E R<1,2,3>/ P 1, M I L L E R<1,2,3>/ R 1), \tag{4.47}
\end{equation*}
$$

are identified. The diode-rip up pattern is applied to these paths as $P 1$ is diode-connected. The rip-up is performed only once for the Miller operational transconductance amplifier (OTA) and the modified circuit is used to replace the remaining two Miller OTAs in the original circuit.

The gate shut-off procedure computes one optimal solution $\mathbf{v}_{1}^{*}$ for the HIIDA, using the matching and symmetry information from Fig. 3.11. Furthermore, a second sub-optimal solution $\mathbf{v}_{2}^{*}$ is provided to emphasize on the importance of propagating structural and symmetry information between circuit hierarchy levels: the symmetry conditions at the differential inputs of the Miller OTAs have not been made available on the top-level for that solution. The synthesized power-down circuitries for each solution are highlighted in blue in the schematics, their corresponding flat and hierarchical scores according to eqs. (4.39) and (4.38) are displayed in the table underneath them.

The first solution is fully symmetric, i.e., all symmetry constraints are fulfilled. The symmetry conditions at the differential inputs of MILLER1, MILLER2 and MILLER3 ensure that in and $i p$ of the high input impedance differential amplifier are pulled to gnd, yielding only one solution for the top-level. The pins of the three Miller OTAs are all exposed to gnd. Hence, gate shut-off has to be performed only once for the Miller OTA which results in Miller

## 4. Hierarchical Power-Down Synthesis



| solution | matching | area | subcircuits | scores |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  | $n_{o k} / n_{\text {match }}$ | $n_{p w d} / n_{\text {float }}$ |  | flat | hier |
| HIIDA $\mathbf{v}_{1}^{*}$ | $12 / 12$ | $2 / 2$ | 3 M Miller OTA $\mathbf{v}_{1}^{*}$ | 0.0 | 0.0 |
| HIIDA $\mathbf{v}_{2}^{*}$ | $10 / 12$ | $2 / 2$ | 1x Miller OTA $\mathbf{v}_{1}^{*}$, | -0.167 | -0.475 |
|  |  |  | 2 2 Miller OTA $\mathbf{v}_{2}^{*}$ |  |  |
| Miller OTA $\mathbf{v}_{1}^{*}$ | $13 / 13$ | $1 / 1$ | - | 0.0 | 0.0 |
| Miller OTA $\mathbf{v}_{2}^{*}$ | $11 / 13$ | $2 / 2$ | - | -0.154 | -0.154 |

Figure 4.14.: Synthesis results for the high input impedance differential amplifier (HIIDA) and its Miller operational transconductance amplifiers (OTAs).

OTA solution $\mathbf{v}_{1}^{*}$. The computed solution fulfils all matching and symmetry constraints and its power-down circuitry is implemented by only one gate shut-off transistor which shuts off the PMOS current mirror bank $\operatorname{scm}(P 1, P 2, P 3)$ in power-down mode. Its corresponding flat score is $\frac{13}{13}-\frac{1}{1}=0.0$ and the hierarchical score of solution $\mathbf{v}_{1}^{*}$ of the high input impedance differential amplifier evaluates to $0.0+3 \cdot 0.0=0.0$.

In a second experiment, the information that the differential inputs of the Miller OTAs should be matched was not propagated to the top-level. In this case, a second solution $\mathbf{v}_{2}^{*}$ is computed for the high input impedance differential amplifier which pulls its input pins in and $i p$ to $v d d$. Now, gate shut-off has to be performed twice for the Miller OTAs: the first time with input voltages $V_{P, M I L L E R 3}=\left(v_{i n}, v_{i p}, v_{o u t}\right)=(g n d, g n d, g n d)$ and the second time with $V_{P, M I L L E R<1,2>}=\left(v_{i n}, v_{i p}, v_{\text {out }}\right)=(g n d, v d d, g n d)$. The solution for $V_{P, M I L L E R 3}$ corresponds to Miller OTA solution $\mathbf{v}_{1}^{*}$, the one for $V_{P, M I L L E R<1,2>}$ is shown as Miller OTA solution $\mathbf{v}_{2}^{*}$ on the bottom right of Fig. 4.14. Here, the problem of not propagating matching and symmetry conditions throughout the hierarchy levels becomes obvious: the differential pair $d p(P 4, P 5)$ is stressed and two gate shut-off transistors are required two implement this solution. Its flat score is -0.154 and the hierarchical score of high input impedance differential amplifier solution $\mathbf{v}_{2}^{*}$ computes to -0.475 . This solution is discarded in favor of $\mathbf{v}_{1}^{*}$ during the later selection process, but this solution would have never been computed in the first place if the structural and symmetry information would have been made available between the two hierarchy levels.

Hence, this second experiment demonstrates another advantage of propagating structural information between different hierarchy levels for power-down synthesis: sub-optimal solutions are not further explored which reduces the computational complexity of the implemented algorithms.
4. Hierarchical Power-Down Synthesis

## 5. Library-Free Structure Recognition

The power-down mode verification and synthesis methods presented in Chaps. 3 and 4 make use of the structure recognition method described in App. A. Structure recognition identifies analog basic building blocks which require matching, e.g., differential pairs and current mirrors. Power-down verification and power-down synthesis evaluate the identified matching conditions to detect faulty or to construct faultless power-down circuitry.

The structure recognition method described in App. A uses a predefined building block library which is similar to the one of the enhanced sizing rules method presented in [10]. Fig. 5.1 shows a small excerpt of the building block library presented in App. A.

The array library $L_{\text {array }}$ contains parallelly connected devices of the same type which show the same electrical behavior as a device of their combined device dimensions. The pair library $L_{\text {pair }}$ is organized hierarchically in ranks. Rank one contains analog building blocks which are formed by two arrays, e.g., a simple current mirror and a bias shifter. The higher ranks contain building blocks which are formed by two lower ranked elements, e.g., a cascode current mirror consists of a bias shifter and simple current mirror.

Apart from power-down verification and synthesis, the structure recognition results of this library can also be used to generate sizing constraints for numerical circuit optimization tools [4] and placement constraints for layout generators [17], which improve the quality of the given designs significantly.

In [10], a building block library dedicated to track the power-down signals through inverters, ESD protection circuitry and simple level shifters was presented. The results are used to partition the circuit into its digital power-down circuitry and its original analog circuit topology. This method has been adapted to hierarchical designs as described in Sec. 3.4.1.

These examples illustrate the importance of structure recognition in analog circuit design. Structure recognition has one main limitation: the size and the scope of the provided building block library. The afore mentioned libraries contain up to 24 elements of up to six devices. Additionally, the libraries were hand-crafted: the building blocks, their recognition rules and their organization in ranks had to be manually defined. This process is tedious and error prone as forgetting to specify only one recognition rule can lead to wrong structure recognition results.

The reliable identification of larger circuit topologies, e.g., operational amplifiers or level converters, can play an important role for circuit verification and circuit synthesis. Electrostatic discharge (ESD) events can cause permanent damage at interface circuits between different voltage domains [43]. Hence, detecting circuit parts susceptible to ESD damage in a chip and automated verification checks of its implemented ESD protection circuitry can speed up the overall design process of that chip.


Figure 5.1.: Excerpt of the basic building block library from App. A.

### 5.1. State of the Art

To overcome the limitation of structure recognition, a given building block library could be extended by adding new building blocks manually to it, until a specific topology, e.g., a Miller OTA, can be identified. Instead, automatic approaches could be used.
In [44], new topological features of an analog circuit are extracted using unsupervised learning techniques and generates a hierarchical building block representation of the circuit. It still relies on predefined rule-sets to be able to identify new structures in a circuit.
[45] uses graph convolutional neural networks to cluster the devices of an analog circuit into its functional blocks, e.g., circuit biases, loads and input pairs, based on geometrical information of the circuit's devices in the schematic. It does not generate a hierarchical building block description of the circuit which could be used for structure recognition.
[46] uses a graph convolutional network to extract and classify subblocks according to their type and functionality, e.g., low noise amplifiers (LNAs), operational transconductance amplifiers (OTAs), mixer or oscillators, in a hierarchical design. Analog basic building blocks inside these subblocks are detected by traditional graph isomorphism approaches. The gathered information is used to annotate netlists for circuit optimization and layout generation.

This work presents a new, deterministic method which automatically decomposes a given circuit topology into its building blocks. The generated building block description can be used by structure recognition to identify the analyzed topology in large netlists.

### 5.2. Overview

Algorithm 7 outlines the main procedure of the method. It takes a circuit $C\left(P, N, C_{s u b}, t\right)$, an array library $L_{\text {array }}$ and a pair library $L_{\text {pair }}$ as input. It outputs the library $L_{C}$ which contains all elements required to identify circuit $C$ by structure recognition. The array library must contain the array definitions for all supported device types and configurations in the circuit, e.g., the array library from Fig. A.3. The provided pair library $L_{p a i r}$ can either

```
Algorithm 7 Automatic generation of the building block description of a circuit
    procedure Library Generation \(\left(C\left(P, N, C_{\text {sub }}, t\right), L_{\text {array }}, L_{\text {pair }}\right)\)
        \(L_{C}=L_{\text {array }} \cup L_{\text {pair }}\)
        repeat
            \(C_{B}=\) structureRecognition \(\left(L_{C}, C\right) \quad / /\) See Alg. 14
            \(T=\) findTopLevelStructures \(\left(C_{B}\right)\)
            \(E_{\text {new }}=\) createNewLibraryElements \((T) \quad / /\) See Alg. 8
            extendLibrary \(\left(L_{C}, E_{\text {new }}\right) \quad / /\) See Alg. 9
        until \(|T|=1\)
        return \(L_{C}\)
    end procedure
```



Figure 5.2.: Differential input stage and its generated building blocks.
be empty or contain already predefined ranks and building blocks. In the first case, the pair library of $L_{C}$ is generated from scratch, in the second case, it is augmented by the missing ranks and library elements required to identify circuit $C$.

In the first step, $L_{C}$ is initialized by creating a copy of $L_{\text {array }}$ and $L_{p a i r}$ (line 2). Then, structure recognition is performed on the given circuit to identify the building blocks specified by $L_{C}$ (line 4). The top-level structures $T$, i.e., the building blocks which are not part of another higher ranked block, are extracted from the recognition result $C_{B}$ in line 5. Algorithm 8 then groups the top-level building blocks into a set of new library elements $E_{\text {new }}$ (line 6). It analyzes three neighboring characteristics of possible new building block pairs for that purpose (Sec. 5.3). In line 7, Algorithm 9 extends library $L_{C}$ by the new elements and additional ranks whenever required (Sec. 5.4). The above steps are repeated until $L_{C}$ contains the full building block description of the given circuit, i.e., until the set of top-level structures $T$ contains only one element (lines 3-8).

Fig. 5.2 shows a differential input stage of an operational amplifier. The colored boxes indicate how the devices and building blocks are automatically grouped into new library elements by Algorithms 7-9 until the full differential input stage can be identified by structure recognition. Fig. 5.3 shows the corresponding computed building block library $L_{C}$.


Figure 5.3.: Generated building block library for the differential input stage from Fig. 5.2.

The initial pair library provided to Algorithm 7 was empty, i.e., the pair library to identify the differential input stage has been built from scratch. In the first iteration, the transistors $(N 1, N 2),(P 1, P 2)$ and $(N 3, N 4)$ are grouped into new library elements by Algorithm 8. These elements correspond to the traditional analog basic building blocks "simple current mirror" and "differential pair" and have been labeled accordingly by hand.

In the second iteration, the algorithm groups $[(P 1, P 2),(N 3, N 4)]$ together which combines the transconductance with the load of the input stage. Finally, the bias circuitry ( $N 1, N 2$ ) is connected to this structure to form the given differential input stage.

### 5.3. Creation of New Library Elements

Algorithm 8 groups the identified top-level building blocks $T$ into new library elements. Structure recognition should be able to identify the new elements as unambiguous as possible. The algorithm assigns a "group strength" to each possible new building block pair. The group strength is computed based on the connectivity, substrate type and rank difference of two building blocks according to eq. (5.1). A high group strength indicates an easier identification of the considered building block pair by structure recognition.

Algorithm 8 operates as follows: in line 2, it initializes the set of new library elements $E_{\text {new }}$ empty. Then it determines the set of tentative new building blocks $B$ by forming the Cartesian Product of the top-level structures $T$ in line 3. Pairs which are not connected with each other by at least one net are automatically discarded. The remaining pairs are stored

```
Algorithm 8 Heuristic to discover new building block pairs
    procedure CreatenewLibraryElements \((T)\)
        \(E_{\text {new }}=\emptyset\)
        \(B=(T \times T) \quad / /\) Tentative building blocks.
        \(B_{w}=\operatorname{sortByGroupStrengths}(B)\)
        for all \(\left(b_{i}, b_{j}\right) \in B_{w}\) with decreasing group strengths do
            \(e_{\text {new }}=\) generateRecognitionRules \(\left(b_{i}, b_{j}\right)\)
            \(E_{\text {new }}=E_{\text {new }} \cup e_{\text {new }}\)
            eliminateOverlappingBlocks \(\left(B_{w}, b_{i}, b_{j}\right)\)
        end for
        removeDuplicateElements \(\left(E_{\text {new }}\right)\)
        return \(E_{\text {new }}\)
    end procedure
```

in the set $B_{w}$ together with their group strength in decreasing order (line 4). The group strengths of each tentative building block are thereby determined by eq. (5.1). The algorithm iterates over all elements $\left(b_{i}, b_{j}\right) \in B_{w}$ with decreasing group strengths (line 5), generates the required recognition rules to identify $\left(b_{i}, b_{j}\right)$ and stores the new library element $e_{\text {new }}$ in the set $E_{\text {new }}$ (lines 6 and 7). Afterwards, all tentative building blocks which are overlapping with $\left(b_{i}, b_{j}\right)$ are eliminated from $B_{w}$ in line 8 . These steps are iteratively repeated until $B_{w}$ is empty. Afterwards, duplicate elements are removed from $E_{\text {new }}$ in line 10. Duplicate elements are created when $B_{w}$ contains several copies of non-overlapping, but identically connected tentative building blocks. Finally, the algorithm returns the set of new libary elements $E_{\text {new }}$ (line 11).

The group strength of a tentative building block pair is calculated as follows:

$$
\begin{equation*}
g\left(b_{i}, b_{j}\right)=2 \cdot \operatorname{con}\left(b_{i}, b_{j}\right)+\operatorname{sst}\left(b_{i}, b_{j}\right)-\Delta r\left(b_{i}, b_{j}\right), \tag{5.1}
\end{equation*}
$$

where $\operatorname{con}\left(b_{i}, b_{j}\right)$, sst $\left(b_{i}, b_{j}\right)$ and $\Delta r\left(b_{i}, b_{j}\right)$ denote the connectivity, substrate type weight, and the rank level difference between two blocks $\left(b_{i}, b_{j}\right)$, respectively.

The connectivity weight is defined as:

$$
\begin{equation*}
\operatorname{con}\left(b_{i}, b_{j}\right)=3 \cdot \operatorname{internal}\left(b_{i}, b_{j}\right)+2 \cdot \operatorname{supply}\left(b_{i}, b_{j}\right) \tag{5.2}
\end{equation*}
$$

The function supply $\left(b_{i}, b_{j}\right)$ computes the number of connections between $b_{i}$ and $b_{j}$ via the supply nets of the circuit and internal $\left(b_{i}, b_{j}\right)$ denotes the number of connections via internal nets between these two blocks. Typically, many different devices or building blocks are connected with each other by supply nets. Hence, internal connections between two blocks are valued higher compared to connections over supply nets, i.e., the number of building block internal connections is multiplied by factor 3 , the number of connections over supply nets by factor 2 .

The substrate type weight is defined as:

$$
\operatorname{sst}\left(b_{i}, b_{j}\right)=\left\{\begin{array}{ll}
2 & , \text { same substrate type }(n \text { or } p)  \tag{5.3}\\
1 & , \text { mixed types }(n \text { and } p) \\
0 & , \text { else (one block has } u \text {-type })
\end{array} .\right.
$$



Figure 5.4.: Characteristics and group strength of a simple current mirror.

Analog basic building blocks are typically formed by subblocks of the same substrate type, e.g., differential pairs and current mirrors are solely formed by either NMOS- or PMOStransistors. These blocks are then combined to input and output stages of operational amplifiers by connecting basic building blocks with different substrate types, e.g, an NMOS differential pair to a PMOS simple current mirror as shown in Fig. 5.2. Tentative building blocks with subblocks of the same substrate type get a weight of two, mixed substrate types of one and all others of zero. These scores ensure that new library elements with the same substrate type are preferably combined to a new building block on lower ranks.

The rank difference between two blocks is defined as:

$$
\begin{equation*}
\Delta r\left(b_{i}, b_{j}\right)=\left|\operatorname{rank}\left(b_{i}\right)-\operatorname{rank}\left(b_{j}\right)\right| . \tag{5.4}
\end{equation*}
$$

The function $\operatorname{rank}\left(b_{i}\right)$ returns the rank on which the building block can be located in the generated library $L_{C}$. Algorithm 7 generates a building block library organized in ranks. Tentative building blocks with a low absolute rank difference between its subblocks should be preferred over blocks with a higher rank difference in order to reduce the number of ranks of the generated library.

Please note: the weights of the connectivity and the substrate type scores of eqs. (5.2) and (5.3) have been heuristically chosen. Furthermore, the connectivity weight is multiplied by two in eq. (5.1) as it is the main characteristic for an unambiguous identification: tightly connected blocks are easier to identify by structure recognition.

Fig. 5.4 shows how the group strength of the simple current mirror $(N 1, N 2)$ is computed. The two devices share an internal and a supply net connection, have the same substrate type and are located on the same rank. Hence, the group strength of ( $N 1, N 2$ ) corresponds to:

$$
\begin{equation*}
g(N 1, N 2)=2 \cdot(2+3)+2-|0-0|=12 . \tag{5.5}
\end{equation*}
$$

Table 5.1 shows all tentative building blocks and their assigned group strengths of the differential input stage from Fig. 5.2 for each iteration of Algorithm 7. In the first iteration, seven, in the second two and in the last iteration, only one tentative building block is generated. The group strength of each block is computed by Algorithm 8 according to eq. (5.1).

| $\left(b_{i}, b_{j}\right) \in B_{w}^{\prime}$ | $g\left(b_{i}, b_{j}\right)$ | used |
| :---: | :---: | :---: |
| $(N 1, N 2)$ | 12 | yes |
| $(P 1, P 2)$ | 12 | yes |
| $(N 2, N 3)$ | 8 | no |
| $(N 2, N 4)$ | 8 | no |
| $(N 3, N 4)$ | 8 | yes |
| $(N 3, P 1)$ | 7 | no |
| $(N 3, P 2)$ | 7 | no |
| $(N 4, P 2)$ | 7 | no |

Iteration 1

| $\left(b_{i}, b_{j}\right) \in B_{w}^{\prime}$ | $g\left(b_{i}, b_{j}\right)$ | used |
| :---: | :---: | :---: |
| $[(N 3, N 4),(P 1, P 2)]$ | 13 | yes |
| $[(N 1, N 2),(N 3, N 4)]$ | 7 | no |

Iteration 2

| $\left(b_{i}, b_{j}\right) \in B_{w}^{\prime}$ | $g\left(b_{i}, b_{j}\right)$ | used |
| :---: | :---: | :---: |
| $\{[(N 3, N 4),(P 1, P 2)]$, | 6 | yes |
| $(N 1, N 2)\}$ |  |  |

Iteration 3

Table 5.1.: Weights of the tentative buildings formed in the first iteration of Algorithm 7 for the circuit of Fig. 5.2.

```
Algorithm 9 Insertion of new library elements
    procedure EXTENDLIBRARY \(\left(L_{C}, E_{\text {new }}\right)\)
        for all \(e_{\text {new }}=\left(b_{i}, b_{j}\right) \in E_{\text {new }}\) do
            \(\operatorname{rank}=\operatorname{maxRank}\left(b_{i}, b_{j}\right)+1\)
            if \(\operatorname{rank}>\operatorname{maxRank}\left(L_{C}\right)\) then
                addNewRank \(\left(L_{C}\right.\), rank \()\)
            end if
            addNewElement ( \(e_{\text {new }}\), rank)
        end for
    end procedure
```

Algorithm 8 iterates over the tentative building blocks according to decreasing group strengths. In the first iteration, the recognition rules for $(N 1, N 2)$ are generated and the blocks $(N 2, N 3)$, $(N 2, N 4)$ are removed from $B_{w}$ as they overlap with $(N 1, N 2)$ while having a lower group strength. Similarly, all other blocks except $(N 3, N 4)$ are discarded as they overlap with ( $P 1, P 2$ ).

In the second iteration, the recognition rules for $[(N 3, N 4),(P 1, P 2)]$ are generated as that block has a higher group strength than $[(N 1, N 2),(N 3, N 4)]$.

Finally, $\{[(N 3, N 4),(P 1, P 2)],(N 1, N 2)\}$ are combined to the new library element which corresponds to the building block description of the differential input stage.

### 5.4. Library Extension

Algorithm 9 iteratively inserts the newly created elements $e_{\text {new }}=\left(b_{i}, b_{j}\right) \in E_{\text {new }}$ into the building block library $L_{C}$. It first determines the rank of the new building block by incrementing the maximum rank of $\left(b_{i}, b_{j}\right)$ by one (line 3 ). The algorithm afterwards creates a new library rank for $\left(b_{i}, b_{j}\right)$ if required (line 5) and inserts it into its corresponding rank (line 7).

(a) folded cascode amplifier

(b) load variants

Figure 5.5.: Schematic of a folded cascode amplifier with different current mirror loads.

Algorithm 7 creates two new library elements for the differential input stage from Fig. 5.2 in its first iteration. As the initial pair library was empty, a new rank is initialized and the two elements are inserted into it. In the second and third iteration, one new library element each has been inserted into a new rank of $L_{C}$ by Algorithm 9. The final building block library consists of three ranks with six library elements in total as shown in Fig. 5.3.

### 5.5. Experimental Results

This section presents experimental results for the library-free structure recognition. First, a folded cascode amplifier (FOCA) with different current mirror loads is investigated. It will be shown that the analog basic building block library presented by [4] can be created without prior knowledge of typical analog circuit elements, e.g., current mirrors or differential pairs.

Afterwards, a new library dedicated to identify level shifters is presented. Level shifters are interface circuits which convert signals between different voltage domains of a chip. These circuits are susceptible to ESD damage and need suitable protection [43].
The generated library elements are manually labeled according to their functionality in the circuit afterwards, indicating that the implemented algorithms are able to capture the functional elements of the circuit based on the three simple neighboring characteristics presented in Sec. 5.2.

### 5.5.1. Folded Cascode Amplifier Variants

Fig. 5.5a shows the schematic of a folded cascode amplifier. Its load is implemented by one of the current mirror variants shown in Fig. 5.5b.

In a first experiment, library free structure recognition is performed on the folded cascode amplifier with a cascode current mirror load. The initially provided pair library is empty, i.e., the pair library required to identify this folded cascode amplifier variant is built from scratch. The array library contains the array definitions for resistor, capacitor, normal transistor and diode transistor arrays. The result of the method corresponds to the basic version of the folded cascode amplifier building block library which is then subsequently extended by the library elements required to identify the other variants with various different current mirror loads.

Algorithms 7-9 group the devices of the amplifier in the first iteration to the following building block pairs:

- $(N 1, N 2),(P 1, P 2)$ and the devices in the load form a simple current mirror,
- $(P 3, P 4)$ build a bias shifter,
- $(N 3, N 4)$ are grouped to a differential pair.

These blocks are highlighted in blue in Fig. 5.5.
In the second iteration, highlighted in orange, $[(N 1, N 2),(N 3, N 4)]$ are assembled to a differential input stage and $[(P 1, P 2),(P 3, P 4)]$ to a cascode current mirror.

The algorithm combines that cascode current mirror with the current mirror load in the third iteration to the output stage of the amplifier as shown in red. The tentative pair formed by the differential input stage $d s(N 1-N 4)$ and the PMOS cascode current mirror $c c m(P 1-P 4)$ as well as the tentative pair by $c c m(P 1-P 4)$ and the current mirror load variant have the same group strength. Hence, a tie breaker rule has to be applied to decide which of the two pairs should be kept. In this case the latter pair remains in the computation process as both of its subblocks are connected to the output pin out of the amplifier which indicates that both blocks belong to the output stage of the amplifier.

In the fourth and last iteration of the algorithm, the input and the output stage are combined to form the folded cascode amplifier.

The generated building blocks are shown in the building library from Fig. 5.6 in the colors of the iteration they have been created. The remaining elements of that library are required to identify the other five folded cascode amplifier variants from Fig. 5.6. These elements are generated by executing the presented algorithms for each variant once more with the basic version of the folded cascode amplifier library as input. In this way, the missing elements are successively added to the library as shown in Fig. 5.6. The final library consists of five ranks and 29 building blocks.

The generated library contains furthermore all analog basic building blocks of the library provided by [4]. Hence, the new method was able to "learn" the most common analog basic building blocks by analyzing the given amplifiers using the three simple neighboring characteristics connectivity, substrate type and rank difference.

Finally, it has been shown that the automatic library generation overcomes the main limitation of structure recognition: the scope, complexity and completeness of the provided building block library. The library from Fig. 5.6 is able to identify six folded cascode amplifier variants and can easily be extended by other circuit topologies.

## 5. Library-Free Structure Recognition



Figure 5.6.: Generated building block library for the six folded cascode amplifier variants from Fig. 5.6.

### 5.5.2. Level Shifter Topologies

Level shifter circuits are interface circuits between different voltage domains of a chip: they convert signals from lower voltage to higher voltage domains, e.g., from the digital to the analog domain of an integrated circuit and vice versa. Such interface circuits are susceptible to ESD events and must be protected by suitable protection circuitry [43].

Structure recognition can help to verify the functionality of an ESD protection network by its capability to reliably identify interface circuits in large designs. However, structure recognition requires a building block library containing all used level shifter topologies in a given design. Library-free structure recognition can automatically generate this library.

Figs. 5.7 shows five different level shifter topologies. The basic elements of the differential cascade voltage switch [47] and the pass gate level converter [48] have been used to design more sophisticated topologies, e.g., the pass gate level converter with keeper device [48], the single supply level converter [49] and the even more elaborate single supply true voltage level shifter [50]. These topologies either improve the speed, reduce the power consumption or lead to less routing congestion compared to the conventional differential cascade voltage switch and pass gate level converter topologies.

In the following, the method presented in Sec. 5.2 is used to generate a building block library for structure recognition which is dedicated to level shifter topologies. Starting with a library containing only array definitions, the algorithm first generates the building block description of the differential cascade voltage switch. In a second run, the elements required to identify the pass gate level converter are added to the library. The library is then subsequently extended to include the building block descriptions of the pass gate level converter with keeper device, the single supply level converter and finally by the single supply true voltage level shifter.

The final generated building block library is shown in Fig. 5.8. It consists of six ranks and 24 elements. The building blocks marked with bold letters correspond to one of the circuit topologies shown in Fig. 5.7. The remaining library elements have been labelled according to their functionality in the corresponding circuit. E.g., rank four contains the input stage of the single supply true voltage level converter, rank two a NOR-Gate and the input and output stages of the analyzed level shifter topologies which are further split up into smaller input and output circuitry on rank one. Hence, all of the generated library elements fulfil a specific functionality in the corresponding circuit, i.e., the given circuit topologies have been decomposed into meaningful building blocks by the method presented in Sec. 5.2.

The resulting library can easily be extended by other level shifter topologies when required.


 $0 . \frac{T}{\square}$

(b) pass gate level converter (passGateLC)

(c) passGateLC with keeper device (passGateLCKeeper)

(d) single supply level converter (singleSupplyLC)

(e) single supply true voltage level converter (singleSupplyTVVS)

Figure 5.7.: Five different level shifter topologies [47; 48; 49; 50].





Figure 5.8.: Level converter library generated by Algorithms 7-9.
5. Library-Free Structure Recognition

## 6. Conclusion

The design and verification of analog circuits is still mainly done manually in industrial practice. Designs are hierarchically partitioned into smaller parts until individual subblocks, e.g. operational amplifiers, are assigned to an analog circuit designer. The sized analog blocks are then combined to form the overall system. Extensive evaluation and tweaking of the subblocks is thereby necessary to make the overall system functional.

This problem aggravates if analog blocks are equipped with additional power-down circuitry: the correct power-down mode functionality of the overall system cannot be guaranteed even though it has been correctly implemented for each individual subblocks of the system.

This work presented two new methods which automatically verify or synthesize the powerdown mode of hierarchical circuits. The methods exploit the hierarchical structure of a given design by reusing intermediate results whenever possible. Experimental results have been presented to demonstrate the proposed methods.

The presented power-down verification and synthesis methods use structure recognition and symmetry computation procedures to extract matching constraints of a given circuit. These traditional circuit analysis methods have been adapted to hierarchical analog designs.

Structure recognition is mainly limited by the provided basic building block library. A new approach has been presented which automatically generates the building block description of a given circuit which can be used by structure recognition. The approach partitions the given circuit into its building blocks by analyzing three neighboring characteristics of its devices. It has been shown that the new approach captures the same analog basic building blocks as previous predefined libraries by analyzing operational amplifiers. It also successfully partitioned level shifter circuits into their input and output circuitry which enables the reliable identification of these ESD susceptible structures during ESD verification.
6. Conclusion

## A. Hierarchical Structure Recognition

## A.1. Introduction

Analog circuits are composed of so-called "basic building blocks", e.g., current mirrors or differential pairs. These blocks implement a specific functionality in the circuit, e.g., the differential pair $d p(N 2, N 3)$ of the folded cascode amplifier from Fig. A. 2 converts a differential input voltage at its input pins in, ip into a differential output current at its nets $n 2$, $n 3$. This differential current is then amplified and mirrored to the output of the amplifier by the cascode current mirror $c c m(P 1-P 4)$ and the wide swing cascode current mirror $\operatorname{wsccm}(N 4-N 7)$. The shown folded cascode amplifier implements the subcircuits FOCA1 and FOCA2 of the core circuitry of the dual gain amplifier from Fig. A.1b. FOCA1 and FOCA2 are biased by the circuit shown in Fig. A.1a. The transistors N1, N2 each form a simple current mirror $\operatorname{scm}(N<1,2>, F O C A<1,2>/ N 1)$ with transistor $N 1$ of the folded cascode amplifier to bias the differential input stage of FOCA1 and FOCA2. Similarly, N3 and $N 5$ form level shifters $l s(N<3,5>, F O C A<1,2>/ N<6,7>)$ with transistors $N 6$ and N7 of FOCA1 and FOCA2 to set the bias voltage of their wide swing cascode current mirrors.

Analog basic building blocks require specific operating conditions to fulfill their functionality, e.g., the transistors of a simple current mirror of differential pair must operate in saturation region to mitigate the influence of channel modulation effects [4].
Apart from circuit sizing, this structural information can also be used for circuit verification and synthesis tasks. The power-down verification and synthesis method from Chaps. 3 and 4 consider matching conditions at analog basic building blocks during their execution.

An experienced designer can easily identify analog basic building blocks from a circuit schematic by inspection and use it for sizing or verification tasks. For computers, specialized software is required to extract this information from a schematic, generally known as structure recognition.

## A.2. State of the Art and Contributions

The hierarchical structure recognition method implemented in this work is based on the "sizing rules method" which was originally published in [16]. The presented algorithms have been further refined in $[4 ; 51 ; 10]$. All versions require a predefined building block library $L_{\text {analog }}$ as input. This library is shown in Fig. A.3.
The library is partitioned into an array library $L_{\text {array }}$ and a pair library $L_{\text {pair }}$. Each element of $L_{\text {array }}$ and $L_{\text {pair }}$ defines the recognition rules to identify the corresponding structure in a circuit.

## A. Hierarchical Structure Recognition



Figure A.1.: Dual gain amplifier from Fig. 3.6 without power-down circuitry.


Figure A.2.: Schematic of a folded cascode amplifier which implements subcircuits FOCA1 and $F O C A 2$ of the dual gain amplifier from Fig. A.1.

## Arraylibrary $\mathrm{L}_{\text {array }}$



level shifter (ls)

cascode pair (cp) current mirror load (cml)

oltage reference 1
(vr1)

persistence $=1$, for all building blocks on this rank except for scm, Is and ccp

wide swing cascode current mirror (wsccm)


wilson current mirror (wcm)

cascode current mirror (ccm)
 mirror (4tcm)

rank 2

Figure A.3.: Analog basic building block library $L_{\text {analog }}=L_{\text {array }} \cup L_{\text {pair }}$ based on $[4 ; 51 ; 10]$.

## A. Hierarchical Structure Recognition

A device array consists of one or more parallelly connected devices which exhibit the same electrical behavior as a device of their combined sizes. A normal transistor array ( $n t a$ ), e.g., consists of a set of NMOS- or PMOS-transistors whose drain, gate and source pins are connected in parallel by three different nets. Combining the devices of a circuit to arrays speeds up the recognition process of analog building blocks as fewer devices need to be considered in the later iterations of the algorithm [17]. The recognition rules and an algorithm to efficiently identify arrays in a circuit is presented in [51], Sec. 3.2.2. This algorithm has been re-implemented in this work without any changes.

The pair library $L_{\text {pair }}$ is organized in ranks: a building block pair of rank $i$ consists of two building blocks from the lower ranks $i-1, \ldots, 0$ with the array library being considered as rank zero. The pair library can have an arbitrary number of ranks. A level shifter of rank one, e.g., consists of a normal and a diode transistor array, a Wilson current mirror of a simple current mirror and a normal transistor array.

The sizing rules method iteratively scans the circuit for the provided library elements by starting at the lowest and ending at the highest rank. Ambiguities during the recognition process are resolved by dominance graphs in [4;51]. Ambiguities occur when an identified array or pair could be assigned to two different higher ranked library elements, e.g., a normal transistor array could be assigned to a differential pair and a simple current mirror at the same time. The relations defined by the dominance graph decide in such cases which of the identified pairs should be kept in the recognition process. The other pairs are then discarded. In above example, the simple current mirror would be kept and the differential pair discarded as the devices of a simple current mirror are tightly connected and hence unambiguously identifiable.

This mechanism has been simplified by [10] by assigning an expiration date, a so-called "persistence", to each element of the pair library $L_{\text {pair }}$. The persistence defines for how many ranks an identified pair should be kept in the iterative recognition process. A pair is discarded if the currently processed rank minus the rank of the pair exceeds its persistence. E.g., a cascode pair of rank one has a persistence of one. All cascode pairs which have not become part of a wide swing cascode current mirror are discarded after all elements of rank two have been identified in the circuit.

The sizing rules method by [10] supports hierarchical circuits by circuit flattening. The circuit hierarchy is not maintained rendering it unsuitable for industrial and IP based designs. Intermediate results cannot be reused to speed up the structure recognition algorithm.

Another approach to support hierarchical designs based on the sizing rules method is presented in [23]. It traverses the circuit hierarchy recursively, flattens it partially which makes required information for structure recognition available in-between the different hierarchy levels of the circuit. This method however does still not reuse intermediate results.

The work by [52] simplifies structure recognition by encoding the elements of the analog basic building block library based on their connectivity. The circuit is then scanned for pairs with the same encoding as the library elements. Hierarchical circuits are flattened at lower hierarchy levels; the subcircuits at higher levels are separated from each other and structure recognition is performed independently for each them. However, it is not stated how this partitioning is done.


Figure A.4.: Overview of the hierarchical structure recognition method.

In this work, structure recognition has been modified to support hierarchical circuits by partial circuit flattening (Fig. 1.8c). Similar to [23], this approach first makes connectivity information required for structure recognition available in-between the different circuit hierarchy levels. This step decouples the hierarchy levels from each other which would, in comparison to [23], allow the parallelization of the recognition process. However, this parallelization has not been implemented yet. The decoupling of the hierarchy levels furthermore splits up the whole recognition process into smaller sub-problems and has to be executed only once for each circuit type in the partially flattened design which hence reduces the computational complexity of the overall problem.

The algorithms and procedures of the sizing rules method have been further enhanced as follows: the state-of-the-art method [10] merges the recognition results of each rank into one data structure. It will be shown that organizing the results in ranks will further simplify and decrease the runtime of the pair recognition and ambiguity resolution procedures.

## A.3. Overview

Fig. A. 4 gives an overview of the new hierarchical structure recognition method. It takes a top-level circuit $C_{\text {top }}$ and the set $\mathcal{C}$ which contains all circuit definitions instantiated by subcircuits of $C_{\text {top }}$ as input.

The circuit is partially flattened (App. A.4) by recursively propagating connectivity information between the hierarchy levels similarly to [23]. Structure recognition however is not performed yet. The resulting set of partially flattened circuits $\mathcal{C}^{\prime}$ is input to the actual structure recognition method (App. A.5).

The hierarchy levels are now decoupled from each other. Hence, structure recognition only has to iterate over each element of $\mathcal{C}^{\prime}$ instead of recursively descending into the hierarchy. This enables the parallelization of the method for hierarchical designs. However, parallel execution was not implemented in this work. Each circuit type is inherently processed only once, the circuit hierarchy is maintained and the recognition results can be easily back annotated to the original circuit.

The output of the algorithm is the structure recognition results library $L_{\text {struct }}$ which contains the identified building blocks of each circuit type $t_{i}^{\prime}$.

```
Algorithm 10 Partial circuit flattening.
    procedure PartialCircuitFlattening \(\left(C_{\text {top }}\left(P_{\text {top }}, N_{\text {top }}, C_{\text {sub,top }}, t_{\text {top }}\right), \mathcal{C}\right)\)
        \(\mathcal{C}^{\prime}=\mathcal{C}\)
        \(C_{\text {top }}^{\prime}=C_{\text {top }}\left(P_{\text {top }}, N_{\text {top }}, C_{\text {sub }, \text { top }}, t_{\text {top }}\right)\)
        recursivePartialCircuitFlattening \(\left(C_{\text {top }}^{\prime}\right) \quad / / \operatorname{Modifies} \mathcal{C}^{\prime}\) inherently
        return \(\mathcal{C}^{\prime}\)
    end procedure
```

```
Algorithm 11 Recursive partial circuit flattening.
    procedure RecursivePartialCircuitFlattening \(\left(C_{i}\left(P_{i}, N_{i}, C_{\text {sub }, i}, t_{i}\right)\right)\)
        for all \(c_{s u b} \in C_{s u b, i}\) do
            if \(i s S u b C i r c u i t B l o c k\left(c_{s u b}\right)\) then
                partialCircuitFlattening \(\left(c_{s u b}\right)\) // Recursive call
                propagateConnectivity \(\left(C_{i}, c_{s u b}\right)\) // See Algorithm 12
            end if
        end for
    end procedure
```


## A.4. Partial Circuit Flattening

The devices forming a basic building block can be distributed between several hierarchy levels for hierarchical designs. E.g., the transistors $N 1$ and $N 2$ of the dual gain amplifier from Fig. A. 1 each form a simple current mirror with the transistor $N 1$ of the folded cascode amplifiers FOCA1 and FOCA2 from Fig. A.2.

This connectivity information must be propagated between the two hierarchy levels such that structure recognition is able to identify these two simple current mirrors.

Algorithms 10 to 12 flatten a given circuit hierarchy partially by augmenting each hierarchy level with the devices connected to its respective external pins. Algorithm 10 initializes $C_{\text {top }}^{\prime}$ and $\mathcal{C}^{\prime}$ as copies of $C_{\text {top }}$ and $\mathcal{C}$ in lines 2 and 3 . Afterwards, it calls Algorithm 11 in line 4 to flatten the circuit hierarchy partially in a recursive manner.

Algorithm 11 first descends to the bottom level of the hierarchy by calling itself recursively for each subcircuit of $C_{i}$ in lines 2 and 4. There, in line 5, it calls Algorithm 12 which makes the connectivity information between the circuit $C_{\text {high }}$ and its subcircuit $C_{\text {low }}$ available as follows: it determines a set of devices $C_{\text {con,high }} \subseteq C_{\text {sub,high }}$ which are externally connected to the pins $P_{\text {low }}$ of the given subcircuit (line 2). Furthermore, the algorithm determines a set

```
Algorithm 12 Sharing connectivity information between two hierarchy levels.
    procedure PROPAGATECONNECTIVITY \(\left(C_{\text {high }}, C_{\text {low }}\right)\)
        \(/ / C_{\text {high }}=C_{\text {high }}\left(P_{\text {high }}, N_{\text {high }}, C_{\text {sub,high }}, t_{\text {high }}\right)\)
        \(/ / C_{\text {low }}=C_{\text {low }}\left(P_{\text {low }}, N_{\text {low }}, C_{\text {sub }, \text { low }}, t_{\text {low }}\right)\)
            \(C_{\text {con }, \text { high }}=\) findHigherConnectedDevices \(\left(P_{\text {low }}, C_{\text {high }}\right) \quad / / C_{\text {con,high }} \subseteq C_{\text {sub,high }}\)
        \(C_{\text {con,low }}=\) findLowerConnectedDevices \(\left(P_{\text {low }}, C_{\text {low }}\right) \quad / / C_{\text {con,low }} \subseteq C_{\text {sub }, \text { low }}\)
        \(C_{\text {sub,low }}=C_{\text {sub,low }} \cup C_{\text {con,high }}\)
        \(C_{\text {sub }, \text { high }}=C_{\text {sub }, \text { high }} \cup C_{\text {con,low }}\)
    end procedure
```



Figure A.5.: Partially flattened dual gain amplifier from Fig. A.1.


Figure A.6.: Partially flattened folded cascode amplifier from Fig. A.2.
of devices $C_{\text {con,low }} \subseteq C_{\text {sub,low }}$ which are internally connected to the pins $P_{\text {low }}$ (line 3 ). Then, in lines 4 and 5 , the higher levels devices $C_{\text {con,high }}$ are inserted into the set of subcircuits of $C_{\text {low }}$ and the lower-level devices $C_{\text {con, low }}$ into $C_{\text {sub }, \text { high }}$. The connectivity information is propagated in both directions in order to keep the affected hierarchy levels synchronized.

After processing the device level, Algorithm 12 ascends the hierarchy again, repeating the steps described above for every subcircuit in the currently investigated hierarchy level. The algorithms directly operate on the circuits contained by $\mathcal{C}^{\prime}$. Hence, all results are automatically available in $\mathcal{C}^{\prime}$ which is the return value of Algorithm 10 in line 5.

Figs. A. 5 and A. 6 show the dual gain amplifier and the folded cascode amplifier from Figs. A. 5 and A. 6 after partial circuit flattening. The transistors $N 1$ and $N 2$ of the dual gain amplifier are connected to transistor $N 1$ inside the folded cascode amplifiers FOCA1 and FOCA2, respectively. Furthermore, transistors $N 3$ and $N 5$ on the top-level are connected to $N 6$ and $N 7$ of FOCA1 and FOCA2, respectively. Hence, this connectivity information has been

```
Algorithm 13 Structure recognition for hierarchical analog circuits.
    procedure HIERARCHICALSTRUCTURERECOGNITION \(\left(\mathcal{C}^{\prime}, L_{\text {analog }}, N_{\text {supply }}\right)\)
        \(L_{\text {struct }}=\emptyset\)
        for all \(C_{i}^{\prime}\left(P_{i}^{\prime}, N_{i}^{\prime}, C_{\text {sub }, i}^{\prime}, t_{i}^{\prime}\right) \in \mathcal{C}^{\prime}\) do \(\quad / /\) Prallel execution possible.
            \(C_{B, t_{i}^{\prime}}=\) structureRecognition \(\left(C_{i}^{\prime}, L_{\text {analog }}, N_{\text {supply }}\right) \quad / /\) See Algorithm 14
            \(L_{\text {struct }}=L_{\text {struct }} \cup\left(t_{i}^{\prime}, C_{B, t_{i}^{\prime}}\right)\)
        end for
        return \(L_{\text {struct }}\)
    end procedure
```

made available to both hierarchy levels. Please note: the resistors of the dual gain amplifier which are also connected to pins $b 1$ and $b 2$ of the folded cascode amplifiers have not been inserted into the corresponding lower-level schematics as analog basic building blocks are formed by MOSFET or bipolar transistors only. Hence, this connectivity information is not relevant for structure recognition and does not need to be propagated in-between hierarchy levels.

## A.5. Structure Recognition

Algorithm 13 outlines the new hierarchical structure recognition method. It takes the set of partially flattened circuits $\mathcal{C}^{\prime}$ from the preprocessing step, the basic building block library $L_{\text {analog }}$ from Fig. A. 3 and the supply nets of the circuits $N_{\text {supply }}$ as inputs.

In line 2 , it first initializes the results library $L_{\text {struct }}$ empty. Then, in line 3, the algorithm simply iterates over all elements of the set $\mathcal{C}^{\prime}$ and calls Algorithm 14, the main structure recognition procedure which is dedicated to flat circuits (line 4). This is only possible because the hierarchy levels have been decoupled from each other by partial circuit flattening. The flat algorithm ignores all subcircuit blocks of $C_{i}^{\prime}$ as all required structural information of other hierarchy levels connected to $C_{i}^{\prime}$ has been made available to it during partial circuit flattening. The results of the flat structure recognition method are inserted into $L_{\text {struct }}$ in line 5.

The final output of Algorithm 13 is the structure recognition results library $L_{\text {struct }}$ which contains all identified building blocks of each circuit of type $t_{i}^{\prime}$ (line 7).

Algorithm 14 shows the enhanced structure recognition method based on [10]. It takes a circuit $C_{i}$, the array and pair libraries, $L_{\text {array }}$ and $L_{\text {pair }}$, and the supply nets $N_{\text {supply }}$ as inputs.

The structure recognition results for $C_{i}$ are organized in ranks. The set $C_{B, i}$ contains one building block circuit $C_{B, i, r_{j}}$ for each rank of the provided building block library $L_{\text {analog. }}$. A building block circuit $C_{B, i, r_{j}}$ contains all identified building blocks of rank $j$ of circuit $C_{i}$ plus their connectivity. This representation reduces the complexity of the pair recognition algorithm by [17] and simplifies the arbitration mechanisms introduced by [10] which will be explained later in this section.

The structure recognition algorithm first initializes the set of building block circuits $C_{B, i}$ empty (line 2). Then, the arrays of circuit are identified in lines $3-7$, using the array recognition from [17]. The resulting array circuit $C_{B, i, r_{0}}$ is stored in $C_{B, i}$.

```
Algorithm 14 Enhanced analog structure recognition algorithm based on [10].
    procedure structureRecognition \(\left(C_{i}, L_{\text {array }}, L_{\text {pair }}, N_{\text {supply }}\right)\)
        \(C_{B, i}=\emptyset \quad / /\) Set of building block circuits, organized in ranks.
        // Recognition of arrays
        \(C_{B, i, r_{0}}=\emptyset \quad / /\) Array circuit initialized empty.
        for all array \(\in L_{\text {array }}\) do
            \(\left.C_{B, i, r_{0}}=C_{B, i, r_{0}} \cup \operatorname{arrayRecognition(array,} C_{i}\right) \quad / /\) Similar to [17], Sec. 3.2.1.
        end for
        \(C_{B, i}=C_{B, i} \cup C_{B, i, r_{0}}\)
        // Recognition of pairs
        for all \(r_{j} \in L_{\text {pair }}\) do // Iterate over all ranks.
            \(C_{B, i, r_{j}}=\emptyset\)
            for all pair \(\in r_{j}\) do // Iterate over all pairs of the current rank.
                \(C_{B, i, r_{j}}=C_{B, i, r_{j}} \cup\) pairRecognition(pair, \(\left.C_{B, i}\right) \quad / /\) See Alg. 15 and 16.
            end for
            \(C_{B, i}=C_{B, i} \cup C_{B, i, r_{j}}\)
            // Arbitration simplified compared to [10]
            self Arbitration \(\left(C_{B, i}\right) \quad / /\) Algorithm 18
            dominanceRelations \(\left(C_{B, i}\right) \quad / /\) Algorithm 19
        end for
        return \(C_{B, i}\)
    end procedure
```

```
Algorithm 15 Pair recognition by [17].
    procedure PairRecognition \(\left(p, C_{i}, \mathcal{B}\right)\)
        \(B B=\emptyset, T B B=\emptyset\)
        for all \(n \in N_{i}\) do
            \(T B B_{1}=\operatorname{findBlocks}\left(p_{c, 1}, n, \mathcal{B}\right)\)
            \(T B B_{2}=\) findBlocks \(\left(p_{c, 2}, n, \mathcal{B}\right)\)
            \(T B B=T B B \cup\left(T B B_{1} \times T B B_{2}\right)\)
        end for
        for all \(t b b \in T B B\) do
            if rulesFulfilled \((t b b, p)\) then
                \(B B=B B \cup\{t b b\}\)
            end if
        end for
        return \(B B\)
    end procedure
```

```
Algorithm 16 Enhanced pair recognition.
```

Algorithm 16 Enhanced pair recognition.

```
    procedure PAIRRECOGNITION \(\left(p, C_{B}\right)\)
```

    procedure PAIRRECOGNITION \(\left(p, C_{B}\right)\)
    \(C_{B, r_{c, 1}}, C_{B, r_{c, 2}}=\operatorname{find}\left(C_{B}, p\right)\)
    \(C_{B, r_{c, 1}}, C_{B, r_{c, 2}}=\operatorname{find}\left(C_{B}, p\right)\)
    \(B B=\emptyset, T B B=\emptyset\)
    \(B B=\emptyset, T B B=\emptyset\)
    for all \(n \in N_{r_{c, 1}}\) do
    for all \(n \in N_{r_{c, 1}}\) do
                            \(T B B_{1}=\operatorname{findBlocks}\left(p_{c, 1}, n, C_{B, r_{c, 1}}\right)\)
                            \(T B B_{1}=\operatorname{findBlocks}\left(p_{c, 1}, n, C_{B, r_{c, 1}}\right)\)
                            \(T B B_{2}=\) findBlocks \(\left(p_{c, 2}, n, C_{B, r_{c, 2}}\right)\)
                            \(T B B_{2}=\) findBlocks \(\left(p_{c, 2}, n, C_{B, r_{c, 2}}\right)\)
                                \(T B B=T B B \cup\left(T B B_{1} \times T B B_{2}\right)\)
                                \(T B B=T B B \cup\left(T B B_{1} \times T B B_{2}\right)\)
        end for
        end for
        for all \(t b b \in T B B\) do
        for all \(t b b \in T B B\) do
            if rulesFulfilled \((t b b, p)\) then
            if rulesFulfilled \((t b b, p)\) then
                        \(B B=B B \cup\{t b b\}\)
                        \(B B=B B \cup\{t b b\}\)
        end if
        end if
        end for
        end for
        return \(B B\)
        return \(B B\)
    end procedure
    ```
    end procedure
```

In the next step, the algorithm iterates over each rank $r_{j}$ of the pair library $L_{p a i r}$ in ascending order (line 8), initializing a new building block circuit $C_{B, i, r_{j}}$ for each rank (line 9). The algorithm then iterates over each pair of that rank, identifies and inserts the detected pairs into $C_{B, i, r_{j}}$ which in turn is inserted into $C_{B, i}$ after all library elements of that rank have been identified (lines 10, 11 and 13). Ambiguities encountered during the recognition process are resolved afterwards in lines 14 and 15 by applying a self-arbitration mechanism and dominance relations for the recognition results of the current rank. Finally, in line 17, Algorithm 14 returns the building block circuit $C_{B, i}$ which contains all analog structures identified in the given circuit $C_{i}$.

Pairs are identified by a set of recognition rules [17]. These can be summarized as follows:

- block type rules: each subblock of a pair must have a specific building block type, e.g., the type of the subblocks of a cascode current mirror must be "level shifter" and "simple current mirror".
- substrate type rules: each subblock must have a specific substrate type, e.g., a differential pair always consists of two MOSFETs with the same substrate type, i.e., two NMOS or two PMOS transistors. Analog building blocks are typically formed by two subblocks of the same substrate type.
- connection rules: define which pins of two subblocks have and do not have to be connected with each other via a net. E.g., the gate pin of a normal transistor array has to be connected to the drain pin of a diode transistor array to form a simple current mirror. Furthermore, their sources have to be connected and any other connections between the two arrays are not allowed.
- supply net rules: define which pins of the two subblocks have and do not have to be connected to a supply or ground net of the circuit. E.g., a differential is not allowed to be connected to the ground or supply net of a circuit. Additionally, rules for specific supply rails, e.g., the positive supply or ground rail can be specified.

An algorithm to efficiently identify pairs in a circuit using the first two rules is published in [17], Sec. 3.1.1. This algorithm has been further refined in this work as explained in the

```
Algorithm 17 Self-arbitration by [10].
    procedure SElfARbitration \((\mathcal{B})\)
        \(k=\max \operatorname{Rank}(\mathcal{B})\)
        repeat
            removed \(=0\)
            for all \(b \in \mathcal{B}\) do
                    if not hasParent(b)
                    and expired \((b, k)\) then
                    \(C_{B}=C_{B} \backslash\{b\}\)
                    removed \(=1\)
                    end if
            end for
        until removed \(=0\)
    end procedure
```

```
Algorithm 18 Enhanced self-arbitration.
    procedure PAIRRECOGNITION \(\left(C_{B}\right)\)
        \(k=\operatorname{maxRank}\left(C_{B}\right)\)
        for all \(i=k-1\) to \(i=1\) do
            for all \(b \in C_{B, r_{i}}\) do
                if not hasParent(b)
                    and expired \((b, k)\) then
                        \(C_{B}=C_{B} \backslash\{b\}\)
                end if
            end for
        end for
    end procedure
```

following.
Algorithms 15 and 16 compare the two versions of the pair recognition algorithm. Both algorithms iterate over a set of nets (line 4), try to find all occurrences of the arrays or pairs $p_{c, 1}$ and $p_{c, 2}$ which form the currently investigated pair $p$ at these nets (lines 5 and 6 ), build the Cartesian product $T B B$ of the found structures (line 7) and check whether all other recognition rules are fulfilled for the created tentative building block tuples (lines 9-13). Finally, in line 14, the algorithms return all building block pairs $B B$ fulfilling all recognition rules provided by $p$.

The differences between the two versions lie in the set of nets the two algorithms are iterating over and in the sets in which they are searching for the subblocks $p_{c, 1}$ and $p_{c, 2}$. The previous version iterates over all nets $n \in N_{i}$ of the given circuit $C_{i}$ (line 4, Algorithm 15). The enhanced version only iterates over the nets $n \in N_{r_{c, 1}}$ of the building block circuit $C_{B, r_{c, 1}}$ (line 4, Algorithm 16) which is determined in line 2 of the enhanced pair recognition algorithm. $C_{B, r_{c, 1}}$ thereby corresponds to the building block circuit of rank $r_{c, 1}$ on which pairs of type $p_{c, 1}$ can be found. A net is only inserted into a building block circuit if a valid pair connected to it has been identified during structure recognition. Hence, the number of nets for a building block circuit is less than or equal to the overall number of nets in the circuit, i.e., $\left|N_{B, r_{j}}\right| \leq\left|N_{i}\right|$. This effect becomes more significant for higher ranked building block circuits. The overall runtime of the pair recognition algorithm improves as it has to iterate over a set of nets which is continuously getting smaller for higher ranks.

The old version furthermore merged all identified building blocks into the data structure $\mathcal{B}$, i.e., a net could be connected to several building blocks of different ranks at the same time. This leads to a higher computational complexity of the function findBlocks $\left(p_{c, 1 / 2}, n, \mathcal{B}\right)$ which tries to find building block $p_{c, 1 / 2}$ at net $n$. The new data structure separates the recognition results for each rank, i.e., a net of a building block circuit of rank $r_{k}$ is only connected to building blocks of the same rank, hence reducing the lookup time of the function findBlocks $\left(p_{c, 1 / 2}, n, C_{B, r_{c, 1}}\right)$.
Algorithms 17 and 18 outline the old [10] and new version of the self-arbitration process using the persistence mechanism. Self-arbitration is used to resolve ambiguities early in the

```
Algorithm 19 Removal of dominated building blocks.
    procedure DOMINANCERELATIONS \(\left(C_{B}\right)\)
        \(D=\) findDominanceRelations \(\left(\operatorname{maxRank}\left(C_{B}\right)\right)\)
        for all \(d \in D\) do
            \(B_{M}=\) findDominatingBuildingBlocks \(\left(C_{B}, d\right)\)
            \(B_{S}=\) findDominatedBuildingBlocks \(\left(C_{B}, d_{i}\right)\)
            for all \(\left(b_{m}, b_{s}\right) \in\left(B_{M} \times B_{S}\right)\) do
                if \(\operatorname{overlap}\left(b_{m}, b_{s}\right)\) then
                    remove \(\left(C_{i}, b_{s}\right)\)
                end if
            end for
        end for
    end procedure
```

recognition process and is called whenever the identification of each library element of the given rank has been completed. The persistence of a building block defines for how many iterations after its identification it will remain in the structure recognition process without becoming part of a higher ranked building block. E.g., a differential pair has a persistence of one. Hence, an identified differential pair must become part of a differential stage on the next rank. Otherwise it will be discarded.

The implementation of [10] uses the set $\mathcal{B}$ which contains all arrays and pairs of the circuit. The algorithm iterates over each building block $b \in \mathcal{B}$ and checks whether it is has to be discarded (lines 5 and 7). A block is discarded from $\mathcal{B}$ in line 8 if it has not become part of a higher ranked building block and if its persistence expired [10]:

$$
\operatorname{expired}\left(b, r_{k}\right)= \begin{cases}1 & , \text { if } r_{k}>r_{b}+\operatorname{persistence}(b)  \tag{A.1}\\ 0 & , \text { else }\end{cases}
$$

The rank $r_{k}$ thereby corresponds to the highest rank of an identified structure of $\mathcal{B}$ and is determined by function $\max \operatorname{Rank}(\mathcal{B})$ in line 2 . Removing an element from $\mathcal{B}$ might affect the result of eq. (A.1) for other already processed building blocks, i.e., previously valid building blocks might have become invalid after a higher ranked structure has been removed from $\mathcal{B}$. Hence, the self-arbitration mechanism is repeated until no more elements can be removed from $\mathcal{B}$ (lines 3 and 12).

In the new implementation, self-arbitration has to iterate only once over all identified building blocks, starting from the pairs stored in the highest ranked building block circuit down to the array circuit (line 3). Hence, the runtime of the arbitration procedure has been improved compared to [10].

Algorithm 19 shows a second mechanism to resolve ambiguities during the recognition process by dominance relations. A dominance relation specifies one building block type which dominates another set of building block types. A dominated building block is removed from $C_{B}$ if it overlaps with another building block dominating it (lines 7 and 8). For each rank, Algorithm 19 fetches a predefined set of dominance relations and enforces them on the specified building block types (lines 2 and 3-11).

The dominance relations presented by [10] are limited to cross coupled pairs. In this version, the definition of dominance relations has been generalized, providing a fine-grained control
mechanism to resolve building block ambiguities. Furthermore, the application of dominance relations has been simplified compared to the implementation of [10] as the requested building block types can be directly looked up in the corresponding building block circuits.

## A.6. Experimental Result

## Dual Gain Amplifier

## Bias and Core Circuitry

Fig. A. 7 shows the structure recognition results for the bias and circuitry of the dual gain amplifier (DGA) from Fig. A.5. The upper left shows the array circuit $C_{B, D G A_{b i a s}, r_{0}}$ of the bias circuitry. It contains four resistor, six diode and six normal transistor arrays.

The upper right shows the array circuit $C_{B, D G A_{\text {core }, r_{0}}}$ of the core circuit. Two resistor arrays have been identified. The folded cascode amplifier subcircuits FOCA1 and FOCA2 have been drawn in dashed lines as structure recognition ignores subcircuit blocks. Net in is also dashed as it is not connected to one of the resistor arrays and hence not part of $C_{B, D G A_{\text {core },}, r_{0}}$.

The lower part of Fig. A. 7 shows the building block circuit $C_{B, D G A_{b i a s}, r_{1}}$ which contains all elements of rank one of the bias circuitry. It consists of two simple current mirrors and four level shifters. The core part does not contain any elements from rank one is hence empty.

The building block circuit of the complete amplifier $C_{B, D G A, r_{1}}=C_{B, D G A_{\text {bias }}, r_{1}} \cup C_{B, D G A_{\text {core } e}, r_{1}}$ has four fewer nets than the combined array circuits $C_{B, D G A, r_{0}}=C_{B, D G A_{\text {bias }}, r_{0}} \cup C_{B, D G A_{\text {core } e}, r_{0}}$. Pair recognition is called for each net and each library element of the current rank. Rank two of $L_{\text {analog }}$ contains six elements. Hence, $4 \cdot 6=24$ executions of the pair recognition algorithm are saved with the new data structure compared to $[17 ; 10]$ as the circuit on rank one contains four fewer nets than the original circuit.

## Folded Cascode Amplifier

Figs. A. 8 and A. 9 show the structure recognition results for the folded cascode amplifier (FOCA) from Fig. A.6. The array circuit $C_{B, F O C A, r_{0}}$ contains eleven normal and two diode transistor arrays. On rank one, three simple current mirrors, three level shifters, one differential pair, one voltage reference two and three cascode pairs are identified. On the next rank, one cascode current mirror, one improved Wilson current mirror and a differential stage are recognized. Please note: the differential stage is only used to validate the previously identified differential pair and is discarded afterwards again as it does not generate any additional constraints for the power-down verification or power-down synthesis procedures from Chaps. 3 and 4. The cascode pairs $c p(N 1, N 2)$ and $c p(N 1, N 3)$ overlap with the differential pair $d p(N 2, N 3)$ and did not become part of a higher ranked structure. Hence, self-arbitration removes them from $C_{B, F O C A, r_{1}}$ which is indicated by dashed blue lines in the figure. The final recognition results of the folded cascode amplifier are summarized on the bottom of Fig. A.9.

## A. Hierarchical Structure Recognition



Bias circuitry of the dual gain amplifier:
building block circuit $C_{B, D G A_{\text {bias }}, r_{0}}$

Core circuit of the dual gain amplifier: building block circuit $C_{B, D G A_{\text {core }}, r_{0}}$


scm: simple current mirror Is: level shifter
Bias circuitry of the dual gain amplifier:
building block circuit $C_{B, D G A_{\text {bias }}, r_{1}}$

Core circuit of the dual gain amplifier:
building block circuit $C_{B, D G A_{\text {core },}, r_{1}}$

Figure A.7.: Structure recognition results for the bias circuitry of the dual gain amplifier (DGA) from Fig. A.5a.


Folded cascode amplifier: building block circuit $C_{B, F O C A, r_{0}}$


Folded cascode amplifier: building block circuit $C_{B, F O C A, r_{1}}$
Figure A.8.: Structure recognition results of rank zero and rank one for the folded cascode amplifier (FOCA) from Fig. A.6.

## A. Hierarchical Structure Recognition



Folded cascode amplifier: building block circuit $C_{B, F O C A, r_{2}}$


Folded cascode amplifier: final results $C_{B, F O C A}$
Figure A.9.: Structure recognition results of rank two and final results for the folded cascode amplifier (FOCA) from Fig. A.6.

## B. Hierarchical Symmetry Computation

## B.1. Introduction

Symmetry is a widely used design principle in analog circuit design: fully differential amplifiers are built by two identical circuit halves, their layout is symmetrical which improves the accuracy, power-supply and common mode rejection ratio of the amplifier and makes it less susceptible to manufacturing variations, cross talk, DC coupling and substrate noise [53; 54; 55; 17].

The symmetry information inside a circuit can be further used to constrain the solution space of numerical circuit optimization tools, e.g., Wicked [56], which improves their runtime and the quality of the computed results [57]. Symmetries must also be considered in the circuit's layout to minimize the influence of parasitic mismatch on its performances [58; 59].

## B.2. State of the Art and Contributions

The automatic identification of symmetries in netlists and layouts is an ongoing research topic. Many methods thereby follow a graph-based approach. The method by [58] additionally uses a sensitivity analysis, [53] grows two symmetric connection trees around a center element, [60; 61] uses weighted bipartite graph matching, [62] adapted the Gemini II algorithm [64] and $[54 ; 63]$ analyze signal paths for symmetry computation in their respective graph models. All of these methods require a fully symmetric netlist as inputs, the netlists for $[58 ; 60 ; 61]$ must be sized. The methods $[53 ; 60 ; 61 ; 62 ; 54 ; 63]$ claim to be able to handle near symmetries by netlist transformations. However, it is often not explained in detail how this can be achieved.

The authors of $[55 ; 65]$ extract symmetries from a circuit layout using a graph model.
The methods presented by $[24 ; 59 ; 57]$ introduced a signal flow graph library which models the signal flow of the analog basic building blocks from [4] qualitatively. The signal flow graph of the given circuit is then obtained by merging the graph models of its basic building blocks which are identified by structure recognition, e.g., [51]. The methods by [24; 59] recursively track the signal flow from the inputs to the outputs of the circuit whilst identifying symmetry pairs in the traversed edges. The method by [57] formulates and solves a constraint optimization problem based on this signal flow graph to identify symmetries in the core, i.e., signal processing, and bias part of the circuit. The method is able to identify multiple symmetry axes in the design.

The method by [66] implements similar ideas to [24; 59], but additionally uses machine learning algorithms to identify approximate graph symmetries based on the graph edit distance metric.

## B. Hierarchical Symmetry Computation



Figure B.1.: Overview of the hierarchical symmetry computation method.
[52] identifies analog basic building blocks by vector encodings. The same publication presents a method which identifies symmetries in the circuit by tracking devices with identical encodings alongside signal paths similarly to [24; 59].
[67] performs a Kolmogorov-Smirnov test to measure the similarity of the eigenvalue distribution of two graphs to extract symmetries from the circuit.
Most of the mentioned methods operate on device level. The methods [66; 52; 67] claim to support hierarchical designs; they however state that they have to flatten hierarchical designs in a preprocessing step.
[23] presents a hierarchical symmetry computation approach based on [57]. It handles circuit hierarchies by traversing them recursively, alternately executing structure recognition, signal flow graph generation and symmetry computation. This method does not reuse intermediate results and cannot be parallelized due to its recursive nature.

This work presents a new hierarchical symmetry computation method. It uses the hierarchical structure recognition results from App. A. In contrast to [23], the circuit's hierarchy levels are already decoupled due to partial circuit flattening in the structure recognition process. This enables the parallelization of symmetry computation which has, however, not been implemented yet. Furthermore, intermediate results are reused whenever possible.
For symmetry computation itself, an algorithm similar to [24; 59] has been implemented. However, the following adjustments have been made: the algorithm traces the signal paths not only in forward, but also in backward direction allowing to identify symmetries in the bias and feedback paths of the given circuit. Additional edge attributes are introduced which define more accurate matching conditions. Approximate matching conditions for single nodes allow a greater flexibility during symmetry computation.

## B.3. Overview

Fig. B. 1 gives an overview of the implemented hierarchical symmetry computation method. It takes a structure recognition results library $L_{\text {struct }}$, the enhanced signal flow graph library $L_{E S F G}$, the supply nets $N_{\text {supply }}$ and the pin type library $L_{p i n}$ as inputs. In the first step, it extends the signal flow graph library $L_{E S F G}$ to $L_{E S F G}^{\prime}$ based on the structure recognition results $L_{\text {struct }}$ of the given circuit. $L_{E S F G}^{\prime}$ then contains one signal flow graph for each circuit type used to form the hierarchical design.

An enhanced signal flow graph (ESFG) models how the input signals of a circuit traverse its building blocks to its outputs [24; 59].

The symmetry computation is performed by an algorithm similar to $[24 ; 59]$ for each element of $L_{E S F G}^{\prime}$. That algorithm recursively tracks symmetrical edges alongside the signal paths of the given graph. The (differential) input, (differential) output and bias pins of a circuit $C_{i}$, specified by the library $L_{\text {pin }}$, serve as starting or end point for the computations.

## B.4. Signal Flow Graph Generation

Analog basic building blocks process a given (differential) input signal in various different ways, e.g., a simple current mirror amplifies and copies a reference current from its input to its output pin; a differential pair converts a differential input voltage into a differential output current.

This behavior is qualitatively modelled in [59] by a so called "enhanced signal flow graph" $G(N, E)$. The set $N$ contains the nodes of the graph which correspond to a subset of the nets in the circuit. The set $E$ contains directed edges $e_{j}=\left(n_{+}, n_{-}\right)$which model a signal flow between its start and end its nodes, $n_{+}$and $n_{-}$, respectively. Each edge has an attribute

$$
\begin{equation*}
\operatorname{att}\left(e_{j}\right)=\left(p_{+}, p_{-}, b b, s s t\right) \tag{B.1}
\end{equation*}
$$

which contains information about the building block $b b$ from which the edge has been generated. The substrate type of $b b$ is denoted as $s s t$ and the pins of $b b$ at which the edge originates and terminates are written as $p_{+}$and $p_{-}$, respectively. The substrate type is either of type $n$ or $p$, i.e., the building block is entirely formed by NMOS or PMOS transistors, or of type $u$, which is used for passive devices or a mixture of NMOS and PMOS transistors.

The attributes of two edges $e_{k}, e_{l}$ are identical if the following equation holds [17]:

$$
\begin{array}{rlr}
\operatorname{att}\left(e_{k}\right)==\operatorname{att}\left(e_{l}\right) \Leftrightarrow & p_{+}\left(e_{k}\right)==p_{+}\left(e_{l}\right) \wedge & \text { ("same start pin") } \\
& p_{-}\left(e_{k}\right)=p_{-}\left(e_{l}\right) \wedge & \text { ("same end pin") } \\
& \operatorname{type}\left(b b\left(e_{k}\right)\right)==\operatorname{type}\left(b b\left(e_{l}\right)\right) \wedge & \text { ("same building block type") }  \tag{B.2}\\
& \operatorname{sst}\left(b b\left(e_{k}\right)\right)==\operatorname{sst}\left(b b\left(e_{l}\right)\right) & \text { ("same substrate type") }
\end{array}
$$

This condition is crucial to identify symmetries in a signal flow graph as will be shown in Sec. B. 5 .

The graph model $G(N, E)$ from [51] for flat circuits has been extended for hierarchical circuits to $G_{i}\left(P_{i}, N_{i}, E_{i}, t_{i}\right)$. The symbol $t_{i}$ denotes a circuit type from set $\mathcal{C}$ (see eq. (2.1)) and the set of pins $P_{i} \subseteq N_{i}$ represents possible connections to external signal flow graphs.

Fig. B. 2 shows the signal flow graph library for the analog basic building blocks from Fig. A.3. The signal flow of a cross coupled pair, e.g., is modelled by two antiparallel edges between its input and its output pin.

The shown library is almost identical to the one presented in [17]. However, the following adjustments have been made: only elements without a persistence in the structure recognition library are included; signal flows between the inner pins of more sophisticated current

## B. Hierarchical Symmetry Computation


normal transistor array (nta)

simple current mirror (scm)

wide swing cascode current mirror (wscem)


diode transistor array (dta)

cross coupled pair (ccp)

wilson current mirror (wcm)


resistor array (rta), capacitor array (cta)


cascode current mirror (ccm)


Figure B.2.: Signals flow graph library $L_{E S F G}$ for the analog basic building blocks of Fig. A. 3 according to [17].

(a) signal flow graph
$G_{D, F O C A}$

| edge | attributes |  |  |  |
| :---: | :---: | :---: | :---: | :---: |
|  | $p_{+}$ | $p_{-}$ | $b b$ | sst |
| $e_{1}$ | in | out | $\operatorname{scm}(T O P / N<1,2>, N 1]$ | $n$ |
| $e_{2}$ | $s$ | out 1 | $d p(N 2, N 3)$ | $n$ |
| $e_{3}$ | $s$ | out 2 | $d p(N 2, N 3)$ | $n$ |
| $e_{4}$ | in 1 | out 1 | $d p(N 2, N 3)$ | $n$ |
| $e_{5}$ | in 2 | out 2 | $d p(N 2, N 3)$ | $n$ |
| $e_{6}$ | inner 1 | inner 2 | $\operatorname{ccm}(P 1-P 4)$ | $p$ |
| $e_{7}$ | in | out | $c c m(P 1-P 4)$ | $p$ |
| $e_{8}$ | inner 1 | in | $c c m(P 1-P 4)$ | $p$ |
| $e_{9}$ | inner 2 | out | $c c m(P 1-P 4)$ | $p$ |
| $e_{10}$ | in | out | $w s c c m(N 4-N 7)$ | $n$ |
| $e_{11}$ | in | out | $l s(T O P / N<3,5>, N 6)$ | $n$ |
| $e_{12}$ | in | out | $l s(T O P / N<3,5>, N 7)$ | $n$ |

(b) edge attributes

Figure B.3.: Detailed signal flow graph and corresponding edge attributes of the folded cascode amplifier (FOCA) from Fig. A.6. The structure recognition results used for signal flow graph generation are shown in Fig. A.9.
mirror structures are considered in order to support folded cascode structures. The dashed edges of the current mirrors from rank two of $L_{E S F G}$ (see Fig. B.2) are only kept if at least one non-optional edge of another building block starts or ends at the inner pins of these current mirrors. Additionally, identically colored edges of building blocks of the same type are considered symmetrical even if they originate and end at different pins of such blocks. E.g., a differential pair converts a differential input voltage at the pins (in1,in2) into a differential output current at the pins (out1,out2). Hence, the edges $e_{1}=($ in 1, out 1 ) and $e_{2}=($ in 2 , out 2 ) are symmetric and drawn with the same color.

The signal flow graph of a circuit is generated by merging the signal flow graphs of each of its top-level building blocks.

Fig. B. 3 shows the signal flow graph and the corresponding edge attributes for the folded cascode amplifier (FOCA) from Fig. A.6. The circuit contains one simple current mirror, one differential pair, two level shifters, one cascode and one wide-swing cascode current mirror as top-level building blocks (see Fig. A.9). The signal flow through the wide-swing cascode current mirror $w \operatorname{sccm}(N 4-N 7)$ is modelled only by edge $e_{10}$ between the nodes $n 4$ and out of the graph as no other non-optional edge starts or ends at its inner pins. The optional edges from the two level shifters which would have been inserted at nodes $n 5$ and $n 6$ have also been removed from the final graph due to the same reason.

The signal flow at the inner pins of the cascode current mirror has to be captured in the graph as these pins are connected to the output pins of the differential pair via edges.

When encountering a subcircuit block during graph generation, its underlying detailed signal

## B. Hierarchical Symmetry Computation



Figure B.4.: Abstract signal flow graph $G_{A, F O C A}$ of the folded cascode amplifier from Fig. A. 6.
flow graph has to be abstracted such that its internal behavior is simplified yet qualitatively captured on the hierarchy level containing the subcircuit block. Therefore, an edge between each input and each output pin of the subcircuit block is generated. Bias pins are thereby treated as inputs, as small changes in the bias current or bias voltage signal are only reflected at the outputs of the circuit. Hence, each subcircuit of $\mathcal{C}$ is associated to an abstract and a detailed graph model $G_{A, i}$ and $G_{D, i}$, respectively. The set of pins $P$ and the set of nets $N$ are identical for all abstract signal flow graphs $G_{A, i}$. Furthermore, the abstract graph model for analog basic building blocks corresponds to its detailed graph model.

Fig. B. 4 shows the abstract signal flow graph $G_{A, F O C A}$ of the folded cascode amplifier from Fig. B.3. The folded cascode amplifier has one differential input, in and $i p$, two bias pins, $b 1$ and $b 2$, and one output pin out. Hence, the abstract graph model contains four edges in total. Each of these edges is pointing from an input or a bias pin to the output. The edges from the differential input pins are drawn in blue as they are considered to be symmetric.

The abstract graph model of the folded cascode amplifier is then used to construct the detailed signal flow graph of the dual gain amplifier from Fig. A.5. The corresponding signal flow graph is shown in Fig. B.5. It consists of 16 nodes and 30 edges in total. The core circuit instantiates the abstract folded cascode amplifier graph model $G_{A, F O C A}$ two times and contains two edges each for the resistor arrays $\operatorname{rta}(R 1)$ and $r t a(R 2)$. The bias circuitry consists of four transistor arrays which corresponds to eight edges in the graph, two simple current mirrors, i.e., two edges, and two level shifters each of them generating two edges. The attributes of the edges are shown at the bottom of Fig. B.5.

Algorithm 20 shows how the signal flow graph for hierarchical circuits is generated. It takes the structure recognition results library $L_{\text {struct }}$ and the signal flow graph library $L_{E S F G}$ from Fig. B. 2 as input. $L_{E S F G}$ is iteratively extended by the abstract and detailed graph models for all building block circuits $C_{B, i}$ of $L_{\text {struct }}$. The algorithm first initializes $L_{E S F G}^{\prime}$ as a copy of the signal flow graph library $L_{E S F G}$ from Fig. B. 2 (line 2). Then, it iterates over the elements $C_{B, i}$ of $L_{s t r u c t}$ and initializes an empty detailed graph model $G_{D, i}$ for each of it (lines 3 and 4). This graph model is filled in by merging the abstract graph model $G_{A, b b}$ of each of its top-level building blocks into it (lines 6-9). A top-level building block is either a subcircuit block of $C_{B, i}$ or an analog basic building block which is not part of a higher ranked building block from the structure recognition library (Fig. A.3). If an abstract graph model for a subcircuit block is not present in $L_{E S F G}^{\prime}$ yet, it is automatically created by function findOrCreateAbstractGraphModel (bb, $\left.L_{E S F G}^{\prime}, L_{p i n}\right)$ as described in the previous paragraphs and stored in $L_{E S F G}^{\prime}$ (line 7). After all top-level blocks of $C_{B, i}$ have

(a) detailed signal flow graph

| bias circuitry |  |  |  |  | core circuit |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| edge | attribute |  |  |  | edge |  |  | tribute |  |
|  | $p_{+}$ | $p_{-}$ | $b b$ | sst |  | $p_{+}$ | $p_{-}$ | $b b$ | sst |
| $e_{1}$ | $d$ | $s$ | $d t a(N 4)$ | $n$ | $e_{19}$ | $b 2$ | out | FOCA1 | $u$ |
| $e_{2}$ | $s$ | $d$ | $d t a(N 4)$ | $n$ | $e_{20}$ | $b 2$ | out | FOCA2 | $u$ |
| $e_{3}$ | $d$ | $s$ | dta(N6) | $n$ | $e_{21}$ | b1 | out | FOCA1 | $u$ |
| $e_{4}$ | $s$ | $d$ | $d t a(N 6)$ | $n$ | $e_{22}$ | b1 | out | FOCA2 | $u$ |
| $e_{5}$ | $s 1$ | in | $l s(N 3, F O C A 1 / N 6)$ | $n$ | $e_{23}$ | ip | out | FOCA1 | $u$ |
| $e_{6}$ | s1 | in | $l s(N 5, F O C A 2 / N 6)$ | $n$ | $e_{24}$ | ip | out | FOCA2 | $u$ |
| $e_{7}$ | + | - | $r t a(R 5)$ | $u$ | $e_{25}$ | in | out | FOCA1 | $u$ |
| $e_{8}$ | - | + | $r t a(R 5)$ | $u$ | $e_{26}$ | in | out | FOCA2 | $u$ |
| $e_{9}$ | + | - | $r t a(R 6)$ | $u$ | $e_{27}$ | + | - | $r t a(R 1)$ | $u$ |
| $e_{10}$ | - | + | $r t a(R 6)$ | $u$ | $e_{28}$ | - | + | $r t a(R 1)$ | $u$ |
| $e_{11}$ | + | - | $r t a(R 3)$ | $u$ | $e_{29}$ | $+$ | - | $r t a(R 2)$ | $u$ |
| $e_{12}$ | - | + | $r t a(R 3)$ | $u$ | $e_{30}$ | - | + | $r t a(R 2)$ | $u$ |
| $e_{13}$ | $+$ | - | $r t a(R 4)$ | $u$ |  |  |  |  |  |
| $e_{14}$ | - | + | $r t a(R 4)$ | $u$ |  |  |  |  |  |
| $e_{15}$ | in | out | $l s(N 3, F O C A 1 / N<6,7>)$ | $n$ |  |  |  |  |  |
| $e_{16}$ | in | out | $l s(N 5, F O C A 2 / N<6,7>)$ | $n$ |  |  |  |  |  |
| $e_{17}$ | in |  | $\operatorname{scm}(N 1, F O C A 1 / N 1)$ | $n$ |  |  |  |  |  |
| $e_{18}$ | in | out | $\operatorname{scm}(N 2, F O C A 2 / N 1)$ | $n$ |  |  |  |  |  |

(b) attributes

Figure B.5.: Signal flow graph and edge attributes of the dual gain amplifier from Fig. A.5. The structure recognition results for signal flow graph generation of the bias circuitry are shown in Fig. A.7. The abstract signal flow graph for FOCA1 and $F O C A 2$ is shown in Fig. B.4.

```
Algorithm 20 Hierarchical signal flow graph generation.
    procedure BUILDSIGNALFLOWGRAPH \(\left(L_{\text {struct }}, L_{E S F G}, L_{\text {pin }}\right)\)
        \(L_{E S F G}^{\prime}=L_{E S F G}\)
        for all \(C_{B, i} \in L_{\text {struct }}\) do
            \(G_{D, i}=\emptyset\)
            \(T_{i}=\) findTopLevelStructures \(\left(C_{B, i}\right)\)
            for all \(b b \in T_{i}\) do
                \(G_{A, b b}=\) findOrCreateAbstractGraphModel(bb, \(\left.L_{E S F G}^{\prime}, L_{p i n}\right)\)
                \(G_{D, i}=G_{D, i} \cup G_{A, b b}\)
            end for
            \(L_{E S F G}^{\prime}=L_{E S F G}^{\prime} \cup G_{D, i}\)
        end for
        return \(L_{E S F G}^{\prime}\)
    end procedure
```

```
Algorithm 21 Symmetry computation for analog circuits.
    procedure SYMMETRYCOMPUTATION \(\left(L_{E S F G}, L_{\text {pin }}, N_{\text {supply }}\right)\)
        \(L_{\text {sym }}=\emptyset\)
        for all \(G_{D, i} \in L_{G}\) do
            \(P_{\text {diff }, i n, i}=\) findDifferentialInputs \(\left(L_{p i n}, G_{D, i}\right)\)
            \(P_{\text {single }, i n, i}=\) findSingleInputs \(\left(L_{p i n}, G_{D, i}\right)\)
            \(P_{\text {bias }, i}=\) findBiasPins \(\left(L_{p i n}, G_{D, i}\right)\)
            for all \(\left(p_{j}, p_{k}\right) \in P_{d i f f, i n, i}\) :
                    diff NodeSymmetries \(\left(p_{j}, p_{k}, G_{D, i}, L_{\text {sym }}, P_{\text {bias }, i}, N_{\text {supply }}\right) \quad / /\) See Alg. 22
            for all \(p_{l} \in P_{\text {single }, i n, i}\) :
                    singleNodeSymmetries \(\left(p_{l}, G_{D, i}, L_{\text {sym }}, P_{\text {bias }, i}, N_{\text {supply }}\right) \quad / /\) See Alg. 23
        end for
        return \(L_{\text {sym }}\)
    end procedure
```

been processed, the detailed graph model $G_{D, t_{i}}$ is also stored in $L_{E S F G}^{\prime}$ (line 10). Finally, Algorithm 20 returns the library $L_{E S F G}^{\prime}$ in line 12 all which contains all generated signal flow graphs.

The signal flow graph generation for a hierarchy level is decoupled from all other hierarchy levels in the design as the partial circuit flattening method from Sec. A. 4 made the required hierarchical information to model the signal flow available on each hierarchy level. Hence, signal flow graph generation can be parallelized.

## B.5. Signal Path Analysis

Algorithms 21-26 show the implemented symmetry computation method for analog circuits. As shown in the previous section, partial circuit flattening allows to decouple the circuit hierarchy levels for signal flow graph generation. The same is true for symmetry computation: Algorithm 21 simply iterates over every element of a given enhanced signal flow graph library $L_{E S F G}$ in line 3 and executes Algorithms 22 and 26 in lines 7 and 10 which perform the actual computations for the given graph $G_{D, i}$ without requiring to traverse the circuit hierarchy.

The library $L_{p i n}$ contains the following information for each circuit type $t_{i}$ :

- a set of differential input pins $P_{d i f f, i n, i} \subseteq\left(P_{i} \times P_{i}\right)$,
- a set of single input pins $P_{\text {single }, i n, i} \subseteq P_{i}$,
- and a set of bias pins $P_{\text {bias }, i} \subseteq P_{i}$.

The single and differential input pins carry the (differential) input signals processed by the circuit. The bias pins provide voltages or currents which set the operating point of the circuit and do not process any signals in general.

Algorithm 21 uses the sets $P_{d i f f, i n, i}$ and $P_{\text {single }, i n, i}$, which are determined by functions findDifferentialInputs $\left(L_{p i n}, G_{D, i}\right)$ and findSingleInputs $\left(L_{p i n}, G_{D, i}\right)$ in lines 4 and 5, as starting points of Algorithms 22 and 26. These track the (differential) input signals through the graph alongside symmetrical edges recursively. The recursive search stops when a bias pin or a supply net of the circuit has been reached. The output pins of the circuit have been explicitly excluded as termination criterion as there might be feedback paths in the circuit which have to be traced back.

Algorithms 22 and 26 partition a given detailed signal flow graph $G_{D}$ into the following five sets:

- $N_{s}$ : this set contains all nodes of the graph which are not considered symmetric to any other node of the graph. However, incoming and outgoing edges of such nodes can be pairwise symmetrical.
- $E_{s}$ : this set contains all edges which do not yield any symmetry condition. They simply transfer a signal from one node to another.
- $N_{d i f f}$ : this set contains all symmetrical node pairs of the graph. Two nodes are symmetric if it is possible to pairwise match their incoming and outgoing edges.
- $E_{\text {diff }}$ : this set contains all symmetrical edge pairs in the graph. Two edges are symmetric if their attributes match, i.e., eq. (B.2) holds, and if they origin or end at the same node or a symmetrical node pair.
- $E_{c l}$ : this set stores all conversion links of a graph, i.e., edges which convert a differential signal into a single ended signal. Conversion links are self-symmetric, i.e., their corresponding devices must be sized identically and layed out symmetrically.

These sets are stored in the library $L_{\text {sym }}$ which is the output of Algorithm 21.
Fig. B. 6 shows several different scenarios of node and edge connectivity which can be encountered in a signal flow graph during symmetry computation. Their corresponding partitioning is annotated under each of them.

The nodes $n 1$ and $n 2$ are symmetric as it is possible to pairwise match their incoming and outgoing edges.

Node $n 5$ is a merge point in the graph, i.e., two symmetrical edges end in $n 5$ which corresponds to merging a differential signal into a single ended one.

The branch node $n 6$ splits a single ended signal into a differential one by two symmetrical edges.

A special case of signal conversion occurs when a differential signal is merged from one node into the other by one self-symmetric edge. Such edges are called "conversion links" from now on. In the example, edge $e_{\text {out }, n 9,1}$ converts the differential signal at nets ( $n 9, n 10$ ) into a single ended one. The merge point is $n 10$ as the conversion link ends there.

$N_{s}=\emptyset$
$E_{s}=\emptyset$
$N_{\text {diff }}=\{(n 1, n 2)\}$
$E_{\text {diff }}=\left\{\left(e_{i n, n 1, i,}, e_{i n, n 2, i}\right) \mid i=1,2, \ldots, k\right\} \cup$ $\left\{\left(e_{\text {out }, n 1, i}, e_{\text {out }, n 2, i}\right) \mid i=1,2, \ldots, k\right\}$
merge point

$N_{s}=\{n 5\}$
$E_{s}=\left\{e_{\text {out }, n 5,1}\right\}$
$N_{d i f f}=\{(n 3, n 4)\}$
$E_{\text {diff }}=\left\{\left(e_{\text {out }, n 3,1}, e_{\text {out }, n 4,1}\right)\right\}$
$E_{c l}=\emptyset$
$E_{c l}=\emptyset$
branch point


$$
\begin{aligned}
& N_{s}=\{n 6\} \\
& E_{s}=\left\{e_{\text {out }, n 6,1}\right\} \\
& N_{\text {diff }}=\{(n 7, n 8)\} \\
& E_{\text {diff }}=\left\{\left(e_{\text {out }, n 6,1}, e_{\text {out }, n 6,3}\right),\right. \\
&\left.\left(e_{\text {out }, n 7,1}, e_{\text {out }, n 8,1}\right)\right\}
\end{aligned}
$$

$$
E_{c l}=\emptyset
$$



$$
\begin{aligned}
& N_{s}=\{n 10\} \\
& E_{s}=\emptyset \\
& N_{d i f f}=\{(n 9, n 10)\} \\
& E_{\text {diff }}=\left\{\left(e_{\text {in,n9,1 }}, e_{i n, n 10,1}\right)\right\} \\
& E_{c l}=\left\{e_{o u t, n 9,1}\right\}
\end{aligned}
$$

Figure B.6.: Different node and edge types encountered during symmetry computation. Symmetry conditions are indicated by blue arrows.

```
Algorithm 22 Symmetry computation for differential nodes.
    procedure DIFFNODESYMMMETRIES \(\left(n_{i}, n_{j}, G_{D}\right)\)
        // Additional inputs \(L_{\text {sym }}, P_{\text {bias }}\) and \(N_{\text {supply }}\) not shown.
        \(N_{m}=\operatorname{detectConversionLinks}\left(n_{i}, n_{j}, G_{D}\right) \quad / /\) Alg. 24.
        if \(N_{m} \neq \emptyset\) then
            \(N_{s}, N_{\text {diff }}=\) matchEdges \(\left(E_{i n, n_{i}}, E_{i n, n_{i}}, G_{D}\right) \quad / / \mathrm{Alg} .25\).
            for all \(\left(n_{k}, n_{l}\right) \in N_{\text {diff }}\) : diffNodeSymmetries \(\left(n_{k}, n_{l}, G_{D}\right)\) // Recursion.
            for all \(n_{p} \in N_{m} \cup N_{s}\) : singleNodeSymmetries \(\left(n_{p}, G_{D}\right)\) // Alg. 23.
        else
            \(N_{s}, N_{\text {diff }}=\) edgeSymmetries \(\left(n_{i}, n_{j}, G_{D}\right) \quad / /\) Alg. 25.
            for all \(n_{p} \in N_{s}:\) singleNodeSymmetries \(\left(n_{p}, G_{D}\right) \quad / /\) Alg. 23.
            for all \(\left(n_{k}, n_{l}\right) \in N_{\text {diff }}: \operatorname{diffNodeSymmetries}\left(n_{k}, n_{l}, G_{D}\right) \quad / /\) Recursion.
        end if
    end procedure
```

```
Algorithm 23 Recursive symmetry computation for single nodes.
    procedure SINGLENODESYMMETRIES \(\left(n_{i}, G_{D}\right)\)
        // Additional inputs \(L_{\text {sym }}, P_{\text {bias }}\) and \(N_{\text {supply }}\) not shown.
        if \(\operatorname{isBiasPin}\left(n_{i}\right)\) or isSupplyNet \(\left(n_{i}\right)\) then
            return
        end if
        \(N_{s}, N_{\text {diff }}=\operatorname{edgeSymmetries}\left(n_{i}, G_{D}\right) \quad / /\) Alg. 26.
        for all \(\left(n_{k}, n_{l}\right) \in N_{d i f f}: \operatorname{diffNodeSymmetries}\left(n_{k}, n_{l}, G_{D}\right) \quad / /\) Alg. 22.
        for all \(n_{j} \in N_{s}\) : singleNodeSymmetries \(\left(n_{j}, G_{D}\right)\) // Recursion.
    end procedure
```

Algorithms 22 and 26 require a known symmetrical node pair ( $n_{i}, n_{j}$ ) or an unsymmetrical single node $n_{s}$ as starting point for symmetry computation. From there, the signal paths of the graph are tracked recursively alongside incoming and outgoing symmetrical edge pairs. Typically used starting points are the (differential) input pins of the given circuit which are provided by the library $L_{\text {pin }}$. However, any other internal known symmetrical node pair or unsymmetrical node of the circuit would be suitable. The pin library $L_{\text {pin }}$ must thereby be specified by the designer.

Algorithm 22 discovers symmetries at a differential node pair $\left(n_{i}, n_{j}\right)$ as follows: It first detects all conversion links between $n_{i}$ and $n_{j}$ by calling Algorithm 24 in line 2. The detected conversion links are thereby removed from $G_{D}$. Furthermore, this method returns the merge point of the detected conversion link and stores it in the set $N_{m}$. If $N_{m}$ is not empty, i.e., there is a conversion link between $n_{i}$ and $n_{j}$, symmetries in the merged single ended signal are computed by calling Algorithm 23 (line 6). Before that however, symmetries in the incoming edges of $\left(n_{i}, n_{j}\right)$ have to be detected by calling Algorithm 25 in line 4.

If there is no conversion link between $\left(n_{i}, n_{j}\right)$, then all edge symmetries in the incoming and outgoing edges of the two nodes have to be computed (lines 8-10). The sets $N_{s}$ and $N_{d i f f}$, computed by Algorithm 25 in line 8, contain the next node (pair) which has to be investigated by either calling Algorithm 23 or itself recursively.
Algorithm 23 determines further symmetries at a single node $n_{i}$ of graph $G_{D}$. It operates as follows: it calls Algorithm 26 which partitions the incoming and outgoing edges into the


Figure B.7.: A current mirror bank and its corresponding signal flow graph. A unique edge mapping does not exist.

```
Algorithm 24 Detection of conversion links.
    procedure DETECTCONVERSIONLINKS \(\left(n_{i}, n_{j}, G_{D}\right)\)
        // Results \(N_{m}, E_{c l}, E_{d i f f}\) are stored in \(L_{s y m}\).
        \(N_{m}=\emptyset\)
        \(E_{c l}=\) findConnectingEdges \(\left(n_{i}, n_{j}\right)\)
        \(G_{D}=G_{D} \backslash E_{c l}\)
        for all \(e_{k} \in E_{c l}\) do
            if getEndNode \(\left(e_{k}\right)==n_{i}\) and \(E_{\text {out }, n_{i}}==\emptyset: \quad N_{m}=N_{m} \cup n_{i}\)
            if getEndNode \(\left(e_{k}\right)==n_{j}\) and \(E_{o u t, n_{j}}=\emptyset: \quad N_{m}=N_{m} \cup n_{j}\)
        end for
        if both \(n_{i}, n_{j} \in N_{m}\) then // Antiparallel edges.
            \(N_{m}=\emptyset\)
        end if
        return \(N_{m}\)
    end procedure
```

sets $E_{s}$ and $E_{\text {diff }}$ (line 5). Furthermore, that algorithm determines the next single nodes $N_{s}$ and node pairs $N_{d i f f}$ to investigate based on these two sets. Algorithm 23 iterates over $N_{s}$ and $N_{\text {diff }}$ either calling itself recursively or calling Algorithm 22. The algorithm stops immediately when $n_{i}$ is a bias pin or a supply net of the circuit as these net types in general do not carry any signal information.

Additionally, bias pins are often input to a reference current of a current mirror bank which copies and distributes those currents to different branches of the circuit. The attributes of the generated current mirror bank edges have all identical attributes which could lead to falsely identified symmetry pairs in the circuit. This is another reason why bias pins are not taken as start point for symmetry computation. Instead, the edges in the signal flow graph are traced back to the bias and supply pins of the circuit to determine symmetries in the bias part of the circuit.

Algorithm 24 shows how conversion links between two nodes $n_{i}$ and $n_{j}$ are detected in a graph $G_{D}$. In line 2, the algorithm initializes the set of merge points $N_{m}$ empty. Then it tries to find edges connecting $n_{i}$ with $n_{j}$ by calling function findConnectingEdges $\left(n_{i}, n_{j}\right)$ in line 3, i.e., edges which have $n_{i}$ and $n_{j}$ as start and end node or vice versa, respectively. These edges are removed from the graph $G_{D}$ and determined self-symmetric (line 4).

Afterwards, the algorithm computes the merge point of the two nodes for each detected conversion link $E_{c l}$ in lines 5-8: a signal merges from node $n_{i}$ into node $n_{j}$ if there is an

```
Algorithm 25 Computation of edge symmetries for differential nodes.
    procedure EDGESYMMETRIES \(\left(n_{i}, n_{j}, G_{D}\right)\)
        \(N_{s, i n}, N_{\text {diff }, i n}=\) matchEdges \(\left(E_{i n, n_{i}}, E_{i n, n_{j}}, G_{D}\right)\)
        \(N_{s, \text { out }}, N_{\text {diff }, \text { out }}=\) matchEdges \(\left(E_{\text {out }, n_{i}}, E_{\text {out }, n_{j}}, G_{D}\right)\)
        return \(N_{s, i n} \cup N_{s, \text { out }}, N_{d i f f, \text { in }} \cup N_{d i f f, \text { out }}\)
    end procedure
    procedure matchEdges \(\left(E_{n_{i}}, E_{n_{j}}, G_{D}\right)\)
        // Results \(N_{s}, N_{d i f f}, E_{d i f f}\) are stored in \(L_{s y m}\).
        \(N_{s}, N_{\text {diff }}, E_{\text {diff }}=\emptyset\)
        for all \(e_{k} \in E_{n_{i}}\) do
            \(e_{l}=\) findMatchingEdge \(\left(E_{n_{j}}, e_{k}\right)\)
            \(E_{d i f f}=E_{d i f f} \cup\left(e_{k}, e_{l}\right)\)
            \(n_{p}=\operatorname{getOtherEnd}\left(e_{k}, n_{i}\right), n_{q}=\operatorname{getOtherEnd}\left(e_{l}, n_{j}\right)\)
            if \(n_{p}==n_{q}\) then // Merge point detected.
                    \(N_{s}=N_{s} \cup\left\{n_{p}\right\}\)
            else // New node symmetry detected.
                \(N_{\text {diff }}=N_{\text {diff }} \cup\left(n_{p}, n_{q}\right)\)
            end if
        end for
        \(G_{D}=G_{D} \backslash E_{\text {diff }}\)
        return \(N_{s}, N_{\text {diff }}\)
    end procedure
```

edge connecting $n_{i}$ with $n_{j}$ (line 6). Furthermore, the set of outgoing edges $E_{\text {out }, n_{i}}$ must be empty after removing $E_{c l}$ from the graph. Otherwise, the signal continues to be processed at $n_{i}$. The same conditions hold for an edge merging a signal from $n_{j}$ into $n_{i}$ (line 7).

A special case occurs when both $n_{i}$ and $n_{j}$ have been inserted into $N_{m}$ : this can happen if the two nodes are connected by anti-parallel edges (line 9). Then, no signal merging in the conventional sense took place and $N_{m}$ is emptied again in line 10.
Finally, the algorithm returns the detected merge points $N_{m}$ (line 12).
Algorithm 25 identifies new merge points $N_{s}$, symmetrical node pairs $N_{\text {diff }}$ and symmetrical edge pairs $E_{\text {diff }}$ based on the incoming (line 2), i.e., $E_{n_{i}, i n}$ and $E_{n_{j}, \text { in }}$, and the outgoing (line 3), i.e., $E_{n_{i}, \text { out }}$ and $E_{n_{j}, \text { out }}$, edges of a given node pair $\left(n_{i}, n_{j}\right)$. In this stage of the algorithm, the conversion links between $n_{i}$ and $n_{j}$ have already been removed from the graph $G_{D}$. Hence, it must be possible to match all remaining edges pairwise. The algorithm thereby uses the function matchEdges $\left(E_{n_{i}}, E_{n_{j}}, G_{D}\right)$ shown at the bottom of Algorithm 25. That function initializes the set of merge points $N_{s}$, symmetrical nodes $N_{d i f f}$ and symmetrical edges $E_{\text {diff }}$ empty (line 7). Then in lines 8 and 9, it iterates over all edges of $e_{k} \in E_{n_{i}}$ and tries to find an edge $e_{l} \in E_{n_{j}}$ which has the same attributes as $n_{j}$ according to (B.2). The detected symmetrical edge pair $\left(e_{k}, e_{l}\right)$ is stored in $E_{d i f f}$ in line 10 and the next nodes $n_{p}$ and $n_{q}$ which have to be investigated are determined by the function:

$$
\operatorname{getOtherEnd}(e, n)= \begin{cases}n_{+} & , \text {if } n==n_{-},  \tag{B.3}\\ n_{-} & , \text {if } n==n_{+}\end{cases}
$$

```
Algorithm 26 Computation of edge symmetries for single nodes.
    procedure EDGESYMMETRIES \(\left(n_{i}, G_{D}\right)\)
        \(N_{s, i n}, N_{\text {diff }, i n}=\) matchEdges \(\left(E_{i n, n_{i}}, G_{D}\right)\)
        \(N_{s, \text { out }}, N_{\text {diff }, \text { out }}=\operatorname{matchEdges}\left(E_{\text {out }, n_{i}}, G_{D}\right)\)
        return \(N_{s, i n} \cup N_{s, \text { out }}, N_{d i f f, \text { in }} \cup N_{d i f f, \text { out }}\)
    end procedure
    procedure matchEdges \(\left(E_{n_{i}}, G_{D}\right)\)
        // Results \(N_{s}, N_{d i f f}, E_{s}, E_{d i f f}\) are stored in \(L_{s y m}\).
        \(N_{s}, N_{\text {diff }}, E_{s}, E_{\text {diff }}=\emptyset\)
        for all \(e_{j} \in E_{n_{i}}\) do
            if hasMatchingEdge \(\left(E_{n_{i}}, e_{j}\right)\) then // Branch point detected.
                \(e_{k}=\) findMatchingEdge \(\left(E_{n_{i}}, e_{j}\right)\)
                    \(E_{\text {diff }}=E_{\text {diff }} \cup\left(e_{j}, e_{k}\right)\)
                    \(\left(n_{p}, n_{q}\right)=\) getOtherEnds \(\left(e_{j}, e_{k}\right)\)
                    \(N_{\text {diff }}=N_{\text {diff }} \cup\left(n_{p}, n_{q}\right)\)
            else // Simple signal transfer.
            \(N_{s}=N_{s} \cup\) getOtherEnd \(\left(e_{j}, n_{i}\right)\)
            \(E_{s}=E_{s} \cup e_{j}\)
            end if
        end for
        \(G_{D}=G_{D} \backslash\left(E_{s} \cup E_{d i f f}\right)\)
        return \(N_{s}, N_{\text {diff }}\)
    end procedure
```

in line 11. This function determines whether $n$ corresponds to the start or the end node of $e$ and returns its corresponding other node.

In the next step, the algorithm determines whether $\left(e_{k}, e_{l}\right)$ merges a differential signal into a single ended one by evaluating whether $n_{p}$ and $n_{q}$ correspond to the same node (line 12). If a merge point has been detected, $n_{p}$ is stored in $N_{s}$ (line 13); otherwise, $\left(n_{p}, n_{q}\right)$ is inserted into $N_{\text {diff }}$ as a new node symmetry has been detected (line 15). Finally, in lines 18 and 19, the identified edge symmetries $E_{d i f f}$ are removed from $G_{D}$ and the algorithm returns $N_{s}$ and $N_{d i f f}$.

Algorithm 26 computes the symmetries in the incoming and outgoing edges in lines 2 and 3 for a single node $n_{i}$, similarly to Algorithm 25 for differential node pairs. The function match Edges $\left(E_{n_{i}}, G_{D}\right)$ shown at the bottom of the algorithm first initializes the sets $N_{s}$, $N_{d i f f}, E_{s}$ and $E_{d i f f}$ empty (line 7). Then, it iterates over all edges $e_{j} \in E_{n_{i}}$ and tries to find another edge $e_{k} \in E_{n_{i}}$ which has the same attributes as $e_{j}$ by calling function hasMatchingEdge $\left(E_{n_{i}}, e_{j}\right)$ (lines 8 and 9 ). If that is the case, the corresponding edge pair is inserted into $E_{\text {diff }}$ and the new differential node pair $\left(n_{p}, n_{q}\right)$ into $N_{\text {diff }}$ (lines 10-13). If no matching edge is found for $e_{j}$, then the signal simply flows through $e_{j}$ to its other end node (lines 15-16). That node and $e_{j}$ are inserted into $N_{s}$ and $E_{s}$, respectively.

Finally, the algorithm removes the edges in $E_{s}$ and $E_{\text {diff }}$ from $G_{D}$ and returns the node sets $N_{s}$ and $N_{d i f f}$ in lines 19 and 20.


Figure B.8.: Symmetry computation on the graph model of the folded cascode amplifier from Fig. B.3. Each iteration shows intermediate steps of Algorithms 21-26.

## B. Hierarchical Symmetry Computation

## B.6. Experimental Result

The implemented algorithms are illustrated at the example of the signal flow graph of the folded cascode amplifier from Fig. B. 3 in the following. The starting point is the differential input pin pair (in,ip), the bias pins of the circuit are given as $b 1$ and $b 2$, the output pin as out. The nodes which are currently processed in each step are marked by double circles. Underneath each step, the computed intermediate results are shown.

In the first step, Algorithm 22 detects the differential edge and differential node symmetries $\left(e_{4}, e_{5}\right)$ and $(n 2, n 3)$ in the graph, respectively. The edges $e_{4}$ and $e_{5}$ are symmetric because their attributes fulfil eq. B.2: they originate and end at the input and output pins of the differential pair $d p(N 2, N 3)$. The two edges are afterwards removed from the graph and Algorithm 22 calls itself recursively for $(n 2, n 3)$. Here, the algorithm discovers the conversion link $e_{8}$, the edge symmetries $\left(e_{2}, e_{3}\right),\left(e_{6}, e_{7}\right)$, the merge point $n 1$ and the symmetrical node pair ( $n 4$, out). Node $n 3$ is not treated as merge point as there is still another outgoing edge at node $n 2$ after the removal of $e_{8}$ from the graph. Algorithm 22 calls itself recursively again for ( $n 4$, out) where it detects the conversion links $e_{9}$ and $e_{10}$, the edge symmetry ( $e_{11}, e_{12}$ ) and node out as merge point of the differential input signal $\left(e_{6}, e_{7}\right)$. Furthermore, edge $e_{1}$ has been classified by Algorithm 23 as an edge which does not yield any symmetry conditions and $b 2$ as branch point of $e_{11}$ and $e_{12}$. In the last step, symmetry computation terminates as all edges of the graph have been processed.
Fig. B. 9 shows the final symmetry computation results for the dual gain amplifier from Fig. A.5, Fig. B. 10 the results for its folded cascode amplifier subcircuits FOCA1 and FOCA2. The results for the dual gain amplifier have been computed with the input pin in and the signal flow graph from Fig. B. 5 as inputs.
A method to annotate the identified conversion links and symmetrical edge pairs back to device level is described in [17], Sec. 5.3.3. A similar method has been implemented for this work which is additionally supporting the back-annotation to subcircuit block level.

Overall, nine symmetrical device pairs and one symmetrical subcircuit block pair have been identified on the top-level of the dual gain amplifier. Six more device symmetries have been found in each of the folded cascode amplifiers FOCA1 and FOCA2.

## B.7. Discussion and Limitations

The implemented method is greedy, i.e., it accepts any edge pair as symmetric as long as their attributes fulfil eq. (B.2) and if they origin or end at a symmetrical node pair. Hence, wrong results can be produced when ambiguities are encountered during symmetry computation as shown on the left of Fig. B.11: the four outgoing edges of $n 1$ all have identical attributes. However, only nodes $(n 2, n 3)$ and $(n 4, n 5)$ are symmetrical in the circuit. Hence, the greedy approach can result in falsely identified symmetry pairs as shown in the center of the figure. A mechanism which resolves such ambiguities has been presented by [24]. This mechanism has not been implemented yet.

Conversion links typically consist of only one edge in the graph. However, there might be scenarios where a signal is merged into another one by a series of edges as shown on the right side of Fig. B.11. The edges of such conversion paths must be either all self-symmetric



(a) bias circuitry
(b) core circuit

Figure B.9.: Symmetry computation results for the bias circuitry (a) and the core circuit (b) of the dual gain amplifier from Fig. A.5.


Figure B.10.: Symmetry computation results for the folded cascode amplifier from Fig. A.6.


Figure B.11.: Ambiguities and conversion paths in a signal flow graph.


Figure B.12.: Multiple symmetry axes in a graph.
or point symmetric around the center of such a path. Conversion paths can be detected by a depth-first search algorithm [68]. However, this mechanism has not been implemented yet.

Finally, multiple symmetry axes in a circuit as shown in Fig. B. 12 cannot be detected yet. The presented algorithms can identify the node symmetries (in1,in2), (in3,in4) and ( $n 1, n 2$ ), but not that $[(i n 1, i n 2),(i n 3, i n 4)]$ are also symmetric.

The work by [17], Sec. 5.1.5, overcomes this problem by a so-called "symmetry equivalence transformation". It combines detected node and edge symmetries into graph components which each represent one of the detected symmetries. In the example on the right, nodes $i n 1 *, i n 3 *$ and $n 1 *$ are the symmetry equivalents for $(i n 1, i n 2),(i n 3, i n 4)$ and $(n 1, n 2)$. The symmetry computation algorithm is then rerun for the resulting equivalence graph which reveals further symmetries and symmetry axes in the circuit. These two steps are repeated until no more new symmetries are discovered in the circuit. However, also this method has not been implemented yet.
B. Hierarchical Symmetry Computation

## C. Manual Power-Down Mode Revision

The verification results computed by the power-down verification method from Chap. 3 can be used to revise and resolve the detected errors in the power-down circuitry of a given circuit. In the following, three heuristics are presented to analyze the provided error report.

The design errors are handled according to their severeness, i.e., short circuit paths are handled before dealing with matching and symmetry violations and floating nets. Finally, redundant power-down transistors are identified and removed from the circuit.

After taking one or more of the following measures, it must be verified that no new design errors are introduced by inserting or replacing a power-down transistor by re-running the verification method.

## Step 1: Short Circuit Paths

Short circuit paths can be classified into three different types: definite, potential and induced [9].

All devices of a definite short circuit path are conducting in power-down mode, i.e., the gates of its transistors are connected to a net with voltage level gnd for PMOS and $v d d$ for NMOS transistors, respectively. A definite short circuit path can be shut off as follows: if it runs over one or more power-down transistors, removing at least one of those transistors will switch it off. The net which has been previously pulled to $v d d$ or $g n d$ by the removed power-down transistor takes the opposite voltage level automatically. If it does not run over a power-down transistor, a net on that path has to be split up according to the rip-up patterns from Fig. 1.6.

A current path is a potential short circuit path if the gate of at least one of its transistors is connected to a floating net. The state of such a transistor is not predictable and could cause a current flow in the circuit. By inserting an additional power-down transistor to force at least one of those transistors into the non-conducting state will switch the potential current flow off.

Induced short circuit paths are caused by other potential or definite short circuit paths which run over at least one of the gates of the induced path. Therefore, all potential and definite short circuit paths should be fixed first. Afterwards, the induced current flows either disappeared or turned into definite or potential ones. In the latter case, additional measures have to be taken as described above.

The detected current paths in the dual gain amplifier from Fig. 3.6 are definite and run over the power-down transistors $P P 1$ and NP3 inside the folded cascode amplifiers FOCA1 and FOCA2. Hence, removing one of these transistors from the circuit will shut them off. Removing PP1 however would allow the ground voltage to propagate to the input pin in of the folded cascode amplifiers which would stress their differential pair $d p(N 2, N 3)$. Hence,


Step 1: no changes


Step 1: no changes

additional matching constraints: $s c m(N 1, D G A / N<1,2>)$, $\mathrm{Is}(\mathrm{N} 6, \mathrm{DGA} / \mathrm{N}<3,5>)$, Is(N7, DGA/N<3,5>)
Step 1: NP3 removed
Figure C.1.: Measures taken to fix the short circuit paths of the dual gain amplifier and its folded cascode amplifier subcircuits from Figs. 3.6 and 3.7.

NP3 must be removed from the circuit. The corresponding modified power-down circuitry is shown in Fig. C.1.

## Step 2: Matching and Symmetry Violations

A matching or a symmetry violation is reported whenever the voltage levels at the pins of a matched or symmetrical subcircuit pair differ from each other.

If one of the pins is connected to a floating net and the other to a net with voltage level $v d d$ or gnd, then a power-down transistor can be added to pull the floating net to the corresponding voltage level.

If both nets have different defined voltage levels, i.e., $v d d$ and $g n d$, it can be checked whether one of those nets is pulled to $v d d$ or gnd by a power-down transistor. That transistor can be replaced by another power-down transistor which pulls the net to the opposite voltage level.

If both nets have different defined voltage levels and both are not connected to a powerdown transistor, then it is not possible to resolve the symmetry violation by the above stated measures. In those cases, it must be decided by the designer whether the affected subcircuits are critical for the operation of the circuit and whether the error can be waived or not. If the error cannot be waived, additional measures, e.g. isolating the affected subcircuits in power-down mode by pass gates, must be considered.

By removing NP3 from the power-down circuitry of the folded cascode amplifiers in the previous step, all matching and symmetry violations reported in Sec. 3.5.1 have been resolved. The supply voltage propagates over the top level to the output pins of the folded cascode amplifiers which then fulfill the matching constraints at their cascode and wide-swing cascode current mirrors. Furthermore, $N 2$ and $N 3$ are conducting. The supply voltage can now propagate to $n 1$ and $n 3$ which sets the previously floating nets to a defined voltage level. The matching constraints at the simple current mirrors and level shifters which are each formed by one device of the bias circuitry of the dual gain amplifier and by one device of its folded cascode amplifiers are also fulfilled. The resistors $R 3$ and $R 4$ pull the nets $n 31$ and $n 41$, i.e., the input pins of the simple current mirrors, to $v d d$ which corresponds to the voltage level at their output pins, i.e., $v d d$ at net $n 1$ of $F O C A 1$ and $F O C A 2$. It can be shown that the constraints at the level shifters hold analogously.

Hence, no measures have to be taken for the two circuits as shown in Fig. C.2. A subsequent verification run does not report any short circuit paths, floating nodes or matching violations in power-down mode.

## Step 3: Floating Nets and Redundant Power-Down Transistors

After steps one and two have been successfully executed, the remaining floating nets of the circuit can either be pulled to $v d d$ or gnd by power-down transistors or they can be left floating to reduce the area occupied by the power-down circuitry. This must be carefully decided by the designer.

Furthermore, power-down transistors which can be removed from the power-down circuitry whilst maintaining its functionality can be identified as follows: a power-down transistor is


Step 2: no changes


Step 2: no changes


Step 2: no changes
Figure C.2.: Measures taken to resolve matching and symmetry violations in the dual gain amplifier and its folded cascode amplifier subcircuits from Figs. 3.6 and 3.7.


Step 3: no changes


Step 3: insertion of $P P 1$ and $P P 2$

$\longleftrightarrow$ : matching constaints from building blocks or symmetry pairs
additional matching constraints: $\mathrm{scm}(\mathrm{N} 1, \mathrm{DGA} / \mathrm{N}<1,2>)$, Is(N6, DGA/N<3,5>), Is(N7, DGA/N<3,5>)
Step 3: removal of $P P 1$ and $P P 2$
Figure C.3.: Measures taken to reduce the area of the power-down circuitry of the dual gain amplifier and its folded cascode amplifier subcircuits from Figs. 3.6 and 3.7.
redundant at net $n$ of the circuit, if there exists another conducting path from $n$ to the supply or ground net of the circuit which does not run over the examined power-down transistor, i.e., gnd or $v d d$ would inherently propagate to $n$.

In the dual gain amplifier from Fig. C.2, the input pins $i p$ of the folded cascode amplifiers $F O C A 1$ and FOCA2 are connected by net $i n$. Hence, pulling this net to $v d d$ by a powerdown transistor allows to remove $P P 2$ from the folded cascode amplifiers. Furthermore, the pins in and out of FOCA1 and FOCA2 are connected with each other over the resistors $R 1$ and $R 2$. Hence, inserting a power-down transistor at the output pin of the dual gain amplifier allows the supply voltage to propagate to the afore-mentioned pins of the folded cascode amplifiers. Hence, PP1 can be removed again from FOCA1 and FOCA2. Fig. C. 3 illustrates the above taken measures. Overall, two power-down transistors can be removed from the hierarchical design whilst still maintaining its functionality.

## List of Figures

1.1. Simplified block diagram of a wireless speaker. ..... 3
1.2. Schematics of a simple digital (a) and a simple analog circuit (b). ..... 4
1.3. Two digital power-saving techniques. ..... 5
1.4. Basic principle of the analog power-down mode (a) and the source circuit from Fig. 1.2b with its power-down circuitry (b) highlighted in blue. ..... 6
1.5. Power-down mode implementation variants considering device matching. ..... 6
1.6. Shut-off patterns used for power-down mode implementation [11]. ..... 8
1.7. Potential current flows in a source circuit (a), faulty power-down mode im- plementation for a source circuit series (b). ..... 10
1.8. Data flows of the state-of-the-art (a) and the proposed power-down verification and power-down synthesis method (b) for hierarchical analog circuits. The diagram on the right (c) shows the data flow for the new structure recognition and symmetry computation methods. ..... 11
2.1. Different device types and their corresponding pins. ..... 15
3.1. Overview of the hierarchical power-down verification method. ..... 18
3.2. Static DC behavior for different device types. ..... 19
3.3. Overview of the symmetry assertion procedure [13]. ..... 25
3.4. Differential stage before (a) and after (b) power-down up transformation. ..... 26
3.5. Matching rules for: (a) analog basic building blocks [10], (b) symmetry pairs in signal paths and (c) symmetrical pin pairs of subcircuit blocks due to underlying signal path symmetries. ..... 27
3.6. Bias circuitry (a) and core circuit (b) of a dual gain amplifier (DGA) with its power-down circuitry highlighted in blue. The subcircuits FOCA1 and $F O C A 2$ are each implemented as the folded cascode amplifier from Fig. 3.7. ..... 29
3.7. Schematic of a folded cascode amplifier (FOCA) which implements subcircuits FOCA1 and FOCA2 of the dual gain amplifier from Fig. 3.6. Its power-down circuitry is highlighted in blue. ..... 30
3.8. Symmetry and matching constraints of the dual gain amplifier from Fig. 3.6. ..... 32
3.9. Symmetry and matching constraints for FOCA1 and FOCA2 from Fig. 3.7. ..... 33
3.10. Schematic of a high input impedance differential amplifier (HIIDA) [38]. The Miller operational transconductance amplifier (OTA) implements its subcir- cuits MILLER1, MILLER2 and MILLER3 (b). Their power-down cir- cuitry is highlighted in blue. ..... 37
3.11. High input impedance differential amplifier (HIIDA) [38] and its Miller op- erational transconductance amplifier (OTA) subcircuit blocks after power-up transformation. ..... 39
4.1. Overview of the power-down synthesis method. ..... 44
4.2. Schematic of a differential stage series $(S D F S)$. Subcircuits $D F S 1$ and $D F S 2$ each implement one differential stage. The definite short circuit path $P 1$, detected by voltage propagation and short-circuit path computation, is high- lighted in red. ..... 46
4.3. Ripped-up schematic of the differential series from Fig. 4.2. ..... 46
4.4. Final synthesized power-down circuitry for the differential stage series from Fig. 4.3 ..... 59
4.5. Analysis and synthesis results for a current mirror operational amplifier [40]. ..... 64
4.6. Basic building blocks and symmetry pairs of a low resistance operational am- plifier [41]. ..... 66
4.7. Power-down synthesis results for the low resistance operational amplifier from Fig. 4.6. ..... 67
4.8. Analysis and synthesis results for a fully differential operational amplifier [42]. ..... 68
4.9. Symmetry and matching constraints of the dual gain amplifier from Fig. 3.6. ..... 69
4.10. Symmetry and matching constraints for $F O C A 1$ and $F O C A 2$ from Fig. 4.9. ..... 70
4.11. Bias circuitry of the dual gain amplifier from Fig. 4.9a after rip-up and gate shut-off. ..... 70
4.12. Valid power-down mode implementation variants for the dual gain amplifier (DGA) and folded cacscode amplifier (FOCA) from Tables 4.15 and 4.16, and their corresponding flat and hierarchical scores. ..... 72
4.13. Basic building blocks and symmetry pairs of a high input impedance dif- ferential amplifier (HIIDA) [38] and its Miller operational transconductance amplifier (OTA) subcircuit blocks. ..... 73
4.14. Synthesis results for the high input impedance differential amplifier (HIIDA) and its Miller operational transconductance amplifiers (OTAs). ..... 74
5.1. Excerpt of the basic building block library from App. A. ..... 78
5.2. Differential input stage and its generated building blocks. ..... 79
5.3. Generated building block library for the differential input stage from Fig. 5.2. ..... 80
5.4. Characteristics and group strength of a simple current mirror. ..... 82
5.5. Schematic of a folded cascode amplifier with different current mirror loads. ..... 84
5.6. Generated building block library for the six folded cascode amplifier variants from Fig. 5.6. ..... 86
5.7. Five different level shifter topologies [ $47 ; 48 ; 49 ; 50$ ]. ..... 88
5.8. Level converter library generated by Algorithms 7-9. ..... 89
A.1. Dual gain amplifier from Fig. 3.6 without power-down circuitry. ..... 94
A.2. Schematic of a folded cascode amplifier which implements subcircuits FOCA1 and $F O C A 2$ of the dual gain amplifier from Fig. A.1. ..... 94
A.3. Analog basic building block library $L_{\text {analog }}=L_{\text {array }} \cup L_{\text {pair }}$ based on [4;51; 10]. 95
A.4. Overview of the hierarchical structure recognition method. ..... 97
A.5. Partially flattened dual gain amplifier from Fig. A.1. ..... 99
A.6. Partially flattened folded cascode amplifier from Fig. A.2. ..... 99
A.7. Structure recognition results for the bias circuitry of the dual gain amplifier (DGA) from Fig. A.5a. ..... 106
A.8. Structure recognition results of rank zero and rank one for the folded cascode amplifier (FOCA) from Fig. A.6. ..... 107
A.9. Structure recognition results of rank two and final results for the folded cas- code amplifier (FOCA) from Fig. A.6. ..... 108
B.1. Overview of the hierarchical symmetry computation method. ..... 110
B.2. Signals flow graph library $L_{E S F G}$ for the analog basic building blocks of Fig. A. 3 according to [17]. ..... 112
B.3. Detailed signal flow graph and corresponding edge attributes of the folded cascode amplifier (FOCA) from Fig. A.6. The structure recognition results used for signal flow graph generation are shown in Fig. A.9. ..... 113
B.4. Abstract signal flow graph $G_{A, F O C A}$ of the folded cascode amplifier from Fig. A.6.114
B.5. Signal flow graph and edge attributes of the dual gain amplifier from Fig. A.5. The structure recognition results for signal flow graph generation of the bias circuitry are shown in Fig. A.7. The abstract signal flow graph for FOCA1 and $F O C A 2$ is shown in Fig. B.4. ..... 115
B.6. Different node and edge types encountered during symmetry computation. Symmetry conditions are indicated by blue arrows. ..... 118
B.7. A current mirror bank and its corresponding signal flow graph. A unique edge mapping does not exist. ..... 120
B.8. Symmetry computation on the graph model of the folded cascode amplifier from Fig. B.3. Each iteration shows intermediate steps of Algorithms 21-26. ..... 123
B.9. Symmetry computation results for the bias circuitry (a) and the core circuit (b) of the dual gain amplifier from Fig. A.5. ..... 125
B.10.Symmetry computation results for the folded cascode amplifier from Fig. A.6. ..... 126
B.11.Ambiguities and conversion paths in a signal flow graph. ..... 126
B.12.Multiple symmetry axes in a graph ..... 127
C.1. Measures taken to fix the short circuit paths of the dual gain amplifier and its folded cascode amplifier subcircuits from Figs. 3.6 and 3.7. ..... 130
C.2. Measures taken to resolve matching and symmetry violations in the dual gain amplifier and its folded cascode amplifier subcircuits from Figs. 3.6 and 3.7. . ..... 132
C.3. Measures taken to reduce the area of the power-down circuitry of the dual gain amplifier and its folded cascode amplifier subcircuits from Figs. 3.6 and 3.7. . ..... 133

List of Figures

## List of Tables

2.1. Set of circuits $\mathcal{C}$ used to construct the source circuit series from Fig. 1.7. ..... 16
3.1. Voltage propagation results for the source circuit series (SCS) of Fig. 1.7b. ..... 21
3.2. Voltage propagation results for the source circuits $S C 1$ and $S C 2$ from Fig. 1.7a. ..... 21
3.3. Matching and symmetry assertion results for the differential stage from Fig. 3.4a. ..... 28
3.4. Voltage propagation results for the dual gain amplifier from Fig. 3.6. ..... 29
3.5. Voltage propagation results for the folded cascode amplifiers from Fig. 3.7. The results are valid for $F O C A 1$ and $F O C A 2$ of the dual gain amplifier from Fig. 3.6 as these subcircuit are exposed to the same pin voltages. ..... 30
3.6. Summary of the matching and symmetry assertion results for the dual gain amplifier (DGA) and its folded cascode amplifier subcircuits FOCA1 and $F O C A 2$ from Fig. 3.6 and 3.7 ..... 34
3.7. Matching assertion results for the dual gain amplifier from Fig. 3.6. ..... 34
3.8. Symmetry assertion results for the dual gain amplifier from Fig. 3.6. ..... 35
3.9. Matching assertion results for $F O C A 1$ and $F O C A 2$ from Fig. 3.7. ..... 36
3.10. Symmetry assertion results for $F O C A 1$ and $F O C A 2$ from Fig. 3.7. ..... 37
3.11. Final voltage propagation result for the high input impedance differential amplifier (HIIDA) and the Miller operational transconductance amplifiers (OTAs) from Fig. 3.10. ..... 38
3.12. Matching assertion results for the Miller operational transconductance ampli- fiers MILLER1 - MILLER3 from Fig. 3.10b. ..... 39
3.13. Symmetry assertion result for the high input impedance differential amplifier from Fig. 3.10a. ..... 40
3.14. Symmetry assertion results for the Miller operational transconductance am- plifiers from Fig. 3.10b. ..... 40
4.1. Voltage propagation result for the ripped-up differential stage series from Fig. 4.3. ..... 55
4.2. Constraint optimization problem from eq. (4.15) for the differential stage series from Fig. 4.3. ..... 55
4.3. Solutions of the constraint optimization problem from Table 4.2. The power- down transistors inserted by the boundaryConditions(v) constraint at nets outn, $n 1$ and pwd have already been removed from this table and reverted to $g n d, v d d, v d d$, respectively. ..... 56
4.4. Voltage propagation results of differential stage $D F S 1$ for solution $\mathbf{v}_{2}^{*}$ of the differential stage series. ..... 56
4.5. Voltage propagation results for differential stage $D F S 1$ based on solution $\mathbf{v}_{1}^{*}$ of the differential stage series. ..... 57
4.6. Voltage propagation results for differential stage $D F S 2$ based on solution $\mathbf{v}_{1}^{*}$ of the differential stage series. ..... 57
4.7. Constraint optimization problem from eq. (4.15) for differential stage $D F S 2^{\prime}$ and solution $\mathbf{v}_{1}^{*}$ of the differential stage series from Fig. 4.3 ..... 58
4.8. Solutions of the constraint optimization problem from Table 4.7. The power- down transistors inserted by the boundary conditions have already been re- verted to $v d d$ or $g n d$, respectively. ..... 58
4.9. Valid solutions of the gate shut-off COP for the differential stage series from Fig. 4.3 if the symmetry and matching information is not propagated between the bottom and the top-level of the circuit hierarchy. The solutions for differ- ential stages $D F S 1$ and $D F S 2^{\prime}$ are shown in Tables 4.10 and 4.11, respectively. 60 ..... 60
4.10. Power-down mode implementation variants for differential stage $D F S 1$ re- quired to implement the solutions of the differential stage series shown in Table 4.9. ..... 61
4.11. Power-down mode implementation variants for differential stage $D F S 2^{\prime}$ re- quired to implement the solutions of the differential stage series shown in Table 4.9. ..... 61
4.12. Flat and hierarchical score for the power-down mode implementation variants of the differential stage series from Table 4.9. ..... 62
4.13. Power-down synthesis results of the new method from Chap. 4 and the ap- proach by [10] for the current mirror operational amplifier. ..... 65
4.14. Comparison of the building block matching results for the current mirror operational amplifier for the new synthesis method from Chap. 4 and the approach by [10]. ..... 65
4.15. Solutions of the gate shut-off COP for the dual gain amplifier for its core circuit and its ripped-up bias circuits from Figs. 3.8b and 4.11. ..... 71
4.16. Synthesized implementation variants for the folded cascode amplifiers FOCA1 and FOCA2 from Fig. 3.9. ..... 71
5.1. Weights of the tentative buildings formed in the first iteration of Algorithm 7 for the circuit of Fig. 5.2. ..... 83

## Bibliography

[1] M. Horowitz, T. Indermaur, and R. Gonzalez, "Low-power digital design," in Proceedings of 1994 IEEE Symposium on Low Power Electronics, 1994, pp. 8-11.
[2] R. Gonzalez, B. Gordon, and M. Horowitz, "Supply and threshold voltage scaling for low power cmos," IEEE Journal of Solid-State Circuits, vol. 32, no. 8, pp. 1210-1216, 1997.
[3] T. Kuroda, T. Fujita, S. Mita, T. Nagamatsu, S. Yoshioka, K. Suzuki, F. Sano, M. Norishima, M. Murota, M. Kako, M. Kinugawa, M. Kakumu, and T. Sakurai, "A 0.9-v, 150$\mathrm{mhz}, 10-\mathrm{mw}, 4 \mathrm{~mm} / \mathrm{sup} 2 /, 2-\mathrm{d}$ discrete cosine transform core processor with variable threshold-voltage (vt) scheme," IEEE Journal of Solid-State Circuits, vol. 31, no. 11, pp. 1770-1779, 1996.
[4] T. Massier, H. Graeb, and U. Schlichtmann, "The sizing rules method for cmos and bipolar analog integrated circuit synthesis," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 27, no. 12, pp. 2209-2222, 2008.
[5] E. Maricau and G. Gielen, "Transistor aging-induced degradation of analog circuits: Impact analysis and design guidelines," in 2011 Proceedings of the ESSCIRC (ESSCIRC), Sep. 2011, pp. 243-246.
[6] C. Michael, H. Wang, C. S. Teng, J. Shibley, L. Lewicki, C.-M. Shyu, and R. Lahri, "Mismatch drift: a reliability issue for analog MOS circuits," in 30th Annual Proceedings Reliability Physics 1992, March 1992, pp. 81-84.
[7] Y. Chen, J. Zhou, S. Tedja, Frank Hui, and A. S. Oates, "Stress-induced mosfet mismatch for analog circuits," in 2001 IEEE International Integrated Reliability Workshop. Final Report (Cat. No.01TH8580), 2001, pp. 41-43.
[8] S. Blieck and E. Janssens, "Software check for power-down mode of analog circuits," in ESSCIRC '96: Proceedings of the 22nd European Solid-State Circuits Conference, 1996, pp. 404-407.
[9] M. Zwerger and H. Graeb, "Short-circuit-path and floating-node verification of analog circuits in power-down mode," in 2012 International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD), 2012, pp. 241-244.
[10] M. Zwerger, "Verification and synthesis of analog power-down circuits," Ph.D. dissertation, Technical University of Munich, 2017.
[11] M. Zwerger, M. Neuner, and H. Graeb, "Power-down circuit synthesis for analog/mixedsignal," in 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 2015, pp. 656-663.
[12] ——, "Analog power-down synthesis," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 36, no. 12, pp. 1954-1967, 2017.
[13] M. Zwerger and H. Graeb, "Detection of asymmetric aging-critical voltage conditions in analog power-down mode," in 2015 Design, Automation Test in Europe Conference Exhibition (DATE), 2015, pp. 1269-1272.
[14] M. Zwerger, G. Shrivastava, and H. Graeb, "Power-down synthesis for analog circuits including switch sizing," in 2016 13th International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD), 2016, pp. 1-4.
[15] M. Neuner, M. Zwerger, and H. Graeb, "Power-down schematic synthesis for analog/mixed-signal circuits," in ANALOG 2016; 15. ITG/GMM-Symposium, 2016, pp. 1-6.
[16] H. Graeb, S. Zizala, J. Eckmueller, and K. Antreich, "The sizing rules method for analog integrated circuit design," in IEEE/ACM International Conference on Computer Aided Design. ICCAD 2001. IEEE/ACM Digest of Technical Papers (Cat. No.01CH37281), 2001, pp. 343-349.
[17] M. Eick, "Structure and signal path analysis for analog and digital circuits," Ph.D. dissertation, Technical University of Munich, 2013.
[18] M. Zwerger, P. Vlachas, and H. Graeb, "A fast analytical approach for static powerdown mode analysis," in 2015 IEEE International Conference on Electronics, Circuits, and Systems (ICECS), 2015, pp. 1-4.
[19] R. Barták, "Constraint programming: In pursuit of the holy grail," in Week of Doctoral Students (WDS99), June 1999, pp. 1-4.
[20] G. T. C. Schulte and M. Z. Lagerkvist, Modeling and Programming with Gecode, 6th ed., www.gecode.org, June 2020.
[21] Cadence Design Systems, Inc, https://www.cadence.com/, 2020.
[22] J. DeLisle, "What is analog ic design?" All About Circuits, Tech. Rep., Dec. 2020, online; accessed 18.03.202021. [Online]. Available: https://www.allaboutcircuits.com/ technical-articles/what-is-analog-ic-design/
[23] D. Sridharan, "Automatic structural analysis for hierarchical designs," Master's thesis, Technical University of Munich, 2013.
[24] K. Lu, "Symmetry Recognition for Automatic Sizing of Analog Integrated Circuits," Master's thesis, Technische Universität München, Sep. 2009.
[25] M. Neuner and H. Graeb, "Power-down mode verification for hierarchical analog circuits," in 2019 16th International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD), 2019, pp. 125-128.
[26] ——, "Verification and revision of the power-down mode for hierarchical analog circuits," Integration, vol. 73, pp. 1-9, 2020.
[27] ——, "Synergetic algorithm for power-down synthesis," in 2020 European Conference on Circuit Theory and Design (ECCTD), 2020, pp. 1-5.
[28] -_, "Hierarchical analog power-down synthesis," in 2020 IEEE International Conference on Electronics Circuits and Systems (ICECS), 2020.
[29] M. Neuner, I. Abel, and H. Graeb, "Library-free Structure Recognition for Analog Circuits," in Design, Automation and Test in Europe (DATE), Feb. 2021.
[30] I. Abel, "Eine Datenstruktur zur Identifikation von Funktionsblöcken in analogen Schaltungen," Bachelor Thesis, Technical University of Munich, Chair of Electronic Design Automation, Tech. Rep., 2016.
[31] ——, "Automatische Regelgenerierung zur Erkennung von Schaltungsstrukturen," Research Internship, Technical University of Munich, Chair of Electronic Design Automation, Tech. Rep., Mar. 2018.
[32] E. Haardt, "Power-Down-Verifikation hierarchischer analoger Schaltungen," Jan. 2019.
[33] ——, "Sizing Constraints for Rail-to-Rail Amplifiers," Master's thesis, Oct. 2020.
[34] A. Dangi, "Framework for Symmetry Computation in Analog Circuits," DAAD WISE internship report, Technical University of Munich, Chair of Electronic Design Automation, Tech. Rep., 2017.
[35] S. Muralitharan, "Automatic Symmetry Constraint Generation and their Application to Wicked," DAAD WISE internship report, Technical University of Munich, Chair of Electronic Design Automation, Tech. Rep., 2018.
[36] E. Shi, "ESD Verification at Circuit Level based on Structural Analysis," TUM PREP internship report, Technical University of Munich, Chair of Electronic Design Automation, Tech. Rep., 2018.
[37] Analog Devices, "Paralleling amplifiers increases output drive," http://www.surfline. com/surf-news/maldives-surf-access-controversy-update_75296/, 2020, online; accessed 23.11.2020.
[38] G. B. Clayton, Operational Amplifiers, 5th ed. Newnes, Aug 2003.
[39] T. W. Francesca Rossi, Peter van Beek, Handbook of Constraint Programming, 2nd ed. Elsevier, 2006, vol. Volume 2.
[40] K. M. T. C. Carusone, D. Johns, Analog Integrated Circuit Design. New Jersey, USA: John Wiley \& Sons, Inc., 2011.
[41] P. E. Allen and D. R. Holberg, CMOS Analog Circuit Design, 3rd ed. Oxford University Press, 2012.
[42] Y. Wenger and B. Meinerzhagen, "Implementation of a fully-differential operational amplifier with wide-input range, high-impedance common-mode feedback," in PRIME, July 2019, pp. 1-4.
[43] H. Hung, M. Ker, S. Chen, and C. Chuang, "Abnormal esd damages occur in interface circuits between different power domains in nd-mode mm esd stress," in IPFA, Jul 2006, pp. 163-166.
[44] H. Li, F. Jiao, and A. Doboli, "Analog circuit topological feature extraction with unsupervised learning of new sub-structures," in 2016 Design, Automation Test in Europe Conference Exhibition (DATE), 2016, pp. 1509-1512.
[45] K. Settaluri and E. Fallon, "Fully automated analog sub-circuit clustering with graph convolutional neural networks," in 2020 Design, Automation Test in Europe Conference Exhibition (DATE), 2020, pp. 1714-1715.
[46] K. Kunal, T. Dhar, M. Madhusudan, J. Poojary, A. Sharma, W. Xu, S. M. Burns, J. Hu, R. Harjani, and S. S. Sapatnekar, "Gana: Graph convolutional network based automated netlist annotation for analog circuits," in 2020 Design, Automation Test in Europe Conference Exhibition (DATE), 2020, pp. 55-60.
[47] M. Lanuzza, P. Corsonello, and S. Perri, "Low-power level shifter for multi-supply voltage designs," TCSII, vol. 59, no. 12, pp. 922-926, Dec 2012.
[48] S. H. Kulkarni and D. Sylvester, "High performance level conversion for dual v/sub dd/ design," TVLSI, vol. 12, no. 9, pp. 926-936, Aug 2004.
[49] Q. A. Khan, S. K. Wadhwa, and K. Misri, "A single supply level shifter for multi-voltage systems," in VLSID, Jan 2006, pp. 4 pp.-.
[50] R. Garg, G. Mallarapu, and S. P. Khatri, "A single-supply true voltage level shifter," in DATE, Apr 2008, pp. 979-984.
[51] M. Eick and H. Graeb, "A versatile structural analysis method for analog, digital and mixed-signal circuits," in 2012 International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD), 2012, pp. 1-4.
[52] H. Liu, S. Li, Z. Wu, and L. Li, "Vector coding method for symmetry detection in the analog circuits," in 2020 IEEE 14th International Conference on Anti-counterfeiting, Security, and Identification (ASID), 2020, pp. 82-86.
[53] M. E. Kole, J. Smit, and O. E. Herrmann, "Modeling symmetry in analog electronic
circuits," in Proceedings of IEEE International Symposium on Circuits and Systems ISCAS '94, vol. 1, 1994, pp. 315-318 vol.1.
[54] Qingsheng Hao, Song Chen, Xianlong Hong, Yi Su, Sheqin Dong, and Zhiyi Qu, "Constraints generation for analog circuits layout," in 2004 International Conference on Communications, Circuits and Systems (IEEE Cat. No.04EX914), vol. 2, 2004, pp. 1334-1338 Vol.2.
[55] Sambuddha Bhattacharya, N. Jangkrajarng, Roy Hartono, and C. . R. Shi, "Hierarchical extraction and verification of symmetry constraints for analog layout automation," in ASP-DAC 2004: Asia and South Pacific Design Automation Conference 2004 (IEEE Cat. No.04EX753), 2004, pp. 400-405.
[56] MunEDA GmbH, "WiCkeD," http://www.muneda.com/, 2020.
[57] M. Eick and H. E. Graeb, "Mars: Matching-driven analog sizing," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 31, no. 8, pp. 11451158, 2012.
[58] E. Charbon, E. Malavasi, and A. Sangiovanni-Vincentelli, "Generalized constraint generation for analog circuit design," in Proceedings of 1993 International Conference on Computer Aided Design (ICCAD), 1993, pp. 408-414.
[59] M. Eick, M. Strasser, K. Lu, U. Schlichtmann, and H. E. Graeb, "Comprehensive generation of hierarchical placement rules for analog integrated circuits," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 30, no. 2, pp. 180193, 2011.
[60] B. G. Arsintescu, "A method for analog circuits visualization," in Proceedings International Conference on Computer Design. VLSI in Computers and Processors, 1996, pp. 454-459.
[61] B. G. Arsintescu and S. A. Spanoche, "Global stacking for analog circuits," in Proceedings EURO-DAC '96. European Design Automation Conference with EURO-VHDL '96 and Exhibition, 1996, pp. 392-397.
[62] S. Y. Su Yi, S. D. Sheqin Dong, Q. H. Qingsheng Hao, X. H. Xiangqing He, and X. H. Xianlong Hong, "Automated analog circuits symmetrical layout constraint extraction by partition," in ASIC, 2003. Proceedings. 5th International Conference on, vol. 1, 2003, pp. 166-169 Vol.1.
[63] Zhe Zhou, Sheqin Dong, Xianlong Hong, Qingsheng Hao, and Song Chen, "Analog constraints extraction based on the signal flow analysis," in 2005 6th International Conference on ASIC, vol. 2, 2005, pp. 825-828.
[64] C. Ebeling, "Geminiii: a second generation layout validation program," in [1988] IEEE International Conference on Computer-Aided Design (ICCAD-89) Digest of Technical Papers, 1988, pp. 322-325.
[65] S. Bhattacharya, N. Jangkrajarng, and C. . R. Shi, "Multilevel symmetry-constraint generation for retargeting large analog layouts," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 25, no. 6, pp. 945-960, 2006.
[66] K. Kunal, J. Poojary, T. Dhar, M. Madhusudan, R. Harjani, and S. S. Sapatnekar, "A general approach for identifying hierarchical symmetry constraints for analog circuit layout," in 2020 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 2020, pp. 1-8.
[67] M. Liu, W. Li, K. Zhu, B. Xu, Y. Lin, L. Shen, X. Tang, N. Sun, and D. Z. Pan, "S3det: Detecting system symmetry constraints for analog circuits with graph similarity," in 2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC), 2020, pp. 193-198.
[68] R. Sedgewick and K. Wayne, Algorithms, 4th ed. Addison-Wesley Professional, 2011.


[^0]:    ${ }^{1}$ Preliminary works: $[9 ; 13 ; 11 ; 14 ; 15 ; 12]$
    ${ }^{2}$ The structure recognition method by [10] is based on the "sizing rules method" which was originally published in [16]. The sizing rules method has been subsequently enhanced and extended by $[4 ; 17 ; 10]$

