#### EMICRO/SIM 2013

XV Escola de Microeletrônica Sul / 28° Simpósio Sul de Microeletrônica Porto Alegre, 29 de abril a 3 de maio de 2013

# Simultaneous Gate Sizing and Vth Assignment using Lagrangian Relaxation and Delay Sensitivities

<u>Guilherme Flach</u>, Tiago Reimann, Gracieli Posser Marcelo Johann, Ricardo Reis



**PROGRAMA DE** 

PÓS-GRADUAÇÃO EM MICROELETRÔNICA

SIM/EMICRO 2013 Gate Sizing

### What is gate sizing?

Commonly a digital circuit is assembled using standard cells from a library.

Cell Library



## What is gate sizing?

Circuit description is then mapped to this library.

**Cell Library** cell types cell sizes Vth low Vth medium Vth high **Gate Sizing** SIM/EMICRO 2013

### What is gate sizing?

 Is the proper selection of gate types in order to trade-off power and timing requirements.

Power → Minimized Frequency → Met

 $min \quad Power(x)$   $s.t. \quad Timing(x) \leq T$ 

Power → Minimized Frequency → Met

 $min \quad Timing(x)$ s.t.  $Power(x) \leq P$ 

where *x* is the vector with the selected cell type for each cell

#### Gate Sizing

#### Discrete

- Also known as gate selection
- NP-Complete
- No exact solution can be found in reasonable time (general case)

#### Continuous

 Optimum solutions can be found in feasible time but relying on simplified delay models.

# Problem Formulation – ISPD Contest 2012

$$min$$
  $Leakage(x)$   $s.t.$   $Timing(x) \leq T$ 

#### **ISPD Contest 2013**

International Symposium on Physical Design

- Static power (leakage) is the main concern.
- Static Timing Analysis used as measure.
- Simplified wire model: lumped capacitance.

#### **Leakage Power**

Power consumption when circuit is on but not "working". Gets worse as dimensions shrinks

#### Lagrangian Relaxation

$$min$$
  $Leakage(x)$   $s.t.$   $Timing(x) \leq T$ 

Very hard to be solved!

#### **Lagrangian Relaxation**

Hard constraints become part of the objective function.

A little bit easier... heuristics still needed.

# Problem Formulation – KKT Conditions

Due to KKT conditions to optimality, we got...

$$min \ Leakage(x) + \lambda(Timing(x) - T)$$

quivalent

C. P. Chen, C. C.-N. Chu, and D. F. Wong Fast and exact simultaneous gate and wire sizing by lagrangian relaxation

IEEE Trans. on Computer-Aided Design

 $\min \ \ Leakage(x) + \lambda delay_k$  where delay, is the delay of timing ark k



#### Problem Formulation - After All

 $min \ Leakage(x) + \lambda delay_k$ 



SIM/EMICRO 2013

#### Approach

- min  $Leakage(x) + \lambda delay_k$
- for each cell in topological order
  - try every cell type and chose the one which locally best trades-off leakage and delay.



SIM/EMICRO 2013

# Why only local effects are taken into account?

- Well, a change in a cell type may affect several timing arc delays.
- But an incremental STA would be required for each cell change to keep timing updated.



SIM/EMICRO 2013

#### But isn't that too bad?

 Not totally, since the most effect occurs on the first levels.



SIM/EMICRO 2013

# **Greedy Algorithm**



SIM/EMICRO 2013



SIM/EMICRO 2013



SIM/EMICRO 2013

# **Greedy Algorithm**



SIM/EMICRO 2013

# **Greedy Algorithm**



SIM/EMICRO 2013

# **Greedy Algoritm**



SIM/EMICRO 2013

# **Greedy Algorithm**



SIM/EMICRO 2013

# **Greedy Algorithm**



SIM/EMICRO 2013

#### Can we do better?

- Dealing with the cells other than that in the vicinity.
- Propagate back the timing arc delays sensitivities.



SIM/EMICRO 2013

### Timing Arc Delay Sensitivities

- Idea: how a change in the middle of circuit affect the overall circuit time
  - without requiring to execute STA



SIM/EMICRO 2013

### Timing Arc Delay Sensitivities

- for timing arc in reverse topological order
  - compute and propagate its sensitivity in delay due to a change in its input slew



### Timing Arc Delay Sensitivities

- Precise timing information is still only locally computed...
  - but sensitivities provides a way to see what may happen globally;
  - an no incremental STA required.



SIM/EMICRO 2013

#### Power and Timing Recovery

- After selecting cells via Lagrangian:
  - sweep all cells trying to fix any timing violation left
    - usually our Lagrangian model provides a small timing violation
  - sweep all cells trying to decrease leakage power
- In these methods, an incremental STA is used for each cell change.

#### Results

- Smallest leakage power for every ISPD 2012 benchmark.
  - 23k cells circuit sized in less than 1min.
  - 861K cells circuit sized in less than 50min.

|                   | # of  | Leakage Power (W) |       |       | Power Saved (%) |          | Runtime (m) |       |       | OurLPRS Reduction (%) |          |
|-------------------|-------|-------------------|-------|-------|-----------------|----------|-------------|-------|-------|-----------------------|----------|
| Benchmark         | Comb. | Hu                | Li    | Our-  | Compared        | Compared | Hu          | Li    | Our-  | Compared              | Compared |
|                   | Cells | [1]               | [2]   | LPRS  | to [1]          | to [2]   | [1]         | [2]   | LPRS  | to [1]                | to [2]   |
| DMA_slow          | 23K   | 0.145             | 0.153 | 0.132 | 8.73            | 13.50    | 9.90        | 0.60  | 0.79  | 92.02                 | -31.67   |
| DMA_fast          |       | 0.299             | 0.281 | 0.238 | 20.29           | 15.19    | 13.90       | 0.60  | 0.92  | 93.38                 | -53.33   |
| pci_bridge32_slow | 30K   | 0.111             | 0.111 | 0.096 | 13.31           | 13.31    | 10.20       | 1.20  | 0.87  | 91.47                 | 27.50    |
| pci_bridge32_fast |       | 0.183             | 0.167 | 0.136 | 25.51           | 18.37    | 13.00       | 1.20  | 0.92  | 92.92                 | 23.33    |
| des_perf_slow     | 102K  | 0.614             | 0.671 | 0.570 | 7.14            | 15.03    | 70.10       | 6.00  | 25.31 | 63.89                 | -321.83  |
| des_perf_fast     |       | 1.842             | 1.93  | 1.395 | 24.27           | 27.73    | 82.70       | 6.60  | 16.37 | 80.21                 | -148.03  |
| vga_lcd_slow      | 148K  | 0.351             | 0.375 | 0.328 | 6.61            | 12.59    | 87.50       | 7.80  | 5.67  | 93.52                 | 27.31    |
| vga_lcd_fast      |       | 0.471             | 0.46  | 0.413 | 12.22           | 10.12    | 45.60       | 10.20 | 8.37  | 81.64                 | 17.94    |
| b19_slow          | 213K  | 0.583             | 0.604 | 0.564 | 3.28            | 6.64     | 213.90      | 10.20 | 9.15  | 95.72                 | 10.29    |
| b19_fast          |       | 0.771             | 0.784 | 0.717 | 7.06            | 8.61     | 206.50      | 12.00 | 11.75 | 94.31                 | 2.08     |
| leon3mp_slow      | 540K  | 1.341             | 1.4   | 1.334 | 0.53            | 4.72     | 1,274.00    | 43.80 | 38.98 | 96.94                 | 11.00    |
| leon3mp_fast      |       | 1.487             | 1.64  | 1.443 | 2.99            | 12.04    | 1,323.20    | 54.60 | 46.62 | 96.48                 | 14.62    |
| netcard_slow      | 861K  | 1.77              | 1.78  | 1.763 | 0.41            | 0.97     | 299.90      | 48.00 | 34.39 | 88.53                 | 28.35    |
| netcard_fast      |       | 1.861             | 2.18  | 1.841 | 1.07            | 15.55    | 1,096.90    | 88.80 | 47.41 | 95.68                 | 46.61    |
| Avg.              | -     | 0.845             | 0.895 | 0.784 | 9.53            | 12.45    | -           | -     | -     | -                     | -        |
| Sum (h)           | -     | -                 | -     | -     | -               | -        | 79.12       | 4.86  | 4.13  | 94.79                 | 15.12    |

#### Conclusions

- Lagrangian Relaxation fits well to model the discrete gate sizing problem.
  - Mainly when simplified by KKT conditions
- A change in a cell type may affect almost the whole circuit.
  - However the most effects are nearby the changed cell.
  - The remaining effect can be estimated propagating back the delay sensitivities of timing arcs.

#### EMICRO/SIM 2013

XV Escola de Microeletrônica Sul / 28° Simpósio Sul de Microeletrônica Porto Alegre, 29 de abril a 3 de maio de 2013

# Simultaneous Gate Sizing and Vth Assignment using Lagrangian Relaxation and Delay Sensitivities

<u>Guilherme Flach</u>, Tiago Reimann, Gracieli Posser Marcelo Johann, Ricardo Reis

gaflach@inf.ufrgs.br



**PROGRAMA DE** 

PÓS-GRADUAÇÃO EM MICROELETRÔNICA

SIM/EMICRO 2013 Gate Sizing