1. (5 points) Draw the transistor-level diagram of a two-input CMOS NAND gate.

   ![Transistor-Level Diagram of a Two-Input CMOS NAND Gate]

2. (10 points) Write a VHDL entity (3 points) and architecture (7 points) of a D latch with the generics, TPDQ, which reflects the time for a change on D to appear at the outputs and TGDQ, which reflects the time for a change on the gate input, D, to appear at the outputs.

   ```vhdl
   entity DLATCH is
       generic (TPDQ, TPGQ : time);
       port (D, G : in std_logic;
             Q, QB : out std_logic);
   end DLATCH;

   architecture DLATCH of DLATCH is
       signal QTEMP : std_logic;
   begin
       process (D, G)
       begin
           if (G'event) then
               if (G = '1') then
                   QTEMP <= D after TPGQ;
               end if;
           elsif (D'event) then
               if (G = '1') then
                   QTEMP <= D after TPDQ;
               end if;
           end if;
       end process;
       Q <= QTEMP;
       QB <= not QTEMP;
   end DLATCH;
   ```

3. (1 point) **Scheduling** consists of assigning each operation to a control step.
4. (1 point) The **utilization** of a component is the ratio of the busy time for the component to the execution time for the process.

5. (5 points) If the NRE costs for FPGA and ASIC circuits are $25,000 and $575,000, respectively, and the cost of individual parts for FPGA and ASIC circuits are $22 and $7, respectively, what is the break-even manufacturing volume for these two types of circuits?

Let \( x \) be the volume

\[
25,000 + (22x) = 575,000 + (7x)
\]

\[
15x = 550,000
\]

\[
x = 36,667
\]

6. (10 points) Consider the dataflow graph shown below. As part of the cluster partitioning algorithm, determine which two nodes should be merged and show the dataflow graph that results from the merger.

[Diagram of a dataflow graph with nodes 1, 2, 3, 4, 5 and edges showing common neighbors.]  

<table>
<thead>
<tr>
<th>Common Neighbors</th>
<th>Count</th>
</tr>
</thead>
<tbody>
<tr>
<td>(1, 2)</td>
<td>2</td>
</tr>
<tr>
<td>(1, 3)</td>
<td>3</td>
</tr>
<tr>
<td>(1, 4)</td>
<td>1</td>
</tr>
<tr>
<td>(1, 5)</td>
<td>2</td>
</tr>
<tr>
<td>(2, 3)</td>
<td>2</td>
</tr>
<tr>
<td>(2, 5)</td>
<td>2</td>
</tr>
<tr>
<td>(3, 4)</td>
<td>1</td>
</tr>
<tr>
<td>(3, 5)</td>
<td>2</td>
</tr>
</tbody>
</table>

So, 1 and 3 have the most common neighbors, merge them.

7. (4 points) List the four types of paths that must be considered when doing timing analysis of sequential circuits.

- **primary inputs to primary outputs**
- **primary inputs to inputs of storage elements**
- **outputs of storage elements to primary outputs**
- **outputs of storage elements to inputs of storage elements**

8. (1 point) **Synthesis** is the process of transforming a behavioral description into a structural gate-level circuit.

9. (1 point) **Routing** is the process of making the connections between standard cells.
10. (10 points) For the data lifetime chart shown, use the left edge algorithm to obtain an efficient register allocation.

<table>
<thead>
<tr>
<th></th>
<th>A</th>
<th>B</th>
<th>C</th>
<th>D</th>
<th>E</th>
<th>F</th>
<th>G</th>
<th>H</th>
<th>I</th>
<th>J</th>
<th>K</th>
<th>L</th>
</tr>
</thead>
<tbody>
<tr>
<td>S1</td>
<td>X</td>
<td></td>
<td>X</td>
<td></td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S2</td>
<td>X</td>
<td></td>
<td>X</td>
<td></td>
<td>X</td>
<td></td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S3</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S4</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S5</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S6</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S7</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Sorting by earliest start and then by earliest end times,

<table>
<thead>
<tr>
<th></th>
<th>G</th>
<th>J</th>
<th>D</th>
<th>I</th>
<th>A</th>
<th>K</th>
<th>B</th>
<th>F</th>
<th>H</th>
<th>E</th>
<th>L</th>
<th>C</th>
</tr>
</thead>
<tbody>
<tr>
<td>S1</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S2</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S3</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S4</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S5</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S6</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>S7</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Doing the allocation

<table>
<thead>
<tr>
<th></th>
<th>R1</th>
<th>R2</th>
<th>R3</th>
<th>R4</th>
<th>R5</th>
<th>R6</th>
<th>R7</th>
</tr>
</thead>
<tbody>
<tr>
<td>S1</td>
<td>G</td>
<td>J</td>
<td>D</td>
<td>I</td>
<td>A</td>
<td></td>
<td></td>
</tr>
<tr>
<td>S2</td>
<td>G</td>
<td>J</td>
<td>D</td>
<td>I</td>
<td>A</td>
<td>B</td>
<td>F</td>
</tr>
<tr>
<td>S3</td>
<td>K</td>
<td>J</td>
<td>D</td>
<td>I</td>
<td>A</td>
<td>B</td>
<td>F</td>
</tr>
<tr>
<td>S4</td>
<td>K</td>
<td>H</td>
<td>D</td>
<td>I</td>
<td>A</td>
<td>B</td>
<td>F</td>
</tr>
<tr>
<td>S5</td>
<td>K</td>
<td>H</td>
<td>E</td>
<td>L</td>
<td>A</td>
<td>B</td>
<td>F</td>
</tr>
<tr>
<td>S6</td>
<td>C</td>
<td>H</td>
<td>E</td>
<td>L</td>
<td>A</td>
<td>B</td>
<td>F</td>
</tr>
<tr>
<td>S7</td>
<td>C</td>
<td></td>
<td>L</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

11. (6 points) Translate the following VHDL use (a) block statement(s) instead of a process:

```vhdl
process (S1, S4, DI, S3, Q)
begin
  if (S1 = '1' and S4 = '1') then
    Q <= DI after FFDEL;
  elsif (S1 = '0' and S4 = '0') then
    Q <= "00000000" after FFDEL;
  end if;
  if (S3 = '1') then
    DO <= Q after BUFDEL;
  else
    DO <= "ZZZZZZZZ" after BUFDEL;
  end if;

B1 : block
begin
  Q <= DI after FFDEL when (S1 = '1' and S4 = '1') else
      "00000000" after FFDEL when (S1 = '0' and S4 = '0');
  DO <= Q after BUFDEL when (S3 = '1') else
       "ZZZZZZZZ" after BUFDEL;
end block B1;
```

12. (9 points) (a) (5 points) Write a single VHDL model which represents an AND gate with an arbitrary number of inputs, N. (b) (4 points) Use that model as a component in an entity that represents a four input AND gate with inputs a, b, c, d and output f.

```
entity N_AND is
  generic (N : integer);
  port (I : in std_logic_vector(N-1 downto 0);
        O : out std_logic);
end N_AND;

architecture N_AND of N_AND is
begin
  process(I)
  variable TEMP : std_logic := '1';
  begin
    for j in I'range loop
      TEMP := TEMP and I(j);
    end loop;
    O <= TEMP;
  end process;
end N_AND;

entity UPPER is
end UPPER;

architecture UPPER of UPPER is
  component N_AND is
    generic (N : integer);
    port (I : in std_logic_vector(N-1 downto 0);
          O : out std_logic);
  end component;
  signal a, b, c, d, f : std_logic;
begin
  U1 : N_AND generic map (N => 4)
    port map (I(0) => a,
              I(1) => b,
              I(2) => c,
              I(3) => d,
              O => f);
end UPPER;
```

13. (12 points) A sequential network with one input and one output is used to stretch the first two bits of a 4-bit sequence as follows:

<table>
<thead>
<tr>
<th>Input</th>
<th>Output</th>
</tr>
</thead>
<tbody>
<tr>
<td>00dd</td>
<td>0000</td>
</tr>
<tr>
<td>01dd</td>
<td>0011</td>
</tr>
<tr>
<td>10dd</td>
<td>1100</td>
</tr>
<tr>
<td>11dd</td>
<td>1111</td>
</tr>
</tbody>
</table>

After every four bits, the network resets. Model this network in VHDL as a Moore state machine.

a. (3 points) Write an entity. b. (9 points) Write an architecture.

```
entity STRETCH is
  port (X, CLK, RST : in std_logic;
        Z : out std_logic);
end STRETCH;
```
architecture STRETCH of STRETCH is
  type STATE is (S0, S1, S2, S3, S4, S5, S6, S7, S8);
  signal CURRENT_STATE, NEXT_STATE : STATE;
  signal TEMP : std_logic;
begin
  process(X, CURRENT_STATE)
  begin
    case CURRENT_STATE is
      when S0 => if (X = '0') then -- initial state
        NEXT_STATE <= S1; -- pick state to go to based on the
      else
        NEXT_STATE <= S2; -- first input
      end if;
      when S1 => TEMP <= X; -- have received a ‘0’, store next
        NEXT_STATE <= S3 -- input, output a ‘0’, go to the
      when S2 => TEMP <= X; -- state where we output another ‘0’
        NEXT_STATE <= S4; -- have received a ‘1’, store next
      when S3 => if (TEMP <= '0') then -- output a ‘0’, pick state to go to
        NEXT_STATE <= S5; -- based on the value of TEMP
      else NEXT_STATE <= S6; end if;
      when S4 => if (TEMP <= '0') then -- output a ‘1’, pick state to go to
        NEXT_STATE <= S5; -- based on the value of TEMP
      else NEXT_STATE <= S6; end if;
      when S5 => NEXT_STATE <= S7; -- output first ‘0’ for second input
      when S6 => NEXT_STATE <= S8; -- output first ‘1’ for second input
      when S7 => if (X = '0') then -- output second ‘0’, pick state to
        NEXT_STATE <= S1; -- go to based on input (now first
      else
        NEXT_STATE <= S2; -- bit of next four)
      end if;
      when S8 => if (X = '0') then -- output second ‘1’, pick state to
        NEXT_STATE <= S1; -- go to based on input (now first
      else
        NEXT_STATE <= S2; -- bit of next four)
      end if;
    end case;
  end process;
  process(CLK, RST)
  begin
    if (RST = '1')
      then CURRENT_STATE <= S0;
    elsif (CLK'event and CLK = '1') then
      CURRENT_STATE <= NEXT_STATE;
    end if;
  end process;
  process(CURRENT_STATE)
  begin
    case CURRENT_STATE is
      when S0|S1|S3|S5|S7 => Z <= '0';
      when S2|S4|S6|S8 => Z <= '1';
    end case;
  end process;
end STRETCH;
Consider the following VHDL code:

```vhdl
-- Entity declaration
entity SCHED2 is
  port (A, B, C, D, E, F: in INTEGER;
        CLK : in BIT;
        W, X, Y: out INTEGER);
end SCHED2;

-- Architecture declaration
architecture HIGH_LEVEL of SCHED2 is
  signal V, Z: INTEGER;
begin
  X <= ((A - B) * Z) * (C + V);
  Y <= ((A * B) + Z) * E + F;
  Z <= (C * D) + D * (E + F);
  W <= (A/F + C*C + D* (A - B)) / V;
  V <= (C * (B + D*E)) + F;
end HIGH_LEVEL;
```

16. (15 points) The following task refers to the VHDL code above. Assume that all operations are done in an ALU module and there are two ALU modules available. Derive a list schedule using the critical path priority metric for the operations.

List priority \{3, 4, 2, 8, 10, 11, 5, 7, 9, 12, 13, 1, 6, 14, 16, 19, 15, 17, 18, 21, 20, 22, 23\}

<table>
<thead>
<tr>
<th>Step</th>
<th>Ready</th>
<th>Schedule</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>{1, 2, 3, 4, 5, 6, 7, 8}</td>
<td>{3, 4}</td>
</tr>
<tr>
<td>2</td>
<td>{1, 2, 5, 6, 7, 8, 10, 11}</td>
<td>{2, 8}</td>
</tr>
<tr>
<td>3</td>
<td>{1, 5, 6, 7, 9, 10, 11}</td>
<td>{10, 11}</td>
</tr>
</tbody>
</table>
17. (10 points) Write a VHDL model for a shift register module that includes a 16-bit shift register, a controller, and a 4-bit down counter. The shifter can shift a variable number of bits depending on a count provided to the shifter module. Inputs to the module are a number N (indicating a shift count) in the range 1 to 15, a 16-bit vector par_in, a clock and a start signal, St. When St = ‘1’, N is loaded in the down counter, and par_in is loaded into the shift register. Then the shift register does a left shift N times and the controller returns to the start state. Assume that St is only ‘1’ for one clock time. All operations are synchronous on the falling edge of the clock.

entity SHIFT is
port (PAR_IN : in std_logic_vector(15 downto 0);
   CLK, ST : std_logic;
   N : in integer range 1 to 15;
   PAR_OUT : out std_logic_vector(15 downto 0));
end SHIFT;
architecture SHIFT of SHIFT is
begin
  process(CLK)
  variable TEMP : std_logic_vector(15 downto 0);
  variable COUNT : integer range 1 to 15;
  variable READY : std_logic;
  begin
    if (CLK'event and CLK = '0') then
      if (ST = '1' and READY = '1') then
        COUNT := N;
        TEMP := PAR_IN;
        READY := '0';
      else
        if (COUNT > 0) then
          COUNT := COUNT - 1;
          TEMP := TEMP(14 downto 0) & '0';
        else
          READY := '1';
        end if;
      end if;
    end if;
  end process;
P AR_OUT <= TEMP;
end SHIFT;

18. (10 points) A Mealy sequence detector detects a sequence of four consecutive 1 inputs. The detector has a single binary input, X, and a single binary output, Z. Signal Z should be logic 1 if and only if the last four inputs were all logic 1. Here is an example input-output sequence:

| X  | 010111111101101011110 |
| Z  | 0000011110000000000010 |

Model this detector in VHDL.

defined entity FOUR_ONES is
  port (CLK, RST, X : in std_logic;
        Z : out std_logic);
end FOUR_ONES;

architecture FOUR_ONES of FOUR_ONES is
begin
  process(CLK, RST, X)
  variable TEMP : std_logic_vector(3 downto 0);
  begin
    if (RST = '1') then
      TEMP := "0000";
    elsif (CLK'event and CLK = '1') then
      TEMP := TEMP(2 downto 0) & X;
      if (TEMP = "1111") then
        Z <= '1';
      else
        Z <= '0';
      end if;
    end if;
  end process;
end FOUR_ONES;