Electronic Design

  
Reprints     Printer-Friendly    Email this Article    RSS        Font Size     What's This?


[Design View / Design Solution]
Optimize Timing Margins For Your High-Speed Interface

David Mahashin, Kannan Srinivasagam  |   ED Online ID #9611  |   February 17, 2005


High-speed digital buses have evolved dramatically over the past decade. Not only are they faster, but they're also changing how systems clock data. To improve data throughput, emerging synchronous digital buses are sending data multiple times per cycle via an array of clocking schemes. This article presents a framework for understanding how source-synchronous clocking can optimize timing margins for high-speed interfaces.

Timing budget is the account of timing requirements or timing parameters necessary for a system to function properly. For synchronous systems to work, timing requirements must fit within one clock cycle. A timing-budget calculation involves many factors, including hold-time requirements and maximum operating frequency requirements. By calculating a timing budget, the limitations of conventional clocking methods can be seen.

Let's use Figure 1 as an example for a system with standard clocking. The figure shows a memory controller interfacing with an SRAM. Both the SRAM and memory controller receive clock signals from the same clock source. It's assumed that clock traces are designed to match the trace delays. The relevant timing parameters are:

  • tSU (setup time) of memory controller (Fig. 2)
  • tH (hold time) of memory controller (Fig. 2)
  • tPD (propagation delay) of board trace
  • tCO (clock to output delay) of SRAM (Fig. 2)
  • tDOH (output data hold time) of SRAM (Fig. 2)
  • tSKEW (clock skew) of clock generator (Fig. 2)
  • tJIT (cycle-to-cycle jitter) of clock generator (Fig. 2)
  • tCYC (cycle time) of clock generator

    The maximum-frequency calculation gives the minimum cycle time of the system if the worst-case input setup time, clock to output time, propagation delay, clock skew, and clock jitter are considered. The maximum frequency is given by:

    tCO(max, SRAM) + tPD(max) + tSU(max, CTRL) + tSKEW(max, CLK) + tJIT(max, CLK) < tCYC

    The hold-time calculation verifies that the system outputs data too fast, violating input hold time of the receiving device in the system. In this case, the worst-case condition occurs when the data is driven out at the earliest possible time. The formula is given by:

    tCO(min, SRAM) + tPD(min) - tSKEW(min, CLK) - tJIT(min, CLK) > tH(max, CTRL)

    Now let's assume the following values for the timing parameters of our SRAM and memory controller. In this case, we will use a high-speed SRAM with a double-data-rate (DDR) interface, where data is driven by the SRAM with every rising and falling edge of the clock.

  • tSU = 0.5 ns
  • tH = 0.4 ns
  • tCO = 0.45 ns
  • tDOH* = -0.45 ns
  • tSKEW = ±0.2 ns
  • tJIT = ±0.2 ns

    *tDOH < 0 means data becomes invalid before the subsequent rising/falling edge of the clock (DDR device).

    The minimum hold-time requirement is calculated as:

    tDOH + tPD - tSKEW - tJIT > tH  -0.45 ns + tPD - 0.2 ns - 0.2 ns > 0.4 ns   -0.85ns + tPD > 0.4 ns  tPD > 1.25 ns

    Assuming that the delay per inch of an FR4 board trace is 160 ps/in., the trace length from SRAM to memory controller must be at least 7.82 in. Using 1.2 ns for tPD, the maximum operating frequency is calculated below. Because the SRAM has a DDR interface, the timing budget is based on a half cycle:

    tCO + tPD + tSU + tSKEW + tJIT < tCYC/2  0.45 ns + 1.25 ns + 0.5 ns + 0.2 ns + 0.2 ns < tCYC/2  2.6 ns < tCYC/2  5.2 ns < tCYC  192 MHz > fCYC 

    With a 7.82-in. FR4 trace length and typical timing parameters, the timing budget requirements are met for an operating frequency of up to 192 MHz. In systems that have limited board space, the 7.82-in. minimum trace-length constraint becomes a difficult requirement to satisfy in systems.

    If it isn't possible to introduce a trace delay, the memory controller can satisfy the hold-time requirement by using a delay-locked loop/phase-locked loop (DLL/PLL) to phase-shift the clock signal to capture data at an earlier time (Fig. 3). The memory controller will have to resynchronize captured data with the system clock. Using this method will introduce additional PLL/DLL jitter, which decreases the system's maximum operating frequency. With the added delay of the PLL, the minimum hold-time requirement becomes:

    tDOH + tPD(trace) + tPLL/DLL_DELAY - 
    tSKEW - tJIT > tH

    tCO + tPD + tSU + tSKEW + tJIT
    + tJIT_PLL/DLL < tCYC/2, where tJIT_PLL/DLL
    is jitter introduced by the DLL/PLL.

    Clock skew, clock jitter, and trace propagation delay can significantly limit system performance, even with the fastest SRAMs and ASICs/FPGAs available.

    As mentioned earlier, the trace delay is approximately 160 ps/in. if an FR4 board is used. This is a significant number considering how the data-valid window at high frequencies has become 2 ns (e.g., for a 250-MHz, double-data-rate (DDR) device) and lower. Skew between the clock signals can also significantly reduce timing margins. We shall see that source-synchronous clocks can significantly reduce propagation delay, skew, and jitter, making timing closure more attainable.


    <-- prev. page     [1] 2     next page -->

    Reprints   Printer-Friendly  Email this Article  RSS    Font Size   What's This?


  • In EDA, A Year Of Mergers, Failed And Otherwise
  • 2008 BEST Electronic Design Winners
  • Engineers Rely On Internet For Product Info
  • Rochester Electronics Establishes New Design and Technology Group
  • November 17, 2008
  • Custom Sources Light Way To 22-nm IC Lithography
  • Software Turns Scopes Into Vector RF Signal Analyzers
  • Couple’s $15 Million Gift Advances Rice Engineering Education
    1) Build A Smart Battery Charger Using A Single-Transistor Circuit
    (268 views today)
    2) Ten Top Design Skills For Tough Times
    (199 views today)
    3) Consumer Electronics Series: AMD Live! Home Cinema Platform
    (190 views today)
    4) Easily Convert Decimal Numbers To Their Binary And BCD Formats
    (164 views today)
    5) FPGA Costs Half A Buck
    (110 views today)
    ALL TOP 20



    Reader Comments

    good. nice introduction

    Anonymous -May 24, 2006   (Article Rating: )

    POST YOUR COMMENTS HERE
    Name:

    Email:
    Your Comments:

    Enter the text from the image below


    Please refresh the page if you have trouble reading this text.

  • Search Electronic Design
         
      
     
    Web Seminar
    Sponsored By:
    Title: Read Pacing: A Performance Enhancing Feature of PCI Express Gen 2 Switch Devices
    Speakers: 
    Date: 07/01/08
    Register: 

    Electronic Design Europe Electronic Design China EEPN Power Electronics Auto Electronics Microwaves & RF
    Mobile Dev & Design Schematics Find Power Products Military Electronics EE Events Related Resources