# **PolarFire Family Memory Controller User Guide** #### Introduction (Ask a Question) This document describes the memory controller solutions in the PolarFire® device family. The PolarFire family consists of the following FPGA devices. **PolarFire FPGAs** Microchip's PolarFire® FPGAs are the fifth-generation family of non-volatile FPGA devices, built on state-of-the-art 28 nm non-volatile process technology. PolarFire FPGAs deliver the lowest power at mid-range densities. PolarFire FPGAs lower the cost of mid-range FPGAs by integrating the industry's lowest power FPGA fabric, lowest power 12.7 Gbps transceiver lane, built-in low power dual PCI Express\* Gen 2 (EP/RP), and, on select data security (S) devices, an integrated low-power crypto co-processor. **FPGAs** PolarFire SoC Microchip's PolarFire SoC FPGAs are the fifth-generation family of non-volatile SoC FPGA devices, built on state-of-the-art 28 nm non-volatile process technology. The PolarFire SoC family offers industry's first RISC-V based SoC FPGAs capable of running Linux®. It combines a powerful 64-bit 5x core RISC-V Microprocessor Subsystem (MSS), based on SiFive's U54-MC family, with the PolarFire FPGA fabric in a single device. **RT PolarFire FPGAs** Microchip's RT PolarFire® FPGAs combine our 60 years of space flight heritage with the industry's lowest-power PolarFire FPGA family to enable new capabilities for space and mission-critical applications. RT PolarFire FPGA family includes RTPF500T, RTPF500TL, RTPF500TLS, RTPF500ZLS, RTPF500ZTL, RTPF500ZTL, RTPF500ZTLS, and RTPF500ZTLS devices. The following table lists the memory controller IP solutions in the PolarFire family. These memory controller solutions address the high-speed data transfer requirements for a wide range of applications and code execution. **Table 1.** Memory Controller IP Solutions in the PolarFire Family | Memory Controller | PolarFire® FPGA<br>(MPF) | PolarFire SoC<br>FPGA (MPFS) | RT PolarFire <sup>®</sup><br>FPGA (RTPF) | Description | |-----------------------------------------------------------------------------|--------------------------|------------------------------|------------------------------------------|----------------------------------------------------------------------| | Embedded MSS DDR Controller (see 2. MSS DDR Memory Controller) | _ | <b>✓</b> | _ | Supports DDR3, DDR3L, DDR4,<br>LPDDR3, and LPDDR4 memory<br>devices. | | PolarFire DDR IP <sup>1</sup> (see 3. Fabric DDR Subsystem) | ✓ | ✓ | ✓ | Supports DDR3, DDR3L, DDR4, and LPDDR3 memory devices. | | PolarFire QDR IP <sup>1</sup> (see 6. QDR Memory Controller) | ✓ | ✓ | ✓ | Supports QDR II+ and QDR II+ Xtreme memory devices. | | PolarFire Octal DDR PHY <sup>1</sup> (see 3.8. Octal DDR PHY-Only Solution) | ✓ | ✓ | ✓ | Supports xSPI (JESD251), HyperBUS, and ONFI memory devices. | #### Important: 1. These IPs belong to the PolarFire family and can be seamlessly used in RT PolarFire and PolarFire SoC designs. #### References - For more information about the PolarFire SoC MSS, see PolarFire SoC FPGA MSS Technical Reference Manual. - For information about configuring the MSS DDR Controller, see Standalone MSS Configurator User Guide for PolarFire SoC. - For more information about the clocking resources, see PolarFire Family Clocking Resources User Guide. - For more information about SmartDebug, see SmartDebug User Guide. - For more information about PolarFire board design, see PolarFire FPGA Board Design User Guide. - For more information about PolarFire board design, see RT PolarFire Board Design User Guide. - For more information about PolarFire SoC board design, see PolarFire SoC Board Design Guidelines User Guide. # **Table of Contents** | mu | roducti | on | | |----|--------------|--------------------------------------------|-----| | 1. | Acror | nyms | 5 | | 2. | MSS I | DDR Memory Controller | 6 | | | 2.1. | Training Logic for Embedded DDR Controller | | | 2 | Cabri | c DDR Subsystem | - | | 3. | | Features | | | | 3.1.<br>3.2. | Performance | | | | 3.2.<br>3.3. | Resource Utilization | | | | 3.4. | Functional Description | | | | 3.5. | DDR Subsystem Ports | | | | 3.6. | Functional Timing Diagrams | | | | 3.7. | DDR PHY-Only Solution Integration | 35 | | | 3.8. | Octal DDR PHY-Only Solution | 54 | | | 3.9. | Implementation | | | | 3.10. | Functional Examples | 101 | | 4. | Initial | ization Sequence | 107 | | | 4.1. | DDR3 | 107 | | | 4.2. | DDR4 | | | | 4.3. | LPDDR3 | 110 | | | 4.4. | LPDDR4 (For PolarFire SoC FPGA Only) | 111 | | 5. | Traini | ing Logic | 113 | | 6. | QDR | Memory Controller | 117 | | | 6.1. | Features | 117 | | | 6.2. | Performance | 117 | | | 6.3. | Functional Description | 118 | | | 6.4. | Functional Timing Diagrams | 125 | | | 6.5. | Implementation | | | | 6.6. | Functional Example | | | | 6.7. | PCB Recommendations | | | 7. | Polar | Fire Board Design Recommendations | 144 | | | 7.1. | DDR3 | 144 | | | 7.2. | LPDDR3 | 147 | | | 7.3. | DDR4 | 149 | | 8. | Polar | Fire SoC Board Design Recommendations | 152 | | | 8.1. | DDR3 | 152 | | | 8.2. | LPDDR3 | 155 | | | 8.3. | DDR4 | 157 | | | 8.4. | LPDDR4 | 159 | | 9. | Debu | gging Techniques | 165 | | | 9.1. | Debug Tools | | | | 9.2. | Design Debug During Simulation | 166 | |-----|----------|--------------------------------------|-----| | | 9.3. | Hardware Debug | 166 | | 10. | Apper | dix: Supported Memory Configurations | 167 | | 11. | Apper | dix: Timing Parameters | 168 | | 12. | Apper | dix: Fabric DDR Placement Locations | 169 | | 13. | Revision | on History | 177 | | Mic | rochip | FPGA Support | 181 | | Mic | | Information | | | | The M | icrochip Website | 181 | | | | ct Change Notification Service | | | | Custo | mer Support | 181 | | | Micro | chip Devices Code Protection Feature | 181 | | | Legal | Notice | 182 | | | Trader | marks | 182 | | | | y Management System | | | | World | wide Sales and Service | 184 | # 1. Acronyms (Ask a Question) Table 1-1. Acronyms | Acronym | Description | |---------|------------------------------------------------------------------| | QDR | Quad Data Rate | | SRAM | Synchronous Random Access Memory | | 1K | 1024-bits | | Word | 8-bit, 9-bit data, or 18-bit data unit, depending on memory size | | DDR | Double Data Rate | # 2. MSS DDR Memory Controller (Ask a Question) The PolarFire SoC MSS includes an embedded DDR controller to address the memory solution requirements for a wide range of applications with varying power consumption and efficiency levels. The DDR controller along with other blocks external to MSS form the MSS DDR subsystem that can be configured to support DDR3, DDR3L, DDR4, LPDDR3, and LPDDR4 memory devices. For more information about the features and functional blocks of the MSS DDR Controller, see PolarFire SoC FPGA MSS Technical Reference Manual. For information about configuring the MSS DDR Controller, see Standalone MSS Configurator User Guide for PolarFire SoC. For debugging LPDDR4 and DDR4 training issues, use Memory Log Analyzer GUI. # 2.1 Training Logic for Embedded DDR Controller (Ask a Question) The training logic performs the following operations. - Clock training - Write leveling - · Read leveling - I/O calibration For more information, see 5. Training Logic. For information about initialization sequence of DDR3, DDR4, LPDDR3, and LPDDR4, see 4. Initialization Sequence. # 3. Fabric DDR Subsystem (Ask a Question) The Fabric DDR subsystem is made up of the following soft and hard blocks: - PolarFire DDR controller (soft) - Training logic (soft) - I/O lane (hard) - Phase-Locked Loop (PLL) (hard) The following memory interface solutions can be created as shown in the following figures: - Fabric DDR subsystem—consists of PolarFire DDR controller (PF\_DDR IP), PLL, I/O lane, and training logic. - DDR PHY-Only solution—consists of PLL, I/O lane, and training logic. This solution is intended for third-party DDR controller implementations. - Octal DDR PHY-Only Solution—I/O lane, IOD logic, PLL, and soft fabric logic. This solution is intended for memory devices which use an 8-bit serial interface in DDR mode. For example, xSPI (JESD251), HyperBUS, and ONFI. Figure 3-1. Fabric DDR Subsystem Figure 3-2. DDR PHY-Only Solution Figure 3-3. Octal DDR PHY-Only Solution The Fabric DDR subsystem accepts read and write commands using the AMBA Advanced Extensible Interface 4 (AXI3/4) or using a native interface. The subsystem translates the AXI interface commands for the off-chip DDR SDRAM. It can also automatically perform DDR initialization, refresh, and ZQ calibration functions. The Fabric DDR subsystem and the PHY-Only solution can be configured using the DDR Configurator from Libero System-on-Chip (SoC) > IP catalog. One of the two configuration modes can be selected: - Preset configuration—allows speed-based selection of memory devices for presetting memory initialization and timing parameters. - User configuration—allows manual configuration of memory initialization and timing parameters. # 3.1 Features (Ask a Question) The following are the features of the Fabric DDR subsystem: - Integrated hard PHY. - Configurable to support DDR4, DDR3, DDR3L, and LPDDR3 memory devices, and Dual In-line Memory Modules (DIMM). The subsystem supports: - DDR3 and DDR4 additive latency modes. - DDR4 temperature-controlled refresh modes. - Up to 1600 Mbps (800 MHz DDR) data rates. - Supports 16-bit, 32-bit (40-bit with ECC), and 64-bit (72-bit with ECC) DDR SDRAM data buses. - Supports a maximum of 16 memory banks. - Includes bank management logic that monitors the status of each SDRAM bank. - Queue-based user interface that enables the DDR subsystem to optimize performance and throughput. - Supports look-ahead precharge and auto-precharge modes. - Supports automatic generation of initialization, ZQ calibration, and refresh sequences. - Supports issue of commands with or without SDRAM auto-precharge. - Supports 2T timing on SDRAM control signals (2T timing is always enabled). - Includes multi-burst capability, allowing requests with bursts longer than that of the native memory burst. - Single-Error Correction and Double-Error Detection (SECDED) Error-Correcting Code (ECC) protection. ## 3.2 Performance (Ask a Question) The following table lists DDR3, DDR4, and LPDDR3 speeds. Table 3-1. DDR Performance | Memory Type | I/O Type | Speed (-1) | Speed (STD) | |-------------|----------|------------|-------------| | DDR3 | HSIO | 1333 | 1067 | | | GPIO | 1067 | 800 | | DDR4 | HSIO | 1600 | 1333 | | LPDDR3 | HSIO | 1333 | 800 | **Note:** For DDR timing closure, use the SmartTime reports from Libero SoC. The following table lists DDR3, DDR4, and LPDDR3 throughput. Table 3-2. Throughput | Memory | Fabric<br>Interface | Fabric<br>Interface | DDR<br>Memory | Theoretical<br>Throughput | Actual Throug | ghput | Actual<br>Throughpo | ut (%) | |---------------------------|---------------------|---------------------|---------------|-----------------------------|---------------|-------------|---------------------|-------------| | | Frequenc<br>y (MHz) | Data Width | Data Width | Max<br>Throughput<br>(MB/s) | Write (MB/s) | Read (MB/s) | Write (%) | Read<br>(%) | | DDR3 | 166.66 | 256 | 32 | 5344 | 4945.19 | 4987.06 | 92.5% | 93.3% | | DDR3 (AXI4) | 166.66 | 256 | 32 | 5344 | 5338.89 | 4866.08 | 99.9% | 91.2% | | DDR4 (NI <sup>1</sup> ) | 200 | 256 | 32 | 6400 | 5916.5 | 5866.9 | 92.4% | 91.6% | | DDR4 (AXI4) | 200 | 256 | 32 | 6400 | 6395.9 | 5558.12 | 99.9% | 86.84<br>% | | LPDDR3 (NI <sup>1</sup> ) | 166.66 | 256 | 32 | 5344 | 5034.4 | 4917.6 | 94.2% | 92% | | LPDDR3 (AXI4) | 166.66 | 256 | 32 | 5344 | 5340.59 | 4681.22 | 99.9% | 87.5% | #### Notes: - 1. NI refers to native interface. - 2. To get high DDR4 throughput, set tCCD\_L to 4 on the Memory Timing tab of the PF\_DDR4 Configurator. ## 3.3 Resource Utilization (Ask a Question) The following table provides resource utilization data for the DDR3 subsystem with AXI3 and AXI4 interfaces. Table 3-3. DDR3 Resource Utilization | Interface Type | DDR Width | AXI Width | Maximum<br>Number of<br>LUT4 | Maximum<br>Number of DFF | Maximum<br>Number of<br>µSRAM | Maximum<br>Number of<br>LSRAM | |----------------|-----------|-----------|------------------------------|--------------------------|-------------------------------|-------------------------------| | AXI3 | 16 | 64 | 10155 | 8373 | 57 | 6 | | | | 128 | 10375 | 9095 | 69 | 6 | | | 32 | 64 | 14409 | 12107 | 107 | 2 | | | | 128 | 14818 | 12884 | 93 | 9 | | | | 256 | 15322 | 14346 | 115 | 9 | | | 40 | 64 | 19282 | 13596 | 119 | 2 | | | | 128 | _ | _ | _ | _ | | | | 256 | 20289 | 15828 | 127 | 9 | | | 64 | 512 | 25354 | 24849 | 67 | 58 | | | 72 | 512 | 32280 | 26575 | 87 | 56 | | AXI4 | 16 | 64 | 10496 | 8626 | 33 | 18 | | | | 128 | 10715 | 9348 | 33 | 22 | | | 32 | 64 | 14736 | 12354 | 45 | 21 | | | | 128 | 15083 | 13152 | 45 | 32 | | | | 256 | 15682 | 14614 | 45 | 39 | | | 40 | 64 | 19500 | 13843 | 57 | 21 | | | | 128 | 20109 | 14664 | 57 | 32 | | | | 256 | 20615 | 16112 | 57 | 39 | | | 64 | 512 | 25655 | 25102 | 69 | 72 | | | 72 | 512 | 32495 | 26827 | 83 | 72 | | Native | 16 | 64 | 7316 | 4456 | 19 | 2 | | | | 128 | 7316 | 4456 | 19 | 2 | | | | 256 | 7316 | 4456 | 19 | 2 | | | | 512 | 7316 | 4456 | 19 | 2 | | | 32 | 64 | 11015 | 6982 | 29 | 2 | | | | 128 | 11015 | 6982 | 29 | 2 | | | | 256 | 11015 | 6982 | 29 | 2 | | | | 512 | 11015 | 6982 | 29 | 2 | | | 40 | 64 | 15228 | 8133 | 38 | 2 | | | | 128 | 15228 | 8133 | 38 | 2 | | | | 256 | 15228 | 8133 | 38 | 2 | | | | 512 | 15228 | 8133 | 38 | 2 | | | 64 | 512 | 18340 | 12053 | 53 | 2 | | | 72 | 512 | 24563 | 13204 | 62 | 2 | The following table provides resource utilization data for the DDR4 subsystem with AXI3 and AXI4 interfaces. Table 3-4. DDR4 Resource Utilization | Interface Type | DDR Width | AXI Width | Maximum<br>Number of<br>LUT4 | Maximum<br>Number of DFF | Maximum<br>Number of<br>µSRAM | Maximum<br>Number of<br>LSRAM | |----------------|-----------|-----------|------------------------------|--------------------------|-------------------------------|-------------------------------| | AXI3 | 16 | 64 | 11215 | 8867 | 57 | 6 | | | | 128 | 11429 | 9589 | 69 | 6 | | | 32 | 64 | 16265 | 12883 | 107 | 2 | | | | 128 | 16602 | 13660 | 93 | 9 | | | | 256 | 17209 | 15122 | 115 | 9 | | | 40 | 64 | 21878 | 14498 | 119 | 2 | | | | 128 | 22439 | 15283 | 105 | 9 | | | | 256 | 22911 | 16730 | 127 | 9 | | | 64 | 512 | 28688 | 26209 | 67 | 58 | | | 72 | 512 | 36868 | 28092 | 87 | 56 | | AXI4 | 16 | 64 | 11578 | 9120 | 33 | 18 | | | | 128 | 11828 | 9842 | 33 | 22 | | | 32 | 64 | 16556 | 13130 | 45 | 21 | | | | 128 | 16966 | 13928 | 45 | 32 | | | | 256 | 17547 | 15390 | 45 | 39 | | | 40 | 64 | 22197 | 14775 | 57 | 21 | | | | 128 | 22764 | 15581 | 57 | 32 | | | | 256 | 23262 | 17028 | 57 | 39 | | | 64 | 512 | 28920 | 26454 | 69 | 72 | | | 72 | 512 | 37117 | 28341 | 83 | 72 | | Native | 16 | 64 | 8424 | 4950 | 19 | 2 | | | | 128 | 8424 | 4950 | 19 | 2 | | | | 256 | 8424 | 4950 | 19 | 2 | | | | 512 | 8424 | 4950 | 19 | 2 | | | 32 | 64 | 12884 | 7758 | 29 | 2 | | | | 128 | 12884 | 7758 | 29 | 2 | | | | 256 | 12884 | 7758 | 29 | 2 | | | | 512 | 12884 | 7758 | 29 | 2 | | | 40 | 64 | 17878 | 9048 | 38 | 2 | | | | 128 | 17878 | 9048 | 38 | 2 | | | | 256 | 17878 | 9048 | 38 | 2 | | | | 512 | 17878 | 9048 | 38 | 2 | | | 64 | 512 | _ | _ | _ | _ | | | 72 | 512 | 28619 | 14716 | 62 | 2 | # 3.4 Functional Description (Ask a Question) The Fabric DDR subsystem translates the AXI/native interface requests into command sequences required by SDRAM devices. The DDR controller module then issues these commands to the PHY module, which sends and receives data to and from the DDR SDRAM. The training logic manages DDR PHY interface (DFI) 3.1 requests between the I/O lane and the DDR controller. The PLL generates the clocks required by the DDR subsystem. For more information about the PLL, see PolarFire Family Clocking Resources User Guide. #### 3.4.1 DDR Controller (Ask a Question) The Fabric DDR controller is a soft IP core that consists of the following blocks: - Control and timing block—contains the main controller logic. - Initialization control block—performs the initialization sequence after system reset (RESET\_N) is deactivated or when dynamic reinitialization control (CTRLR\_INIT) is pulsed. - Bank management block—keeps track of the last opened row and bank to minimize command overhead. - Refresh/ZQ calibration control block—performs automatic refresh/ZQ calibration commands to maintain data integrity. - Queue control block—allows new requests to be accepted on every clock cycle as long as the queue is not full. - Data control block—handles multiplexing and de-multiplexing of data flowing to and from the DDR SDRAM devices. - Multi-burst block—allows requests longer than the programmed memory burst length. Also handles requests with starting addresses not aligned on a burst boundary, breaking the requests up as necessary to prevent data wrapping. The queue-based implementation enables the Fabric DDR controller to optimize throughput and efficiency by looking ahead into the queue to perform precharges before the read and write commands are issued. Configure the queue depth to 3 or 4 using the Command queue depth option on DDR Configurator > Controller tab. For DDR4/LPDDR3, the default queue depth is 3. The core also supports SECDED ECC for 40-bit and 72-bit data buses. SECDED ECC detects and corrects single-bit errors, and detects but does not correct double-bit errors. It cannot detect more than two bit errors in the data bus. The following figure shows the functional blocks of the Fabric DDR controller. Figure 3-4. DDR Controller Block Diagram The following sections describe the key functions of the DDR controller. - 3.4.1.1. Periodic Functions - 3.4.1.2. Look-Ahead Operations (Precharge and Auto-Precharge) - 3.4.1.3. Bank Management - 3.4.1.4. Multi-Burst Capability (Native-Interface Mode Only) - 3.4.1.5. Data Ordering with Non-Aligned Starting Addresses - 3.4.1.6. Auto-Precharge - 3.4.1.7. ECC #### 3.4.1.1 Periodic Functions (Ask a Question) SDRAM devices require periodic functions, such as refresh and ZQ calibration, to maintain data integrity. #### 3.4.1.1.1 Refresh (Ask a Question) The DDR controller automatically issues refresh commands to the attached SDRAM devices, with no user intervention required. A refresh never interrupts an SDRAM read or write burst. However, if the subsystem determines that the refresh period has elapsed at a point concurrent with or prior to a queued read or write request, the request is held off until the refresh is performed. #### 3.4.1.1.2 ZQ Calibration (Ask a Question) SDRAM devices require periodic calibration of output driver impedance and ODT values to minimize variations in voltage and temperature. This calibration is known as ZQ calibration. The DDR controller includes full support for ZQ calibration, including automatic calibration during the memory initialization sequence and periodic automatic calibrations after initialization. ZQ calibration can be initiated either automatically or manually. The DDR Configurator provides an option to enable automatic ZQ calibration and to set automatic ZQ calibration period. Depending on the type of SDRAM memory used in the device, four kinds of ZQ calibration are supported: ZQ calibration long (ZQCL), ZQ calibration short (ZQCS), ZQ calibration reset (ZQRESET), and ZQ calibration initialize (ZQINIT). The following table lists the types of ZQ calibration commands issued in various SDRAM memory devices. | | | _ | | |------------|----------------|-------------|-------------| | Table 3-5. | ZQ Calibration | Support Per | Memory Type | | Memory | Calibration Type | Time the ZQ Command is Issued | |------------|------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------| | DDR3, DDR4 | ZQCL | During initialization sequence. | | | ZQCS | Periodically, after initialization (ZQCS is enabled by default but ZQCL can be selected for periodic ZQ calibration) periodic calibration is optional for DDR3. | | LPDDR3 | ZQCL | During initialization sequence. | | | ZQCS | Periodically, after initialization (ZQCS is enabled by default but ZQCL can be selected for periodic ZQ calibration). | | | ZQRESET | When ZQ must be reset to default output impedance. | The PolarFire device family supports I/O calibration requests from the Fabric DDR controller, and calibrate only the I/Os that are used with the Fabric DDR controller. #### 3.4.1.2 Look-Ahead Operations (Precharge and Auto-Precharge) (Ask a Question) The DDR controller contains logic that precharges and auto-precharges based on upcoming read or write requests pending in the queue. The DDR subsystem looks into the queue and determines whether a requires a precharge command before a read or write command can be issued. When such a requirement is detected, the subsystem issues the necessary precharge command on the SDRAM bus to maximize the data bandwidth efficiency. The look-ahead auto-precharge logic issues an auto-precharge command (concurrent with the current read or write command) when it detects that a precharge to that bank is necessary for a read or write request currently pending in the command queue. #### **Important:** - 1. By default, Look-Ahead operations are not enabled, they can be enabled using DDR configurator. Look-Ahead operations effect the timing closure of the DDR subsystem. - 2. SDRAM commands are issued only for every 4 memory clocks. The timing diagrams shown in this document are only for general understanding and do not illustrate the same. The following figures show a sample timing diagram of the look-ahead precharge feature. In this sequence, two requests are issued to two different banks. The first request is issued to a bank that only requires an activate command. The second request is issued to a bank that requires a precharge and an activate. The first activate corresponding to the first request is issued at clock cycle 2. The write command associated with the first request does not occur until clock cycle 5 due to the tRCD requirement of SDRAM devices. Figure 3-5. Sample Read and Write Sequence With Look Ahead Precharge (Full Wave) Figure 3-6. Sample Read and Write Sequence With Look Ahead Precharge (Zoomed) The following points summarize a read-write sequence with Look-Ahead Precharge: - 1. The L\_W\_REQ signal is asserted along with the L\_ADDR signal, and the L\_B\_SIZE signal is set to 32. - 2. The L\_W\_REQ signal is deasserted, indicating no other write requests are required. - 3. Due to the write request, the subsystem asserts the row address (A), bank address (BA), and chip select (CS\_N) using the Look-Ahead Precharge command to open the bank at the requested row. - 4. L\_D\_REQ transitions a clock cycle after which, the values that enable the Look-Ahead Precharge appear on the COMMAND signal as highlighted in Figure 3-6. - 5. The subsystem issues the write command with a column address corresponding to the request. - 6. The subsystem issues the next write command with the corresponding column address. - 7. The written data begins to appear on the SDRAM bus on the DQ lines. - 8. L\_W\_VALID is asserted, and written data appears on the native interface, and then L\_W\_VALID is deasserted. 9. The read operation follows the same sequence. The following figures show a sample timing diagram without the Look-Ahead Precharge feature. Figure 3-7. Sample Read and Write Sequence Without Look Ahead Precharge (Full Wave) Figure 3-8. Sample Read and Write Sequence Without Look Ahead Precharge (Zoomed) The following points summarize a read-write sequence without Look-Ahead Precharge: - 1. The L\_W\_REQ signal is asserted along with the L\_ADDR signal, and the L\_B\_SIZE signal is set to 32. - 2. The L\_W\_REQ signal is deasserted, indicating no other write requests are required. - 3. Due to the write request, the subsystem asserts the row address (A), bank address (BA), and chip select (CS\_N) using the look-Ahead Precharge command to open the bank at the requested row. - 4. L\_D\_REQ transitions a clock cycle after which, the values that disable the Look-Ahead Precharge appear on the COMMAND signal as highlighted in Figure 3-8. - 5. The subsystem issues the write command with a column address corresponding to the request. - 6. The subsystem issues the next write command with the corresponding column address. - 7. The written data begins to appear on the SDRAM bus on the DQ lines. - 8. L\_W\_VALID is asserted, and written data appears on the native interface, and then L\_W\_VALID is deasserted. - 9. The read operation follows the same sequence. #### 3.4.1.3 Bank Management (Ask a Question) The DDR controller uses various bank management techniques to monitor the status of each SDRAM bank. Banks are only opened or closed when necessary, minimizing access delays. Up to sixteen DDR4 memory banks and eight DDR3/LPDDR3 memory banks can be managed at one time. Read and write requests are issued with minimal idle time between commands, limited only by DDR timing specifications. This timing mechanism results in minimal delay between requests, enabling up to 100% memory throughput for sequential accesses (not including refresh and ZQ calibration commands). #### 3.4.1.4 Multi-Burst Capability (Native-Interface Mode Only) (Ask a Question) The DDR controller can be configured with multi-burst capability if the native-interface mode is selected in the DDR Configurator. This feature extends the allowable range of requests at the burst size (L\_B\_SIZE) port to accommodate requests larger than the programmed burst length. It also handles requests with starting addresses not aligned to a burst boundary, breaking them up as necessary to prevent wrapped or non-sequential data access that may otherwise occur. For more information, see 3.4.1.5. Data Ordering with Non-Aligned Starting Addresses. The following figure shows a sample write sequence with the multi-burst capability enabled. In this sequence, a write request is issued with a starting address (L\_ADDR) 0x08 and a request size (L\_B\_SIZE) of 32 bytes. Figure 3-9. Sample Write Sequence Demonstrating Multi-Burst Operation The following points summarize a write sequence with multi-burst capability: - 1. The L\_W\_REQ signal is first asserted along with the L\_ADDR signal, and the L\_B\_SIZE signal is set to 32. - 2. The L\_W\_REQ signal is deasserted, indicating no other write requests are required. - 3. As a result of the write request, the subsystem asserts the row address (A), bank address (BA), and chip select (CS\_N) using the activate command to open the bank at the requested row. - 4. The subsystem issues the write command with a column address corresponding to the request. - 5. The subsystem issues the next write command with the corresponding column address. - 6. The written data begins to appear on the SDRAM bus on the DQ lines. - 7. L\_W\_VALID is asserted, and written data appears on the native interface, and then L\_W\_VALID is deasserted. The following figure shows a sample read sequence with the multi-burst capability enabled. In this example, a read request is issued with a starting address (L\_ADDR) 0x08 and a request size (L\_B\_SIZE) of 32 bytes. Figure 3-10. Sample Read Sequence Demonstrating Multi-Burst Operation The following points summarize a read sequence with multi-burst capability: 1. The L\_R\_REQ signal is first asserted along with the L\_ADDR signal, and the L\_B\_SIZE signal is set to 32. - 2. The L\_R\_REQ signal is deasserted, indicating no other read requests are required. - 3. As a result of the read request, the subsystem asserts the row address (A), bank address (BA), and chip select (CS\_N) using the activate command to open the bank at the requested row. - 4. The subsystem issues the read command with a column address corresponding to the request. - 5. The subsystem issues the next read command with the corresponding column address. - 6. The read data begins to appear on the SDRAM bus on the DQ lines. - 7. L\_R\_VALID is asserted, and read data appears on the native interface, and then L\_R\_VALID is deasserted. #### 3.4.1.5 Data Ordering with Non-Aligned Starting Addresses (Ask a Question) For write and read requests with a starting address aligned to the programmed burst length, data flow to and from SDRAM devices is always sequential. Each successive data phase within a burst corresponds to a higher address value. If the starting address is not aligned to the programmed burst length, the data will not be sequential. At a minimum, a wrap occurs, because only the locations within a burst range can be accessed with the exact ordering depending on the lower three bits of the address. The multi-burst capability automatically deals with these issues, off-loading the burden from the user logic. #### 3.4.1.6 Auto-Precharge (Ask a Question) Read and write commands can be issued to SDRAM devices with or without auto-precharge. If the auto-precharge option is selected, the SDRAM device automatically closes (precharges) the bank being read from or written to at the end of the transaction. Any subsequent access to this bank does not require a precharge command from the DDR subsystem for the transaction to be closed. The L\_AUTO\_PCH signal is used to issue read and write requests with auto-precharge to the SDRAM. For write requests, the L\_AUTO\_PCH signal is asserted along with the L\_W\_REQ signal, and for read requests, the L\_AUTO\_PCH signal is asserted along with the L\_R\_REQ signal. It is acceptable to permanently tie the L\_AUTO\_PCH signal input high or low, as shown in Figure 3-9 and Figure 3-10. The auto-precharge option is useful in situations where the requested read or write addresses are random. With random address sequences, banks of the exact row required by a subsequent request may not be open. If auto-precharge is not used for the previous access to a bank, subsequent transactions to that bank require the bank to be closed (precharged), causing a delay in the transaction execution. If auto-precharge is used with the previous access to the bank, the bank is closed and ready to be opened to the desired row. #### **3.4.1.7 ECC** (Ask a Question) When ECC is enabled, the DDR controller computes an 8-bit ECC for every 64-bit data to support SECDED. A write operation computes and stores ECC along with the data, and a read operation reads and checks the data against the stored ECC. Therefore, when ECC is enabled, single or double-bit errors might be received when reading uninitialized memory locations. To prevent this, all memory locations must be written to before being read. For a non 64-bit write operation, the DDR controller performs a read-modify-write (RMW) operation as follows: - Reads the data from DDR memory (4 or 8 DDR memory bursts) - Modifies the read data with the user data - Computes the ECC and writes to DDR memory The DDR subsystem uses status signals to indicate a single-bit error (ECC\_ERROR\_1BIT) or a double-bit error (ECC\_ERROR\_2BIT) along with the error position (ECC\_ERROR\_POS) to the fabric. #### 3.4.2 Clocking Structure (Ask a Question) Both the DDR subsystem and the DDR PHY-only solution require a dedicated PLL to generate the clocks, which are then distributed throughout the subsystem using HS\_IO\_CLK routes, dedicated pads, and fabric clock routing. This PLL generates aligned clocks for all sub-blocks for smooth operation and synchronous communication with the user logic. The PLL generates the following required clocks: - REF\_CLK— This clock is routed to the PHY for clocking the DDR memory device. - HS\_IO\_CLK— This clock routed to I/O lanes and the training logic. - HS\_IO\_CLK\_270— HS\_IO\_CLK phase shifted by 270. This clock is also routed to I/O lanes and the training logic. - SYS\_CLK— This clock is routed to the DDR controller, training logic, and user logic in the fabric. The HS\_IO\_CLK and REF\_CLK clocks are generated with the same frequency and phase. The REF\_CLK to SYS\_CLK ratio is 4:1. The following illustration shows the clocking structure of the DDR subsystem. Figure 3-11. DDR Clocking Structure **Note:** The dedicated clock output pad, CCC\_xx\_PLLy\_OUTz, is selected depending on the DDR subsystem placement constraint, where xx = SE/NE/NW/SW, y = 0/1, and z = 0/1. #### 3.4.3 Integrated PHY (Ask a Question) The integrated PHY, which consists of the I/O lane and the training logic provides a physical interface to DDR3, DDR4, and LPDDR3 SDRAM devices. It receives commands from the DDR controller and generates the DDR memory signals required to access the external DDR memory. The training logic in the PHY manages DFI 3.1 training requests between the I/O lane and the DDR controller. For more information about the DDR PHY-only integration, see 3.7. DDR PHY-Only Solution Integration. For more information about Octal DDR PHY-Only Solution, see 3.8. Octal DDR PHY-Only Solution. Figure 3-12. DDR PHY #### 3.4.3.1 I/O Lane (Ask a Question) An I/O lane contains 12 I/Os, a lane controller, I/O gearing logic, and a set of high-speed, low-skew clock resources. The I/O gearing logic in a lane enables easy data transfer between the high-speed I/O pad and the lower-speed FPGA core. The logic is used to either gear up the data rate from the FPGA fabric to the memory device or gear down the data rate from the memory device to the FPGA fabric. For information about non-memory interface usage, see PolarFire FPGA and PolarFire SoC FPGA User I/O User Guide. The lane controller contains the logic for managing the read and write DQS signals and provides the lane clocks. The following table lists the number of lanes required for the DDR memory interface. Table 3-6. DDR Memory Interface Data Lanes | Memory Type | DQ Width | Number of I/O Lanes | | | |-------------|----------|-----------------------|------------|--| | | | Address/Command Lanes | Data Lanes | | | DDR3/4 | x16 | 3 | 2 | | | DDR3/4 | x32 | 3 | 4 | | | DDR3/4 | x40 | 3 | 5 | | | continued | | | | | | |-------------|----------|-----------------------|------------|--|--| | Memory Type | DQ Width | Number of I/O Lanes | | | | | | | Address/Command Lanes | Data Lanes | | | | DDR3/4 | x64 | 3 | 8 | | | | DDR3/4 | x72 | 3 | 9 | | | | LPDDR3 | x16 | 2 | 2 | | | | LPDDR3 | x32 | 2 | 4 | | | Each data lane uses one I/O lane with 12 I/O pads each—two of the I/O pads are used for the DQS, eight for the DQ bits, and one for the Data Mask (DM). One I/O pad is left as spare. **Important:** I/O usage depends on the configuration of the DDR memory, not all I/Os in the address/command lanes are used. Some I/Os in these lanes can be reused as normal I/Os. For data lanes, all 12 I/Os in the lane are used. #### 3.4.3.2 Training Logic for Fabric DDR Controller (Ask a Question) The training logic manages the DFI 3.1 training requests between the integrated PHY and DDR controller modules, and performs the following operations. - Clock training - Write leveling - Read leveling - I/O calibration For more information, see 5. Training Logic. For information about initialization sequence of DDR3, DDR4, and LPDDR3, see 4. Initialization Sequence. ## 3.5 DDR Subsystem Ports (Ask a Question) The Fabric DDR subsystem ports are categorized into the following groups: - AXI3/4 slave interface signals—required when the DDR subsystem is configured in AXI3/4 mode. An AXI master is implemented in the FPGA fabric to perform DDR memory transactions by interfacing with the DDR subsystem. - Native interface signals—required when the DDR subsystem is configured in native interface mode. To perform DDR memory transactions, custom logic must be implemented in the FPGA fabric to interface with the DDR subsystem. Logic connected to the native interface must operate synchronously with the SYS\_CLK. - Refresh controls—required for user-initiated refresh control. - ZQ controls—required for user-initiated ZQ calibration control. - RE-INIT controls—required for re-initializing DDR memories. - SDRAM interface signals—required for connecting to the DDR SDRAM. The following figure shows the DDR subsystem ports. Figure 3-13. Fabric DDR Subsystem Ports In addition to ports shown in the preceding figure, the Fabric DDR subsystem also has some generic ports, see the following table. Table 3-7. Generic Signals | Signal Name | Direction | Description | |----------------------|-----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | SYS_CLK | Output | Clock for user logic generated by the embedded PLL. All native interface signals are synchronous to this clock. Always on the global clock network. | | SYS_RESET_N | Input | Active-low asynchronous system reset. SYS_RESET_N must be generated by ANDing the DEVICE_INIT_DONE and BANK_X_CALIB_STATUS signals of the PF_INIT_MONITOR IP <sup>1</sup> . | | | | BANK_X refers to the BANK where Fabric DDR subsystem is placed. | | PLL_REF_CLK | Input | Reference clock to the PLL. | | PLL_LOCK | Output | Lock signal generated by the PLL to indicate that the PLL is locked on to the PLL_REF_CLK signal. | | CTRLR_READY | Output | Signal that is deasserted when in reset or after CTRLR_INIT is asserted, then asserted after initialization and training sequences (if applicable) are completed. | | ALERT_N (ERR_OUT_N) | Input | This signal is asserted by the DDR memory when it detects a CA parity error or a CRC error. It is an input from DDR3 RDIMM or LRDIMM, and DDR4 components or DIMMs. | | stat_ca_parity_error | Output | This signal is asserted when an assertion is detected on the ALERT_N pin. Each occurrence of a write CRC error or CA parity error causes the assertion of the ALERT_N pin. | | | | The value is cleared on read. | #### Note: 1. For more information about the PF\_INIT\_MONITOR IP, see PolarFire Family Power-Up and Resets User Guide. The DDR Controller complies with the AXI3/4 protocol. The AXI3/4 slave interface provides the following features: - Supports all AXI allowed burst sizes, types, and lengths. - Handles all AXI wrapping conditions. - Supports 64-bit, 128-bit, 256-bit, and 512-bit AXI interfaces. For more information about the AXI3/4 protocol, see the AMBA AXI and ACE Protocol Specification. The following table lists the AXI3/4 slave interface signals. All AXI interface signals are active-high and are synchronous to SYS\_CLK. **Table 3-8.** AXI3/4 Slave Interface Signals | Signal Name | Direction | Description | |------------------------------------|-----------|--------------------------------------------------------| | Write Address Channel | | | | AXIO_AWID[X-1:0] <sup>2</sup> | Input | Write address ID | | AXIO_AWADDR[31:0] | Input | Write address | | AXIO_AWLEN[M:0] | Input | Burst length. M = 7 for AXI4 IF and M = 3 for AXI3 IF. | | AXIO_AWSIZE[2:0] | Input | Burst size. | | AXIO_AWBURST[1:0] | Input | Burst type. | | AXI0_AWLOCK[1:0] | Input | Lock type.Not supported. | | AXI0_AWCACHE[3:0] | Input | Memory type. Not supported. | | AXI0_AWPROT[2:0] | Input | Protection type. Not supported. | | AXI0_AWVALID | Input | Write address valid. | | AXIO_AWREADY | Output | Write address ready. | | Write Data Channel | | | | AXI0_WID[X-1:0] | Input | Write ID tag. Supported only in AXI3. | | AXIO_WDATA[N-1:0] <sup>1</sup> | Input | Write data. | | AXI0_WSTRB[N/8 - 1:0] <sup>1</sup> | Input | Write strobes. | | AXIO_WLAST | Input | Write last. | | AXIO_WVALID | Input | Write valid. | | AXIO_WREADY | Output | Write ready. | | Write Response Channel | | | | AXIO_BID[X-1:0] | Output | Response ID tag. | | AXIO_BRESP[1:0] | Output | Write response. | | AXI0_BVALID | Output | Write response valid. | | AXIO_BREADY | Input | Response ready. | | Read Address Channel | | | | AXIO_ARID[X-1:0] <sup>2</sup> | Input | Read address ID. | | AXIO_ARADDR[31:0] | Input | Read address. | | AXIO_ARLEN[M:0] | Input | Burst length. M = 7 for AXI4 IF and M = 3 for AXI3 IF. | | AXIO_ARSIZE[2:0] | Input | Burst size. | | AXIO_ARBURST[1:0] | Input | Burst type. | | AXIO_ARLOCK[1:0] | Input | Lock type. Not supported. | | AXIO_ARCACHE[3:0] | Input | Memory type. Not supported. | | AXIO_ARPROT[2:0] | Input | Protection type. Not supported. | | AXIO_ARVALID | Input | Read address valid. | | AXIO_ARREADY | Output | Read address ready. | | Read Data Channel | | | | AXI0_RID[X-1:0] <sup>2</sup> | Output | Read ID tag. | | AXIO_RDATA[N-1:0] | Output | Read data. | | AXIO_RRESP[1:0] | Output | Read response. | | AXIO_RLAST | Output | Read last. | | AXIO_RVALID | Output | Read valid. | | AXIO_RREADY | Input | Read ready. | | | | 1 | #### Notes: - 1. N can be configured as 64,128, 256, or 512 using the Fabric DDR subsystem configurator. - 2. 'X' can be configured between 1 to 8 using the Fabric DDR Configurator. **Important:** While performing DDR4 memory transfers using AXI interface, user must ensure not to exceed the 4 KB data limit as per the AXI specification. User can modify burst length (AxLEN) or burst size (AxSIZE) without exceeding the 4 KB limit. The following table lists the native interface signals. All control signals are active-high and are synchronous to SYS\_CLK. Table 3-9. Native Interface Signals | Signal Name | Direction | Description | | |-------------------|-----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--| | L_ADDR[38:0] | Input | Native interface address sizes: DDR4 = 39 bits, DDR3 = 36 bits, and LPDDR3 = 36 bits | | | L_B_SIZE[10:0] | Input | Native interface burst length in terms of bytes. It must be in multiples the native interface bus width. | | | L_R_REQ | Input | Native interface read request. | | | L_W_REQ | Input | Native interface write request. | | | L_AUTO_PCH | Input | When asserted along with L_R_REQ or L_W_REQ, causes the command to be issued as read with auto-precharge and write with auto-precharge, respectively. | | | L_BUSY | Output | Specifies that the subsystem is busy and is not accepting new requests. A command is accepted on any clock cycle where L_R_REQ or L_W_REQ is set, and L_BUSY is low. If L_BUSY is high when L_R_REQ or L_W_REQ is set, the request may be kept asserted (along with the desired L_ADDR, L_B_SIZE and L_AUTO_PCH values) until L_BUSY goes low. | | | L_D_REQ | Output | Requests data on the native interface write data bus (L_DATAIN) during a write transaction. Asserts one clock cycle prior to when data is required. | | | L_D_REQ_LAST_P0 | Output | Requests the last data on the native interface write data bus<br>This signal is used along with L_D_REQ | | | L_R_VALID | Output | Data-valid indication for data on the native interface read data bus (L_DATAOUT). | | | L_R_VALID_LAST_P0 | Output | Data-valid indication for the last data on the native interface read data bus This signal is used along with L_R_VALID | | | L_DATAIN[N:0] | Input | Input data bus. This data bus is eight times the width of the SDRAM device data bus. Memory width (bits): 16, 32, 64. Input data bus (bits): 128, 256, 512. | | | L_DATAOUT[N:0] | Output | Output data bus. This data bus is twice the width of the SDRAM device data bus. Memory width (bits): 16, 32, 64. Output data bus (bits): 128, 256, 512. | | | L_DM_IN[N:0] | Input | Individual byte masks during data write. Memory width (bits): 16, 32, 64. Data mask bus (bits): 16, 32, 64. | | The following table lists the refresh control signals. To expose these signals, select the **Enable User Refresh Controls** check box from DDR3/LPDDR3 configurator > Controller tab > Efficiency. Table 3-10. Refresh Control Signals | Signal Name | Direction | Description | |-------------|-----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | L_REF_REQ | Input | User-initiated refresh control. Causes a refresh command to be issued at the next opportunity. This signal is only used when manual control of refresh is desired. The DDR Configurator provides an option (Enable User Refresh Controls) to expose the user-initiated refresh control (L_REF_REQ) signal. tREFI parameter in the DDR Configurator specifies the period between refreshes. | | L_REF_ACK | Output | Refresh acknowledge. Asserted for one clock cycle when a refresh command is being issued. Typically used to determine when a refresh was last issued in user-controlled refresh. Can be ignored when the subsystem is automatically generating refreshes. | The following table lists the ZQ calibration control signals. To expose these signals, enable User ZQ Calibration Controls in the DDR Configurator Memory Timing tab. Table 3-11. ZQ Calibration Control Signals | Signal Name | Direction | Description | |--------------|-----------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | L_ZQ_CAL_REQ | Input | Causes user-initiated ZQCS or ZQCL requests to be issued at the next opportunity. The ZQCS or ZQCL are issued to the ranks corresponding to the asserted bits. ZQ calibration can be initiated either automatically or manually. The DDR Configurator provides an option to enable automatic ZQ calibration and to set the automatic ZQ calibration period. | | L_ZQ_CAL_ACK | Output | ZQ calibration acknowledge. Asserted for a single clock cycle when a ZQ calibration command is issued to memory devices. Used for ZQ calibrations initiated by asserting the L_ZQ_CAL_REQ signal. | The following table describes the reinitialization control signal. To expose this signal, enable RE-INIT Controls in the DDR Configurator Controller tab. Table 3-12. Reinitialization Control Signal | Signal Name | Direction | Description | |-------------|-----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | CTRLR_INIT | Input | Causes the subsystem to reissue the initialization sequence to the SDRAM. The initialization begins when a low-to-high transition is detected on the input. The controller always issues the initialization sequence (including the startup delay) after a reset, regardless of this signal's state. If run-time reinitialization is not required, the signal can be tied low. | The following table lists the ECC status signals. Table 3-13. ECC Status Signals | Signal Name | Direction | Description | |--------------------|-----------|----------------------------------------------------------------------------------------------------| | ECC_ERROR_1BIT | Output | Active when a 1-bit error is detected on the data being presented on the AXI RDATA/L_DATAOUT port. | | ECC_ERROR_2BIT | Output | Active when a 2-bit error is detected on the data being presented on the AXI RDATA/L_DATAOUT port. | | ECC_ERROR_POS[6:0] | Output | Indicates bit position of the error in a 64-bit data output for a 1-bit error. | The following table lists the SDRAM interface signals. Table 3-14. SDRAM Interface Signals | Signal Name | Direction | Description | |-------------|-----------|------------------------------------------------| | CK | Output | Differential clock pair forwarded to SDRAM. | | CK_N | Output | Differential clock pair forwarded to SDRAM. | | RESET_N | Output | SDRAM reset. Supported only for DDR3 and DDR4. | | continu | ed | | |------------------------|---------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Signal Name | Direction | Description | | A[15:0] | Output | Address bus. Sampled during the active, precharge, read, and write commands. Also provides the mode register value during MRS commands. Bus width for LPDDR3 is 10 bits, DDR3 is 16 bits, and DDR4 is 14 bits. | | BA[2:0] | Output | Bank address. Sampled during active, precharge, read, and write commands to determine which bank the command is to be applied to. Supported only for DDR3 and DDR4. For DDR4, bus width is 2 bits. For DDR3, bus width is 3 bits. | | BG[1:0] | Output | DDR bank group address for DDR4 only. | | CS_N | Output | SDRAM chip select. | | CKE | Output | SDRAM clock enable. Held low during initialization to ensure SDRAM DQ and DQS outputs are in the hi-Z state. | | RAS_N | Output | SDRAM row address strobe command. Supported only for DDR3 and DDR4. | | CAS_N | Output | SDRAM column access strobe command. Supported only for DDR3 and DDR4. | | WE_N | Output | SDRAM write-enable command. Supported only for DDR3 and DDR4. | | ODT | Output | On-die termination control. ODT is asserted during reads and writes according to the ODT activation settings in the DDR Configurator. | | PAR | Output | Command and address parity output. Supported only for DDR4. | | ALERT_N<br>(ERR_OUT_N) | Input | It is asserted by the DDR memory when it detects a CA parity error or a CRC error. It is an input from DDR3 RDIMM or LRDIMM, and DDR4 components or DIMMs. | | DQ | Bidirectional | SDRAM data bus. Supports 16-bit, 32-bit, 39-bit, 64-bit, and 72-bit DDR SDRAM data buses. | | DM/DM_N | Output | Write data mask. DM for DDR3/LPDDR3 and DM_N for DDR4. | | DQS | Bidirectional | Strobes data into the SDRAM devices during writes and into the DDR subsystem during reads. | | DQS_N | Bidirectional | Complimentary DQS. | | SHIELD | Output | Pads must be connected to the Ground. They are placed between the data lanes for improving signal integrity. <b>Note:</b> SHIELD signals are available only in the fabric DDR subsystem because it does not have dedicated I/O Banks to interface with the DDR memory. SHIELD pins are not applicable to all device and package combinations. | # 3.5.1 DDR PHY-Only Solution Ports (Ask a Question) The DDR PHY-only solution ports are divided into the following groups: - DFI interface—signals required when the DDR subsystem is configured for DDR-PHY mode. This interface allows a third-party DDR controller hosted in the fabric to connect directly to the PHY. - SDRAM interface—signals connected to the DDR SDRAM. The following illustration shows the DDR PHY-only solution ports. Figure 3-14. DDR PHY-Only Solution Ports # 3.5.2 DFI Interface (Ask a Question) The DDR memory controller interface solution leverages the DDR PHY interface (DFI 3.1) for connections between the controller and the PHY. The control signal, write data, read data update, status, and training interfaces are listed in the following tables. Use the PolarFire FPGA DDR3 PHY configurator to expose the DFI interface. For more information about the interface, see the DFI 3.1 Specification available on the DFI Group DDR PHY website. Table 3-15. DFI Control Signal Interface (synchronous to SYS\_CLK) | City Control Signar III | | _ : | |-------------------------|-----------|----------------------------------------------------| | Signal Name | Direction | Description | | DFI_ADDRESS_P0[15:0] | Input | DFI Address bus | | DFI_ADDRESS_P1[15:0] | Input | DFI Address bus | | DFI_ADDRESS_P2[15:0] | Input | DFI Address bus | | DFI_ADDRESS_P3[15:0] | Input | DFI Address bus | | DFI_BANK_P0[2:0] | Input | DFI bank bus (only for DDR3 and DDR4) | | DFI_BANK_P1[2:0] | Input | DFI bank bus (only for DDR3 and DDR4) | | DFI_BANK_P2[2:0] | Input | DFI bank bus (only for DDR3 and DDR4) | | DFI_BANK_P3[2:0] | Input | DFI bank bus (only for DDR3 and DDR4) | | DFI_CAS_N_P0 | Input | DFI column address strobe (only for DDR3 and DDR4) | | DFI_CAS_N_P1 | Input | DFI column address strobe (only for DDR3 and DDR4) | | DFI_CAS_N_P2 | Input | DFI column address strobe (only for DDR3 and DDR4) | | DFI_CAS_N_P3 | Input | DFI column address strobe (only for DDR3 and DDR4) | | DFI_CKE_P0[0] | Input | DFI clock enable | | DFI_CKE_P1[0] | Input | DFI clock enable | | DFI_CKE_P2[0] | Input | DFI clock enable | | DFI_CKE_P3[0] | Input | DFI clock enable | | DFI_CS_N_P0[0] | Input | DFI chip select | | DFI_CS_N_P1[0] | Input | DFI chip select | | DFI_CS_N_P2[0] | Input | DFI chip select | | DFI_CS_N_P3[0] | Input | DFI chip select | | DFI_ODT_P0[0] | Input | DFI on-die termination control | | DFI_ODT_P1[0] | Input | DFI on-die termination control | | DFI_ODT_P2[0] | Input | DFI on-die termination control | | DFI_ODT_P3[0] | Input | DFI on-die termination control | | DFI_RAS_N_P0 | Input | DFI row address strobe (only for DDR3 and DDR4) | | DFI_RAS_N_P1 | Input | DFI row address strobe (only for DDR3 and DDR4) | | DFI_RAS_N_P2 | Input | DFI row address strobe (only for DDR3 and DDR4) | | DFI_RAS_N_P3 | Input | DFI row address strobe (only for DDR3 and DDR4) | | DFI_RESET_N_P0 | Input | DFI reset | | DFI_RESET_N_P1 | Input | DFI reset | | DFI_RESET_N_P2 | Input | DFI reset | | DFI_RESET_N_P3 | Input | DFI reset | | DFI_WE_N_P0 | Input | DFI write enable (only for DDR3 and DDR4) | | DFI_WE_N_P1 | Input | DFI write enable (only for DDR3 and DDR4) | | DFI_WE_N_P2 | Input | DFI write enable (only for DDR3 and DDR4) | | DFI_WE_N_P3 | Input | DFI write enable (only for DDR3 and DDR4) | Table 3-16. DFI Write Data Interface | Signal Name | Direction | Description | |---------------------------------------|-----------|-----------------------------------------------------------------| | DFI_WRDATA_CS_N_P0[0] | Input | DFI write data chip select | | DFI_WRDATA_CS_N_P1[0] | Input | DFI write data chip select | | DFI_WRDATA_CS_N_P2[0] | Input | DFI write data chip select | | DFI_WRDATA_CS_N_P3[0] | Input | DFI write data chip select | | DFI_WRDATA_EN_P0[63:0] <sup>1</sup> | Input | DFI write data and data mask enable | | DFI_WRDATA_EN_P1[63:0] | Input | DFI write data and data mask enable | | DFI_WRDATA_EN_P2[63:0] | Input | DFI write data and data mask enable | | DFI_WRDATA_EN_P3[63:0] | Input | DFI write data and data mask enable | | DFI_WRDATA_MASK_P0[15:0] <sup>1</sup> | Input | DFI write data byte mask | | DFI_WRDATA_MASK_P1[15:0] <sup>1</sup> | Input | DFI write data byte mask | | DFI_WRDATA_MASK_P2[15:0] <sup>1</sup> | Input | DFI write data byte mask | | DFI_WRDATA_MASK_P3[15:0] <sup>1</sup> | Input | DFI write data byte mask | | DFI_WRDATA_P0[127:0] <sup>1</sup> | Input | These signals transfer write Data from memory controller to PHY | | DFI_WRDATA_P1[127:0] <sup>1</sup> | Input | These signals transfer write Data from memory controller to PHY | | DFI_WRDATA_P2[127:0] <sup>1</sup> | Input | These signals transfer write Data from memory controller to PHY | | DFI_WRDATA_P3[127:0] <sup>1</sup> | Input | These signals transfer write Data from memory controller to PHY | #### Note: 1. Depends on DQ width (see Table 3-22 and Table 3-23). Table 3-17. DFI Read Data Interface | Signal Name | Direction | Description | |---------------------------------------|-----------|---------------------------| | DFI_RDDATA_CS_N_P0[0] | Input | DFI read data chip select | | DFI_RDDATA_CS_N_P1[0] | Input | DFI read data chip select | | DFI_RDDATA_CS_N_P2[0] | Input | DFI read data chip select | | DFI_RDDATA_CS_N_P3[0] | Input | DFI read data chip select | | DFI_RDDATA_EN_P0[63:0] <sup>1</sup> | Input | DFI read data enable | | DFI_RDDATA_EN_P1[63:0] <sup>1</sup> | Input | DFI read data enable | | DFI_RDDATA_EN_P2[63:0] <sup>1</sup> | Input | DFI read data enable | | DFI_RDDATA_EN_P3[63:0] <sup>1</sup> | Input | DFI read data enable | | DFI_RDDATA_VALID_W0[7:0] <sup>1</sup> | Output | DFI read data valid | | DFI_RDDATA_VALID_W1[7:0] <sup>1</sup> | Output | DFI read data valid | | DFI_RDDATA_VALID_W2[7:0] <sup>1</sup> | Output | DFI read data valid | | DFI_RDDATA_VALID_W3[7:0] <sup>1</sup> | Output | DFI read data valid | | DFI_RDDATA_W0[127:0] <sup>1</sup> | Output | DFI read data | | DFI_RDDATA_W1[127:0] <sup>1</sup> | Output | DFI read data | | DFI_RDDATA_W2[127:0] <sup>1</sup> | Output | DFI read data | | DFI_RDDATA_W3[127:0] <sup>1</sup> | Output | DFI read data | ## Note: 1. Depends on DQ width (see Table 3-22 and Table 3-23). Table 3-18. DFI Write Calibration Interface | Signal Name | Direction | Description | |-------------------------------------|-----------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------| | CAL_L_BUSY | Input | Write Calibration Busy. Indicates this interface is not accepting new commands. A command is accepted when CAL_L_R_REQ or CAL_L_W_REQ is set and CAL_L_BUSY is low. | | CAL_L_DATAOUT[511:0] <sup>1</sup> | Input | Write calibration data output from the controller Depends on DQ width. | | CAL_L_D_REQ | Input | Write calibration data request from the controller. | | CAL_L_R_VALID | Input | Write calibration read valid from the controller. | | CAL_L_DATAIN[511:0] <sup>1</sup> | Output | Write calibration data input to the controller Depends on DQ width. | | CAL_L_DM_IN[63:0] <sup>1</sup> | Output | Write calibration data mask input to the controller Depends on DQ width. | | CAL_L_R_REQ | Output | Write calibration read request to the controller. | | CAL_L_W_REQ | Output | Write calibration write request to the controller. | | CAL_SELECT | Output | Write Calibration Select to the controller. | | CAL_INIT_ACK <sup>2</sup> | Input | Calibration initialization bus available handshake from DDR controller. | | CAL_INIT_CS[1:0] <sup>2</sup> | Output | Calibration initialization bus chip select. | | CAL_INIT_MR_ADDR[7:0] <sup>2</sup> | Output | Calibration initialization bus mode register write address. | | CAL_INIT_MR_DATA[17:0] <sup>2</sup> | Output | Calibration initialization bus mode register write data. | | CAL_INIT_MR_MASK[17:0] <sup>2</sup> | Output | Calibration initialization bus mode register write mask. | | CAL_INIT_MR_W_REQ <sup>2</sup> | Output | Calibration initialization bus mode register write request. | #### **Notes:** - 1. Depends on DQ width (see Table 3-22 and Table 3-23). - 2. These ports are only for DDR4. - 3. The DRAM interface is same as Table 3-14 for DDR PHY only solution. Table 3-19. DFI Training Interface | Signal Name | Direction | Description | |---------------------|-----------|----------------------------------------------| | DFI_RDLVL_CS_N[0] | Input | Chip select for read data eye training | | DFI_RDLVL_EN | Input | PHY read data eye training enable | | DFI_RDLVL_GATE_EN | Input | PHY read gate training enable | | DFI_WRLVL_CS_N[0] | Input | Chip select for write leveling | | DFI_WRLVL_EN | Input | PHY write leveling logic enable | | DFI_WRLVL_STROBE | Input | Initiates capture of write level response | | DFI_RDLVL_RESP[7:0] | Output | Indicates data eye or gate training complete | | DFI_WRLVL_RESP[7:0] | Output | Indicates PHY has completed write leveling | Table 3-20. DFI Status Interface | Signal Name | Direction | Description | |-------------------|-----------|-----------------------------| | DFI_INIT_START | Input | DFI setup stabilization | | DFI_INIT_COMPLETE | Output | PHY initialization complete | Table 3-21. DFI Clock and Miscellaneous | Signal Name | Direction | Description | |-------------|-----------|---------------------------------------| | PLL_LOCK | Output | PLL lock indicator | | PLL_REF_CLK | Input | PLL reference clock | | SYS_CLK | Output | PLL system clock output to controller | | continued | | | | |-----------------------|-----------|--------------------------------------------------------|--| | Signal Name | Direction | Description | | | SYS_RESET_N | Input | Global reset input | | | SYNC_SYS_RST_N | Output | Synchronized reset output | | | CTRLR_READY_IN | Input | Controller ready status from controller to PHY | | | CTRLR_READY_OUT | Output | Controller ready status from PHY to controller | | | DFI_TRAINING_COMPLETE | Output | PHY output to controller to indicate training complete | | ## 3.6 Functional Timing Diagrams (Ask a Question) This section describes the write and read operations of the Fabric DDR subsystem in native interface and AXI4 interface modes. #### 3.6.1 Native Interface Transactions (Ask a Question) This section describes the read/write rules and sequences in the native interface. For information about native interface signals, see Table 3-9. **Important:** During the simulation, users can find the difference between Core configuration timings (JEDEC\*) and simulation results. This is due to the internal architecture of the controller (T4), which includes a 4-stage queue so that commands to memory are issued at every 4 clock cycles. #### 3.6.1.1 SDRAM Writes (Ask a Question) SDRAM writes are requested in the native interface by asserting the L\_W\_REQ signal high, and by driving the starting address and the burst size on the L\_ADDR and L\_B\_SIZE signals respectively. A write can be requested with auto-precharge by asserting the L\_AUTO\_PCH signal along with the L\_W\_REQ signal. The following are the rules for write requests in the native interface: - A write is accepted by the subsystem on any clock cycle where the L\_W\_REQ signal is asserted while the L\_BUSY signal is deasserted. After L\_BUSY is deasserted by the controller, L\_W\_REQ may remain asserted to request a follow-on write transaction. L\_W\_REQ may remain asserted over any number of clock periods to generate any number of cascaded write requests. - The values of the L\_ADDRL\_B\_SIZE, and L\_AUTO\_PCH signals are captured when the L\_BUSY signal is low. It is acceptable to permanently tie the L\_B\_SIZE and L\_AUTO\_PCH signals to fixed values. - The L W REQ signal cannot be asserted while the L R REQ signal is asserted. - The data request (L\_D\_REQ) signal is asserted one clock prior to the data signal at the L\_DATAIN bus and the subsystem masks the information at the L\_DM\_IN bus. - The timing relationship between an initial L\_W\_REQ, L\_D\_REQ, and L\_BUSY or between L\_BUSY assertions and de-assertions as a result of multiple cascaded writes varies depending on the status of the banks being accessed, the configuration port settings, and the refresh and initialization status. **Note:** The user logic must not rely on any fixed timing relationship between the L\_W\_REQ, L\_D\_REQ, and L\_BUSY signals. #### 3.6.1.1.1 Sample Write Sequence (Ask a Question) The following figure shows a sample write sequence consisting of single write request. Figure 3-15. Example Single Write Sequence The following points summarize a write sequence: - 1. The L\_W\_REQ signal is first asserted along with the L\_ADDR signal, and the L\_B\_SIZE signal is set to 32. - 2. If the L\_BUSY signal is not asserted in clock cycle 1 (indicating that the request is accepted by the subsystem), the L\_W\_REQ signal remains asserted. The L\_ADDR and L\_B\_SIZE signals are updated for the next write request. - 3. The L\_W\_REQ signal is deasserted, indicating that no other write requests are required. - 4. As a result of the write request, the subsystem asserts the row address (A), bank address (BA), and chip select (CS\_N) using the activate command to open the bank at the requested row. - 5. As a result of the write request, the subsystem issues a write command with the appropriate column address. In response to the next write request, the subsystem issues another write command with the corresponding column address. - 6. The subsystem requests data in the native interface by asserting the L\_D\_REQ signal. In this sequence, all the writes are to the same bank and row. If the next write request targets a different bank, the DDR subsystem may have to issue a precharge and/or activate commands prior to issuing write commands. #### 3.6.1.2 SDRAM Reads (Ask a Question) SDRAM reads are requested in the native interface by asserting the L\_R\_REQ signal, and by driving the starting address and burst size on the L\_ADDR and L\_B\_SIZE signals respectively. The L\_AUTO\_PCH signal may also be asserted along with the L\_R\_REQ signal to issue the read as a read with auto-precharge. A read can be requested with auto-precharge by asserting the L\_AUTO\_PCH signal along with the L\_R\_REQ signal. The following are the rules for read requests at in the native interface: - A read is accepted by the subsystem on any clock cycle where the L\_R\_REQ signal is asserted while the L\_BUSY signal is deasserted. After L\_BUSY is deasserted by the subsystem, the L\_R\_REQ signal may remain asserted to request a follow-on read transaction. The L\_R\_REQ signal may remain asserted over any number of clock periods to generate any number of cascaded read requests. - The values of the L\_ADDRL\_B\_SIZE, and L\_AUTO\_PCH signals are captured when the L\_BUSY signal is low. The L\_B\_SIZE and L\_AUTO\_PCH signals can be tied to fixed values, if desired. - The L\_R\_REQ signal cannot be asserted while the L\_W\_REQ signal is asserted. - The read data valid (L\_R\_VALID) signal is asserted when valid data is available at the L\_DATAOUT bus. - The timing relationship between an initial L\_R\_REQ, L\_R\_VALID, and L\_BUSY or between L\_BUSY assertions and de-assertions as a result of multiple cascaded reads varies depending on the status of the banks being accessed, configuration port settings, refresh status, and initialization status. The user logic must not rely on any fixed timing relationship between the L\_R\_REQ, L\_R\_VALID, and L\_BUSY signals. #### 3.6.1.2.1 Sample Read Sequence (Ask a Question) The following figure shows a sample read sequence consisting of single read request. Figure 3-16. Example Sample Read Sequence The following points summarize a read sequence: - 1. The L\_R\_REQ signal is first asserted along with the L\_ADDR signal, and the L\_B\_SIZE signal is set to 32. - 2. In the previous clock cycle, the l\_busy signal was not asserted, indicating that the subsystem accepted the first request on that cycle. The L\_R\_REQ signal remains asserted. The L\_ADDR and L\_B\_SIZE signals are updated for the next read request. - 3. The L R REQ signal is deasserted, indicating no other read requests are required. - 4. As a result of the read request, the subsystem asserts the row address (A), bank address (BA), and chip select (CS\_N) using the activate command to open the bank at the requested row. - 5. The subsystem issues a read command with a column address corresponding to the request at the respective clock cycle. - 6. The subsystem issues the next read command with the corresponding column address. - 7. Read data begins to appear on the SDRAM bus on the DQ lines. - 8. L\_R\_VALID is asserted, and read data appears in the native interface. - 9. L\_R\_VALID is deasserted. In this sequence, all read requests are to the same bank and row. If the next read was to different banks or rows, the core may have issued precharge and/or activate commands prior to issuing read commands. #### 3.6.2 AXI Transactions (Ask a Question) The following timing diagrams illustrate the operation of the DDR controller with the AXI interface. The Micron DDR3 16-bit memory model is used to perform the read and write transactions. Figure 3-17. AXI Single Write Transaction Figure 3-18. AXI Burst Write Transaction (INCR-2) Figure 3-20. AXI Burst Read Transaction (INCR-2) # 3.7 DDR PHY-Only Solution Integration (Ask a Question) This section describes how to integrate third-party DDR memory controllers with PolarFire DDR PHY. The PHY top-level behavior on both DFI and DRAM interfaces are detailed in this section. The PolarFire DDR PHY-only solution is DFI 3.1 complaint and supports DDR3, DDR3L, DDR4, and LPDDR3. The PolarFire DDR PHY-only solution includes a PLL (CCC – Clock Conditioning Circuit) and an integrated PHY as shown in Figure 3-2. The integrated PHY includes I/O lanes and training logic, I/O lanes are used for command/data. The following figure shows the top-level I/O of the DDR PHY. The signals are grouped and color coded for easy visualization. Figure 3-21. DDR3 PHY-Only Top-Level I/O As shown in the preceding figure, DFI signals **DFI\_INIT\_START** and **DFI\_INIT\_COMPLETE** are part of the status interface according to the DFI 3.1 specification. These signals are shown as part of the training interface because the PHY performs the clock training from SYS\_CLK to HSIO\_CLK and from HSIO\_CLK to REF\_CLK during the initialization. # 3.7.1 Controller and PHY Integration (Ask a Question) This section describes the PHY top-level pin list and how they interconnect to other modules in a DDR subsystem. The DDR3 PHY is compliant with DFI 3.1 and can be easily integrated with controllers that comply with DFI 3.1. All of the pin list, description, behavior, and timing are as per DFI 3.1. For more information, see the DFI 3.1 specification. The following figure shows how PHY integrates with a memory controller for control, write and read interfaces. Figure 3-22. DDR PHY DFI Interface The following figure shows how the training IP inside the DDR PHY integrates with a memory controller. The training IP includes write calibration. Figure 3-23. DDR PHY Training IP Interface with Write Calibration The following figure shows how DDR PHY interfaces with DRAM, PLL, and Miscellaneous. Figure 3-24. DDR PHY Interface with DRAM, PLL, and Miscellaneous # 3.7.2 PHY Configurator (Ask a Question) The PolarFire DDR PHY IP core is delivered with a configurator, which includes parameters like bus width, density, organization and frequency of operation. The port widths on DFI read and write interfaces depend on the DQ bus width selected for the design. The following tables list the DFI read and write interfaces and their corresponding DQ bus width on the DRAM interface. Table 3-22. DFI Port Widths Vs DRAM DQ Bus Width (16-Bit Interface) | DFI Signal name | Interface | DRAM DQ Width | Remarks | |--------------------------|-----------|---------------|-------------------------------------------------------------------| | dfi_rddata_w0[31:0] | READ | DQ[15:0] | dfi_rddata_w* width is [31:0] | | dfi_rddata_w1[31:0] | | DQS[1:0] | for 16 bit DRAM interface. $(4 \times 32 = 128 \text{ bit data})$ | | dfi_rddata_w2[31:0] | | DQS_N[1:0] | transferred to controller in a | | dfi_rddata_w3[31:0] | | | single SYS_CLK) | | dfi_rddata_valid_w0[1:0] | | | dfi_rddata_valid width is 1:0 | | dfi_rddata_valid_w1[1:0] | | | for two byte lanes. | | dfi_rddata_valid_w2[1:0] | | | | | dfi_rddata_valid_w3[1:0] | | | | | dfi_rddata_cs_n_p0** | | | cs_n_p** width is 1. These | | dfi_rddata_cs_n_p1** | | | signals are always asserted in the Libero® SoC design. | | dfi_rddata_cs_n_p2** | | | the Libero 30C design. | | dfi_rddata_cs_n_p3** | | | | | dfi_rddata_en_p0[1:0]* | | | *dfi_rddata_en_p* is 15:0. Out | | dfi_rddata_en_p1[1:0]* | | | of this only 1:0 are used for two byte lanes. | | dfi_rddata_en_p2[1:0]* | | | two byte laries. | | dfi_rddata_en_p3[1:0]* | | | | | dfi_wrdata_p0[31:0] | WRITE | DQ[15:0] | dfi_wrdata_p* width is [31:0] | | dfi_wrdata_p1[31:0] | | DQS[1:0] | for 16 bit DRAM interface. $(4 \times 32 = 128 \text{ bit Data})$ | | dfi_wrdata_p2[31:0] | | DQS_N[1:0] | transferred from controller in | | dfi_wrdata_p3[31:0] | | | a single SYS_CLK) | | dfi_wrdata_mask_p0[3:0] | | | Data on dfi_wrdata is valid | | dfi_wrdata_mask_p1[3:0] | | | only when wrdata_mask is '0' (4 bytes of wrdata[31:0] have | | dfi_wrdata_mask_p2[3:0] | | | 4 bits of mask) | | dfi_wrdata_mask_p3[3:0] | | | | | dfi_wrdata_cs_n_p0** | | | cs_n_p** width is 1. These | | dfi_wrdata_cs_n_p1** | | | signals are always asserted in Libero SoC design | | dfi_wrdata_cs_n_p2** | | | Libero 30C design | | dfi_wrdata_cs_n_p3** | | | | | dfi_wrdata_en_p0[1:0]* | | | *dfi_wrdata_en_p* is 15:0. | | dfi_wrdata_en_p1[1:0]* | | | Out of this only 1:0 are used for two byte lanes. | | dfi_wrdata_en_p2[1:0]* | | | Tor two byte farles. | | dfi_wrdata_en_p3[1:0]* | | | | Table 3-23. DFI Port Width Vs DRAM DQ Bus Width – 32-Bit Interface | DFI Signal name | Interface | DRAM DQ Width | Remarks | | | | | | |--------------------------|-----------|---------------|------------------------------------------------------------|--|--|--|--|--| | dfi_rddata_w0[63:0] | READ | _ | dfi_rddata_w* width is [63:0] | | | | | | | dfi_rddata_w1[63:0] | | - | for 32 bit DRAM interface.<br>(4 x 64 = 256 bit Data | | | | | | | dfi_rddata_w2[63:0] | | _ | transferred to controller in a | | | | | | | dfi_rddata_w3[63:0] | | _ | single SYS_CLK) | | | | | | | dfi_rddata_valid_w0[3:0] | | _ | dfi_rddata_valid width is 3:0 | | | | | | | dfi_rddata_valid_w1[3:0] | | - | for four byte lanes. | | | | | | | dfi_rddata_valid_w2[3:0] | | DQ[31:0] | | | | | | | | dfi_rddata_valid_w3[3:0] | | DQS[3:0] | | | | | | | | dfi_rddata_cs_n_p0** | | DQS_N[3:0] | cs_n_p** width is 1. These | | | | | | | dfi_rddata_cs_n_p1** | | - | signals are always asserted in Libero SoC design. | | | | | | | dfi_rddata_cs_n_p2** | | _ | Libero 30C design. | | | | | | | dfi_rddata_cs_n_p3** | | - | | | | | | | | dfi_rddata_en_p0[3:0]* | | _ | *dfi_rddata_en_p* is 31:0. Out | | | | | | | dfi_rddata_en_p1[3:0]* | | - | of this only 3:0 are used for four byte lanes. | | | | | | | dfi_rddata_en_p2[3:0]* | | _ | Tour byte laries. | | | | | | | dfi_rddata_en_p3[3:0]* | | - | | | | | | | | dfi_wrdata_p0[63:0] | _ | _ | dfi_wrdata_p* width is [63:0] | | | | | | | dfi_wrdata_p1[63:0] | _ | _ | for 32 bit DRAM interface.<br>(4 x 64 = 256 bit Data | | | | | | | dfi_wrdata_p2[63:0] | _ | _ | transferred from controller in | | | | | | | dfi_wrdata_p3[63:0] | _ | _ | a single SYS_CLK) | | | | | | | dfi_wrdata_mask_p0[7:0] | _ | _ | Data on dfi_wrdata is valid | | | | | | | dfi_wrdata_mask_p1[7:0] | _ | _ | only when wrdata_mask is '0' (8 bytes of wrdata[63:0] have | | | | | | | dfi_wrdata_mask_p2[7:0] | _ | _ | 8 bits of mask) | | | | | | | dfi_wrdata_mask_p3[7:0] | _ | DQ[31:0] | | | | | | | | dfi_wrdata_cs_n_p0** | WRITE | DQS[3:0] | cs_n_p** width is 1. These | | | | | | | dfi_wrdata_cs_n_p1** | _ | DQS_N[3:0] | signals are always asserted in Libero SoC design | | | | | | | dfi_wrdata_cs_n_p2** | _ | _ | Libero 30c design | | | | | | | dfi_wrdata_cs_n_p3** | _ | _ | | | | | | | | dfi_wrdata_en_p0[3:0]* | _ | _ | *dfi_wrdata_en_p* is 31:0. | | | | | | | dfi_wrdata_en_p1[3:0]* | _ | _ | Out of this only 3:0 are used for four byte lanes. | | | | | | | dfi_wrdata_en_p2[3:0]* | _ | _ | Tor Tour byte lattes. | | | | | | | dfi_wrdata_en_p3[3:0]* | _ | _ | | | | | | | The configuration of the controller and the DDR PHY must be matched. For example, if the controller is configured for 32-bit DQ width, PHY also must be configured for the 32-bit DQ width. The DDR3 PHY is instantiated by selecting the **Generate PHY Only** check box in the PolarFire DDR3 configurator > **General** Tab settings. When this check box is selected as shown in the following figure, the remaining tabs are grayed out because these settings are pertaining to the controller. Figure 3-25. DDR3 PHY-Only Configurator The following parameters are configurable: - Protocol - Memory Clock Frequency - PLL Reference Clock - User Logic Clock Rate - · CCC PLL Clock Multiplier - · DQ Width - Enable DM - Enable ECC # 3.7.3 Initialization and Bring-Up of the DDR Subsystem (Ask a Question) The DDR subsystem initializes in the following sequence: - 1. DFI Initialization (DFI\_INIT\_START & DFI\_INIT\_COMPLETE) - 2. DRAM Initialization - 3. ZQCL - 4. Write Leveling - 5. DQS Gate Training - 6. Read Data Eye Training - 7. Write Calibration - 8. Simple Burst Write - 9. Simple Burst Read Note: Any DFI compliant controller must follow this sequence. # 3.7.3.1 DFI Initialization (Ask a Question) In this step, the PHY de-asserts the **DFI\_INIT\_COMPLETE** signal (1'b0) to indicate that the PHY is not ready to accept DFI commands. After the RESET release, the PHY goes through the following PLL Lock and Clock Training sequence, which does not require any controller or user intervention. - 1. Initializes the PLL and waits for the lock signal. - 2. Switches SYS\_CLK to PLL HSIO\_CLK/4 clock. - 3. Sets the IOA buffer Vref to 50%. **Note:** $V_{REF}$ is set either internally or externally by a pin at device power-up. - 4. Aligns HSIO\_CLK to SYS\_CLK. - 5. Aligns REF\_CLK (CK0/CK0\_N) to ADD/CMD bus. This completes the DFI and IOD initialization. The Training IP asserts the **DFI\_INIT\_COMPLETE** signal (1) indicating the controller to start the DDR training procedure. The PHY is now ready to accept DFI commands. **Note:** After dfi\_init\_complete assertion, the controller must keep DFI\_RESET\_N\_P\* / DFI\_CKE\_P\* low for 200 µs, and then drive DFI\_RESET\_N\_P\* high. After that, the controller must wait for another 500 µs and assert DFI\_CKE\_P\* high. This is required as per the JEDEC specification. # 3.7.3.2 DRAM Initialization (Ask a Question) After the first step, the controller starts the JEDEC compliant initialization sequence. The controller initializes the DRAM devices by writing to their mode registers. The DRAM device is initialized by issuing MRS command to mode registers as per the JEDEC specification. For example, the MRS command is issued on the DRAM interface by asserting CS\_N=0, RAS\_N=0, CAS\_N=0, and WE\_N=0. The Mode register address is sent on BA pins and the Mode register data is sent on DRAM ADDR pins. The controller must drive these signals accordingly on the DFI interface. The latency in the PHY is identical on all of the control interface signals. The controller must meet the timing of the DRAM interface on the DFI interface. Figure 3-26 shows the DFI to DRAM latency and is an example of how the memory controller drives the DFI interface to obtain the desired functionality and timing on the DRAM interface. In the DFI Specification, there is a single timing parameter (tCTRL\_DELAY, which needs to be set in the DDR controller) for all of the control interface. The controller must implement this DFI timing inside it. The latency seen in Figure 3-26 is the minimal latency observed from DFI command to DRAM command. For this example tCTRL\_DELAY was set to 0. Any tCTRL\_DELAY value set by the controller is added to the latency shown in Figure 3-26. **Note:** The DDR3 example is used to show the simulation. Figure 3-26. DFI To DRAM Control Interface Timing Relationship ### 3.7.3.3 ZQ Calibration (Ask a Question) After the second step, the controller performs the ZQ calibration. This command takes 512 REF\_CLK cycles to complete the ZQ calibration. After issuing this command, the controller must wait for 512 REF\_CLK cycles. The ZQCL command is issued (by asserting CS\_N=0, WE\_N=0, and DRAM ADDR=0x400 for DDR3). The controller drives these signals accordingly on the DFI interface. Figure 3-27 shows ZQCL on DFI followed by ZQCL on DRAM interface. There is a single timing parameter (tCTRL\_DELAY) for all of the control Interface. The latency seen Figure 3-27 is minimal latency observed from the DFI command to DRAM command. For this example, tCTRL\_DELAY was set to 0. Any tCTRL\_DELAY value set by the controller is added to the latency shown in Figure 3-27. Figure 3-27. ZQCL On DFI Followed By ZQCL On DRAM INTERFACE ### 3.7.3.4 Write Leveling (Ask a Question) After the third step, the controller enters the write leveling mode. Any DFI compliant controller must follow this Write Leveling sequence to complete the training step: - 1. To enter the Write Leveling mode, the controller must issue Mode Register Set (MRS) to MR1 for DDR3/DDR4, MR2 for LPDDR3, and set the write leveling enable bit (DFI\_WRLVL\_EN = 1). - 2. After detecting DFI\_WRLVL\_EN =1, the Training IP starts the training algorithm and aligns CLK to DQS. Write leveling is a JEDEC defined method used to detect 0 to 1 transition on CLK by DQS. - 3. DQS samples the CLK value and feeds it back on the prime DQ on the DRAM DQ byte lane. - 4. When the prime DQ value transitions from 0 to 1, CLK is aligned to DQS within the JEDEC specified timing limits. The Training IP performs this procedure on all byte lanes concurrently. - 5. The controller sends continuous strobe (DFI\_WRLVL\_STROBE) pulses to capture DFI\_WRLVL\_RESP. DFI\_WRLVL\_STROBE initiates the capture of DFI\_WRLVL\_RESP within the PHY DQ bus. - 6. The Training IP asserts DFI\_WRLVL\_RESP = 1 indicating the completion of Write Leveling to the controller. - **Note:** In the Fast Simulation Mode, PHY ignores the WRLVL request without responding. The controller must bypass the write leveling, read gate leveling, and read eye training (Read Leveling) stages. - 7. When the controller receives the DFI\_WRLVL\_RESP asserted signal, it de-asserts the write to mode register MR1 and also de-asserts DFI\_WRLVL\_EN to disable write leveling enable bit. - 8. The delay values detected for each byte lane are stored in the PHY lane controller. The following figure shows the controller sampling DFI\_WRLVL\_RESP and disabling DFI\_WRLVL\_EN. Figure 3-28. DFI Write Leveling Complete # 3.7.3.5 DQS Gate Training (Ask a Question) After the fourth step, Controller initiates the DOS Gate Training in the following sequence: - 1. The controller initiates the DQS Gate Training by asserting DFI\_RDLVL\_GATE\_EN. The controller must write 0 to MR1 (for DDR3/4)/MR2 (for LPDDR3) bit 7 by issuing the MRS command for disabling the Write Leveling of Step 4. - 2. The controller writes 1 to MR3 bit 2 for enabling data on DQ bus flow from MPR (Multi-Purpose Register) for DDR3 and DDR4 SDRAMs. - 3. The controller must assert DFI\_RDLVL\_GATE\_EN = 1 and wait for dfi\_rdlvl\_resp. After detecting DFI\_RDLVL\_GATE\_EN = 1, the Training IP starts the DQS Gate Training algorithm. - 4. The Training IP positions the DQS\_GATE mask to enable READ\_DQ and READ\_DQS as shown in the following figure. Figure 3-29. DQS Gate Training Conceptual View When the DQS GATE Mask is positioned correctly, the read data matches the pattern selected in the MPR register. Then, the Training IP will assert DFI\_RDLVL\_RESP. After sampling DFI\_RDLVL\_RESP = 1, the controller can de-assert DFI\_RDLVL\_GATE\_EN = 0. For DDR3/4, the controller must write 0 to MR3 bit 2 for restoring normal data flow on the DQ bus. (MR3 bit 2 = 0 > Normal Data flow; MR3 bit 2 = 1 > Data flow from MPR) Otherwise, DQS strobe pulses will be missing and the read data will not match the pattern selected in the MPR register. **Note:** For DDR3/4, the controller must issue regular READ commands to read the data from MPR. For LPDDR3, the controller must issue mode register read to MR32 or MR40. The time between reads must at least be 62 SYS\_CLK cycles long so that the Training IP can stabilize. The DFI READ time during the MPR reads must be the same as the normal data read operations. During the DQS Gate training, the timing of DFI\_RDDATA\_VALID is not trained. ### 3.7.3.6 Read Data Eye Training (Ask a Question) After the fifth step, the controller initiates The READ Data Eye training in the following sequence. - 1. The controller reads a continuous data pattern of '00' followed by 'FF' from DRAM. This is same as enabling MPR and reading a fixed pattern from MPR for DDR3/4. DDR3 supports only one pattern in MPR. - 2. Enabling the data flow for DDR3/4 from MPR is similar to what is done in DQS Gate training. For DDR3/4, the controller must set MR3 bit 2 to 1 (data flow from MPR) by issuing the MRS command. - 3. The controller must assert DFI\_RDLVL\_EN = 1 to initiate READ DATA EYE training. The Training IP sweeps the input tap delay, reads the data and compares it with the expected results. - 4. The left and right margins of the passing data are stored for all the bits in the byte lane. The left and right margins of each bit are analyzed and the appropriate delay value is applied to each bit to maximize the valid data window. - 5. After the completion of READ Data eye training, the Training IP asserts DFI\_RDLVL\_RESP. After sampling DFI\_RDLVL\_RESP = 1, the controller can de-assert DFI\_RDLVL\_EN = 0. For DDR3/4, the controller must write 0 to MR3 bit 2 for restoring the normal data flow on the DQ bus. **Important:** For DDR3/4, the controller must issue the regular 'READ' commands to read data from the MPR register. For LPDDR3, the controller must issue mode register read to MR32 or MR40. The following figure shows READ DATA EYE before and after training. Figure 3-30. Read Data Eye Before And After Training # 3.7.3.7 Write Calibration (Ask a Question) The write leveling described in step 4 addresses only timing. It does not address latency. In the fly-by topology, there is a latency from the first DRAM device to the last DRAM device. Write calibration is not defined in the DFI Specification. This implementation of Write calibration is specific to PolarFire family designs. Correcting for latency is accomplished by pushing both DQ and DQS by a full clock cycle (memory clock, CKO/CKO\_N). The Training IP handles the Write calibration. The Training IP writes and reads known patterns through the controller and pushes DQ/DQS by a full clock cycle until Writes and Reads match. This step is also called as coarse correction because DQS is pushed by a full clock cycle. The Write calibration is done on a per byte lane basis. The Write calibration is handled by a group of signals called CAL\_TRAINING interface. After the completion of READ Data eye training (Step 6), the Training IP asserts the DFI\_TRAINING\_COMPLETE signal. The Write Calibration sequence is as follows. - 1. Training IP issues TRAINING\_COMPLETE. - 2. Controller asserts CTRLR READY IN. - 3. For DDR3/4, the controller writes to 'MR3' '0x0' to restore normal data flow. This restores the DQ data flow in the normal mode. - Controller de-asserts CAL\_L\_BUSY (0x0). Training IP asserts CAL\_SELECT (0x1, ~CAL\_L\_BUSY). - 5. Training IP issues CAL\_L\_W\_REQ to issue 'WRITE' Command. - 6. Controller issues CAL L D REQ one clock before Data required. - 7. Training IP provides 'CAL\_L\_DATAIN' & 'CAL\_L\_DM\_IN' in the following clock. - 8. Controller issues 'WRITE' command on DFI to be written to DRAM. - After waiting for 19 SYS\_CLK clock cycles, the Training IP issues CAL\_L\_R\_REQ to issue 'READ' command. - 10. Controller Reads Data from DRAM and presents it on 'CAL\_L\_DATAOUT' with 'CAL\_L\_R\_VALID'. - 11. Training IP compares DATA written and read. - 12. If DATA written and read do not match, Training IP pushes DQ/DQS by full clock cycle at a time, until DATA written matches DATA read. - 13. Upon successful completion of this sequence, the Training IP asserts CTRLR\_READY\_OUT and de-asserts CAL\_SELECT in the following clock. - 14. Controller will de-assert CAL\_L\_BUSY (0x1) to indicate Write calibration complete. The following figures show the Write calibration sequence. Figure 3-31. Write And Read Of Known Patterns During Write Calibration Figure 3-32. Write And Read Sequence During Write Calibration Figure 3-33. Write Calibration Sequence Figure 3-34. Write Calibration Complete ### 3.7.3.8 Simple Burst Write (Ask a Question) After Write Calibration, the DDR subsystem is ready for a normal operation. Typically, the controller asserts a signal like CTRLR\_READY to let the host system know that the DDR subsystem is now ready for normal DDR operations. This step describes a simple Burst Write operation along the DFI interface. This step describes how the controller must drive the DFI interface for a simple Burst Write operation and the associated timing parameters. There are two timing parameters associated with the WRITE Command, tphy\_wrdata, and tphy\_wrlat. Figure 3-35 shows how these timing parameters are used by the controller. The controller must be aware, must parameterize and hold constant all of the PHY related timing during operations. The controller must know the PHY protocol related timing defined in the DFI 3.1 specification. For more information on timing, see the DFI 3.1 sequence. Figure 3-35. DFI Write Burst Example In the frequency ratio (4:1) system, the controller must follow the phase relationship and timing. The following examples for CWL = 7 & CWL = 8 where the controller must calculate the phase to drive WRDATA\_EN, WRDATA, and WRDATA\_MASK. If CWL = 7 and DFI\_WRDATA\_EN {P3} are asserted in two DFI clocks (SYS\_CLK) after issuing the Write command. DFI\_WRDATA\_EN {P0, P1, P2} are asserted in the third clock cycle. WRDATA & WRDATA\_MASK are issued in the next phase after the issuing WRDATA\_EN. If CWL = 8 and DFI\_WRDATA\_EN {P0, P1, P2, P3} are asserted in three DFI clocks (SYS\_CLK) after issuing the Write command. WRDATA & WRDATA\_MASK are issued in the next phase after issuance of WRDATA\_EN. The following figures show simple burst write sequence for 16 and 32 bit interfaces. Figure 3-36. Activate And Write Commands On DFI & DRAM (16-Bit Interface) Figure 3-37. WRDATA On DFI & DRAM - DFI 128 Bits (4 X 32) Translate To 16-Bit BL8 On DRAM Figure 3-38. WRDATA On DFI & DRAM - DFI 256 BITS (4 X 64) Translate To 32-Bit BL8 On DRAM # 3.7.3.9 Simple Burst Read (Ask a Question) This step describes simple burst read operation. Similar to WRITE, there are two Timing parameters the controller must be aware of: trddata\_en and tphy\_rdlat. After issuing DFI READ command on control interface, Controller must assert DFI\_RDDATA\_EN after trddata\_en clocks. The PHY responds to the READ command and DFI\_RDDATA\_EN with DFI\_RDDATA and DFI\_RDDATA\_VALID. The following figure shows the sequence of events for a simple burst READ operation. Figure 3-39. DFI Read Burst Example Figure 3-40. RDDATA On DFI - 16-Bit BL8 On DRAM Translate To DFI 128 BITS (4 X 32) Figure 3-41. RDDATA On DFI - 32-Bit BL8 On DRAM Translate To DFI 256 BITS (4 X 64) Data on DFI\_RDDATA is valid only when DFI\_RDDATA\_VALID is asserted. After receiving READ command, PHY responds with Read Data and Read Data Valid signals within (trddata\_en + tphy\_rdlat) cycles. PHY latencies associated with DFI-to-DRAM and DRAM-to-DFI are added to DFI timing, trddata\_en and tphy\_rdlat. For 16-bit interface, 128-bits of Data (16 x 8 (BL8)) is transferred to Controller on a single DFI clock cycle. The timing parameters trddata\_en and tphy\_rdlat together define maximum number of cycles from assertion of READ command to assertion of DFI\_RDDATA\_VALID signal. DFI dictates a timing relationship between DFI\_RDDATA\_EN and DFI\_RDDATA\_VALID specified by tphy\_rdlat. DFI does not dictate exact number of cycles. Typically, DFI\_RDDATA\_VALID can assert any time prior to max delay. DFI\_RDDATA\_VALID is trained and DFI\_RDDATA is valid only when DFI\_RDDATA\_VALID is asserted. ## 3.7.4 Timing (Ask a Question) The following figure shows a typical DDR3 protocol sequence and behavior of DFI and DRAM interfaces. All the DDR protocol timing must be taken care by the controller. PHY is just an I/O and is responsible for timing parameters like setup time, hold time, clock duty cycle, and jitter. The PHY Timing Parameters are defined in the DFI 3.1 specification. The following figure shows the continuous multi-burst operation on WRITE and READ, and includes commands like PRECHARGE and REFRESH. Figure 3-42. Continuous Multi-Burst Write And Read On DFI/DRAM # 3.8 Octal DDR PHY-Only Solution (Ask a Question) Libero SoC Catalog includes the PolarFire Octal DDR PHY IP, that allows the generation of a PHY with following hard and soft blocks: - I/O - IOD logic - PLL - Soft fabric logic The Octal DDR PHY supports the xSPI (JESD251), HyperBUS, and ONFI memories. These memories use an 8-bit serial interface operating Double Data Rate (DDR) mode. These memories are used in applications such as the automotive semiconductor market, Industrial Ethernet, AI, Wearable device market, and IoT market. The Octal DDR PHY-Only solution supports up to 250 MHz operations and is available in GPIO and HSIO banks. The Octal DDR PHY configurator supports up to eight shared parallel interfaces and up to four ranks per interface. The PHY relies on input static delay settings and static timing analysis for interface I/O timing closure. The interface is not intended to be a trained interface. # 3.8.1 Configurator (Ask a Question) The following figure shows the main GUI window of the Octal DDR PHY Configurator. Figure 3-43. Octal DDR PHY Configurator Main GUI The following table lists the Octal DDR PHY Configurator options and parameters. **Table 3-24.** Configurator Options and Parameters | Parameter | Range | Description | |-------------------------------------------|---------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | PROTOCOL | XSPI | Protocol used to build the PHY | | | ONFI | | | | HYPERBUS | | | | APMEMORY_PSRAM | | | CLOCK_FREQ | 1-250 MHz | Memory clock frequency | | CLOCK_TYPE | SINGLE_ENDED | 1 clock | | | DIFFERENTIAL | True differential clock pair | | | PSEUDO_DIFFERENTIAL | Two single ended clocks in phase opposition | | Enable Differential DQS | | This option is available for ONFI, HyperBus, and APMemory PSRAM protocols. When this option is selected, differential DQS bidirectional ports are exposed for DQS IOD. This option is required for interfacing to an external flash or NVDDR RAM at more than 200 MHz. | | NUMBER_OF_DQ | 4 or 8 | Number of DQ bits | | EXPOSE_DQ_RAW_DATA | 0-1 | When 1, this adds I*_DQ_RAW_DATA output to the fabric. Direct connection of DQ input to the fabric bypassing the PHY. It can be used to sample input data in 1-1-1 mode for xSPI. | | FABRIC_CLOCK_RATIO | FULL | In full rate, the fabric run at the memory frequency. | | CCC_PLL_MULTIPLIER | 1-39 | Multiplication factor from the external reference clock to the memory frequency. | | CCC_PLL_REF_FREQ | _ | Resulting reference clock frequency | | DQS_DELAY_TAP | 1 - 255 | Delay tap set on DQS for read.<br>Libero default value of 70 is optimized for 200 MHz<br>performance if PCB routing is closely matched.<br>Value below 20 fails in simulation. | | INTERFACE_NB | 1-8 | Number of shared interface | | RANK_NB | 1-4 | Number of rank per interface | | I{INTERFACE#}R{RANK#}_{IO}_ENAB<br>LED | 0-1 | For all I/O except DQ/DQS: Expose the I/O and corresponding signals. It can be used to disable I/O that are optional in the protocol (RESET_N for instance) or that are shared between interfaces/ranks. This parameter is set using the related I/O Sharing tab in the Configurator. | | I{INTERFACE#}R{RANK#}_{IO}_IN_DQ<br>_LANE | 0-1 | For all I/O except DQ/DQS: Indicates that, the corresponding I/O will be placed in the DQ Lane. P and R will fail if an I/O is placed in DQ/DQS lane if this parameter is not set. By default, we assume all I/O other than DQ/DQS outside of the DQ/DQS lane. This parameter is set using the related I/O Placement tab in the Configurator. | The **I/O Sharing** tab sets the (I{INTERFACE#}R{RANK#}\_{IO}\_ENABLED parameter for CK, CS\_N, INT\_N, and RESET\_N/RSTO pins allowing the user to deselect a port of the interface. Once set, that port would not be in the generated netlist. This is used to deselect the not used in the design or a port that is tied-off (that is, INT# input of HypeRAM), or for multi-rank deselect ports that are shared between ranks. Figure 3-44. I/O Sharing Configurator Tab The I{INTERFACE#}R{RANK#}\_{IO}\_IN\_DQ\_LANE parameter is set with the **I/O Placement** tab in the configurator. It is used to indicate that a CK, CS\_N, and RESET ports are in the DQ lane enabling Libero to correctly optimize the connections. Figure 3-45. I/O Placement Configurator Tab Figure 3-46. Generated Octal DDR PHY SmartDesign Component Table 3-25. Port List | Port | Width | Direction | Visibility | Description | Notes | |----------|-------|-----------|------------|-------------------------------------------------------------------------------------------------------------------------|-------| | General | | | | | | | REF_CLK | 1 | IN | _ | Reference clock<br>to the CCC.<br>It must be at<br>CCC_PLL_REF_FREQ<br>frequency. | _ | | ARST_N | 1 | IN | _ | Asynchronous reset<br>to the PHY. Note<br>that, while ARST_N<br>is asserted, the CCC<br>inside the PHY is<br>powerdown. | _ | | PLL_LOCK | 1 | OUT | _ | Internal CCC PLL lock. | _ | | continued | | | | | • | |-------------------------------------------|-------|-----------|----------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------| | Port | Width | Direction | Visibility | Description | Notes | | SYS_CLK | 1 | OUT | | System clock. All input/output of the PHY from/to the fabric are synchronous to this clock (unless mentioned otherwise). SYS_CLK is generated by OUT0 of the CCC through CLKINT. | | | READY | 1 | OUT | _ | When high indicates<br>that the PHY<br>initialization is<br>complete and can<br>receive commands. | _ | | Output | | | | Clock can be any of: C, SCK, CK, CLK depending on protocol. | _ | | I{INTERFACE#}R{RANK#<br>}_{CLOCK} | 1 | OUT | I{INTERFACE#}R{RANK#}_{CLOCK}_E<br>NABLED== 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | Output pad to the memory, DDR. | CCC OUT1<br>through<br>HS_IO_CLK,<br>90deg phase<br>shift | | I{INTERFACE#}R{RANK#<br>}_{CLOCK}_N | 1 | OUT | I{INTERFACE#}R{RANK#}_{CLOCK}_E<br>NABLED== 1. Protocol dependent.<br>This parameter is set using the<br>related I/O Sharing tab in the<br>Configurator. | Output pad N side to the memory, DDR. | CCC OUT1<br>through<br>HS_IO_CLK,<br>90deg phase<br>shift | | I{INTERFACE#}R{RANK#<br>}_{CLOCK}_DATA | 2 | IN | I{INTERFACE#}R{RANK#}_{CLOCK}_E<br>NABLED== 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | <ul> <li>2-bit port.</li> <li>00: Drives 0 out</li> <li>01: Drives clock pattern out</li> <li>10: Drives inverted clock pattern out</li> <li>11: Drives 1 out</li> </ul> | SYS_CLK | | I{INTERFACE#}R{RANK#<br>}_{CLOCK}_OE_DATA | 2 | IN | I{INTERFACE#}R{RANK#}_{CLOCK}_E<br>NABLED == 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | Output enable data<br>from the fabric to<br>the I/O pad. See the<br>timing section for<br>the latency through<br>the PHY. | SYS_CLK | | Command | | | | Output can be any<br>of: S_N, CS_N, CE_N,<br>RESET_N, ALE, CLE,<br>WR_N, WP_N. | | | I{INTERFACE#}R{RANK#<br>}_{OUTPUT} | 1 | OUT | I{INTERFACE#}R{RANK#}_{OUTPUT}_<br>ENABLED == 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | Output pad to the memory, SDR. | CCC OUT0<br>through<br>HS_IO_CLK | | continued | | | | | | | |--------------------------------------------|-------|-----------|------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------|--| | Port | Width | Direction | Visibility | Description | Notes | | | I{INTERFACE#}R{RANK#<br>}_{OUTPUT}_DATA | | IN | I{INTERFACE#}R{RANK#}_{OUTPUT}_<br>ENABLED== 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | - | SYS_CLK | | | I{INTERFACE#}R{RANK#<br>}_{OUTPUT}_OE_DATA | 1 | IN | I{INTERFACE#}R{RANK#}_{OUTPUT}_<br>ENABLED== 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | Output enable data<br>from the fabric to<br>the I/O pad. See the<br>timing section for<br>the latency through<br>the PHY. | SYS_CLK | | | <b>Asynchronous Inputs</b> | | | | | | | | I{INTERFACE#}R{RANK#<br>}_{INPUT} | 1 | IN | I{INTERFACE#}R{RANK#}_{INPUT}_E<br>NABLED == 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | Input pad from the memory. | ASYNC | | | I{INTERFACE#}R{RANK#<br>}_{INPUT}_DATA | 1 | OUT | I{INTERFACE#}R{RANK#}_{INPUT}_E<br>NABLED == 1. This parameter is set<br>using the related I/O Sharing tab in<br>the Configurator. | Input data to the fabric. | ASYNC | | | DQS | | | | | | | | I{INTERFACE#}_{DQS} | 1 | INOUT | _ | Inout pad from/to the memory, DDR. | CCC OUT0<br>through<br>HS_IO_CLK | | | I{INTERFACE#}_{DQS}_<br>N | 1 | INOUT | Protocol dependent | Inout pad N side from/to the memory, DDR. | CCC OUT0<br>through<br>HS_IO_CLK | | | I{INTERFACE#}_{DQS}_T<br>X_DATA | 2 | IN | protocol !=XSPI | Output data from<br>the fabric to the<br>I/O pad. See the<br>timing section for<br>the latency through<br>the PHY. | SYS_CLK | | | I{INTERFACE#}_{DQS}_<br>OE_DATA | 2 | IN | _ | Output enable data<br>from the fabric to<br>the I/O pad. See the<br>timing section for<br>the latency through<br>the PHY. | SYS_CLK | | | I{INTERFACE#}_{DQS}_<br>RX_DATA | 1 | OUT | _ | Input data from the I/O pad to the fabric. Asynchronous. It is not gated by the GATE signal. Note that, this signal will toggle when DQS is driven by the fabric. | ASYNC | | | continued | | | | | | |----------------------------------|---------|-----------|-----------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------| | Port | Width | Direction | Visibility | Description | Notes | | I{INTERFACE#}_{DQS}_GATE | | | | DQS gating signal to the PHY. It is used to gate DQS going into the PHY RX FIFO. The gate must be low when the memory does not drive strobes (must be low whenever DQS is 'bZ or one DQS is driven by the FPGA or when DQS from the memory is not strobe. See timing section for the latency. | SYS_CLK | | DQ | | | | | | | I{INTERFACE#}_DQ | 4 or 8 | INOUT | | Inout pad to<br>the memory, DDR<br>INOUT except for<br>XSPI where it is IN<br>DQS can be any<br>of: DQS, DS, and<br>RWDS depending on<br>protocol. | CCC OUTO<br>through<br>HS_IO_CLK | | I{INTERFACE#}_DQ_TX_<br>DATA | 8 or 16 | IN | protocol !=XSPI | Output data from<br>the fabric to the<br>I/O pad. See the<br>timing section for<br>the latency through<br>the PHY. | SYS_CLK | | I{INTERFACE#}_DQ_OE_<br>DATA | 4 or 8 | IN | | Output enable data<br>from the fabric to<br>the I/O pad. See the<br>timing section for<br>the latency through<br>the PHY. | SYS_CLK | | I{INTERFACE#}_DQ_RX_<br>DATA | 8 or 16 | OUT | | Input data from the I/O pad to the fabric. Synchronous. See the timing section for the latency through the PHY. | SYS_CLK | | I{INTERFACE#}_RX_DAT<br>A_VALID | 1 | OUT | _ | Indicate that received data are valid. | SYS_CLK | | I{INTERFACE#}_DQ_RX_<br>RAW_DATA | 4 or 8 | OUT | | Input data from the I/O pad to the fabric. Asynchronous. This port will also toggle when DQ is driven by the fabric. | ASYNC | Figure 3-47. Command Timing Figure 3-48. DQ/DQS Receive **Note:** DQS preamble width differs depending on the protocol and configuration of the memory device. ### 3.8.2 Timing Constraints (Ask a Question) The configurator generates a timing constraint for the internal CCC. Users can add or modify the I/O delay a timing constraint based on the part, and board layout. DQ input delay can be changed using **Libero SoC** > **I/O Editor**. # 3.8.3 Place and Route Pin-Out Rules (Ask a Question) Consider the following points during Place and Route: - I/O Bridging is allowed. - I/Os cannot share a lane with another independent IF. - All DQ must be in the same lane. - DQS/RWDS must be on the DQ lane on the P side I/O with the DQS function. - The N side I/O with the DQS function must not be used. It is re-used internally to create the DQS Gate, and no other signal can be connected to pin. - Addr/command/CLK can be placed anywhere. However, if they are placed in the DQ lane, the parameter I{INTERFACE#}R{RANK#} {IO} IN DQ LANE must be set to one. Asynchronous input can be placed anywhere. **Note:** For a burst of multiple of two strobes cycle, any burst shorted than two cycles will not trigger RX\_DATA\_VALID and corrupt the receive FIFO. Any burst longer than two cycles but not multiple of two will keep RX\_DATA\_VALID high and corrupt the receive FIFO. # 3.9 Implementation (Ask a Question) This section describes how to use the DDR subsystem in the design. It includes information about selecting memory devices and configuring the subsystem to interface with the chosen devices. ### 3.9.1 Selecting the Memory Device (Ask a Question) The Fabric DDR subsystem supports DDR3, DDR3L, DDR4, and LPDDR3 memory devices. The memory that is used with the system depends on the application requirements. No single memory device can fulfill all system requirements; therefore, the trade-offs associated with each memory type must be considered for a balanced design. Consider the following factors while selecting the memory device: - Cost - · Bandwidth and speed - · Power consumption - Bus width - Latency - Capacity The following table lists the memory parameters for DDR3, DDR3L, DDR4, and LPDDR3 memory devices. Based on the system requirements (bandwidth, speed, bus width, density, and cost), the appropriate memory device is selected. **Table 3-26.** Memory Selection Parameters | Parameter | DDR3 | DDR3L | DDR4 | LPDDR3 | |-------------------------|---------------------|----------------------|---------------------|---------| | DDR clock | 666 | 666 | 800 | 666 | | Data width (in bits) | 16, 32, 64 | 16, 32, 64 | 16, 32, 64 | 16, 32 | | Burst length (in bytes) | 8 | 8 | 8 | 8 | | Number of banks | 8 | 8 | 8,16 | 8 | | Density | 512 Mb-8192 Mb | 512 Mb-8192 Mb | 2 Gb-16 Gb | 32 Gb | | I/O standard | SSTL-15 Class I, II | SSTL-135 Class I, II | POD-12 Class I, II* | HSUL-12 | **Note:** HSTL 12 IO standard is used for DDR4 address, command, and control signals. # 3.9.1.1 DDR4 Optimization Guidelines (Ask a Question) The following points provides guidance in selecting the proper speed bin memory component when using the PolarFire DDR PHY with the DDR4 Controller IP from Libero SoC: - 1. The PolarFire DDR4 interface can operate up to 1600 Mbps. To achieve maximum efficiency at this speed, a 2400 Mbps speed bin DDR4 SDRAM must be used. Maximum efficiency is achieved with the tCCD\_L = 4 (4 clock cycles) parameter setting. The following calculations can be used as examples to determine the memory component speed bin to be selected when running at a user's desired memory clock rate: - For DDR4-2400 memory components, tCCD L must be ≥ 5 ns. - 4 clock cycles \* (1/800 MHz) = 5 ns, hence, an 800 MHz memory clock can be used to achieve 1600 Mbps data rate. - For DDR4-2133 memory components, tCCD L must be ≥ 5.355 ns - 4 clock cycles \* (1/746 MHz) = 5.36 ns, hence, a 746 MHz memory clock can be used to achieve 1492 Mbps data rate. - For DDR4-1866 memory components, tCCD\_L must be ≥ 5.355 ns - 4 clock cycles \* (1/746 MHz) = 5.36 ns, hence, a 746 MHz memory clock can be used to achieve 1492 Mbps data rate. - 2. tCCD\_L delay values are obtained from Micron's 8Gb\_DDR4\_SDRAM datasheet. - 3. The tCCD\_L delay values are the same for DDR4-2133 and DDR4-1866. - 4. In the above calculations, a slower memory clock can also be used. These examples may not apply if the embedded DDR PHY is interfaced to a proprietary DDR Controller IP. In this case, the user must develop appropriate set of optimized parameters. ### 3.9.2 Configuring the DDR3 Subsystem (Ask a Question) The PF\_DDR3 macro located in the Libero IP catalog must be instantiated in SmartDesign to access the DDR3 memory from the FPGA fabric. The DDR3 Configurator, shown in Figure 3-51, configures the DDR3 subsystem. It supports the following modes: Preset configuration—allows selection from a list of memory vendors and devices to preset all of the memory initialization and timing parameters into the DDR3 Configurator, as shown in Figure 3-49. **Important:** For DDR3, preset configuration is supported for 1333, 1600, 1866, and 2133 at 666.67 MHz (minimum tCK $\geq$ 1.5 ns). For DDR4, preset configuration is supported for 1600, 1866, 2133, 2400, 2666, 2933, and 3200 at 800 MHz (minimum tCK $\geq$ 1.25 ns). Figure 3-49. DDR3 Configurator—Preset Configuration • User configuration—allows manual configuration of all memory initialization and timing parameters. This can be saved as a preset configuration, as shown in Figure 3-50. **Important:** See the DDR3 vendor datasheets before configuring DDR3 parameters. Figure 3-50. User-Defined Configuration The DDR3 Configurator provides the following configuration options. - 3.9.2. General - 3.9.2. Memory Initialization - 3.9.2. Memory Timing - 3.9.2. Controller - 3.9.2. Miscellaneous # General The following figure shows the **General** tab. Figure 3-51. PF\_DDR3 Configurator—General Tab As shown in the preceding figure, the **General** tab provides the following options. - Top - **Protocol**: By default, this is set to DDR3. - **Generate PHY only**: Enable this option to create a PHY-only DDR3 subsystem. - **Clock**: To set the DDR3 memory clock frequency and user logic clock rate. The user clock frequency is automatically populated. - Memory Clock Frequency (MHz): Valid range 0 to 667 MHz. - CCC PLL Clock Multiplier: Valid range 1 to 49. - CCC PLL Reference Clock Frequency (MHz): Depends upon the setting of CCC PLL Clock Multiplier. - User Logic Clock Rate: QUAD - User Clock Frequency: 166.5 MHz - Topology- To set the topology of the memory system, including the address bits, bank configurations, DM modes, and ECC. - Memory Format: COMPONENT or type of DIMM (UDIMM, RDIMM, LRDIMM, or SODIMM) - **DQ Width**: 16-bit, 32-bit, or 64-bit **Important:** AXI width must be larger than the data width and clock rate. SDRAM Number of Ranks: 1 - Enable address mirroring on odd ranks: Disabled - **DQ/DQS group size**: 8 (fixed) - Row Address width: Value must be within 12-16 - Column Address Width: Value must be within 5-12 Bank Address width: 3 (fixed) - Enable DM: OFF or DM - **Enable Parity/Alert**: When the memory format is RDIMM or LRDIMM, this option can be selected to support the Alert signal from DDR memory when it detects a CA parity error. - Enable ECC: Disabled Number of clock outputs: 1 (fixed) ### **Memory Initialization** The **Memory Initialization** tab sets the DDR3 mode register configuration according to the JEDEC specification. In the Fabric DDR3 subsystem architecture, these parameters are passed to the DDR3 subsystem IP, which then performs the DDR3 initialization sequence and configures the mode registers. The following figure shows the **Memory Initialization** tab. Figure 3-52. Memory Initialization Tab As shown in the preceding figure, the **Memory Initialization** options provides the following options. - Mode Register 0 - Read Burst Type: Sequential or Interleaved - Burst length: Fixed BL8 **Important:** On-the-fly BC4 or BL8 option for burst length mode is not supported for QUAD clock rate. - Memory CAS Latency: Value must be within 5-14 - Mode Register 1 - **ODT Rtt Nominal Value**: Any one of the following options can be selected: - ODT Disabled - RZQ/4 - RZQ/2 - RZQ/6 - RZQ/12 for non-write - RZQ/8 for non-write - **Memory Additive CAS Latency**: Any one of the following options can be selected: - Disabled - CL-1 - CL-2 - **Output Drive Strength**: Any one of the following options can be selected: - RZQ/6 - RZQ/7 - Mode Register 2 - **Self Refresh Temperature**: Any one of the following options can be selected: - Normal - Extended - **Memory Write CAS Latency**: Value must be within 5–12 - **Partial Array Self Refresh**: Any one of the following options can be selected: - Full - Half - Quarter - 1/8 - Half (reversed) - Quarter (reversed) - 1/8 (reversed) - **Dynamic ODT (Rtt\_WR)**: Any one of the following options can be selected: - Dynamic ODT off - RZQ/4 - RZQ/2 # **Memory Timing** The following figure shows the **Memory Timing** tab. Figure 3-53. Memory Timing Tab As shown in the preceding figure, the **Controller** options provides the following options. - Timing parameters dependents on speed bin - tRAS (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. - tRCD (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. - tRP (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. - tRC (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. - tWR (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. - tFAW (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. ### Timing parameters dependents on speed bin and clock frequency - tWTR (cycles): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. - tRRD (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. - tRTP (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. ### Timing parameters dependents on operating condition tREFI (us): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. ### Timing parameters dependents on speed bin and page size tRFC (ns): By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor data sheet. # Others Timing parameters - tZQinit (cycles): By default, 512 is selected. The value must be within [512, 4095] range. - **ZQ Calibration Type**: Select Short or Long. By default, Short is selected. - **tZQCS (cycles)**: By default, 64 is selected. The value must be within [64, 2047]. - **tZQoper (cycles)**: This value is fixed to 256. - Enable User ZQ Calibration Controls: Enable or Disable. By default, this option is disabled. - Automatic ZQ Calibration Period (us): By default, 200 μs is selected. For every 200 μs, ZQ calibration is triggered based on the ZQCL or ZQCS cycles selected. ### Important: By default, preset configuration for the MT41K1G8SN-125 part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. ### Controller The following figure shows the **Controller** tab. Figure 3-54. Controller Options As shown in the preceding figure, the **Controller** options provides the following options. - **Instance Select**: Enables selection of the DDR3 subsystem instance number that is used to unify the controller instance. The PolarFire family supports a maximum of six DDR3 subsystem instances. - **Instance Number**: Up to eight instances can be generated. - User Interface: - **Fabric Interface**: Options are AMBA AXI3/4 and native interface. The following table lists the supported AXI bus data widths with respect to the DQ widths. Table-3-27. Supported AXI Bus Data Width | DQ Width | Supported AXI Data Bus Width | |----------|------------------------------| | x16 | 64, 128 | | x32 | 64, 128, 256 | | x64 | 512 | Data Bus Turnaround—provides additional bus turnaround time between different SDRAM ranks. The turnaround time is the number of clock cycles it takes to change the access from one rank to another rank for back-to-back memory accesses (read to read, read to write, write to write, and write to read). The AXI interface address is mapped based on the type of the Address Ordering selected in the PF\_DDR3 configurator. For example, if Chip-Row-Bank-Col is selected, and if a row address width and column address width is configured for 13 and 11, the AXI address is mapped as shown in the following table. Table-3-28. AXI Address Mapping | AXI<br>Address | 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 15 | 14 | 13 | 12 | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 0 | |-------------------|----|----|----|-----|-----|-----|----|----|----|----|----|----|----|----|----|----|-----|-----|-----|-----|----|----|------------|----|----|----|----|----|----|----|-----| | Column<br>Address | | | | | | | | | | | | | | | | | | | | C10 | C9 | C8 | <b>C</b> 7 | C6 | C5 | C4 | С3 | C2 | C1 | C0 | | | Bank<br>Address | | | | | | | | | | | | | | | | | BA2 | BA1 | BA0 | | | | | | | | | | | | | | Row<br>Address | | | | R12 | R11 | R10 | R9 | R8 | R7 | R6 | R5 | R4 | R3 | R2 | R1 | R0 | | | | | | | | | | | | | | | | **Important:** The address mapping shown in the preceding table is applicable to Native interface also. - AXI Width: 64, 128, 256, or 512AXI ID Width: Valid range is 1-8 - **Efficiency**: Provides control over memory refresh, precharge, and address ordering options. Queue depth can be set to 3 or 4 for DDR3 using the Command queue depth option on DDR3 Configurator > Controller tab. - Enable Activate/Precharge look-head: Check box toggle option - Command queue depth: 3 or 4 - **Enable User Refresh Controls**: Check box toggle option - **Address Ordering**: Any one of the following options is supported: - Chip-Bank-Row-Col - Chip-Row-Bank-Col - Misc - **Enable RE-INIT Controls**: When selected, the configurator exposes the CTRLR\_INIT signal. Initialization begins when a low-to-high transition is detected on this signal. - **ODT Activation Setting on Write**: Provides options to enable ODT on write operations. - **Enable Rank0 ODT0**: By default, this option is enabled and can be disabled. - Enable Rank0 ODT1: Not supported - Enable Rank1 ODT0: Not supported - **Enable Rank1 ODT1**: By default, this option is enabled and cannot be disabled. - **ODT Activation Setting on Read**: Provides options to enable ODT on read operations. - **Enable Rank0 ODT0**: By default, this option is disabled and can be enabled. - Enable Rank0 ODT1: Not supported - Enable Rank1 ODT0: Not supported - Enable Rank1 ODT1: Not supported #### Miscellaneous The following figure shows the **Misc.** tab. Figure 3-55. Misc. Tab The **Misc.** tab is used to select the simulation mode and expose the debug interface. As shown in the preceding figure, the **Misc.** options provides the following options. ## Simulation Options - Simulation mode: - Fast (skip training and settling time): In this mode, the DDR3 subsystem skips the DDR3 training sequence and asserts CTRLR\_READY without waiting for the settling time. CTRLR READY is asserted within 8 µs. - **Training (skip settling time)**: In this mode, the DDR3 subsystem performs the DDR3 training sequence but asserts CTRLR\_READY without waiting for settling time. CTRLR\_READY is asserted within 240 µs. - **Full (training and settling time)**: In this mode, the DDR3 subsystem performs the DDR3 training sequence and also waits for settling time before asserting CTRLR\_READY. In this mode, CTRLR\_READY is asserted within 10 ms. ## Throughput Options Pipe Lining: Adds pipeline registers to Training IP (TIP) write and read data path to improve static timing closure. ## 3.9.3 Configuring the LPDDR3 Subsystem (Ask a Question) The PF\_LPDDR3 macro located in the Libero IP catalog must be instantiated in SmartDesign to access the LPDDR3 memory from the FPGA fabric. The following figure shows the LPDDR3 Configurator that configures the LPDDR3 subsystem. It supports the following modes: Preset configuration—Allows selection from a list of memory vendors and devices to preset all of the memory initialization and timing parameters into the DDR Configurator, as shown in Figure 3-56. **Important:** For LPDDR3, preset configuration is supported for 1333 at 666.666 MHz (minimum tCK $\geq$ 1.5 ns). Figure 3-56. LPDDR3 Configurator—Preset Configuration User configuration—Allows manual configuration of all memory initialization and timing parameters. This can be saved as a preset configuration, as shown in Figure 3-56. See the DDR vendor datasheets before configuring DDR parameters. The LPDDR3 Configurator provides the following options. - 3.9.3. General - 3.9.3. Memory Initialization - 3.9.3. Memory Timing - 3.9.3. Controller - 3.9.3. Miscellaneous #### General The following figure shows the **General** tab. Figure 3-57. LPDDR3 General Tab As shown in the preceding figure, the **General** tab provides the following options. - Top - **Protocol**: By default, this is set to LPDDR3. - **Generate PHY only**: Enable this option to create a PHY-only LPDDR3 subsystem. - **Clock**: To set the LPDDR3 memory clock frequency and user logic clock rate. The user clock frequency is automatically populated. - Memory Clock Frequency (MHz): Valid range is 400–667 MHz - CCC PLL Clock Multiplier: Valid range 1 to 49 - CCC PLL Reference Clock Frequency (MHz): Depends on the setting of CCC PLL Clock Multiplier. - User Logic Clock Rate: QUAD - User Clock Frequency: 166.5 MHz - **CK/CA Additive Offset**: The value must be within [0, 7] range - **Topology**: To set the topology of the memory system, including the address bits, bank configurations, DM modes, and ECC. - Memory Format: COMPONENT - **DQ Width**: Select any one of the following options: - 16 - 32 - SDRAM Number of Ranks: 1 - DQ/DQS group size: 8 - **Row Address width**: The value must be within [14, 15] range - Column Address Width: The value must be within [10, 12] range - Bank Address width: 3 - Enable DM: Select any one of the following options: - OFF - DM - Number of clock outputs: 1 #### **Memory Initialization** The following figure shows the **Memory Initialization** tab. Figure 3-58. LPDDR3 Memory Initialization Tab As shown in the preceding figure, the **Memory Initialization** tab provides the following options. - Mode Register 2 - **Data Latency (#RL, #WL)**: Select any one of the following options: - (6, 3) - (8, 4) - (9, 5) - (10, 6) - (11, 6) - (12, 6) By Default, (10, 6) is selected. - Mode Register 3 - **Output Drive Strength**: Select any one of the following options: - 34.3 typical pull-down/pull-up - 40 typical pull-down/pull-up - 48 typical pull-down/pull-up - 34.3 typical pull-down, 40 typical pull-up - 40 typical pull-down, 48 typical pull-up - 34.3 typical pull-down, 48 typical pull-up - Mode Register 11 - **DQ ODT**: Select any one of the following options: - Disable - RZQ/4 - RZQ/2 - RZQ/1 - **Power down ODT**: Disabled or Enabled # **Memory Timing** The following figure shows the **Memory Timing** tab. Figure 3-59. LPDDR3 Memory Timing Tab As shown in the preceding figure, the **Memory Timing** tab provides the following options. ## · Timing parameters dependents on speed bin - **tRAS (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tRCD (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tRP (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tRC (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tWR (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tRP (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tFAW(ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tMRR (cycles)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tMRW (cycles)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. ## Timing parameters dependents on speed bin and clock frequency - **tWTR (cycles)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tRRD (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - **tRTP (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. #### Timing parameters dependents on operating condition - **tREFI (us)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. #### Timing parameters dependents on speed bin and page size - **tRFC (ns)**: Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. ## Others Timing parameters - tZQinit (us): By default, 1 is selected. The value must be within [1, 6] range. - **ZQ Calibration Type**: Select Short or Long. By default, Short is selected. - tZQCS (ns): By default, 90 is selected. The value must be within 2047. - tZQCL (ns): This value is fixed to 360. - **tZQRESET (ns)**: By default, 50 is selected. The value must be within 2047. - Enable User ZQ Calibration Controls: Select this check box for using user-defined ZQ Calibration Controls - Automatic ZQ Calibration Period (us): By default, 200 μs is selected. For every 200 μs, ZQ calibration is triggered based on the ZQCL or ZQCS cycles selected. ## Important: By default, preset configuration is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. ## Controller The following figure shows the **Controller** tab. Figure 3-60. LPDDR3 Controller Tab As shown in the preceding figure, the **Controller** tab provides the following options. - Instance Select: Enables the option to select of the LPDDR3 subsystem instance number that is used to unify the controller instance. PolarFire family supports a maximum of six LPDDR3 subsystem instances. - Instance Number: - User Interface - **Fabric Interface**: Options are AMBA AXI3/4 and native interface. The following table lists the supported AXI bus data widths with respect to the DQ widths. Table-3-29. Supported AXI Bus Data Width | DQ Width | Supported AXI Data Bus Width | |----------|------------------------------| | x16 | 64, 128 | | x32 | 64, 128, 256 | | x64 | 512 | Data Bus Turnaround—provides additional bus turnaround time between different SDRAM ranks. The turnaround time is the number of clock cycles it takes to change the access from one rank to another rank for back-to-back memory accesses (read to read, read to write, write to write, and write to read). The AXI interface address is mapped based on the type of the Address Ordering selected in the PF\_LPDDR3 configurator. For example, if Chip-Row-Bank-Col is selected, and if a row address width and column address width is configured for 13 and 11, the AXI address is mapped as shown in the following table. Table-3-30. AXI Address Mapping | AXI<br>Address | 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 15 | 14 | 13 | 12 | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 0 | |-------------------|----|----|----|-----|-----|-----|----|----|----|----|----|----|----|----|----|----|-----|-----|-----|-----|----|----|------------|----|----|----|----|----|----|----|-----| | Column<br>Address | | | | | | | | | | | | | | | | | | | | C10 | C9 | C8 | <b>C</b> 7 | C6 | C5 | C4 | С3 | C2 | C1 | C0 | | | Bank<br>Address | | | | | | | | | | | | | | | | | BA2 | BA1 | BA0 | | | | | | | | | | | | | | Row<br>Address | | | | R12 | R11 | R10 | R9 | R8 | R7 | R6 | R5 | R4 | R3 | R2 | R1 | R0 | | | | | | | | | | | | | | | | **Important:** The address mapping shown in the preceding table is applicable to Native interface also. - AXI Width: 64, 128, or 256 - AXI ID Width: Valid range is 1–8 - **Efficiency**: provides control over memory refresh, precharge, and address ordering options. Queue depth can be set to 3 or 4 for LPDDR3 using the Command queue depth option on **LPDDR3 Configurator > Controller** tab. - Enable Activate/Precharge look-head: Check box toggle option - Enable User Refresh Controls: Check box toggle option - **Address Ordering**: Any one of the following options is supported: - · Chip-Bank-Row-Col - Chip-Row-Bank-Col - Misc - Enable RE-INIT Controls: When selected, the configurator exposes the CTRLR\_INIT signal. Initialization begins when a low-to-high transition is detected on this signal. - ODT Activation Setting on Write: Provides options to enable ODT on write operations. - Enable Rank0 ODT0: Supported - Enable Rank0 ODT1: Not supported - Enable Rank1 ODT0: Not supported - Enable Rank1 ODT1: Not supported - **ODT Activation Setting on Read**: Provides options to enable ODT on read operations. - Enable Rank0 ODT0: Supported - Enable Rank0 ODT1: Not supported - Enable Rank1 ODT0: Not supported - Enable Rank1 ODT1: Not supported - Low Power - **Enable self request refresh control**: Use this option to save power. ## Miscellaneous The **Misc.** tab provides the following options. Figure 3-61. LPDDR3 Simulation Tab As shown in the preceding figure, the **Misc.** tab provides the following options. ## Simulation Options - Simulation mode: - Fast (skip training and settling time): In this mode, the LPDDR3 subsystem skips the LPDDR3 training sequence and asserts CTRLR\_READY without waiting for the settling time. CTRLR\_READY is asserted within 8 µs. - **Training (skip settling time)**: In this mode, the LPDDR3 subsystem performs the LPDDR3 training sequence but asserts CTRLR\_READY without waiting for settling time. CTRLR READY is asserted within 240 µs. - Full (training and settling time): In this mode, the LPDDR3 subsystem performs the LPDDR3 training sequence and also waits for settling time before asserting CTRLR\_READY. In this mode, CTRLR\_READY is asserted within 10 ms. #### Throughput Options - **Pipe Lining**: Adds pipeline registers to Training IP (TIP) write and read data path to improve static timing closure. #### 3.9.4 Configuring the DDR4 Subsystem (Ask a Question) The PF\_DDR4 macro located in the Libero IP catalog must be instantiated in SmartDesign to access the DDR memory from the FPGA fabric. The following figure shows the DDR4 Configurator that configures the DDR4 subsystem. It supports the following modes: Preset configuration—allows selection from a list of memory vendors and devices to preset all of the memory initialization and timing parameters into the DDR Configurator, as shown in Figure 3-62. **Important:** For DDR4, preset configuration is supported for 1600, 1866, 2133, 2400, 2666, 2933, and 3200 at 800 MHz (minimum tCK $\geq$ 1.25 ns). Figure 3-62. PF\_DDR4 Configurator—Preset Configuration User configuration—allows manual configuration of all memory initialization and timing parameters. This can be saved as a preset configuration, as shown in the preceding. See the DDR4 vendor datasheets before configuring DDR4 parameters. The DDR4 Configurator provides the following configuration. - 3.9.4. General - 3.9.4. Memory Initialization - 3.9.4. Memory Timing - 3.9.4. Controller - 3.9.4. Miscellaneous #### General The following figure shows the **General** tab. Figure 3-63. DDR4 General Tab As shown in the preceding figure, the **General** tab provides the following options. - Top - **Protocol**: By default, this is set to DDR4. - **Generate PHY only**: Enable this option to create a PHY-only DDR4 subsystem. - **Clock**: To set the DDR4 memory clock frequency and user logic clock rate. The user clock frequency is automatically populated. - Memory Clock Frequency (MHz): By default, the clock frequency is set to 800 MHz. The value must be within 0–800 MHz - CCC PLL Clock Multiplier: Valid range is 1–49 - CCC PLL Reference Clock Frequency (MHz): Depends upon Memory Clock Frequency (MHz) and CCC PLL Clock Multiplier. The value is 200 MHz when memory frequency is 800 MHz and clock multiplier is 4. - User Logic Clock Rate: QUAD - User Clock Frequency: Depends upon Memory Clock Frequency (MHz) and CCC PLL Clock Multiplier. The value is 200 MHz when memory frequency is 800 MHz and clock multiplier is 4. - **Topology**: To set the topology of the memory system, including the address bits, bank configurations, DM modes, and ECC. - Memory Format: COMPONENT or type of DIMM (UDIMM, RDIMM, LRDIMM, or SODIMM) - **DQ Width**: 16-bit, 32-bit, or 64-bit **Important:** AXI width must be larger than the data width and clock rate. - SDRAM Number of Ranks: 1 - Enable Address mirroring on odd ranks: Disabled - DQ/DQS group size: 8 - **Row Address width**: The value must be within [13, 17] range - Column Address Width: 10 - Bank Address width: 2 - Bank Group Address width: The value must be within [1, 2] range - Enable DM: OFF or DM (enabled) - **Enable Parity/Alert**: When the memory format is RDIMM or LRDIMM, this option can be selected to support the Alert signal from DDR memory when it detects a CA parity error. - Enable ECC: Check box toggle option - Number of clock outputs: 1 #### **Memory Initialization** The following figure shows the **Memory Initialization** tab. Figure 3-64. DDR4 Memory Initialization Tab As shown in the preceding figure, the **Memory Initialization** tab provides the following options. - Mode Register 0 - Burst length: Fixed BL8 or On-the-fly BC4 or BL8 **Important:** On-the-fly BC4 or BL8 option for burst length mode is not supported for QUAD clock rate. - Read Burst Type: Sequential or Interleaved - Memory CAS Latency: Value must be within 9–32 - Mode Register 1 - **ODT Rtt Nominal Value**: Any one of the following options can be selected: - Disabled - RZQ/4 - RZQ/2 - RZQ/6 - RZQ/1 - RZQ/5 - RZQ/3 - RZQ/7 - **Memory Additive CAS Latency**: Any one of the following option can be selected: - Disabled - CL-1 - CL-2 - Output Drive Strength: RZQ/7 or RZQ/5 - Mode Register 2 - **Low Power Auto Self Refresh**: Any one of the following options can be selected: - Manual Normal - · Manual Reduced - · Manual Extended - Automatic - **Memory Write CAS Latency**: Any one of the following options can be selected: - 9 - 10 - 11 - 12 - 14 - 16 - 18 - 20 - **Dynamic ODT (Rtt\_WR)**: Any one of the following option can be selected: - Disabled - RZQ/2 - RZQ/1 - High-Z - RZQ/3 - Mode Register 3 - **Fine Granularity Refresh Mode**: Any one of the following options can be selected: - Normal mode (Fixed 1x) - Fixed 2x - Fixed 4x - On-the-fly 1x/2x - On-the-fly 1x/4x - Mode Register 4 - Temperature Refresh Range: Normal or Extended - Temperature Refresh Mode: Disabled or Enabled - Internal VRef Monitor: Disabled or Enabled - **Self Refresh Abort Mode**: Disabled or Enabled - **Read Preamble**: 1CK or 2CK - Write Preamble: 1CK - Mode Register 5 - **CA Parity Latency Mode**: Any one of the following options can be selected: - Disabled - 4 clocks - 5 clocks - 6 clocks - 8 clocks - ODT Input Buffer for Power-down: Disabled or Enabled - Park ODT Value(Rtt\_Park): Any one of the following options can be selected: - Disabled - RZQ/4 - RZQ/2 - RZQ/6 - RZQ/1 - RZQ/5 - RZQ/3 - RZQ/7 - Mode Register 6 - **Vref Calibration Range**: Range1 (60%–92.5%) - **Vref Calibration Value**: 70.40 ## **Memory Timing** The following figure shows the **Memory Timing** tab. Figure 3-65. DDR4 Memory Timing Tab As shown in the preceding figure, the **Memory Timing** options provides the following options. #### Timing parameters dependents on speed bin - tRAS (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tRCD (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tRP (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tRC (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tWR (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tCCD\_L (cycles): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tCCD\_S (cycles): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. ## · Timing parameters dependents on operating condition tREFI (us): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. ## · Timing parameters dependents on speed bin and page size - tRFC (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tFAW (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. #### Timing parameters dependents on speed bin and clock frequency - tWTR\_L (cycles): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tWTR\_S (cycles): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tRRD\_L (cycles): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. - tRRD\_S (cycles): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. tRTP (ns): By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. ## Others Timing parameters - tZQinit (cycles): By default, 1024 is selected. The value must be within [1024, 4095] range - **ZQ Calibration Type**: Short or Long - **tZQCS (cycles)**: By default, 128 is selected. The value must be within [128, 2047] - tZQoper (cycles): This value is fixed to 512. - **Enable User ZQ Calibration Controls**: Enable or Disable. By default, this option is disabled. - Automatic ZQ Calibration Period (us): By default, 200 μs is selected. For every 200 μs, ZQ calibration is triggered based on the ZQCL or ZQCS cycles selected. #### Important: By default, preset configuration for the MT40A1G8WE-083E part is selected. Preset values for different configurations according to the JEDEC standard are supported. See the speed bin and its corresponding value in the vendor datasheet. #### Controller The following figure shows the **Controller** tab. Figure 3-66. DDR4 Controller Tab As shown in the preceding figure, the **Controller** tab provides the following options. - **Instance Select**: Enables the option to select the DDR4 subsystem instance number that is used to unify the controller instance. PolarFire family supports a maximum of six DDR4 subsystem instances. - Instance Number: - User Interface - **Fabric Interface**: options are AMBA AXI3/4 and native interface. The following table lists the supported AXI bus data widths with respect to the DQ widths. Table-3-31. Supported AXI Bus Data Width | DQ Width | Supported AXI Data Bus Width | |----------|------------------------------| | x16 | 64, 128 | | x32 | 64, 128, 256 | | x64 | 512 | Data Bus Turnaround—provides additional bus turnaround time between different SDRAM ranks. The turnaround time is the number of clock cycles it takes to change the access from one rank to another rank for back-to-back memory accesses (read to read, read to write, write to write, and write to read). The AXI interface address is mapped based on the type of the Address Ordering selected in the PF\_DDR4 configurator. For example, if Chip-Row-Bank-Col is selected, and if a row address width and column address width is configured for 13 and 11, the AXI address is mapped as shown in the following table. Table-3-32. AXI Address Mapping | AXI<br>Address | 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 15 | 14 | 13 | 12 | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 0 | |-------------------|----|----|----|-----|-----|-----|----|----|----|----|----|----|----|----|----|----|-----|-----|-----|-----|----|----|----|----|----|----|----|----|----|----|-----| | Column<br>Address | | | | | | | | | | | | | | | | | | | | C10 | C9 | C8 | C7 | C6 | C5 | C4 | С3 | C2 | C1 | C0 | | | Bank<br>Address | | | | | | | | | | | | | | | | | BA2 | BA1 | BA0 | | | | | | | | | | | | | | Row<br>Address | | | | R12 | R11 | R10 | R9 | R8 | R7 | R6 | R5 | R4 | R3 | R2 | R1 | R0 | | | | | | | | | | | | | | | | **Important:** The address mapping shown in the preceding table is applicable to Native interface also. - AXI Width: 64, 128, 256, or 512AXI ID Width: Valid range is 1-8 - **Efficiency**: Provides control over memory refresh, precharge, and address ordering options. Queue depth can be set to 3 or 4 for DDR4 using the Command queue depth option on **DDR4 Configurator > Controller** tab. - Enable Activate/Precharge look-head: Check box toggle option - Number of Banks to lookup: 4, 8, or 16 - **Address Ordering**: Any one of the following options can be selected: - · Chip-Row-BG-Bank-Col - · Chip-BG-Bank-Row-Col - Chip-Row-BG1-Bank-Col-BG0 **Important:** Number of columns to lookup must be must be 8 or 16 when address ordering is set toChip-Row-BG1-Bank-Col-BG0. - **ODT Activation Setting on Write**: provides options to enable ODT on write operations. - **Enable Rank0 ODT0**: Enabled by default. Can Disabled or enabled. - Enable Rank0 ODT1: Not supported - Enable Rank1 ODT0: Not supported - Enable Rank1 ODT1: Enabled (fixed). - **ODT Activation Setting on Read**: provides options to enable ODT on read operations. - Enable Rank0 ODT0: Enabled or Disabled - Enable Rank0 ODT1: Not supported - Enable Rank1 ODT0: Not supported - Enable Rank1 ODT1: Not supported - Low Power - Enable self request refresh control: When selected, this option saves power - Misc - Enable RE-INIT Controls: When selected, the configurator exposes the CTRLR\_INIT signal. Initialization begins when a low-to-high transition is detected on this signal. ## **DDR4 Additional Controller Options** For DDR4, the **Controller** tab includes the following additional settings under **Efficiency**. The Address Ordering includes the additional setting Chip-Row-BG1-Bank-Col-BG0. This selection will map the LSBs of an address to the BG0 field of the memory. This results in alternating Bank Groups during a burst operation based on the tCCD\_S timing parameter (see memory device datasheet for tCCD\_S settings). The tCCD\_S parameter is the timing parameter which results in a better efficient operation for the DDR IP Configurator in the DDR4 mode. Figure 3-67. Address Ordering—Chip-Row-BG1-Bank-Col-BG0 The Number of banks to look up option includes additional selections for 8 and 16. The default selection is 4. This must be set to the number of banks in a Bank Group of the DDR4 memory device being used. Selecting a higher value is more challenging in terms if timing closure on the controller. A setting of 8 works for all but the most congested designs. A setting of 16 works for designs that have 15% timing margin or more. Figure 3-68. Number of Banks to Look Up #### Miscellaneous The following figure shows the Misc. tab. Figure 3-69. DDR4 Simulation Tab As shown in the preceding figure, the **Misc.** options provides the following options. - Simulation Options - Simulation mode: - Fast (skip training and settling time): In this mode, the DDR4 subsystem skips the DDR4 training sequence and asserts CTRLR\_READY without waiting for the settling time. CTRLR\_READY is asserted within 8 µs. - Training (skip settling time): In this mode, the DDR4 subsystem performs the DDR4 training sequence but asserts CTRLR\_READY without waiting for settling time. CTRLR\_READY is asserted within 240 µs. • Full (training and settling time): In this mode, the DDR4 subsystem performs the DDR4 training sequence and also waits for settling time before asserting CTRLR\_READY. In this mode, CTRLR\_READY is asserted within 10 ms. #### Throughput Options - **Pipe Lining**: Adds pipeline registers to Training IP (TIP) write and read data path to improve static timing closure. ## 3.9.5 Simulating the DDR Subsystem (Ask a Question) Libero SoC provides a simulation model for the DDR subsystem, which includes a training sequence (three simulation modes are supported). For simulating the DDR subsystem, the DDR DRAM interface must be connected to a third-party DDR memory simulation model in a testbench. Memory vendors such as Micron, Samsung, and Hynix provide downloadable, JEDEC-compliant simulation models for memory devices. For information about setting up and running the simulation, see 3.10.1. Accessing DDR Subsystem Through AXI4 Interface. **Note:** SmartDesign Testbench does not support instantiation of SystemVerilog testbenches for DDR models. To use SystemVerilog, create an HDL testbench instead of a SmartDesign testbench, and select **System Verilog** in Libero Project settings. ## 3.9.6 Design Constraints (Ask a Question) This section describes physical constraints and timing constraints of the DDR subsystem. #### 3.9.6.1 Timing Constraints (Ask a Question) The REF\_CLK, HS\_IO\_CLK, HS\_IO\_CLK\_270, and SYS\_CLK clocks generated using the dedicated PLL require timing constraints for synthesis, place and route, and timing verification. To generate these timing constraints, select the **Timing** tab in **Constraint Manager**, and click **Derive Constraints** as shown in the following figure. Figure 3-70. Constraint Manager—Derive Constraints Tab When prompted, click **Yes** to apply the derived constraints for synthesis, place and route, and timing verification. #### 3.9.6.2 Physical Constraints (Ask a Question) DDR memory controllers can be placed only in the pre-defined locations that are optimized for maximum performance. The maximum width of the DDR memory varies based on die/package combination and placement. For information about the DDR placement, see the respective PPATs as follows: - PolarFire Package Pin Assignment Tables (PPAT) - PolarFire SoC Package Pin Assignment Tables (PPAT) The DDR subsystem location is selected using the Physical Design Constraint (PDC) or the I/O Editor. Libero assigns the I/Os based on the selection of the DDR memory location. Before placing the DDR subsystem to the required I/O Bank location, the VDDI and VREF of that I/O Bank must be set to the appropriate value. In I/O Editor, the VDDI and VREF are set using the I/O Bank Settings option. ### 3.9.6.2.1 Using PDC (Ask a Question) To select the DDR subsystem location, add the following PDC constraint to the floor planner constraints: ``` set_location -inst_name <memory component inst name> -location <edge>_<anchor> ``` For example, set location -inst name {DDR3 TOP/DDR3 0} -location {NORTH NE}. The number of I/Os per bank varies from package to package; therefore, the maximum DDR memory width supported by each bank is not same for all packages in the PolarFire family. For more information, see the respective user guides as follows: - UG0722: PolarFire FPGA Packaging and Pin Descriptions User Guide - UG0902: PolarFire SoC FPGA Packaging and Pin Descriptions User Guide ## 3.9.6.2.2 Using I/O Editor (Ask a Question) To select the DDR subsystem location, follow these steps: 1. Click Design Flow > Constraint Manager > I/O Attributes > Edit with I/O Editor. Figure 3-71. I/O Editor 2. Select the **Memory View** tab in **I/O Editor**. Select the appropriate **Memory Type** to get the valid locations for the memory type. Figure 3-72. I/O Editor—Memory View **Important:** The difference between DDR3 and DDR3L is the I/O standard, SSTL15 and SSTL135 I/O. I/O editor allows to select SSTL15I, SSTL135I, SSTL135I, SSTL135II I/O standards. User needs to select SSTL135 I/Os in the I/O editor for DDR3L configuration. 3. Select the appropriate DDR instance from the **Design View** window, and drag it onto **Port Function**. For the selected DDR instance, the I/O Editor shows the valid locations in green. Figure 3-73. I/O Editor—Valid DDR Locations In **I/O Editor**, **Design View** window, the DDR instances that are assigned and locked are shown in green. **I/O Editor** does not assign a location if the selected location is unavailable, has insufficient resources or is impacted by any other issue. If an issue is detected, **I/O Editor** displays an error. Figure 3-74. I/O Editor—Assigned DDR Location To turn ODT on for a DQ and DQS, select the **Port View** tab, select the signal, and select **ON** from the **On Die Termination** list, as shown in the following figure. **Important:** $60\Omega$ ODT is the recommended ODT setting for DDR4 DIMM/on-board DDR4 component. $30\Omega$ ODT is the recommended ODT setting for on-board DDR3 component. $60\Omega$ ODT is the recommended ODT setting for DDR3 DIMM. Figure 3-75. Setting ODT **Note:** If VREF is not specified, the internal VREF is used. When external VREF is used, any I/O pin available on the same bank can be selected as VREF using the I/O Editor. If the DDR Subsystem uses two banks, two external VREF pins are required (one per each bank). Microchip recommends using the internal VREF. If there is DDR, QDR, and DLL in a design, be aware of the following DLL placement rules: - If DDR/QDR is placed in NORTH pinouts, the DLL cannot be placed in the location 2,377 (DLL0\_NW) - If DDR/QDR is placed in SOUTH pinouts, the DLL cannot be placed in the location 2462,5 (DLL0\_SE) - If DDR/QDR is placed in WEST pinouts, the DLL cannot be placed in the location 2,5 (DLL0\_SW) ## 3.10 Functional Examples (Ask a Question) This section describes how to use the DDR subsystem in the design in both AXI4 and native interface modes. #### 3.10.1 Accessing DDR Subsystem Through AXI4 Interface (Ask a Question) In AXI4 interface mode, the AXI4 master implemented in the FPGA fabric accesses the DDR memory through the AXI4 interconnect and the DDR subsystem, as shown in the following figure. Figure 3-76. DDR Subsystem Accessed Through AXI4 Interface AXI4 interconnect functions in pass-through mode for a single-master-and-slave configuration. The AXI4 interconnect IP (CoreAXI4Interconnect) is available for download from the Libero SoC IP catalog. After successful DDR initialization, the AXI master initiates reads from and writes to the DDR memory. The following steps describe how to create a design to access the DDR3 memory from the AXI master in the FPGA fabric: 1. Create a SmartDesign, and instantiate the DDR3 macro. - 2. Configure the DDR3 subsystem as described in 3.9. Implementation, or apply the preset configuration (if any), as shown in Figure 3-49. The design shown in this figure is created to access the DDR3 memory with a 32-bit data width through the AXI4 interface. - 3. Instantiate the user AXI4 master logic in the SmartDesign canvas. Ensure that the AXI master logic accesses the DDR3 subsystem only after CTRLR\_READY is high. - 4. Instantiate the CoreAXI4Interconnect IP, and configure single master and slave, as shown in the following figure. Figure 3-77. CoreAXI4Interconnect IP Configurator - 5. The AXI master and CoreAXI4Interconnect IP clocks must be driven from the SYS\_CLK clock of the DDR3 subsystem. - 6. In the SmartDesign canvas, connect the blocks as shown in the following figure. Figure 3-78. SmartDesign Connection—AXI - 7. Create a new SmartDesign testbench to simulate the design. - 8. Instantiate the top-level design component and the DDR memory simulation models. - 9. Configure CLK\_GEN to generate the PLL reference clock, and connect to PLL\_REF\_CLK. - 10. Connect the blocks in SmartDesign testbench, as shown in the following figure. Figure 3-79. SmartDesign Testbench #### 11. Select **Simulate** from Libero **Design Flow** -> **Verify Pre-synthesized Design**. The DDR subsystem initializes the memory model, and sequence complete messages are displayed on the transcript window. The following figure shows the AXI read and write transactions and the corresponding SDRAM transactions. | Note Figure 3-80. AXI Read and Write Transactions ## 3.10.2 Accessing DDR Subsystem Through Native Interface (Ask a Question) The DDR subsystem can be used to access SDRAMs directly, as shown in the following figure. Figure 3-81. DDR Subsystem Accessed Through Native Interface User logic is connected directly to the DDR subsystem using the native interface. After successful DDR initialization, the native interface master initiates reads from or writes to the DDR memory. The following steps describe how to create a design to access the DDR3 memory from the native interface master in the FPGA fabric: - 1. Create a SmartDesign, and instantiate the DDR3 macro. - 2. Configure the DDR3 subsystem as described in 3.9. Implementation, or apply the preset configuration (if any), and select the native interface, as shown in the following figure. In this example, the design is created to access the DDR3 memory with a 16-bit data width through the native interface. Figure 3-82. Native Interface Selection - 3. Instantiate the user native interface master logic in the SmartDesign canvas. Ensure that the native interface master logic accesses the DDR3 subsystem only after CTRLR\_READY is high. - 4. In the SmartDesign canvas, connect the blocks, as shown in the following figure. Figure 3-83. SmartDesign Connection—Native Interface 5. Follow steps 7–11 of 3.10.1. Accessing DDR Subsystem Through AXI4 Interface to simulate the design and see the read and write transactions. # 3.10.3 Accessing DDR Memory Using a Third-Party DDR Controller and the DDR PHY-Only Solution (Ask a Question) The DDR PHY-only solution can be used to access the SDRAMs using a third-party DDR controller or an open-source DDR controller. A fabric master accesses the DDR memory through a third-party DDR controller and the DDR PHY-only solution, as shown in the following figure. Figure 3-84. DDR Memory Accessed Through Third-Party DDR Controller ## 4. Initialization Sequence (Ask a Question) After the asynchronous system reset (SYS\_RESET\_N) is deasserted and PLL\_LOCK is asserted, the DDR controller performs a JEDEC compliant initialization sequence for memory devices. Each DDR SDRAM device has a series of mode registers (MR) that are accessed using the mode-register-set (MRS) commands. These mode registers set various SDRAM behaviors, such as burst length, CAS latency, ODT value, additive latency, and so on. These registers can be configured using the DDR Configurator. The CTRLR\_READY signal is asserted to indicate that initialization is completed. This signal can be monitored from the fabric. The following sections explain the automatic initialization sequence for DDR3, DDR4, and LPDDR3 memory devices. ## 4.1 DDR3 (Ask a Question) Automatic initialization of DDR3 memory involves the following steps: - 1. Training logic performs HS\_IO\_CLK-to-SYS\_CLK training. - 2. Training logic performs CMD/ADDR to REF\_CLK training. - 3. DDR controller waits for the PHY to be ready. At this point, it is assumed that the PHY outputs stable clocks and signal levels. - 4. Controller asserts RESET N and CKE low. - 5. Controller deasserts RESET\_N after 200 μs, and then deasserts CKE after an additional 600 μs. - 6. Controller waits for tXPR. - 7. The MR2, MR3, MR1, and MR0 registers are configured (in that order) using the MRS command. - 8. Controller waits for 512 clock cycles. During this wait time, the ZQCL (long) command is sent to each rank independently to calibrate the RTT and RON values. - 9. Training logic performs write leveling. - 10. Training logic performs DQS gate training. - 11. Training logic aligns read DQ bits. - 12. Training logic aligns read DQS to DQ. - 13. Write calibration is performed. - 14. Normal operation begins. The controller reinitializes the SDRAM memory when a low-to-high transition is detected on the reinitialization control signal (CTRLR\_INIT). The following figure shows the DDR3 automatic initialization flow. Figure 4-1. Automatic Initialization Flow for DDR3 Memory **Note:** If the CTRLR\_READY signal is not asserted, find the state of the failure using the **DDR Debug** option of SmartDebug. ## 4.2 DDR4 (Ask a Question) Automatic initialization of DDR4 memory involves the following steps: - 1. Training logic performs HS\_IO\_CLK-to-SYS\_CLK training. - 2. Training logic performs CMD/ADDR to REF CLK training. - 3. DDR controller waits for the PHY to be ready. At this point, it is assumed that the PHY outputs stable clocks and signal levels. - 4. Controller asserts RESET\_N and CKE low. - 5. Controller deasserts RESET\_N after 200 μs, and then deasserts CKE after an additional 600 μs. - 6. Controller waits for tXPR. - 7. Controller writes to RCW and BCW if RDIMM/LRDIMM is used. - 8. The MR3, MR6, MR5, MR4, MR2, MR1, and MR0 registers are configured (in that order) using the MRS command. - 9. Controller waits for 128 clock cycles (geardown mode). Controller waits for an additional 512 clock cycles for the ZQCL (long) command to be sent to each rank independently to calibrate the RTT and RON values. - 10. Training logic performs write leveling. - 11. Training logic performs DQS gate training. - 12. Training logic aligns read DQ bits. - 13. Training logic aligns read DQS to DQ. - 14. Write calibration is performed. - 15. Normal operation begins. The controller reinitializes the SDRAM memory when a low-to-high transition is detected on the reinitialization control signal (CTRLR\_INIT). The following figure shows the DDR4 automatic initialization flow. Figure 4-2. Automatic Initialization Flow for DDR4 Memory ## 4.3 LPDDR3 (Ask a Question) Automatic initialization of LPDDR3 memory involves the following steps: - 1. Training logic performs HS\_IO\_CLK-to-SYS\_CLK training. - 2. Training logic performs CMD/ADDR to REF\_CLK training. - 3. DDR controller waits for the PHY to be ready. At this point, it is assumed that the PHY is outputs stable clocks and signal levels. - 4. CKE is held low for tINIT1. The LPDDR3 specification is 100 ns (minimum), but the controller waits for 100 clock cycles. - 5. CKE is held high for tINIT3. The LPDDR3 specification is 200 μs. - 6. MRW RESET command is issued. - 7. Controller waits for tINIT5. - 8. Controller sequentially issues the initialization calibration command to each active rank. This enables a configuration where all ranks share a ZQ resistor. - 9. Controller waits for tZQINIT after each calibration command is performed. - 10. The MR1, MR2, MR3, MR16, and MR17 registers are configured (in that order) using the MRS command. - 11. Training logic performs write leveling. - 12. Training logic performs DQS gate training. - 13. Training logic aligns read DQ bits. - 14. Training logic aligns read DQS to DQ. - 15. Write calibration is performed. - 16. Normal operation begins. The controller reinitializes the SDRAM memory when a low-to-high transition is detected on the reinitialization control signal (CTRLR INIT). The following figure shows the LPDDR3 automatic initialization flow. Figure 4-3. Automatic Initialization Flow for LPDDR3 Memory # 4.4 LPDDR4 (For PolarFire SoC FPGA Only) (Ask a Question) Automatic initialization of LPDDR4 memory (for embedded MSS DDR controller only) involves the following steps: - 1. Training logic performs HS\_IO\_CLK-to-SYS\_CLK training. - 2. DDR controller waits for the PHY to be ready. At this point, it is assumed that the PHY outputs stable clocks and signal levels. - 3. Controller asserts RESET\_N, and CKE is held LOW. - 4. Controller deasserts RESET\_N after 200 μs, and then deasserts CKE after an additional 2 ms. - 5. Controller waits for tXPR. - 6. The MR13, MR1, MR2, MR3, MR4, MR11, MR16, MR17, MR22, and MR13 registers are configured (in that order) using the MRS command. - 7. Controller waits for 512 clock cycles. During this wait time, the ZQCL (long) command is sent to each rank independently to calibrate the RTT and RON values. - 8. Training logic performs CMD/ADDR to REF\_CLK training. - 9. Training logic performs write leveling. - 10. Training logic performs Read Gate (Read DQS gate) training. - 11. Training logic aligns read data (DQ) bits. - 12. Training logic aligns read data strobe (DQS) to DQ. - 13. Write calibration is performed. - 14. Sanity check is performed. - 15. Normal operation begins. The following figure shows the LPDDR4 automatic initialization flow. Figure 4-4. Automatic Initialization Flow for LPDDR4 Memory # 5. Training Logic (Ask a Question) The training logic manages the training requests between the integrated PHY and DDR controller modules, and performs the following operations. - Clock training - · Write leveling - Read leveling - I/O calibration The CTRLR\_READY signal is asserted to indicate the completion of initialization and training. This signal can be monitored from the fabric. #### **Clock Training** - HS\_IO\_CLK to SYS\_CLK training—aligns HS\_IO\_CLK to the DDR controller clock. When it is aligned, the data can be transferred to or from the user logic. - CMD/ADDR to REF\_CLK training—aligns the rising edge of REF\_CLK to the center of the address and command buses of the DDR memory. When the rising edge is aligned, DDR commands can be written to the SDRAM. - CMD/ADDR to REF\_CLK training (for fabric LPDDR3 memory controller) - Offset of the REF\_CLK to CMD/ADDR buses relies on internal silicon delays and can be affected by board signal integrity. - The user can include an additional phase offset of the signals via the memory configurator on the general tab using the CK/CA additive offset field. Each offset increment equates to 45 degrees. - The default setting of 0 was derived from test results using a DIMM based validation platform. - The user may need to adjust this parameter based on the specific board requirements. It is recommended to derive the accurate values of skew and termination based on simulations. #### Write Leveling Write leveling is a training mode used during DRAM initialization. The write leveling process identifies the delay when the write DQS rising edge aligns with the rising edge of REF\_CLK. By identifying this delay, the system can accurately align the write DQS within REF\_CLK. When it is aligned, data can be written to the SDRAM. This alignment is at the memory chip, not at the I/O Lane. The fly-by topology introduced in DDR3 improves signal integrity, but it creates variable skew between the CLK and DQS in every DRAM device. Long traces on DIMM can create skew greater than one MEM\_CLK cycle. However, the JEDEC DDR TDQSS specification requires DQS and CLK to align ±0.25 MEM\_CLK. Write leveling compensates for this skew. It is the first in a sequence of training steps. The memory controller has a programmable DQS delay. The DRAM provides feedback to indicate if the DQS leads or lags the CLK. As shown in the following figure, the training logic sends out widely spaced DQS pulses. The DRAM uses CLK as D input and DQS as CLK to the flip-flop. The Q output of the flip-flop is fed back to the prime DQ (the DQ bit on which feedback is provided). The prime DQ could be different for different vendors. The objective is to detect a 0-to-1 transition on the CLK with a DQS rising edge. This is done by moving the DQS in small steps until the sampled CLK changes from 0-to-1. When the transition is detected, REF\_CLK is aligned with DQS. Figure 5-1. DQS Delay #### **Read Leveling** - DQS gate training—The training logic has a DQS gate signal for capturing the correct DQS strobe during read operations and for removing the DQS glitches. The purpose of gate training is to determine the optimum delay that can be applied to the DQS gate for it to function properly. - As shown in the following figure, when the bus is idle, DQ/DQS are at VDD/2. The output of the DQS receiver is undefined. Read DQS is internally used to clock FIFO read data. The write pointer for read data must correspond to every incoming piece of data. This means incoming DQS must be qualified with DQS GATE cover pulse, which goes high during pre-amble and goes low during post-amble. For board design guidelines for DQ and DQS pins, see 7. PolarFire Board Design Recommendations and 8. PolarFire SoC Board Design Recommendations. - The memory controller generates the read commands required for DQS gate training. The training logic places the DQS gate in the middle of the burst. It detects a 0-to-1 transition of DQS at the rising edge of the DQS gate. After the transition is detected, the DQS gate is advanced by 90° (of MEM\_CLK) to place the rising edge in the middle of the DQS high pulse. Then, the rising edge is walked back one MEM\_CLK at a time. The rising edge detects a value of 1 until it hits pre-amble. - When the rising edge of the DQS gate detects a value of 0 for the first time during this walk back process, it is in the pre-amble area of the DQS burst. Only the rising edge of the DQS gate needs to be trained. The training logic runs internal timers to place the falling edge of the gate in the post-amble area. Figure 5-2. DQS Gate Training • Align read DQ bits—aligns the read DQ edges for a lane to increase the size of the data window. • Align read DQS to DQ—aligns the DQS strobe to the center of the data window to increase its size. DQ/DQS centering is a two-step process. First, all DQ bits are aligned to maximize the data valid window. Then, the DQS strobe is placed at the center. Because of variables such as package routing, parasitics (controller and device), and board layout, the performance of all DQ bits is not identical. Each DQ bit in a byte lane are impacted differently. Consequently, there is always a best case and worst case DQ bit within a byte lane. Placement of DQS at the center of data valid window is done using known patterns. When the read data matches with the data written to a location, the DQS is moved both to the left and right until it fails. After both the left and right margins are identified, the mean of the left and right margins [(left + right)/2] is computed, and the DQS is placed at the newly computed mean value. The following figures show DQ/DQS before and after centering. Figure 5-3. DQ/DQS Before Centering Figure 5-4. DQ/DQS After Centering ## I/O Calibration The Fabric DDR subsystem performs DDR memory I/O calibration after device power-up. For more information about I/O calibration, see PolarFire Family Power-Up and Resets User Guide. # 6. QDR Memory Controller (Ask a Question) The PolarFire QDR subsystem addresses memory solution requirements for communication applications. The subsystem can be configured to support QDR II+ and QDR II+ Xtreme memory devices. The subsystem is intended for accessing QDR memories for applications that require high-speed data transfer. QDR SRAMs are a family of SRAMs with separate read and write channels, each operating at double date rate, optimized for high-performance communication applications (Example: Data Plane Memory of Routers performing functions such as packet buffering, statistics counting, and flow rate control). The PolarFire QDR subsystem is made up of the following soft and hard blocks as shown in the following figure: - QDRC (Soft Block) - I/O Lane (Hard Block) - Phase-Locked Loop (PLL) (Hard Block) Figure 6-1. QDR Controller Block Diagram ## 6.1 Features (Ask a Question) The QDR subsystem provides the following features: - Integrated Hard PHY - Supports QDR II+ and QDR II+ Xtreme interface - Up to 1100 Mbps data rate (550 MHz clock) - Separate read and write channels, supporting concurrent transactions - Single address channel - Burst of 2 and burst of 4 support (configurable) - Configurable QDR data width (9, 18, and 36 bits) - Configurable address width (18–22 bits) **Important:** The 22-bit address width is supported only for 18-bit data width. ## 6.2 Performance (Ask a Question) The following table lists the performance of QDR families. Table 6-1. Performance of QDR Families | Parameters | | QDR II+ | QDR II+ Xtreme | |------------|--------------|-----------------------|----------------| | Frequency | 2-Word Burst | 333 MHz | 400 MHz | | | 4-Word Burst | 500 MHz | 550 MHz | | Density | | 18, 36, 72, or 144 Mb | 36 or 72 Mb | ## 6.2.1 Read Latency (Ask a Question) The following table lists the QDR Read latency. Table 6-2. Read Latency | Data Rate | Sys Clock<br>Frequency (MHz) | User Interface<br>Data Width | , | Latency in Cycles<br>(User Interface) | Latency in ns | |-----------|------------------------------|------------------------------|----|---------------------------------------|---------------| | 1100 Mbps | 137.5 | 288 | 36 | 15 cycles | 109.09 | | 800 Mbps | 100 | 288 | 36 | 14/15 cycles | 140 – 150 | | 500 Mbps | 62.5 | 288 | 36 | 14 cycles | 224 | ## 6.3 Functional Description (Ask a Question) The QDR Subsystem translates the User Interface requests into command sequences required by QDR SRAM devices using the QDR Controller and PHY subsystem modules. #### 6.3.1 QDR Controller (Ask a Question) QDR Controller is a Soft IP Core that consists of the following blocks: - Data Control: Accepts the Write and Read commands from User Interface and convert them to PHY commands. It controls the PHY to generate QDR memory interface signals such that, the memory device look into data with the correct latency-with respect to Address and Command (WPS\_N and RPS\_N). - Training Logic: Controls PHY Delay Lines and De-skews (due to board and internal delays) Input data, Input Clocks, and Input Valid Signals. Figure 6-2. QDR Memory Controller #### 6.3.1.1 Integrated PHY (Ask a Question) The integrated PHY, which consists of the I/O lane provides a physical interface to QDR II+ and QDR II+ Xtreme SRAM devices. It receives commands from the QDR controller and generates the QDR memory signals required to access the external QDR memory. In QDR subsystem, the I/O Lane gear ratio is configured to 4:1. ### 6.3.1.1.1 I/O Lane (Ask a Question) An I/O Lane contains: Twelve I/Os, a Lane Controller, I/O Gearing Logic, and a set of High-speed and Low-skew clock resources. The I/O Gearing Logic in a lane enables easy data transfer between the high-speed I/O pad and the Lower-speed FPGA core. The logic is used to either gear up the data rate from the FPGA Fabric to the memory device or gear down the data rate from the memory device to the FPGA fabric. For information about non-memory interface usage, see PolarFire FPGA and PolarFire SoC FPGA User I/O User Guide. The Lane Controller contains the logic for managing the Read and Write signals and provides the Lane Clocks. Each Output Data (D) lane (x9) uses one I/O lane with 12 I/O pads each: Nine of the I/O pads are used for the Data (D) bits, and one for BWSx\_N and the remaining I/O pads are left as spare. Each Input Data (Q) lane(x9) uses one I/O lane with 12 I/O pads each-Nine of the I/O pads are used for the Data (D) bits and the remaining I/O pads are left as spare. Address, Clock and Read/Write command (RPS\_N and WPS\_N) uses 3 I/O lanes. **Important:** Depending on the configuration of the QDR memory, few of the I/Os in Address/Command Lanes are used. Some I/Os in these Lanes can be reused as normal I/Os. For Data Lanes, all 12 I/Os in the Lane are used. #### 6.3.1.2 Training Logic (Ask a Question) The training logic is implemented as a soft FPGA core that de-skews read data and center aligns the clock using IOD capabilities in the read direction. A known training pattern, generated by the QDR controller, is used for training. The IOD PHY receives the read data and performs a serial to parallel conversion. In the following figure, the receive path for the QDR data width = 36 case is shown in detail. The IOD PHY collects the bytes for a total of 72 bits received per lane. The bytes are then passed to the respective SHIM block. The training logic shifts the pattern by full bit times in the SHIM if required. This is called data framing and it is performed iteratively until the received pattern matches the expected pattern. The training logic compares the bytes to the expected pattern and adjusts delay taps in the IOD PHY to center align the data ("Q") around memory clock CQP. This is called Q alignment. This operation is performed sequentially for the entire 288-bit word (4 lanes x 72 bits) byte by byte (lane by lane). The following figure shows the receive path. Figure 6-3. Receive Path Note: The above example is applicable for DQ Width=36 bits. For DQ Width=18 bits, two I/O Lanes are used. For DQ Width =9 bits, one I/O Lanes is used. **Important:** QDR training is performed on receive path only. Following QDR vendor-specific PCB guidelines ensures proper write operations. The QDR RAM uses the CQP and CQN clock pair to capture data in the PHY. IODs in the PHY only use CQP for sampling data. The Lane Controller adjusts delay taps to center CQP within the valid data eye. This tuning occurs during CQ alignment. ### **Training Pattern** - Q alignment– Implemented to adjust the delay taps of the serial bits received. A pseudo random generator is used with a standard PRBS8 sequence having a period of 256 words. - Math formula is $x^8 + x^6 + x^5 + x^4 + 1$ . - Seed value is 0xAA. - CQ Alignment– Used to center align the clock with respect to the received data. Training pattern is fixed to 0xA5. #### **Training FSM** The FSM is designed to iterate across lanes and across each IOD data pad. Data framing, Q Alignment, CQ Alignment operations are all performed using the same methodology. A sequence is written into memory and read back; received data is compared with the transmitted data. Based on this comparison, some actions are taken. What differentiates the three phases is the block where these tuning actions are performed. - Data framing– It is used to identify the beginning of the data frame. There is a data valid signal that is a delayed version of the read command and is used as a gate to sample the data. In Figure 6-3, it applies to Q0 (Lane0), Q9 (Lane 1), Q18 (Lane2), and Q27 (Lane4). The CQP clock is used for sampling the data, looking for a match between write and read sequences. All the IOD delay taps are tried as it searches for a match. If there is no match, the read sequence and/or the data valid signal are shifted by an integer number of bits using the shift register in the read SHIM layer. This process continues until a there is a match, and by definition, data valid is now framing the data. - Q alignment– It is used to find the delay tap setting for each bit that results in the widest data valid window across the data byte. Only the bits sampled on the falling edge are used to compare the written and read data. - CQ alignment– After the delays taps for all IODs are set in a lane during Q alignment, then CQ alignment is performed. This step fine tunes the delay taps of the CQP. Since we only use CQP to capture the data, we only adjust the delay taps on the CQP IOD. Only the bits sampled on the rising edge are used to compare written and read data. The algorithm pseudo code of the whole process (data framing, Q alignment, CQ alignment) is as follows: ``` for k=1:num_lanes for g=1:num IODs in lane(9) for j=1:delay taps (256) for i=1:byte len(8) // 1. Wri\overline{\text{te}} a test pattern to memory generated from the PRBS8 generator. // 2. Read it back and compare it with bits 7,5,3,1 (falling edge bits) in the byte to the corresponding bits of the test pattern. // 3. Generate the next pattern from the PRBS8 generator. END for // byte len 1. Compare the eight generated patterns. 2. Increment IOD delay taps as the training logic searches for all 8 patterns to compare. 3. Once found, it indicates the start of the valid window, and the delay value (delay left) is recorded. 4. Continue to increment the delay taps as the patterns match. When 7(*8 patterns) consecutive matches and there is no longer a match, save this second delay value (delay right) calculate mid value (left+right)/2 and exit loop. END for //delay taps // 1. After trying all delay values (or exited the loop): // 2. Once a solution is found, the IOD delay tap is set to the midpoint of the valid window. // 3. If (a solution is not found on the first IOD of the lane), increase the offset (see data framing). Reload default delay value and set j=1 and retry to delay. // 4. If out of range and there is no solution. Increase fail number and try again from beginning. If number of failures is greater than 15, declare ERROR. END for //num IODs in lane Perform CQ alignment END for //num_lanes ``` The timing relationships between CQP, CQN, data (Q), and data valid (before training begins) is shown in the following figure. Figure 6-4. Data Capture Before Training Begins At first, the falling edge data alignment is done concurrently with the data\_valid alignment. As a result, the data\_valid properly frames the 8-bit burst coming from the QDR device and the CQN centers the falling edge data (f0, f1, f2, f3). **Important:** CQN is not moved because it cannot be moved in the specified architecture; instead, Q is moved to align with CQN by performing write and read of 8 different patterns and selecting a match if and only if all patterns match. This is done on a per-Q basis, with the exception of data framing, which is only performed on Q<0>. A high-level timing diagram of alignment is shown in the following figure. Figure 6-5. Data Capture After Aligning Data Valid and Falling Edge Data to CQN Next, CQP is aligned with the rising edge data by performing write and read of 8 different patterns and selecting a match if and only if all patterns match. Figure 6-6. Data Capture After Moving CQP to Center Rising Edge Data #### 6.3.2 Clocking Structure (Ask a Question) The QDR subsystem require a dedicated PLL to generate the clocks, which are then distributed throughout the subsystem using HS\_IO\_CLK routes, dedicated pads, and fabric clock routing. This PLL generates aligned clocks for all sub-blocks for smooth operation and synchronous communication with the user logic. The PLL generates three required clocks-HS\_IO\_CLK, HS\_IO\_CLK phase shifted by 90° (HS\_IO\_CLK\_90), and controller clock (SYS\_CLK). The two HSIO clocks (HS\_IO\_CLK and HS\_IO\_CLK\_90) are routed to the PHY using HS\_IO\_CLK routing resources (for low skew) and multiplexers. These two clocks generate the QDR SRAM interface signals. SYS\_CLK is routed to the QDR controller and user logic in the fabric. The QDR Memory Clock to SYS\_CLK ratio is 4:1. The PLL generates the QDR memory clocks (KP and KN) as shown in the following figure. Figure 6-7. QDR Clocking Structure The dedicated Clock Input Pad (CCC\_xx\_CLKIN\_N\_m) and Output Pad (CCC\_xx\_PLLy\_OUTz) is selected depending on the QDR subsystem placement constraint (where xx = SE/NE/NW/SW, y = 0/1, z = 0/1 and m=0/1/2/3). **Note:** Timing constraints are not required for output signals (Address, Command and Data). Because, these output signals are shifted 90° from the K clock. ## 6.3.3 Port List (Ask a Question) The following table lists the QDR ports. **Table 6-3.** QDR Signals and Descriptions<sup>1</sup> | Name | Width | Direction | Description | |------------------------|-------|-----------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | <b>Generic Signals</b> | | | | | SYS_CLK | 1 | Output | Clock for User Logic generated by the embedded PLL. All User Interface signals are synchronous to this clock. Always on the global clock network. | | RESET_N | 1 | Input | Active-low asynchronous system reset. <b>Note:</b> The RESET_N signal must be generated by ANDing the DEVICE_INIT_DONE and BANK_X_CALIB_STATUS signals of the INIT_MONITOR IP core. BANK_X refers to the BANK where the QDR controller is placed. | | PLL_REF_CLK | | Input | Reference clock to the PLL. | | PLL_LOCK | 1 | Output | Lock signal. This signal is generated by the PLL to indicate that the PLL is locked on to the PLL_REF_CLK signal. | | TRAINING_COMPLETE | 1 | Output | Indicates that Training is complete. The user logic must check for TRAINING_COMPLETE = 1 and TRAINING_ERROR = 0 before accessing the QDR memory. | | continued | | | | |------------------------------|--------------------------------------------------|-----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Name | Width | Direction | Description | | TRAINING_ERROR[2:0] | | Output | - Bit 0: Reserved<br>- Bits [2:1] | | | | | • 0: No error. | | | | | • 1: Timeout error, data VALID is received during Q training. | | | | | • 2: No solution found during training. | | | | | If no solution is found, restart the Training process described in 6.3.1.2. Training Logic. | | <b>User Interface Signal</b> | S | | | | READ_N | 4 | Input | <ul><li>Read request with address specified on RADDR bus.</li><li>4-bit Wide if Burst Size is 2.</li><li>2-bit Wide if Burst Size is 4.</li></ul> | | WRITE_N | 4 | Input | <ul><li>Write request with address specified on WADDR bus.</li><li>4-bit Wide if Burst Size is 2.</li><li>2-bit Wide if Burst Size is 4.</li></ul> | | WSTRB_N | SRAM_DWIDTH/9 * 8 | Input | Write strobe for write transaction. | | WADDR | SRAM_AWIDTH*4 | Input | Write addresses for each beat. Width is (SRAM_AWIDTH)*4 if burst size is 2, (SRAM_AWIDTH)*2 if burst size is 4 In Burst of 2: {ADDR3, ADDR2, ADDR1, ADDR0} In Burst of 4: {ADDR1, ADDR0} | | RADDR | SRAM_AWIDTH*4 | Input | Read addresses for each beat. Width is (SRAM_AWIDTH)*4 if burst size is 2, (SRAM_AWIDTH)*2 if burst size is 4 In Burst of 2: {ADDR3, ADDR2, ADDR1, ADDR0} In Burst of 4: {ADDR1, ADDR0} | | WDATA | (SRAM_DWIDTH)*8 | Input | Write data | | RDATA | (SRAM_DWIDTH)*8 | Output | Read data, valid when RVALID is asserted | | RVALID | 1 | Output | Read data valid | | QDR SRAM Interface | | | | | D[X:0] | 9/18/36<br>(X=8, 17, or 35) | Output | Data Output Bus | | WPS_N | 1 | Output | Write port select—active-low | | BWS0_N | 1 | Output | Byte write select (BWS) 0—active-low | | BWS1_N | 1 | Output | Byte write select (BWS) 1—active-low | | BWS2_N | 1 | Output | Byte write select (BWS) 2—active-low | | BWS3_N | 1 | Output | Byte write select (BWS) 3—active-low | | A[X:0] <sup>2</sup> | 18, 19, 20, 21, or 22<br>(X=17,18,19, 20, or 21) | Output | Address inputs | | Q[X:0] | 9,18, or 36<br>(X=8,17, or 35) | Input | Data input bus | | RPS_N | 1 | Output | Read port select - active-low | | QVALID | 1 | Input | Valid output indicator. Not used in current QDR subsystem | | KP | 1 | Output | Positive Clock Output | | KN | 1 | Output | Negative Clock Output | | CQP | 1 | Input | Synchronous echo Clock input | | continued | | | | |-----------|-------|-----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Name | Width | Direction | Description | | DOFF_N | 1 | Output | Initially, DOFF_N is 0. After calibration, DOFF_N is set to 1. If <b>Expose DOFF_N port</b> option is selected in the configurator, the DOFF_N signal is exposed and taken care by the QDR controller. If <b>Expose DOFF_N port</b> option is not selected in the configurator, the DOFF_N signal does not get exposed and must be taken care on the board using a pull-up resistor (check the memory model data sheet). | #### Notes: - 1. In the QDR configurator, the **QDR data width** field is same as SRAM\_DWIDTH. In the QDR configurator, the **address width** field is same as SRAM\_AWIDTH. - 2. The 22-bit address width is supported only for the 18-bit data width. ## 6.4 Functional Timing Diagrams (Ask a Question) This section describes the read/write rules and sequences in the User Interface. For information about User Interface signals, see 6.3.3. Port List. #### 6.4.1 Write Strobe Mapping (Ask a Question) Mapping of the write strobe signal WSTRB to write data WDATA is done on a phased basis, by mapping WDATA to the QDR data bus D, as shown in Figure 6-10 and Figure 6-11. However, it is also aligned to BWSx\_N signals on the QDR interface as follows. Note that the following example is for SRAM\_DWIDTH of 36, or x36 mode. Table 6-4. WSTRB to BWS Mapping | WSTRB Index | BWS Index | |-------------|-----------| | 7:0 | 0 | | 15:8 | 1 | | 23:16 | 2 | | 31:24 | 3 | Within each WSTRB byte, each bit corresponds to a clock cycle (phase) and either rising or falling edge. A complete mapping for write data bus WDATA to WSTRB is listed in the following tables, for x36, x18, and x9 modes. **Table 6-5.** Write Data to Write Strobe Mapping (x36) | WSTRB_MAPPI<br>NG_INDEX | WDATA IN | IDEX | | | | | | | | RISING/<br>FALLING | CLOCK<br>CYCLE | |-------------------------|----------|------|-----|-----|-----|-----|-----|-----|-----|--------------------|----------------| | 0 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | RISING | 1 | | 8 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | FALLING | 1 | | 16 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | RISING | 2 | | 24 | 108 | 109 | 110 | 111 | 112 | 113 | 114 | 115 | 116 | FALLING | 2 | | 1 | 144 | 145 | 146 | 147 | 148 | 149 | 150 | 151 | 152 | RISING | 3 | | 9 | 180 | 181 | 182 | 183 | 184 | 185 | 186 | 187 | 188 | FALLING | 3 | | 17 | 216 | 217 | 218 | 219 | 220 | 221 | 222 | 223 | 224 | RISING | 4 | | 25 | 252 | 253 | 254 | 255 | 256 | 257 | 258 | 259 | 260 | FALLING | 4 | | 2 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | RISING | 1 | | 10 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | FALLING | 1 | | 18 | 81 | 82 | 83 | 84 | 85 | 86 | 87 | 88 | 89 | RISING | 2 | | 26 | 117 | 118 | 119 | 120 | 121 | 122 | 123 | 124 | 125 | FALLING | 2 | | 3 | 153 | 154 | 155 | 156 | 157 | 158 | 159 | 160 | 161 | RISING | 3 | | continue | continued | | | | | | | | | | | |-------------------------|-----------|-------------|-----|-----|-----|-----|-----|-----|-----|---------|----------------| | WSTRB_MAPPI<br>NG_INDEX | WDATA IN | WDATA INDEX | | | | | | | | | CLOCK<br>CYCLE | | 11 | 189 | 190 | 191 | 192 | 193 | 194 | 195 | 196 | 197 | FALLING | 3 | | 19 | 225 | 226 | 227 | 228 | 229 | 230 | 231 | 232 | 233 | RISING | 4 | | 27 | 261 | 262 | 263 | 264 | 265 | 266 | 267 | 268 | 269 | FALLING | 4 | | 4 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | RISING | 1 | | 12 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | FALLING | 1 | | 20 | 90 | 91 | 92 | 93 | 94 | 95 | 96 | 97 | 98 | RISING | 2 | | 28 | 126 | 127 | 128 | 129 | 130 | 131 | 132 | 133 | 134 | FALLING | 2 | | 5 | 162 | 163 | 164 | 165 | 166 | 167 | 168 | 169 | 170 | RISING | 3 | | 13 | 198 | 199 | 200 | 201 | 202 | 203 | 204 | 205 | 206 | FALLING | 3 | | 21 | 234 | 235 | 236 | 237 | 238 | 239 | 240 | 241 | 242 | RISING | 4 | | 29 | 270 | 271 | 272 | 273 | 274 | 275 | 276 | 277 | 278 | FALLING | 4 | | 6 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | RISING | 1 | | 14 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | FALLING | 1 | | 22 | 99 | 100 | 101 | 102 | 103 | 104 | 105 | 106 | 107 | RISING | 2 | | 30 | 135 | 136 | 137 | 138 | 139 | 140 | 141 | 142 | 143 | FALLING | 2 | | 7 | 171 | 172 | 173 | 174 | 175 | 176 | 177 | 178 | 179 | RISING | 3 | | 15 | 207 | 208 | 209 | 210 | 211 | 212 | 213 | 214 | 215 | FALLING | 3 | | 23 | 243 | 244 | 245 | 246 | 247 | 248 | 249 | 250 | 251 | RISING | 4 | | 31 | 279 | 280 | 281 | 282 | 283 | 284 | 285 | 286 | 287 | FALLING | 4 | <sup>(1)</sup> The mapping is not sequential. First bit of write\_strobe corresponds to the first 9 wdata bits. The second bit does not correspond to the next 9 wdata bits. Table 6-6. Write Data to Write Strobe Mapping (x18) | WSTRB_MAPPING_INDEX | WDAT | A IND | | 6 ( | | | | | | RISING/FALLING | CLOCK CYCLE | |---------------------|------|-------|-----|-----|-----|-----|-----|-----|-----|----------------|-------------| | 0 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | RISING | 1 | | 8 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | FALLING | 1 | | 1 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | RISING | 2 | | 9 | 108 | 109 | 110 | 111 | 112 | 113 | 114 | 115 | 116 | FALLING | 2 | | 2 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | RISING | 3 | | 10 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | FALLING | 3 | | 3 | 81 | 82 | 83 | 84 | 85 | 86 | 87 | 88 | 89 | RISING | 4 | | 11 | 117 | 118 | 119 | 120 | 121 | 122 | 123 | 124 | 125 | FALLING | 4 | | 4 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | RISING | 1 | | 12 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | FALLING | 1 | | 5 | 90 | 91 | 92 | 93 | 94 | 95 | 96 | 97 | 98 | RISING | 2 | | 13 | 126 | 127 | 128 | 129 | 130 | 131 | 132 | 133 | 134 | FALLING | 2 | | 6 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | RISING | 3 | | 14 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | FALLING | 3 | | 7 | 99 | 100 | 101 | 102 | 103 | 104 | 105 | 106 | 107 | RISING | 4 | | 15 | 135 | 136 | 137 | 138 | 139 | 140 | 141 | 142 | 143 | FALLING | 4 | <sup>(1)</sup> The mapping is not sequential. First bit of write\_strobe corresponds to the first 9 wdata bits. The second bit does not correspond to the next 9 wdata bits. **Table 6-7.** Write Data to Write Strobe Mapping (x9) | WSTRB_MAPPING<br>_INDEX | WDATA IN | WDATA INDEX | | | | | | | | | | |-------------------------|----------|-------------|----|----|----|----|----|----|----|---------|---| | 0 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | RISING | 1 | | 1 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | FALLING | 2 | | 2 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | RISING | 3 | | 3 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | FALLING | 4 | | 4 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | RISING | 1 | | 5 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | FALLING | 2 | | 6 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | RISING | 3 | | 7 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | FALLING | 4 | #### 6.4.1.1 Burst of 2 (Ask a Question) In burst of 2 mode (SRAM\_BURST=1), all bits of both WADDR/RADDR and WRITE\_N/READ\_N (the corresponding control signals) are used. A sample write transfer with SRAM\_DWIDTH=18 and SRAM\_AWIDTH=20 is shown in the following figure. In burst of 2 mode, 4 write transactions (WRITE\_N[0,1,2,3]) are completed in one SYS\_CLK cycle. In each transaction data is 18-bit wide and address is 20-bit wide. The 4 write transactions must be triggered with respect to KP clock, which is derived from SYS\_CLK internally. The first write (when WRITE\_N[0] = 0) occurs at the first falling edge of the WPS\_N write enable signal. The second write (when WRITE\_N[1] = 0) occurs at the next falling edge of WPS\_N. Similarly, the remaining writes are triggered. Write transaction is performed as follows: - When the WPS\_N enable signal is '0', four 18-bit data are written in the specified address 'A' in a single system clock. - A1= WADDR[19:0], A2= WADDR[39:20], A3= WADDR[59:40], A4= WADDR[79:60] - During each write enable (WPS\_N), 2 write transactions are performed as follows: - A1 address D0 = WDATA[17:0] and D1=WDATA[35:18] - A2, D2 = WDATA[53:36] and D3=WDATA[71:54] - A3, D4 = WDATA[89:72] and D5=WDATA[107:90] - A4, D6 = WDATA[125:108] and D7=WDATA[143:126] Read transactions are not shown, but are similar, as each RPS\_N assertion and the corresponding A bus values are derived from the four slices of READ\_N and RADDR, respectively. Figure 6-8. Burst of 2 Local Interface Timing #### 6.4.1.2 Burst of 4 (Ask a Question) In the burst of 4 mode (SRAM\_BURST = 0), only half the WADDR/RADDR and WRITE\_N/READ\_N are utilized, because each SYS\_CLK cycle transaction corresponds to only 2 (instead of 4) K clock transactions. A sample write transaction in burst of 4 mode with SRAM\_AWIDTH=20 and SRAM\_DWIDTH=36 is show in the following figure. In burst of 4 mode, 2 write transactions (WRITE\_N[0,1]) are completed in one SYS\_CLK cycle. In each transaction data is 36-bit wide and address is 20-bit wide. The 4 write transactions must be triggered with respect to KP clock, which is derived from SYS\_CLK internally. The first write (when WRITE\_N[0] = 0) occurs at the first falling edge of the WPS\_N write enable signal. The second write (when WRITE\_N[1] = 0) occurs at the next falling edge of WPS\_N. Write transaction is performed as follows: • When the WPS\_N enable signal is '0', 2 36-bit data are written in the specified address 'A' in a single system clock. - A1= WADDR[19:0], A2= WADDR[39:20]. - During each write enable, 4 write transactions are performed in each burst, as follows: - A1 address D0=WDATA[35:0], D1=WDATA[71:36], D2=WDATA[107:72] and D3=WDATA[143:108]. - A2, D4=WDATA[179:144] and D5=WDATA[215:180], D6=WDATA[251:216] and D7=WDATA[287:252] Figure 6-9. Burst of 4 Local Interface Timing ## 6.4.2 QDR SRAM Write (Ask a Question) QDR SRAM Writes are requested by asserting the WRITE\_N[x] signal low, and by driving the Address on the WADDR. The following rules are applicable for the write requests in the user interface: - 1. A Write is accepted by the Subsystem on any SYS\_CLK clock cycle, where the WRITE\_N[x] signal is asserted to Low. Each Write Transaction performs the 2/4 Burst Write operations as per the QDR configuration. - 2. Multiple Writes can be initiated by asserting the multiple WRITE\_N[x] signals to low. The address WADDR[SRAM\_AWIDTH\*(x+1) -1: 0] and DATA[SRAM\_DWIDTH\*(x+1)\*burst\_mode -1:0] will be the corresponding Write Address and Data for the Write Transaction associated with WRITE\_N[x]. - When QDR is configured for 2 Burst mode, four Burst Writes are initiated in single SYS\_CLK clock cycle by using WRITE\_N[3:0]. - When QDR is configured for 4 Burst mode, two Burst Writes are initiated in single SYS\_CLK clock cycle by using WRITE\_N[1:0]. - 3. Maximum of 8 SRAM\_DWIDTH Bit writes to QDR SRAM are initiated in a single SYS\_CLK clock cycle. Figure 6-10. Write in 4 Burst Mode ### 6.4.3 QDR SRAM Read (Ask a Question) QDR SRAM Reads are requested by asserting the READ\_N[x] signal low, and by driving the address on the RADDR. QDR controller asserts the RVALID to high when valid data is available on RDATA. The following rules are applicable for the read requests in the user interface: - 1. A Read is accepted by the Subsystem on any SYS\_CLK clock cycle where the READ\_N[x] signal is asserted to Low. Each Read Transaction performs the 2/4 burst Read Operations as per the QDR configuration. - 2. Multiple Reads can be initiated by asserting the multiple READ\_N[x] signals to Low. The address RADDR[SRAM\_AWIDTH\*(x+1) -1 : 0] and RDATA[SRAM\_DWIDTH\*(x+1)\*burst\_mode 1:0] will be the corresponding Read Address and Data for the Read Transaction associated with READ\_N[x]. - When QDR is configured for 2 Burst Mode, four burst Reads are initiated in single SYS\_CLK clock cycle by using READ\_N[3:0]. - When QDR is configured for 4 Burst Mode, two burst Reads are initiated in single SYS\_CLK clock cycle by using READ\_N[1:0]. Figure 6-11. Read in 4 Burst Mode ## 6.5 Implementation (Ask a Question) This section describes how to use QDR Subsystem in the design. ## 6.5.1 Configuring QDR Memory Controller IP (Ask a Question) The QDR Subsystem macro (PolarFire QDR) located in the Libero IP catalog must be instantiated in SmartDesign to access the QDR memory from the FPGA fabric through the subsystem. The QDR Configurator shown in the following figure configures the QDR Subsystem. Figure 6-12. QDR Configurator The following fields are available for configuring the QDR Subsystem: - Data rate: Allows you to set the data rate of QDR SRAM memory. The K/CQ Clock frequencies are calculated using data rate value (divide by 2). The FPGA logic (SYS\_CLK) frequency is automatically populated (data rate/8). - **CCC PLL clock Multiplier**: Allows you to set the QDR Subsystem PLL Clock multiplier. The PLL Reference Clock frequency must be K/CQ frequency / CCC PLL Clock multiplier. - **QDR data width**: Allows you to set 9,18,36-bit data width. - Address width: Allows you to set 18 to 21-bit address width. - Burst size: Allows you to select burst size of 2 or 4. - Simulation mode: PolarFire QDR Subsystem supports following simulation modes: - **BFM**: In this mode, the QDR subsystem skips the training sequence and asserts the TRAINING\_COMPLETE signal. QDR memory model is not required for BFM simulation. - RTL\_Fast: In this mode, the QDR subsystem loads the training delays as per the selected configuration, skips the training sequence and asserts the TRAINING\_COMPLETE signal. QDR memory model is required for RTL\_Fast simulation. - RTL\_Full: In this mode, the QDR subsystem performs the training sequence and asserts the TRAINING\_COMPLETE signal. TRAINING\_COMPLETE is asserted within 10 ms. QDR memory model is required for RTL\_FULL simulation. - Additional pipelining levels: This option is used for static timing closure, if necessary. Enabling this option increases the read latency by 2 3 SCLK cycles. #### 6.5.1.1 Pipelining Latency (Ask a Question) This section describes the latency with pipelining enabled and disabled. The following figure shows the QDR read latency with pipelining disabled. Figure 6-13. Latency without Pipelining #### QDR READ LATENCY WITH NO PIPELINING (LOW LATENCY) MEMORY INTERFACE **USER INTERFACE** IOD INTERFACE 2.5 clock cycles 0.5 clock cycles control READ TX IOD Asynchronous Logic CONTROL QDR Memory Read data Synchronizatio READ DATA n and RX IOD Alignment Logic 1-2 clock cycles 3 clock cycles The following figure shows the QDR read latency with pipelining enabled. Figure 6-14. Latency with Pipelining ## 6.5.2 Simulating QDR Memory Controller (Ask a Question) Libero SoC supports simulation for the QDR Subsystem, which includes a training sequence. For simulating this Subsystem, the QDR SRAM interface must be connected to the vendor-specific QDR memory simulation model in a testbench. Memory vendors (like Cypress Semiconductor) provide downloadable simulation models for memory devices. For more information on Simulations: Set-up and Running, see 6.6. Functional Example. ### 6.5.3 Design Constraints (Ask a Question) This section describes Timing Constraints and Physical Constraints of the QDR Subsystem. ### 6.5.3.1 Timing Constraints (Ask a Question) The HS\_IO\_CLK, HS\_IO\_CLK\_90, and SYS\_CLK Clocks generated using the dedicated PLL requires timing constraints for synthesis, place and route, and timing verification. To generate these timing constraints, select the **Timing** tab in **Constraint Manager**, and click **Derive Constraints**, as shown in the following figure. It also generates the required multi cycle/false path constraints. Figure 6-15. Constraint Manager-Derive Constraints Tab When prompted, Click **Yes** to apply the derived constraints for synthesis, place and route, and timing verification. #### 6.5.3.2 Physical Constraints (Ask a Question) QDR subsystem can be placed only in the pre-defined locations that are optimized for maximum performance. The maximum width of the QDR subsystem varies based on die/package combination and placement. For information about the QDR placement, see the PolarFire, PolarFire SoC, and RT PolarFire Package Pin Assignment Tables. The QDR subsystem location is selected using the I/O Editor. Libero assigns I/Os based on the selection of DDR memory location. The following steps explains the selection of the QDR Subsystem Location: 1. Click Design Flow > Constraint Manager > I/O Attributes > Edit with I/O Editor. Figure 6-16. Design Flow Window: I/O Editor - 2. Select the **Memory View** tab in I/O Editor. Select the appropriate **Memory Type** to get the valid locations for the chosen memory. - 3. Select the appropriate QDR instance from the **Design View** window, and drag it onto **Port Function**. For the selected QDR instance, I/O Editor shows the valid locations in green. Figure 6-17. Design View Window: Port Function ## 6.5.4 QDR Resource Utilization (Ask a Question) This section provides the post-Synthesis QDR resource utilization reports from Libero® SoC for the following QDR configurations. - For data width = 18-bit, address width = 22-bit, burst size = 4, and Simulation mode = RTL—Fast, see Figure 6-18. - For data width = 18-bit, address width = 21-bit, burst size = 4, and Simulation mode = RTL—Full, see Figure 6-19. - For data width = 36-bit, address width = 21-bit, burst size = 4, and Simulation mode = RTL—Full, see Figure 6-20. - For data width = 9-bit, address width = 21-bit, burst size = 4, and Simulation mode = RTL—Full, see Figure 6-21. - For data width = 9-bit, address width = 21-bit, burst size = 4, and Simulation mode = BFM, see Figure 6-22. - For data width = 18-bit, address width = 21-bit, burst size = 4, and Simulation mode = BFM, see Figure 6-23. - For data width = 36-bit, address width = 21-bit, burst size = 4, and Simulation mode = BFM, see Figure 6-24. The following figure shows the QDR resource utilization report for 18-bit data width and RTL—Fast mode. Figure 6-18. QDR Resource Utilization 1 | Туре | Used | Total | Percentage | |------------------------|------|--------|------------| | 4LUT | 335 | 299544 | 0.11 | | DFF | 90 | 299544 | 0.03 | | I/O Register | 0 | 1536 | 0.00 | | User I/O | 311 | 512 | 60.74 | | Single-ended I/O | 309 | 512 | 60.35 | | Differential I/O Pairs | 1 | 256 | 0.39 | | uSRAM | 0 | 2772 | 0.00 | | LSRAM | 0 | 952 | 0.00 | | Math | 0 | 924 | 0.00 | | H-Chip Global | 2 | 48 | 4.17 | | PLL | 1 | 8 | 12.50 | | DLL | 1 | 8 | 12.50 | | Transceiver Lanes | 0 | 16 | 0.00 | | Transceiver PCle | 0 | 2 | 0.00 | | ICB_CLKINT | 1 | 72 | 1.39 | The following figure shows the QDR resource utilization report for 18-bit data width and RTL—Full mode. Figure 6-19. QDR Resource Utilization 2 | Туре | Used | Total | Percentage | |------------------------|------|--------|------------| | 4LUT | 1562 | 299544 | 0.52 | | DFF | 911 | 299544 | 0.30 | | I/O Register | 0 | 1536 | 0.00 | | User I/O | 466 | 512 | 91.02 | | Single-ended I/O | 464 | 512 | 90.62 | | Differential I/O Pairs | 1 | 256 | 0.39 | | uSRAM | 0 | 2772 | 0.00 | | LSRAM | 0 | 952 | 0.00 | | Math | 0 | 924 | 0.00 | | H-Chip Global | 3 | 48 | 6.25 | | PLL | 1 | 8 | 12.50 | | DLL | 1 | 8 | 12.50 | | Transceiver Lanes | 0 | 16 | 0.00 | | Transceiver PCIe | 0 | 2 | 0.00 | | ICB_CLKINT | 1 | 72 | 1.39 | The following figure shows the QDR resource utilization report for 36-bit data width and RTL—Full mode. Figure 6-20. QDR Resource Utilization 3 | Туре | Used | Total | Percentage | |------------------------|------|--------|------------| | 4LUT | 2754 | 299544 | 0.92 | | DFF | 1570 | 299544 | 0.52 | | I/O Register | 0 | 1536 | 0.00 | | User I/O | 488 | 512 | 95.31 | | Single-ended I/O | 486 | 512 | 94.92 | | Differential I/O Pairs | 1 | 256 | 0.39 | | uSRAM | 0 | 2772 | 0.00 | | LSRAM | 0 | 952 | 0.00 | | Math | 0 | 924 | 0.00 | | H-Chip Global | 3 | 48 | 6.25 | | PLL | 1 | 8 | 12.50 | | DLL | 1 | 8 | 12.50 | | Transceiver Lanes | 0 | 16 | 0.00 | | Transceiver PCIe | 0 | 2 | 0.00 | | ICB_CLKINT | 1 | 72 | 1.39 | The following figure shows the QDR resource utilization report for 9-bit data width and RTL—Full mode. Figure 6-21. QDR Resource Utilization 4 | Туре | Used | Total | Percentage | |------------------------|------|--------|------------| | 4LUT | 926 | 299544 | 0.31 | | DFF | 484 | 299544 | 0.16 | | I/O Register | 0 | 1536 | 0.00 | | User I/O | 295 | 512 | 57.62 | | Single-ended I/O | 293 | 512 | 57.23 | | Differential I/O Pairs | 1 | 256 | 0.39 | | uSRAM | 0 | 2772 | 0.00 | | LSRAM | 0 | 952 | 0.00 | | Math | 0 | 924 | 0.00 | | H-Chip Global | 2 | 48 | 4.17 | | PLL | 1 | 8 | 12.50 | | DLL | 1 | 8 | 12.50 | | Transceiver Lanes | 0 | 16 | 0.00 | | Transceiver PCIe | 0 | 2 | 0.00 | | ICB_CLKINT | 1 | 72 | 1.39 | The following figure shows the QDR resource utilization report for 9-bit data width and BFM mode. Figure 6-22. QDR Resource Utilization 5 | Туре | Used | Total | Percentage | |------------------------|------|--------|------------| | 4LUT | 339 | 299544 | 0.11 | | DFF | 128 | 299544 | 0.04 | | I/O Register | 0 | 1536 | 0.00 | | User I/O | 295 | 512 | 57.62 | | Single-ended I/O | 293 | 512 | 57.23 | | Differential I/O Pairs | 1 | 256 | 0.39 | | uSRAM | 0 | 2772 | 0.00 | | LSRAM | 0 | 952 | 0.00 | | Math | 0 | 924 | 0.00 | | H-Chip Global | 2 | 48 | 4.17 | | PLL | 1 | 8 | 12.50 | | DLL | 1 | 8 | 12.50 | | Transceiver Lanes | 0 | 16 | 0.00 | | Transceiver PCIe | 0 | 2 | 0.00 | | ICB_CLKINT | 1 | 72 | 1.39 | The following figure shows the QDR resource utilization report for 18-bit data width and BFM mode. Figure 6-23. QDR Resource Utilization 6 | Туре | Used | Total | Percentage | |------------------------|------|--------|------------| | 4LUT | 491 | 299544 | 0.16 | | DFF | 170 | 299544 | 0.06 | | I/O Register | 0 | 1536 | 0.00 | | User I/O | 466 | 512 | 91.02 | | Single-ended I/O | 464 | 512 | 90.62 | | Differential I/O Pairs | 1 | 256 | 0.39 | | uSRAM | 0 | 2772 | 0.00 | | LSRAM | 0 | 952 | 0.00 | | Math | 0 | 924 | 0.00 | | H-Chip Global | 2 | 48 | 4.17 | | PLL | 1 | 8 | 12.50 | | DLL | 1 | 8 | 12.50 | | Transceiver Lanes | 0 | 16 | 0.00 | | Transceiver PCle | 0 | 2 | 0.00 | | ICB_CLKINT | 1 | 72 | 1.39 | The following figure shows the QDR resource utilization report for 36-bit data width and BFM mode. Figure 6-24. QDR Resource Utilization 7 | Туре | Used | Total | Percentage | |------------------------|------|--------|------------| | 4LUT | 475 | 299544 | 0.16 | | DFF | 95 | 299544 | 0.03 | | I/O Register | 0 | 1536 | 0.00 | | User I/O | 488 | 512 | 95.31 | | Single-ended I/O | 486 | 512 | 94.92 | | Differential I/O Pairs | 1 | 256 | 0.39 | | uSRAM | 0 | 2772 | 0.00 | | LSRAM | 0 | 952 | 0.00 | | Math | 0 | 924 | 0.00 | | H-Chip Global | 2 | 48 | 4.17 | | PLL | 1 | 8 | 12.50 | | DLL | 1 | 8 | 12.50 | | Transceiver Lanes | 0 | 16 | 0.00 | | Transceiver PCIe | 0 | 2 | 0.00 | | ICB_CLKINT | 1 | 72 | 1.39 | ## 6.6 Functional Example (Ask a Question) This section describes how to perform the RTL\_FULL, BFM, and RTL\_Fast simulations. ## 6.6.1 RTL FULL Simulation (Ask a Question) The QDR subsystem is used to access QDR SRAMs directly, as shown in the following figure. User logic is connected directly to the QDR subsystem using the user interface. Figure 6-25. QDR Subsystem accessed through User Interface After successful QDR initialization, the User Interface Master initiates Read or Write to the QDR memory. The following steps describe how to create a design for accessing the QDR memory from the user interface master in the FPGA fabric: - 1. Create a SmartDesign, and instantiate the QDR component. - 2. Configure the QDR Subsystem as described in 6.5. Implementation. In the following example, a design is created to access the QDR memory with 36-Bit Data Width, 19-Bit Address Width and Burst size of 4. Figure 6-26. QDR Configuration- RTL\_FULL - 3. Instantiate the User Interface Master logic in the SmartDesign canvas. Ensure that the User Interface Master logic accesses the QDR subsystem only after TRAINING\_COMPLETE is high. - 4. In the SmartDesign canvas, connect the blocks, as shown in the following figure. Figure 6-27. SmartDesign Connections - 5. Create a new SmartDesign Testbench to simulate the design. - 6. Instantiate the top-level design component and the QDR memory simulation model. - 7. Configure CLK\_GEN to generate the PLL reference clock and connect to PLL\_REF\_CLK. - 8. Connect the blocks in SmartDesign Testbench, as shown in the following figure. Figure 6-28. QDR SmartDesign Testbench - 9. On Libero Design Flow > Verify Pre-synthesized Design, select Simulate. - 10. The QDR subsystem performs the training and asserts the TRAINING\_COMPLTETE. The following figure shows the User interface read and writes transactions and the corresponding QDR SRAM transactions. Figure 6-29. QDR Read and Write Transactions- RTL FULL ## 6.6.2 BFM Simulation (Ask a Question) In the BFM mode, memory model is a part of the QDR BFM simulation model. To run the BFM simulation: - 1. Follow step 1 to 6 in 6.6.1. RTL\_FULL Simulation. - 2. Select **Simulate** from Libero **Design Flow** -> **Verify Pre-synthesized Design**. 3. The QDR subsystem skips training and asserts the TRAINING\_COMPLTETE signal. The following figure shows the User interface read and writes transactions and the corresponding QDR SRAM transactions. Figure 6-30. QDR Read and Write Transactions- BFM ## 6.6.3 RTL Fast Simulation (Ask a Question) RTL\_fast simulation is same as RTL\_full simulation except that QDR subsystem loads the training delays as per selected configuration and skips the training sequence to assert the TRAINING\_COMPLETE signal. In the following example, a design is created to access the QDR memory with 18-bit data width, 22-bit address width, and burst size of 4. Figure 6-31. QDR Configuration- RTL\_Fast Simulation **Note:** The waveform output of the QDR RTL-Fast simulation will be provided in the next revision of this document. # 6.7 PCB Recommendations (Ask a Question) The routing board signals from FPGA to QDR II+ or QDR II+ Extreme devices, adhere to the constraints mentioned in the following tables. Table 6-8. Maximum Length and Delay Constraints | Signal | Maximum Length (inches) | Routing Delay (ns) | |---------------------|-------------------------|--------------------| | Address and Command | 6.0 | 1 | | Data | 6.0 | 1 | Table 6-9. Skew Constraints | Signal | Skew Constraints | |----------------------------------------|------------------| | Data to Clock<br>(D to K and Q to CQP) | ±25 ps | | Data Bus (D or Q) | ±25 ps | | Address and Command | ±25 ps | | Address, Command and to Clock (KP) | ±25 ps | | KP to KN | ±10 ps | ## 7. PolarFire Board Design Recommendations (Ask a Question) The GPIO and HSIO of PolarFire devices support various memory interfaces. - GPIO supports DDR3. - HSIO supports DDR4, DDR3, DDR3L, and LPDDR3. The following table lists the various types of memories supported in PolarFire devices. Table 7-1. DDR3, LPDDR3, DDR3L, and DDR4 Parameters | Parameter | DDR3 | LPDDR3 | DDR3L | DDR4 | |---------------------------------------------|---------------------------|----------------|---------------------------|-----------------------| | VDDIO | 1.5V | 1.2V | 1.35V | 1.2V | | VTT, VREF | 0.75V | 0.6V | 0.675V | 0.6V | | Clock, Address, and Command (CAC)<br>Layout | Daisy Chained<br>(Fly-by) | Point to point | Daisy Chained<br>(Fly-by) | Daisy Chained (Flyby) | | Data Strobe | Differential | Differential | Differential | Differential | | ODT | Dynamic | Dynamic | Dynamic | Dynamic | | Match Addr/CMD/Ctrl to Clock Tightly | Yes | Yes | Yes | Yes | | Match DQ/DM/DQS Tightly | Yes | Yes | Yes | Yes | | Match DQS to Clock Loosely | Not Required | Not Required | Not Required | Not Required | | I/O Standard | SSTL_15 | HSUL-12 | SSTL_135 | POD-12 <sup>1</sup> | | RZQ | 240_1% | 240_1% | 240_1% | 240_1% | #### Note: 1. HSTL\_12 IO standard is used for DDR4 address, command, and control signals. For more information on DDR memories, see the following documents: - JESD79-3F JEDEC standard for DDR3 SDRAM specifications - JESD79-4 JEDEC standard for DDR4 SDRAM specifications - JESD209-3 JEDEC standard for LPDDR3 SDRAM specifications **Important:** DQ bit swapping within a byte lane is supported on the PCB between the DDR controller and DDR3, DDR3L, LPDDR3, and DDR4 memories for routing flexibility. ## 7.1 DDR3 (Ask a Question) The following are the guidelines for connecting the device to the DDR3 memory: - DDR3 data nets have dynamic ODT built into the controller and SDRAM. The configurations are $60\Omega$ , and $120\Omega$ . DQ lines do not need VTT termination. However, VTT termination resistors need to be placed at the end of address and control lines on the PCB. - Characteristic impedance: $Z_O$ is typically $50\Omega$ , and $Z_{diff}$ (differential) is $100\Omega$ . The following table lists the DDR3 memory interface features supported in PolarFire devices. Table 7-2. Interfaces Supported in DDR3 Memory Devices | Interface | DDR3 | |--------------|-----------------------------------| | Voltage | 1.5V | | I/O standard | SSTL_15 | | Data rate | 1333 MT/S-HSIO and 1066 MT/S-GPIO | | continued | | | |----------------------------------------------|-----------------------------------------------------------------------|--| | Interface | DDR3 | | | Termination | ODT for data group; VTT termination for address, command, and control | | | Routing topology (CK, ADDR/CMD, and CONTROL) | Fly-by | | | Data transmission | Point-to-point | | The following figure shows the connectivity between DDR3 devices and PolarFire FPGAs. Figure 7-1. DDR3 Interface Example #### 7.1.1 DDR3 Layout Guidelines (Ask a Question) This section describes the routing guidelines for DDR3 interface for PolarFire Family. The guidelines are with reference to maximum x72 data width from signal integrity perspective. All the guidelines are provided considering maximum data rate supported. It is recommended to evaluate the interface by performing system level signal integrity simulations. The user is assumed to have the knowledge of the memory interface guidelines. Table 7-3. DDR3 Interface Signals | ChaleSianal | Description | | | |-----------------|----------------------------|--|--| | Clock Signal | Description | | | | CK0:1 P/N | Differential clock signals | | | | Control signals | | | | | CS[1:0] | Chip select | | | | CKE[1:0] | Clock Enable | | | | ODT[1:0] | On die termination enable | | | | Reset_n | Reset signal | | | | Address Signal | | | | | A[15:0] | Address Signals | | | | BA[1:0] | Bank address | | | | Command Signal | | | | | RAS_n | Row address select | | | | CAS_n | Column address select | | | | WE_n | Write enable | | | | Data signals | | | | | | | | | | continued | | | |--------------|-------------|--| | Clock Signal | Description | | | DQ[71:0] | Data bit | | | DQM[8:0] | Data mask | | | DQS_P/N[8:0] | Data strobe | | Due to high speed signaling, DDR3 uses fly-by routing topology for Address, Command, Control signals to achieve best performance. As shown in the following figure, the ADDR/CMD are routed as single-ended in fly-by topology. It is recommended to have all the signals being routed in one signal layer to control the skew within the signal groups. The DQ/DQS signals being point to point signals, uses on die termination on memory and FPGA side. DQ bit swapping within a byte lane is supported on the PCB between the DDR controller and the DDR3 memory for routing flexibility. ### 7.1.1.1 DDR3 Routing Topology (Ask a Question) The following figure shows the FlyBy termination of the DDR3 address, command, and control signals (ACC). Figure 7-2. DDR3 ACC Signals Fly By Termination The following figure shows the FlyBy termination of the DDR3 clock signal. Figure 7-3. DDR3 Clock Signal FlyBy Termination The following figure shows the Point-to-Point routing of the DDR3 data signal. Figure 7-4. DDR3 Data Signal Point-to-Point #### **DDR3 Data Signal Point-to-Point** #### 7.1.1.2 Skew Constraints For Signal Groups (Ask a Question) - 1. The address and command groups must be skew matched with respect to CK signals within ±100 ps. - 2. The DQ, DM signals must be skew matched with respect to DQS within ±25 ps. - 3. The read DQ/DQS training adjusts the DQS signal to optimal sampling point based on the board skew between DQ and DQS. - 4. The write level training ensures that the tDQSS specification for memory is matched for each of the DQ/DQS byte groups and offset the skew introduced due to fly-by routing. The characteristic impedance of $40\Omega$ – $50\Omega$ must be used for single ended signals and $90\Omega$ – $100\Omega$ for differential signals. ## 7.2 LPDDR3 (Ask a Question) The following are the guidelines for connecting the device to the LPDDR3 memory: - LPDDR3 data nets have dynamic ODT built into the controller and SDRAM. The configurations are $120\Omega$ and $240\Omega$ . - DQ lines do not need VTT termination. - The characteristic impedance, $Z_0$ is typically $50\Omega$ , and $Z_{diff}$ (differential) is $100\Omega$ . The following table lists the LPDDR3 memory interface features supported in PolarFire devices. Table 7-4. Interfaces Supported in LPDDR3 Memory Devices | Interface | LPDDR3 | | |------------------|--------------------|--| | Voltage | 1.2V | | | I/O standard | HSUL-12 | | | Data rate | 1333 MT/S-HSIO | | | Termination | ODT for data group | | | Routing topology | Point-to-point | | The following figure shows the connectivity between LPDDR3 devices and PolarFire FPGAs. Figure 7-5. LPDDR3 Interface Example ## 7.2.1 LPDDR3 Layout Guidelines (Ask a Question) The LPDDR3 is a point-to-point interface, where the data group signals and address/command signals are routed from point to point. As LPDDR3 is used in power sensitive designs, there is no termination used for point to point interface. PolarFire supports up to 1333 MT/s for LPDDR3 interface on HSIO only. It is recommended to tightly length match the signals CK, ADDR/CMD signals in one group and DQS, DQ, DM signals in one group together to reduce the skew. The characteristic impedance of $40\text{-}50\Omega$ should be used for single-ended signals and $90\text{-}100\Omega$ for differential signals. Microchip recommends simulating the interface to tightly match the characteristic impedance of trace with driver impedance to reduce reflections and for any possible electrical and timing violations in meeting JEDEC requirements. DQ bit swapping within a byte lane is supported on the PCB between the DDR controller and the LPDDR3 memory for routing flexibility. #### 7.2.1.1 LPDDR3 Routing Topology (Ask a Question) The following figure shows the routing of the LPDDR3 address, control, DQ, and DM signals. Figure 7-6. LPDDR3 Address/Control/DQ/DM Signal Routing LPDDR3 ADDR, Control, DQ, DM signals Routing The following figure shows the routing of LPDDR3 clock and DQS signals. Figure 7-7. LPDDR3 Clock and DQS signal Routing #### 7.2.1.2 Skew Matching Constraints (Ask a Question) - 1. Within the data group DQS, DQ, and DM signals must be matched with ±25 ps. - 2. Skew between CK, CA, and CS signals within the group must not be more than ±30 ps. - 3. Skew between CA and CK group and data lane group must not be more than ±350 ps. **Note:** It is recommended to derive the accurate values of skew and termination based on simulations. #### 7.3 DDR4 (Ask a Question) The following are the guidelines for connecting the device to the DDR4 memory: - DDR4 data nets have dynamic ODT built into the controller and SDRAM. The configurations are $80\Omega$ , $120\Omega$ , and $240\Omega$ . DQ lines do not need VTT termination. However, VTT termination resistors need to be placed at the end of address and control lines on the PCB. - Characteristic impedance: $Z_0$ is typically $50\Omega$ and $Z_{diff}$ (differential) is $100\Omega$ . The following figure shows the features supported by PolarFire FPGA in the DDR4 memory interface. Figure 7-8. DDR4 Interface Example #### 7.3.1 DDR4 Layout Guidelines (Ask a Question) This section describes the routing guidelines for DDR4 interface for PolarFire Family. The guidelines are with reference to maximum x72 data width from signal integrity perspective. All the guidelines are provided considering maximum data rate supported. It is recommended to evaluate the interface by performing system level signal integrity simulations. The user is assumed to have the knowledge of the memory interface guidelines. Table 7-5. DDR4 Interface Signals | Clock Signals | Description | |-----------------------------|----------------------------| | CK[1:0]_P\N | Differential Clock signals | | Address and Command Signals | | | A[15:0] | Address signals | | BA[1:0] | Bank Address signals | | continued | | | |-----------------|----------------------------|--| | Clock Signals | Description | | | BG[1:0] | Bank Group signals | | | WE | Write Enable | | | RAS_n | Row address strobe | | | CAS_n | Column address strobe | | | ACT_n | Activation Command Input | | | PAR | Command and address parity | | | Control Signals | | | | CKE | Clock Enable | | | CS_n[1:0] | Chip Select | | | ODT | On die termination | | | Reset | Reset | | | Data Group | | | | DQ[71:0] | Data signals | | | DQS[8:0]_P/N | Differential data strobe | | | M_N[8:0] | Data mask | | DDR4 interface on PolarFire supports maximum data rate of 1600 MT/s on HSIO banks. The ADDR/CMD signals are routed in fly-by topology and terminated at the last memory component with respect to VDD/2. CK signals are also routed in fly-by topology and terminated at the last memory component with respect to VDD. The data signals are routed as point to point. #### 7.3.1.1 DDR4 Routing Topology (Ask a Question) The following figure shows the Flyby termination of DDR4 ADDR/CMD signals. Figure 7-9. DDR4 ACC Signals FlyBy Termination The following figure shows the FlyBy termination of the DDR4 clock signal. Figure 7-10. DDR4 Clock Signal FlyBy Termination The following figure shows the point-to-point routing of the DDR4 data signal. Figure 7-11. DDR4 Data Signal Point-to-Point #### 7.3.1.2 Skew matching constraints (Ask a Question) - 1. The address and command groups should be skew matched with respect to CK signals within $\pm 100$ ps. - 2. The DQ, DM signals should be skew matched with respect to DQS within ±25 ps. - 3. The read DQ/DQS training will adjust DQS signal to optimal sampling point based on the board skew between DQ and DQS. ## 8. PolarFire SoC Board Design Recommendations (Ask a Question) The GPIO and HSIO of PolarFire SoC devices support various memory interfaces. - GPIO supports DDR3. - HSIO supports DDR4, DDR3, DDR3L, LPDDR3, and LPDDR4. The following table lists the various types of memories supported in PolarFire SoC devices. Table 8-1. DDR3, LPDDR3, DDR3L, LPDDR4, and DDR4 Parameters | Parameter | DDR3 | LPDDR3 | DDR3L | DDR4 | LPDDR4 | |------------------------------------------|---------------------------|----------------|---------------------------|---------------------------|-----------------------------| | VDDIO | 1.5V | 1.2V | 1.35 | 1.2V | 1.1V | | VTT, VREF | 0.75V | 0.6V | 0.675V | 0.6V | 0.55V | | Clock, Address, and Command (CAC) Layout | Daisy Chained<br>(Fly-by) | Point to point | Daisy Chained<br>(Fly-by) | Daisy Chained<br>(Fly-by) | Point-to-Point <sup>2</sup> | | Data Strobe | Differential | Differential | Differential | Differential | Differential | | ODT | Dynamic | Dynamic | Dynamic | Dynamic | Dynamic | | Match Addr/CMD/Ctrl to Clock<br>Tightly | Yes | Yes | Yes | Yes | Yes | | Match DQ/DM/DQS Tightly | Yes | Yes | Yes | Yes | Yes | | Match DQS to Clock Loosely | Not Required | Not Required | Not Required | Not Required | Not Required | | I/O Standard | SSTL_15 | HSUL-12 | SSTL_135 | POD-12 <sup>1</sup> | LVSTL 1.1V | | RZQ | 240_1% | 240_1% | 240_1% | 240_1% | 240_1% | #### Notes: - 1. HSTL\_12 IO standard is used for DDR4 address, command, and control signals. - 2. For dual channel, LPDDR4 needs to go with T-topology routing. For more information, see 8.4. LPDDR4. For more information on DDR memories, see the following documents: - JESD79-3F JEDEC standard for DDR3 SDRAM specifications - JESD79-4 JEDEC standard for DDR4 SDRAM specifications - JESD209-3 JEDEC standard for LPDDR3 SDRAM specifications **Tip:** DQ bit swapping within a byte lane is supported on the PCB between the DDR controller and DDR3, DDR3L, LPDDR3, and DDR4 memories for routing flexibility. ## 8.1 DDR3 (Ask a Question) The following are the guidelines for connecting the device to the DDR3 memory: - DDR3 data nets have dynamic ODT built into the controller and SDRAM. The configurations are $60\Omega$ and $120\Omega$ . DQ lines do not need VTT termination. However, VTT termination resistors must be placed at the end of address and control lines on the PCB. - Characteristic impedance: $Z_0$ is typically $50\Omega$ , and $Z_{diff}$ (differential) is $100\Omega$ . The following table lists the DDR3 memory interface features supported in PolarFire SoC devices. Table 8-2. Interfaces Supported in DDR3 Memory Devices | Interface | DDR3 | |-----------|------| | Voltage | 1.5V | | continued | | | |----------------------------------------------|-----------------------------------------------------------------------|--| | Interface | DDR3 | | | I/O standard | SSTL_15 | | | Data rate | 1333 MT/S-HSIO and 1066 MT/S-GPIO | | | Termination | ODT for data group; VTT termination for address, command, and control | | | Routing topology (CK, ADDR/CMD, and CONTROL) | Fly-by | | | Data transmission | Point-to-point | | The following figure shows the connectivity between DDR3 devices and PolarFire SoC FPGAs. Figure 8-1. DDR3 Interface Example #### 8.1.1 DDR3 Layout Guidelines (Ask a Question) This section describes the routing guidelines for DDR3 interface for PolarFire SoC Family. The guidelines are with reference to maximum x72 data width from signal integrity perspective. All the guidelines are provided considering maximum data rate supported. It is recommended to evaluate the interface by performing system level signal integrity simulations. The user is assumed to have the knowledge of the memory interface guidelines. Table 8-3. DDR3 Interface Signals | Clock Signal | Description | | |-----------------|----------------------------|--| | CK[1:0] P/N | Differential clock signals | | | Control Signals | | | | CS[1:0] | Chip select | | | CKE[1:0] | Clock Enable | | | ODT[1:0] | On die termination enable | | | Reset_n | Reset signal | | | Address Signals | | | | A[12:0] | Address Signals | | | BA[2:0] | Bank address | | | Command Signals | | | | RAS_n | Row address select | | | CAS_n | Column address select | | | continued | | |--------------|--------------| | Clock Signal | Description | | WE_n | Write enable | | Data Signals | | | DQ[31:0] | Data bit | | DM[3:0] | Data mask | | DQS_P/N[8:0] | Data strobe | Due to high speed signaling, DDR3 uses fly-by routing topology for Address, Command, Control signals to achieve best performance. As shown in the following section, the ADDR/CMD are routed as single ended in fly-by topology. It is recommended to have all the signals being routed in one signal layer to control the skew within the signal groups. The DQ/DQS signals being point to point signals, uses on die termination on memory and FPGA side. ## 8.1.1.1 DDR3 Routing Topology (Ask a Question) The following figure shows the FlyBy termination of the DDR3 Address, Command, and Control signals (ACC). Figure 8-2. DDR3 ACC Signals Fly By Termination The following figure shows the FlyBy termination of the DDR3 clock signal. Figure 8-3. DDR3 Clock Signal FlyBy Termination The following figure shows the Point-to-Point routing of the DDR3 data signal. Figure 8-4. DDR3 Data Signal Point-to-Point #### DDR3 Data Signal Point-to-Point #### 8.1.1.2 Skew Constraints For Signal Groups (Ask a Question) - The address and command groups must be skew matched with respect to CK signals within ±100 ps. - 2. The DQ, DM signals must be skew matched with respect to DQS within ±25 ps. - 3. The read DQ/DQS training adjusts the DQS signal to optimal sampling point based on the board skew between DQ and DQS. - 4. The write level training ensures that the tDQSS specification for memory is matched for each of the DQ/DQS byte groups and offset the skew introduced due to fly-by routing. The characteristic impedance of $40\Omega$ – $50\Omega$ must be used for single-ended signals and $90\Omega$ – $100\Omega$ for differential signals. ## 8.2 LPDDR3 (Ask a Question) The following are the guidelines for connecting the device to the LPDDR3 memory: - LPDDR3 data nets have dynamic ODT built into the controller and SDRAM. The configurations are $120\Omega$ and $240\Omega$ . - DQ lines do not need VTT termination. - The characteristic impedance, $Z_0$ is typically $50\Omega$ and $Z_{diff}$ (differential) is $100\Omega$ . The following table lists the LPDDR3 memory interface features supported in PolarFire SoC devices. Table 8-4. Interfaces Supported in LPDDR3 Memory Devices | Interface | LPDDR3 | |------------------|--------------------| | Voltage | 1.2V | | I/O standard | HSUL-12 | | Data rate | 1333 MT/S-HSIO | | Termination | ODT for data group | | Routing topology | Point to point | The following figure shows the connectivity between LPDDR3 devices and PolarFire SoC FPGAs. Figure 8-5. LPDDR3 Interface Example ### 8.2.1 LPDDR3 Layout Guidelines (Ask a Question) The LPDDR3 is a point-to-point interface, where the data group signals and address/command signals are routed from point to point. As LPDDR3 is used in power sensitive designs, there is no termination used for point to point interface. PolarFire SoC supports up to 1333 MT/s for LPDDR3 interface on HSIO only. It is recommended to tightly length match the signals CK, ADDR/CMD signals in one group and DQS, DQ, DM signals in one group together to reduce the skew. The characteristic impedance of $40\Omega$ – $50\Omega$ must be used for single ended signals and $90\Omega$ – $100\Omega$ for differential signals. Microchip recommends simulating the interface to tightly match the characteristic impedance of trace with driver impedance to reduce reflections and for any possible electrical and timing violations in meeting JEDEC requirements. #### 8.2.1.1 LPDDR3 Routing Topology (Ask a Question) The following figure shows the routing of the LPDDR3 address, control, DQ, and DM signals. Figure 8-6. LPDDR3 Address/Control/DQ/DM Signal Routing LPDDR3 ADDR, Control, DQ, DM signals Routing The following figure shows the routing of LPDDR3 clock and DQS signals. Figure 8-7. LPDDR3 Clock and DQS signal Routing #### 8.2.1.2 Skew Matching Constraints (Ask a Question) 1. Within the data group DQS, DQ, and DM signals must be matched with ±25 ps. - 2. Skew between CK, CA, and CS signals within the group must not be more than ±30 ps. - 3. Skew between CA and CK group and data lane group must not be more than ±350 ps. **Note:** It is recommended to derive the accurate values of skew and termination based on simulations. #### 8.3 DDR4 (Ask a Question) The following are the guidelines for connecting the device to the DDR4 memory: - DDR4 data nets have dynamic ODT built into the controller and SDRAM. The configurations are $80\Omega$ , $120\Omega$ , and $240\Omega$ . DQ lines do not need VTT termination. However, VTT termination resistors need to be placed at the end of address and control lines on the PCB. - Characteristic impedance: $Z_o$ is typically $50\Omega$ and $Z_{diff}$ (differential) is $100\Omega$ . The following figure shows the features supported by PolarFire SoC FPGA in the DDR4 memory interface. Figure 8-8. DDR4 Interface Example #### 8.3.1 DDR4 Layout Guidelines (Ask a Question) This section describes the routing guidelines for DDR4 interface for PolarFire SoC Family. The guidelines are with reference to maximum x72 data width from signal integrity perspective. All the guidelines are provided considering maximum data rate supported. It is recommended to evaluate the interface by performing system level signal integrity simulations. The user is assumed to have the knowledge of the memory interface guidelines. Table 8-5. DDR4 Interface Signals | Clock Signals | Description | |-----------------------------|----------------------------| | CK[1:0]_P\N | Differential Clock signals | | Address and Command Signals | | | A[12:0] | Address signals | | BA[1:0] | Bank Address signals | | BG[1:0] | Bank Group signals | | WE | Write Enable | | continued | | |-----------------|----------------------------| | Clock Signals | Description | | RAS_n | Row address strobe | | CAS_n | Column address strobe | | ACT_n | Activation Command Input | | PAR | Command and address parity | | Control Signals | | | CKE | Clock Enable | | CS_n[1:0] | Chip Select | | ODT | On die termination | | Reset | Reset | | Data Group | | | DQ[31:0] | Data signals | | DQS[3:0]_P/N | Differential data strobe | | DM_N[3:0] | Data mask | DDR4 interface on PolarFire SoC supports maximum data rate of 1600 MT/s on HSIO banks. The ADDR/CMD and CK signals are routed in fly-by topology and they are terminated at the last memory component with respect to VDDI/2. The data signals are routed as point to point. #### 8.3.2 DDR4 Routing Topology (Ask a Question) The following figure shows the Flyby termination of DDR4 ADDR/CMD signals. Figure 8-9. DDR4 ACC Signals FlyBy Termination The following figure shows the FlyBy termination of the DDR4 clock signal. Figure 8-10. DDR4 Clock Signal FlyBy Termination The following figure shows the point-to-point routing of the DDR4 data signal. Figure 8-11. DDR4 Data Signal Point-to-Point #### DDR3 Data Signal Point-to-Point ### 8.3.2.1 Skew Matching Constraints (Ask a Question) - 1. The address and command groups should be skew matched with respect to CK signals within ±100 ps. - 2. The DQ, DM signals should be skew matched with respect to DQS within ±25 ps. The read DQ/DQS training will adjust DQS signal to optimal sampling point based on the board skew between DQ and DQS. ### 8.4 LPDDR4 (Ask a Question) The following are the guidelines for connecting the device to the LPDDR4 memory: - LPDDR4 nets (Data and CA) have dynamic ODT built into the controller and SDRAM. The ODT configurations available at memory are 40, 48, 60, 80, 120, and 240 and on controller side options are 34, 40, 48, and 60. - Characteristic impedance for traces: $Z_0$ (single-ended) typically can be either $50\Omega$ or $40\Omega$ and $Z_{diff}$ (differential) can be either $100\Omega$ or $80\Omega$ . LPDDR4 memory comes with different options in terms of channel and rank depending upon the memory size used in the design. Commonly available options are: - Single-channel, single rank - · Dual-channel, single rank - Dual-channel, dual ranks PolarFire SoC LPDDR4 controller supports single rank and single channel. Dual-channel LPDDR4 is supported by concatenating two single-channel on the controller side within FPGA design, and on-board needs to short the Channel-A address of LPDDR4 with the Channel-B address for full memory access. Table 8-6. Supported LPDDR4 Use Models in PolarFire SoC FPGA | LPDDR4 Memory Configuration (Commonly Available) | PolarFire® SoC FPGA | |--------------------------------------------------|---------------------| | Single-channel, single rank (x16) | Supported | | Dual-channel, single rank (x32) | Supported | | Dual-channel, dual rank (x32) | Not supported | The following figure shows the PolarFire SoC interface with single-channel, single-rank LPDDR4. Figure 8-12. PolarFire SoC Interface with Single-Channel, Single-Rank LPDDR4 The following figure shows the PolarFire SoC interface with dual-channel, single-rank LPDDR4. Figure 8-13. PolarFire SoC Interface with Dual-Channel, Single-Rank LPDDR4 The following table lists the LPDDR4 memory interface features supported in PolarFire SoC FPGA. **Table 8-7.** Interfaces Supported in LPDDR4 Memory Devices | Interface | LPDDR4 | |------------------|-------------------------------| | Voltage | 1.1V | | I/O standard | LVSTL 1.1V | | Data rate | 1600MT/S—MDDR I/Os (Bank 6) | | Termination | ODT in Ω (34, 40, 48, and 60) | | Routing Topology | Point-to-point | ### 8.4.1 LPDDR4 Layout Guidelines (Ask a Question) This section describes the routing guidelines for LPDDR4 interface for PolarFire SoC Family. The guidelines are with reference to maximum x32 data width from a signal integrity perspective. It is recommended to evaluate the interface by performing system-level signal integrity simulations. The user is assumed to have knowledge of the memory interface guidelines. Table 8-8. LPDDR4 Interface Signals | Clock Signal | Description | |-----------------------------|----------------------------| | CK_c/t | Differential clock signals | | Address and Control Signals | | | CA [5:0] | Address signals | | CKE | Clock enable | | CS | Chip select | | Reset_n | Memory reset signal | | ODT | On die termination | | Data Group Signals | | | DQ [15:0] | Data signals | | DMI[1:0] | Data mask signal | | DQS_c/t [0:1] | Data strobe signals | #### 8.4.2 LPDDR4 Routing Topology (Ask a Question) LPDDR4 routing can be performed in following channels depending upon the memory size used in the design. - 8.4.2.1. Single-channel LPDDR4 Interface Routing - 8.4.2.2. Dual-Channel LPDDR4 Interface Routing #### 8.4.2.1 Single-channel LPDDR4 Interface Routing (Ask a Question) The following figure shows LPDDR4 routing with Single-channel and single-rank devices. The topology is used to point-to-point for single-channel LPDDR4. All single-ended signals (CA/Data/Cntrl) are routed point-to-point, as shown in the following figure. Figure 8-14. Single-Channel LPDDR4 Interface Routing ## Point-to-point topology #### 8.4.2.2 Dual-Channel LPDDR4 Interface Routing (Ask a Question) For dual-channel LPDDR4 routing, balanced T-topology routing is preferred for CA and Control bus routing. Data group signals are always point-to-point routing. Figure 8-15. Dual-Channel LPDDR4 Interface Routing ## 8.4.2.3 Routing Recommendation (Ask a Question) Following are the routing recommendations for LPDDR4. - Trace impedance can be 40/80 or 50/100. - All CA signals and CNTRL signals should length match with CLK. - DQ/DQS signal should length match. - · Try to make branches short. - Equal branch length gives better signal integrity performance. - Reduce the number of VIA on overall DDR lines. - All DQ/DQS and CLK/CA/Control group signal routing should be similar and transition should be same. For net layer transitions, Microchip recommends routing ground vias next to signal vias for good shielding and to maintain constant impedance. - Typically, the trunk and the branches are the same impedance. If the branches are kept short, the reflection that returns to the component from the mismatch in impedance between the trunk and branches occurs during the rise time and is not observed as a separate event. - Slew rate is dependent on the capacitance at the load and the source impedance. The source impedance is the driver impedance plus the transmission line impedance (mostly the trunk). To improve slew rate, consider shading the driver impedance to and the trunk impedance on the low side. - Ensure that the signals do not cross the split plane and have a reference plane throughout. - Limit the overall trace length to below two inches. #### 8.4.2.4 Skew Machine Constraints (Ask a Question) - The address and control signals group to be tightly length match, skew between them must not exceed 25 ps–50 ps. - DQ/DQS group routing is point-to-point and the skew between DATA/DQS also must not exceed 50 ps. - Skew between CLK/CA/control group and DQ/DQS group should not exceed 100 ps. - Good to have a longer clock/CA trace than DQS. ### 8.4.3 Designing LPDDR4 Interface with Polarfire SoC FPGA (Ask a Question) The following figure shows an example design for PolarFire SoC and LPDDR interface. In the example, the LPDDR4 memory is used from Micron and has dual-channel and single-rank with x32 data width. Figure 8-16. Schematic of PolarFire SoC and LPDDFR4 interface The following figure shows the DQ Bus Routing (Point-to-Point). Figure 8-17. DQ Bus Routing (Point-to-point) The following figure shows the Clk routing (Balance Topology). Figure 8-18. Clk Routing (Balance Topology) The following figure shows the CA bus routing (balance topology). Figure 8-19. CA Bus Routing (Balance Topology) ## 9. Debugging Techniques (Ask a Question) Although the DDR subsystem eliminates the complexity in DDR SDRAM interface design, debugging memory failures is a challenge. Tracking down functional issues (at the FPGA or system level), functional system interaction problems, system timing issues, and signal fidelity issues between FPGA and memory devices (such as noise, crosstalk, or reflections) becomes much more complex. Care must be taken right from the DDR subsystem configuration phase to the PCB layout phase to achieve the expected performance. This section describes the tools and processes used to debug the DDR subsystem interface. ## 9.1 Debug Tools (Ask a Question) This section provides an overview of the tools available for debugging DDR memory controller subsystem. #### 9.1.1 SmartDebug (Ask a Question) SmartDebug Debug DDR Memory GUI capability is automatically provided when a Microchip Fabric DDR Subsystem is hosted in the FPGA fabric and DDR I/O. Microchip's SmartDebug supports the capability of analyzing and displaying the DDR training data including DDR I/O margin data for all DDR memory controller instances used in a design. The information gathered by the SmartDebug tool provides a quick status of the DDR interface initialization between the FPGA DDR IO and the memory device. The information can be used to make design updates to better match the system interface to the FPGA. For more information, see SmartDebug User Guide. ### 9.1.2 Identify RTL Debugger (Ask a Question) Identify is an RTL debugger that allows you to probe the DDR subsystem (native interface, AXI3/4 slave interface, and DFI interface) interface signals and view the signals in the original RTL code or waveform viewer. Identify consists of two tools—the Instrumentor and the Debugger. The Instrumentor compiles and inserts the necessary logic into the original RTL to create an instrumented design that allows for the probing of internal signals. The Debugger communicates with the FlashPro device via a JTAG interface to first retrieve the values for probed signals from the programmed FPGA and then display the retrieved signals. #### 9.1.3 Hardware Probes (Ask a Question) PolarFire family of devices have built-in probe points that enhance the ability to debug design logic within a device through the live and active probe features. These enhanced debug features embedded into the device hardware provide access to logic elements and enable real-time monitoring of inputs and outputs without the need to recompile the design. - Live probes: Two dedicated probes can be configured to observe a probe point, which is the output of a register. After configuring, the probe data can be sent to two dedicated pins (PROBE A and PROBE B), and then to an oscilloscope. - Active probes: Active probes allow dynamic asynchronous read and write to/from a flip-flop or probe point. This capability helps you to observe the output of the logic and perform quick experiments to determine how the logic is affected if a probe point is written to. In many cases, the hardware probe capabilities such as live probe can be used along with the Identify RTL debugger and external test techniques to debug issues in a subsystem. For debugging LPDDR4 and DDR4 training issues, use Memory Log Analyzer GUI. This is only applicable for embedded DDR memory controller. ## 9.2 Design Debug During Simulation (Ask a Question) Before a design can be debugged in the hardware, functional simulation must be performed to rule out any RTL issues. Functional issues are independent of process, voltage, temperature, speed grade, and PCB design. ## 9.3 Hardware Debug (Ask a Question) After the design works correctly in simulation, hardware debugging can be started. The first step is to ensure that the guidelines specified in *PolarFire FPGA Board Design User Guide and PolarFire SoC FPGA Board Design User Guide* are followed, see the . References section. This section lists general checks that can be performed during hardware debug: - Verify that the parameters (timing, signal width, memory parts, and so on) entered in the configurator are correct. - Create a simple design that replicates the issue. - Ensure that all voltages on the board are correctly set and the noise is within the specified limits. - Ensure that all terminations are adequate. - Ensure that clock and reset signals are clean and correct. - Perform a Signal Integrity (SI) analysis. Key items to be observed during the analysis are: - Setup and hold time between data signals (DQ) and their respective DQS - Setup and hold time between control/command/address signals and the clock - Overshoot and undershoot of all signals with respect to JEDEC specifications - DC threshold multi-crossing because of excessive ringing - Cool and heat the device to understand whether temperature variations are affecting the functionality. Temperature fluctuations may uncover a timing-related issue. - Run the device at a lower speed. Ensure that the DDR clock frequency is set within the minimal operating frequency of the device. - Verify that the Libero SoC version and the DDR subsystem IP version are compatible. - Try debugging the design on a different board of the same revision. ## 10. Appendix: Supported Memory Configurations (Ask a Question) This section provides details on the available configurations for the different memory types supported in PolarFire devices. #### DDR4 The PolarFire DDR Subsystem supports non-3DS DDR4 devices. DDR4 SDRAM is available in Dual In-Line Memory Modules (DIMM), Small Outline DIMMs (SO-DIMM), and as discrete devices. For information about the number of rows, columns, and bank bits required for DIMM or SO-DIMM configuration, refer to the data sheet associated with the DIMM module connected to the PolarFire device. The following table lists the number of row bits, column bits, bank bits, and chip selects required for standard discrete DDR4 SDRAM devices in non-3DS configurations. Table 10-1. Non-3DS DDR4 SDRAM Device Configurations | Stack/Chip Size | Configuration | Row Bits | Column Bits | Bank Group Bits | Bank Bits | |-----------------|---------------|----------|-------------|-----------------|-----------| | 2 | 128M × 16 | 14 | 10 | 1 | 2 | | | 64M × 32 | 13 | 10 | 1 | 2 | #### **DDR3 and LPDDR3** The following table lists the number of row bits, column bits, and bank bits required for standard discrete DDR3, LPDDR3, and SDRAM devices. Table 10-2. Standard DDR3, LPDDR3, and SDRAM Device Configurations | Memory Type | Chip Size | Configuration | Row Bits | Column Bits | Bank Bits | |-------------|-----------|---------------|----------|-------------|-----------| | DDR3 | 512 Mb | 32M × 16 | 12 | 10 | 3 | | | 1024 Mb | 64M × 16 | 13 | 10 | 3 | | | 2048 Mb | 128M × 16 | 14 | 10 | 3 | | | 4096 Mb | 256M × 16 | 15 | 10 | 3 | | | 8192 Mb | 512M × 16 | 16 | 10 | 3 | | LPDDR3 | 4 Gb | 128M × 32 | 14 | 10 | 3 | | | 6 Gb | 192M × 32 | 15 | 10 | 3 | | | 8 Gb | 128M × 32 | 15 | 10 | 3 | | | 12 Gb | 384M × 32 | 15 | 11 | 3 | | | 16 Gb | 512M × 32 | 15 | 11 | 3 | # 11. Appendix: Timing Parameters (Ask a Question) The following table lists the timing parameters used in this document. **Table 11-1.** Timing Parameters | Parameter | Description | |-----------|-------------------------------------------------------| | tXPR | Exit reset from CKE HIGH to a valid command | | tINIT1 | Minimum CKE LOW time after completion of voltage ramp | | tINIT3 | Minimum idle time after the first CKE assertion | | tINIT5 | Maximum duration of device auto initialization | | tZQINIT | ZQ initial calibration | | tRP | PRECHARGE command period | | tRFC | REFRESH-to-ACTIVATE or REFRESH command period | # 12. Appendix: Fabric DDR Placement Locations (Ask a Question) This section lists the DDR placement locations available on PolarFire devices. **Important:** Migration between FPGA device and package must be considered prior to final pin out determination for memory interfaces. For DDR interface migration guidance, see the PolarFire PPATs. Table 12-1. NORTH\_NE Anchor (HSIO) | IUDIC IL | <b>1110</b> 10 | III_INE AIICII | 01 (11310) | | | | | | | | |----------|----------------|----------------|------------------|-----------|---------------------|-------------------------------|---|------------|------------|------------| | Standard | Package | Edge_Anchor | Max Pin<br>Count | Max Width | ECC AT Max<br>Width | Add<br>Cmd<br>Parity<br>Check | | MPF200T/TS | MPF300T/TS | MPF500T/TS | | DDR3 | FC1152 | NORTH_NE | 138 | 64 | YES | YES | _ | _ | <b>√</b> | 1 | | | FC784 | NORTH_NE | 126 | 64 | NO | YES | _ | 1 | 1 | 1 | | | FC484 | NORTH_NE | 88 | 40 | YES | NO | 1 | 1 | ✓ | _ | | | FCV484 | NORTH_NE | 88 | 40 | YES | NO | 1 | ✓ | ✓ | _ | | | FCS536 | NORTH_NE | 88 | 40 | YES | NO | _ | 1 | ✓ | _ | | | FCS325 | NORTH_NE | 76 | 32 | NO | NO | ✓ | ✓ | _ | _ | | DDR4 | FC1152 | NORTH_NE | 139 | 64 | YES | YES | _ | _ | 1 | ✓ | | | FC784 | NORTH_NE | 127 | 64 | NO | YES | _ | ✓ | ✓ | ✓ | | | FC484 | NORTH_NE | 91 | 40 | YES | YES | 1 | 1 | ✓ | _ | | | FCV484 | NORTH_NE | 91 | 40 | YES | YES | ✓ | ✓ | ✓ | _ | | | FCS536 | NORTH_NE | 91 | 40 | YES | YES | _ | 1 | ✓ | _ | | | FCS325 | NORTH_NE | 79 | 32 | NO | YES | ✓ | ✓ | _ | _ | | LPDDR3 | FC1152 | NORTH_NE | 63 | 32 | na | na | _ | _ | ✓ | 1 | | | FC784 | NORTH_NE | 63 | 32 | na | na | _ | ✓ | ✓ | 1 | | | FC484 | NORTH_NE | 63 | 32 | na | na | 1 | 1 | 1 | _ | | | FCV484 | NORTH_NE | 63 | 32 | na | na | 1 | 1 | ✓ | _ | | | FCS536 | NORTH_NE | 63 | 32 | na | na | _ | 1 | 1 | _ | | | FCS325 | NORTH_NE | 63 | 32 | na | na | 1 | 1 | _ | _ | | QDR X9 | FC1152 | NORTH_NE | 48 | 9 | na | na | _ | _ | ✓ | 1 | | | FC784 | NORTH_NE | 48 | 9 | na | na | _ | ✓ | ✓ | 1 | | | FC484 | NORTH_NE | 48 | 9 | na | na | 1 | 1 | ✓ | _ | | | FCV484 | NORTH_NE | 48 | 9 | na | na | ✓ | ✓ | ✓ | _ | | | FCS536 | NORTH_NE | 48 | 9 | na | na | _ | 1 | ✓ | _ | | | FCS325 | NORTH_NE | 48 | 9 | na | na | 1 | 1 | _ | _ | | QDR X18 | FC1152 | NORTH_NE | 67 | 18 | na | na | _ | _ | ✓ | 1 | | | FC784 | NORTH_NE | 67 | 18 | na | na | _ | 1 | 1 | 1 | | | FC484 | NORTH_NE | 67 | 18 | na | na | 1 | 1 | 1 | _ | | | FCV484 | NORTH_NE | 67 | 18 | na | na | 1 | 1 | 1 | _ | | | FCS536 | NORTH_NE | 67 | 18 | na | na | _ | 1 | 1 | _ | | | FCS325 | NORTH_NE | 67 | 18 | na | na | 1 | 1 | _ | _ | | | | | | | | | | | | | | con | continued | | | | | | | | | | | |----------|-----------|-------------|------------------|-----------|---------------------|-------------------------------|---|------------|------------|------------|--| | Standard | Package | Edge_Anchor | Max Pin<br>Count | Max Width | ECC AT Max<br>Width | Add<br>Cmd<br>Parity<br>Check | | MPF200T/TS | MPF300T/TS | MPF500T/TS | | | QDR X36 | FC1152 | NORTH_NE | 105 | 36 | na | na | _ | _ | ✓ | ✓ | | | | FC784 | NORTH_NE | 105 | 36 | na | na | _ | ✓ | ✓ | ✓ | | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | **Important:** If a third party DDR Controller is used that supports data in place of ECC bits, the Max Width includes an additional 8 data bits. For example, a Max Width of 32 with an entry of "Yes" for ECC becomes a Max Width of 40 data bits. Table 12-2. NORTH\_NE\_OPT Anchor (HSIO) | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | |----------|---------|--------------|------------------|-----------|------------------------|-------------------------------|------------|------------|------------|------------| | DDR3 | FC1152 | NORTH_NE_OPT | 138 | 64 | YES | YES | _ | _ | _ | 1 | | | FC784 | NORTH_NE_OPT | 138 | 64 | YES | YES | _ | _ | 1 | ✓ | | DDR4 | FC1152 | NORTH_NE_OPT | 139 | 64 | YES | YES | _ | _ | _ | ✓ | | | FC784 | NORTH_NE_OPT | 139 | 64 | YES | YES | _ | _ | 1 | ✓ | | LPDDR3 | FC784 | NORTH_NE_OPT | 63 | 32 | na | na | _ | _ | 1 | ✓ | | QDR X18 | FC784 | NORTH_NE_OPT | 67 | 18 | na | na | _ | _ | 1 | ✓ | | QDR X36 | FC1152 | NORTH_NE_OPT | 105 | 36 | na | na | _ | _ | _ | 1 | | | FC784 | NORTH_NE_OPT | 105 | 36 | na | na | _ | _ | 1 | ✓ | Table 12-3. NORTH\_NW Anchor (HSIO) | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | |----------|---------|-------------|------------------|-----------|------------------------|-------------------------------|------------|------------|------------|------------| | DDR3 | FC1152 | NORTH_NW | 138 | 64 | YES | YES | _ | _ | 1 | 1 | | | FC784 | NORTH_NW | 126 | 64 | NO | YES | _ | 1 | ✓ | ✓ | | | FC484 | NORTH_NW | 88 | 40 | NO | NO | 1 | 1 | 1 | _ | | | FCV484 | NORTH_NW | 88 | 40 | NO | NO | ✓ | 1 | ✓ | _ | | | FCS536 | NORTH_NW | 88 | 40 | NO | NO | _ | 1 | ✓ | _ | | | FCS325 | NORTH_NW | 76 | 32 | NO | NO | ✓ | 1 | _ | _ | | DDR4 | FC1152 | NORTH_NW | 139 | 64 | YES | YES | _ | _ | ✓ | ✓ | | | FC784 | NORTH_NW | 127 | 64 | NO | YES | _ | 1 | ✓ | ✓ | | | FC484 | NORTH_NW | 91 | 40 | NO | YES | ✓ | 1 | 1 | _ | | | FCV484 | NORTH_NW | 91 | 40 | NO | YES | ✓ | 1 | ✓ | _ | | | FCS536 | NORTH_NW | 91 | 40 | NO | YES | _ | 1 | 1 | _ | | | FCS325 | NORTH_NW | 79 | 32 | NO | YES | 1 | 1 | _ | _ | | continued | | | | | | | | | | | |-----------|---------|-------------|------------------|-----------|------------------------|-------------------------------|------------|------------|------------|------------| | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | | LPDDR3 | FC1152 | NORTH_NW | 63 | 32 | na | na | _ | _ | 1 | ✓ | | | FC784 | NORTH_NW | 63 | 32 | na | na | _ | 1 | 1 | ✓ | | | FC484 | NORTH_NW | 63 | 32 | na | na | ✓ | ✓ | ✓ | _ | | | FCV484 | NORTH_NW | 63 | 32 | na | na | 1 | 1 | 1 | _ | | | FCS536 | NORTH_NW | 63 | 32 | na | na | _ | ✓ | 1 | _ | | | FCS325 | NORTH_NW | 63 | 32 | na | na | 1 | 1 | _ | _ | | QDR X9 | FC1152 | NORTH_NW | 48 | 9 | na | na | _ | _ | ✓ | 1 | | | FC784 | NORTH_NW | 48 | 9 | na | na | _ | ✓ | 1 | ✓ | | | FC484 | NORTH_NW | 48 | 9 | na | na | 1 | 1 | 1 | _ | | | FCV484 | NORTH_NW | 48 | 9 | na | na | 1 | ✓ | 1 | _ | | | FCS536 | NORTH_NW | 48 | 9 | na | na | _ | 1 | 1 | _ | | | FCS325 | NORTH_NW | 48 | 9 | na | na | ✓ | 1 | _ | _ | | QDR X18 | FC1152 | NORTH_NW | 67 | 18 | na | na | _ | _ | 1 | 1 | | | FC784 | NORTH_NW | 67 | 18 | na | na | _ | 1 | 1 | ✓ | | | FC484 | NORTH_NW | 67 | 18 | na | na | ✓ | 1 | 1 | _ | | | FCV484 | NORTH_NW | 67 | 18 | na | na | 1 | 1 | 1 | _ | | | FCS536 | NORTH_NW | 67 | 18 | na | na | _ | 1 | 1 | _ | | | FCS325 | NORTH_NW | 67 | 18 | na | na | ✓ | 1 | _ | _ | | QDR X36 | FC1152 | NORTH_NW | 105 | 36 | na | na | _ | _ | 1 | ✓ | | | FC784 | NORTH_NW | 105 | 36 | na | na | _ | 1 | 1 | ✓ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | ## Table 12-4. NORTH\_NW\_OPT Anchor (HSIO) | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | | MPF200T/TS | MPF300T/TS | MPF500T/TS | |----------|---------|--------------|------------------|-----------|------------------------|-------------------------------|---|------------|------------|------------| | DDR3 | FC1152 | NORTH_NW_OPT | 138 | 64 | YES | YES | _ | _ | _ | ✓ | | | FC784 | NORTH_NW_OPT | 138 | 64 | YES | YES | _ | _ | 1 | ✓ | | DDR4 | FC1152 | NORTH_NW_OPT | 139 | 64 | YES | YES | _ | _ | _ | 1 | | | FC784 | NORTH_NW_OPT | 139 | 64 | YES | YES | _ | _ | ✓ | ✓ | | QDR X18 | FC784 | NORTH_NW_OPT | 67 | 18 | na | na | _ | _ | 1 | 1 | | QDR X36 | FC1152 | NORTH_NW_OPT | 105 | 36 | na | na | _ | _ | | ✓ | | | FC784 | NORTH_NW_OPT | 105 | 36 | na | na | _ | _ | 1 | 1 | Table 12-5. SOUTH\_SE Anchor (HSIO) | Table 12- | <b>5.</b> 30011 | H_SE Anchor | (11310) | | | | | | | | |-----------|-----------------|-------------|------------------|-----------|------------------------|-------------------------------|------------|------------|------------|------------| | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | | DDR3 | FC1152 | SOUTH_SE | 52 | 16 | NO | NO | _ | _ | 1 | 1 | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | DDR4 | FC1152 | SOUTH_SE | 55 | 16 | NO | YES | _ | _ | 1 | 1 | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | LPDDR3 | FC1152 | SOUTH_SE | 63 | 32 | na | na | _ | _ | ✓ | 1 | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X9 | FC1152 | SOUTH_SE | 48 | 9 | na | na | _ | _ | 1 | 1 | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X18 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X36 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | | FC784 | na | na | na | na | na | - | - | - | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | - | Table 12-6. SOUTH\_SE\_OPT Anchor (HSIO) | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | | MPF200T/TS | MPF300T/TS | MPF500T/TS | |----------|---------|--------------|------------------|-----------|-----------------|-------------------------------|---|------------|------------|------------| | DDR3 | FC1152 | SOUTH_SE_OPT | 88 | 40 | NO | NO | _ | _ | _ | 1 | | DDR4 | FC1152 | SOUTH_SE_OPT | 91 | 40 | NO | YES | _ | _ | _ | ✓ | | cont | inued | | , | | | | | | | | |----------|---------|--------------|------------------|-----------|------------------------|-------------------------------|---|------------|------------|------------| | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | | MPF200T/TS | MPF300T/TS | MPF500T/TS | | QDR X18 | FC1152 | SOUTH_SE_OPT | 67 | 18 | na | na | _ | _ | _ | ✓ | Table 12-7. SOUTH SW Anchor (GPIO) | DDR3 | | | | r (GPIO) | | | | | | | | |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|---------|-------------|----------|-----------|----|---------------|------------|------------|------------|------------| | FC784 SOUTH_SW 88 40 NO NO - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - | Standard | Package | Edge_Anchor | 1 | Max Width | AT | Cmd<br>Parity | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | | FC484 SOUTH_SW 76 32 NO NO V V V — FCV484 SOUTH_SW 88 40 NO NO V V — FC5536 SOUTH_SW 40 8 NO NO V V — FC3325 SOUTH_SW 40 8 NO NO V V — DDR4 FC1152 na na na na na — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — | DDR3 | FC1152 | SOUTH_SW | 88 | 40 | NO | NO | _ | _ | 1 | 1 | | FCV484 SOUTH_SW 88 40 NO NO V V V — FCS536 SOUTH_SW 88 40 NO NO V V — FCS325 SOUTH_SW 40 8 NO NO V V — FC325 SOUTH_SW 40 8 NO NO V V — DDR4 FC1152 na na na na na na na na — FC784 na na na na na na na na — FC784 na na na na na na na na — FCV484 na na na na na na na na — FCS536 na na na na na na na — FCS536 na na na na na na na — FCS325 na na na na na na na — FC784 na na na na na na na — FC784 na na na na na na na — FC784 na na na na na na na — FC784 na na na na na na na — FC784 na na na na na na na — FC7856 na na na na na na na — FCV4884 na na na na na na na na — FCV484 na na na na na na na na — FCS536 na na na na na na na na — FCS536 na na na na na na na na — FCS536 na na na na na na na na na — FCS536 na na na na na na na na na — FCS536 na na na na na na na na na — FCS536 na na na na na na na na na — FCS536 na — FCS536 na — FCS536 na | | FC784 | SOUTH_SW | 88 | 40 | NO | NO | _ | 1 | 1 | 1 | | FCS536 SOUTH_SW 88 40 NO NO — | | FC484 | SOUTH_SW | 76 | 32 | NO | NO | 1 | 1 | 1 | _ | | FCS325 SOUTH_SW 40 8 NO NO | | FCV484 | SOUTH_SW | 88 | 40 | NO | NO | 1 | 1 | 1 | _ | | DDR4 FC1152 na | | FCS536 | SOUTH_SW | 88 | 40 | NO | NO | _ | 1 | 1 | _ | | FC784 na na na na na na na — — — — — — — — FC484 na na na na na na na na na — — — — — — | | FCS325 | SOUTH_SW | 40 | 8 | NO | NO | 1 | 1 | _ | _ | | FC484 na | DDR4 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | FCV484 na <td< td=""><td></td><td>FC784</td><td>na</td><td>na</td><td>na</td><td>na</td><td>na</td><td>_</td><td>_</td><td>_</td><td>_</td></td<> | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | FCS536 na | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | FCS325 na | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | LPDDR3 FC1152 na na na na na na - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | FC784 na na na na na na - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - <td< td=""><td></td><td>FCS325</td><td>na</td><td>na</td><td>na</td><td>na</td><td>na</td><td>_</td><td>_</td><td>_</td><td>_</td></td<> | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | FC484 na na na na na na - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - <td< td=""><td>LPDDR3</td><td>FC1152</td><td>na</td><td>na</td><td>na</td><td>na</td><td>na</td><td>_</td><td>_</td><td>_</td><td>_</td></td<> | LPDDR3 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | FCV484 na | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | FCS536 na | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | FCS325 na | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | QDR X9 FC1152 SOUTH_SW 48 9 na na — — ✓ FC784 SOUTH_SW 48 9 na na — ✓ ✓ ✓ FC484 SOUTH_SW 48 9 na na ✓ ✓ ✓ — | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | FC784 SOUTH_SW 48 9 na na — ✓ ✓ ✓ ✓ — FC484 SOUTH_SW 48 9 na na ✓ ✓ ✓ — | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | FC484 SOUTH_SW 48 9 na na 🗸 🗸 — | QDR X9 | FC1152 | SOUTH_SW | 48 | 9 | na | na | _ | _ | 1 | 1 | | | | FC784 | SOUTH_SW | 48 | 9 | na | na | _ | 1 | 1 | 1 | | FCV484 SOUTH SW 48 9 na na / / / - | | FC484 | SOUTH_SW | 48 | 9 | na | na | 1 | 1 | 1 | _ | | | | FCV484 | SOUTH_SW | 48 | 9 | na | na | 1 | 1 | 1 | _ | | FCS536 SOUTH_SW 48 9 na na — 🗸 🗸 — | | FCS536 | SOUTH_SW | 48 | 9 | na | na | _ | 1 | 1 | _ | | FCS325 na na na na na — — — — — | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X18 FC1152 SOUTH_SW 67 18 na na − − ✓ ✓ | QDR X18 | FC1152 | SOUTH_SW | 67 | 18 | na | na | _ | _ | 1 | 1 | | FC784 SOUTH_SW 67 18 na na — 🗸 🗸 | | FC784 | SOUTH_SW | 67 | 18 | na | na | _ | 1 | 1 | 1 | | FC484 SOUTH_SW 67 18 na na 🗸 🗸 🗸 — | | FC484 | SOUTH_SW | 67 | 18 | na | na | 1 | 1 | 1 | _ | | FCV484 SOUTH_SW 67 18 na na 🗸 🗸 🗸 — | | FCV484 | SOUTH_SW | 67 | 18 | na | na | 1 | 1 | 1 | _ | | FCS536 SOUTH_SW 67 18 na na — 🗸 🗸 — | | FCS536 | SOUTH_SW | 67 | 18 | na | na | _ | 1 | 1 | _ | | FCS325 na na na na na — — — — — | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X36 FC1152 na na na na na — — — — — | QDR X36 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | FC784 na na na na na — — — — — | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | FC484 na na na na na — — — — — | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | FCV484 na na na na na — — — — — | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | FCS536 na na na na na — — — — — | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | | | | | | | | | | | | Table 12-8. WEST\_NW Anchor (GPIO) | Table 12- | <b>6.</b> WEST | _INVV Anchor | (GPIO) | | | | | | | | |-----------|----------------|--------------|------------------|-----------|------------------------|-------------------------------|------------|------------|------------|------------| | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | | DDR3 | FC1152 | WEST_NW | 136 | 64 | YES | NO | _ | _ | 1 | 1 | | | FC784 | WEST_NW | 124 | 64 | NO | NO | _ | 1 | 1 | 1 | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | WEST_NW | 52 | 16 | NO | NO | 1 | 1 | ✓ | _ | | | FCS536 | WEST_NW | 76 | 32 | NO | NO | _ | 1 | 1 | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | DDR4 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | LPDDR3 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X9 | FC1152 | WEST_NW | 48 | 9 | na | na | _ | _ | 1 | ✓ | | | FC784 | WEST_NW | 48 | 9 | na | na | _ | 1 | 1 | ✓ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | WEST_NW | 48 | 9 | na | na | 1 | 1 | ✓ | _ | | | FCS536 | WEST_NW | 48 | 9 | na | na | _ | 1 | 1 | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X18 | FC1152 | WEST_NW | 67 | 18 | na | na | _ | _ | 1 | ✓ | | | FC784 | WEST_NW | 67 | 18 | na | na | _ | ✓ | 1 | ✓ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | WEST_NW | 67 | 18 | na | na | _ | ✓ | ✓ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X36 | FC1152 | WEST_NW | 105 | 36 | na | na | _ | _ | 1 | ✓ | | | FC784 | WEST_NW | 105 | 36 | na | na | _ | 1 | 1 | 1 | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | Table 12-9. WEST\_NW\_OPT Anchor (GPIO) | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Cmd | | MPF200T/TS | MPF300T/TS | MPF500T/TS | |----------|---------|-------------|------------------|-----------|------------------------|-----|---|------------|------------|------------| | DDR3 | FC1152 | WEST_NW_OPT | 136 | 64 | YES | NO | _ | _ | _ | 1 | | QDR X36 | FC1152 | WEST_NW_OPT | 105 | 36 | na | na | _ | _ | _ | ✓ | Table 12-10. WEST\_SW Anchor (GPIO) | lable 12- | TU. VVES | I_SW Ancho | r (GPIO) | | | | | | | | |-----------|----------|-------------|------------------|-----------|------------------------|-------------------------------|------------|------------|------------|------------| | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | | DDR3 | FC1152 | WEST_SW | 126 | 64 | NO | YES | _ | _ | 1 | 1 | | | FC784 | WEST_SW | 88 | 40 | NO | NO | _ | 1 | 1 | ✓ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | WEST_SW | 52 | 16 | NO | NO | _ | 1 | 1 | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | DDR4 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | LPDDR3 | FC1152 | na | na | na | na | na | _ | _ | _ | _ | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X9 | FC1152 | WEST_SW | 48 | 9 | na | na | _ | _ | ✓ | 1 | | | FC784 | WEST_SW | 48 | 9 | na | na | _ | 1 | 1 | ✓ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | WEST_SW | 48 | 9 | na | na | _ | 1 | 1 | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X18 | FC1152 | WEST_SW | 67 | 18 | na | na | _ | _ | 1 | 1 | | | FC784 | WEST_SW | 67 | 18 | na | na | _ | 1 | 1 | ✓ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | | QDR X36 | FC1152 | WEST_SW | 105 | 36 | na | na | _ | _ | ✓ | 1 | | | FC784 | na | na | na | na | na | _ | _ | _ | _ | | | FC484 | na | na | na | na | na | _ | _ | _ | _ | | | FCV484 | na | na | na | na | na | _ | _ | _ | _ | | | FCS536 | na | na | na | na | na | _ | _ | _ | _ | | | FCS325 | na | na | na | na | na | _ | _ | _ | _ | **Table 12-11.** WEST\_SW\_OPT Anchor (GPIO) | Standard | Package | Edge_Anchor | Max<br>Pin Count | Max Width | ECC<br>AT<br>Max Width | Add<br>Cmd<br>Parity<br>Check | MPF100T/TS | MPF200T/TS | MPF300T/TS | MPF500T/TS | |----------|---------|-------------|------------------|-----------|------------------------|-------------------------------|------------|------------|------------|------------| | DDR3 | FC1152 | WEST_SW_OPT | 138 | 64 | YES | YES | _ | _ | _ | ✓ | | | FC784 | WEST_SW_OPT | 88 | 40 | NO | NO | _ | _ | ✓ | ✓ | | | FC484 | WEST_SW_OPT | 52 | 16 | NO | NO | _ | ✓ | ✓ | _ | | | FCV484 | WEST_SW_OPT | 52 | 16 | NO | NO | _ | ✓ | _ | _ | | QDR X9 | FC1152 | WEST_SW_OPT | 48 | 9 | na | na | _ | _ | _ | ✓ | | | FC784 | WEST_SW_OPT | 48 | 9 | na | na | _ | _ | ✓ | ✓ | | | FC484 | WEST_SW_OPT | 48 | 9 | na | na | _ | ✓ | 1 | _ | | | FCV484 | WEST_SW_OPT | 48 | 9 | na | na | _ | ✓ | ✓ | _ | | QDR X18 | FC1152 | WEST_SW_OPT | 67 | 18 | na | na | _ | _ | _ | ✓ | | | FC784 | WEST_SW_OPT | 67 | 18 | na | na | _ | _ | ✓ | ✓ | | QDR X36 | FC1152 | WEST_SW_OPT | 105 | 36 | na | na | _ | _ | _ | ✓ | | | FC784 | WEST_SW_OPT | 105 | 36 | na | na | _ | _ | 1 | ✓ | # 13. Revision History (Ask a Question) The revision history table describes the changes that were implemented in the document. The changes are listed by revision, starting with the most current publication. | | | I | |----------|---------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Revision | Date | Description | | E | 06/2023 | The following is a summary of the changes made in this revision: | | | | Changed the document title to PolarFire Family Memory Controller User Guide. | | | | Added RT PolarFire support. | | | | <ul> <li>Updated 2. MSS DDR Memory Controller and 9.1.3. Hardware Probes<br/>mentioning that both LPDDR4 and DDR4 training issues are debugged using<br/>Memory Log Analyzer.</li> </ul> | | | | <ul> <li>Added a note under Table 3-8 mentioning the data limit to be ensured during<br/>DDR4 memory transfers using the AXI interface.</li> </ul> | | | | • Updated 3.8. Octal DDR PHY-Only Solution by correcting that the Octal DDR PHY configurator supports up to four ranks per interface instead of two ranks. | | | | • Updated the description of the I{INTERFACE#}R{RANK#}_{CLOCK}_DATA output port. | | | | <ul> <li>Added information regarding the default value in the description of the Automatic ZQ Calibration Period (us) parameter in the DDR3, LPDDR3, and DDR4 configurator's Memory Timing tab. Also, added a note describing that this default value is based on the preset configuration. See 3.9.2. Configuring the DDR3 Subsystem, 3.9.3. Configuring the LPDDR3 Subsystem, and 3.9.4. Configuring the DDR4 Subsystem.</li> </ul> | | | | • Added a note in the description of the RESET_N signal. See Table 6-3. | | D | 10/2022 | <ul> <li>The following is a summary of the changes made in this revision:</li> <li>Added information about LPDDR4 initialization sequence. See 4.4. LPDDR4 (For PolarFire SoC FPGA Only).</li> <li>Added board design recommendation for LPDDR4. See 8.4. LPDDR4.</li> <li>Changed the QDR SRAM Address Width and SRAM_DATA_WIDTH naming as per the configurator. See Table 6-3.</li> <li>Updated the QDR address width information, see 6.1. Features.</li> <li>In 6.5.4. QDR Resource Utilization, removed the note that mentioned about the fixed address width, data width, and burst size in QDR RTL - Fast simulation mode.</li> </ul> | | С | 08/2022 | <ul> <li>The following is a summary of the changes made in this revision:</li> <li>Added a note regarding SHIELD signals in Table 3-14.</li> <li>Added CMD/ADDR to REF_CLK training information specific to LPDDR3 in 5. Training Logic.</li> <li>Added 3.9.3. Configuring the LPDDR3 Subsystem and 3.9.4. Configuring the DDR4 Subsystem to describe the PF_LPDDR3 and PF_DDR4 Configurator options separately.</li> <li>Updated QDR training information, see 6.3.1.2. Training Logic.</li> <li>Added 6.5.4. QDR Resource Utilization.</li> <li>Added information about the DQ bit swapping support, see 7. PolarFire Board Design Recommendations and 8. PolarFire SoC Board Design Recommendations.</li> <li>Added a note about DDR interface migration, see 12. Appendix: Fabric DDR</li> </ul> | | | | Placement Locations. | | cont | inued | | |----------|---------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Revision | Date | Description | | В | 11/2021 | The following is a summary of the changes made in this revision: | | | | • Updated the clock frequency value of PolarFire Octal DDR PHY IP from 200 MHz to 250 MHz. See 3.8. Octal DDR PHY-Only Solution and Table 3-24. | | | | <ul> <li>Updated the description of the output signal DOFF_N of the QDR Controller,<br/>see 6.3.3. Port List.</li> </ul> | | | | Added RTL_Fast simulation information for QDR memory, see 6.5.1. Configuring QDR Memory Controller IP and 6.6.3. RTL_Fast Simulation. | | | | <ul> <li>Updated Figure 3-43 to show the latest configuration parameter Enable</li> <li>Differential DQS and added this configuration parameter to Table 3-24.</li> </ul> | | | | <ul> <li>Added the ALERT_N signal and updated the description of the<br/>stat_ca_parity_error signal in Table 3-7.</li> </ul> | | | | • Updated the description of the ALERT_N signal in Table 3-14. | | | | <ul> <li>Added the information about the Memory Format and Enable Parity/Alert<br/>options in 3.9.2. General.</li> </ul> | | А | 08/2021 | The first publication of this document. This user guide was created by merging the following documents: | | | | UG0676: PolarFire FPGA Memory Controller User Guide | | | | UG0906: PolarFire SoC FPGA Memory Controller User Guide | | | | For more information, see Table 13-1 and Table 13-2 respectively. | The following revision history table describes the changes that were implemented in the *UG0676: PolarFire FPGA Memory Controller User Guide* document. The changes are listed by revision. **Note:** *UG0676: PolarFire FPGA Memory Controller User Guide* document is now obsolete and the information in the document has been migrated to *PolarFire® FPGA and PolarFire SoC FPGA Memory Controller User Guide*. Table 13-1. Revision History of UG0676: PolarFire FPGA Memory Controller User Guide | Revision | Date | Description | |--------------|-------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Revision 8.0 | 04/21 | <ul> <li>The following is a summary of the changes made in this revision.</li> <li>Updated that the DDR Controller complies with the AXI3/4 protocol.</li> <li>Added DDR4 Additional Controller Options.</li> <li>Added DDR4 Optimization Guidelines.</li> <li>Added the prerequisite of setting VDDI and VREF before placing the DDR subsystem.</li> <li>Removed DQ bit swapping information because it is not supported.</li> <li>Added information about the minimum operating frequency of LPDDR3.</li> <li>Updated for Libero SoC v2021.1.</li> </ul> | | Revision 7.0 | 10/20 | <ul> <li>The following is a summary of the changes made in this revision.</li> <li>Added information about Octal DDR PHY-Only Solution.</li> <li>Updated support for ECC at maximum width for various device variants.</li> <li>Added brief information about SmartDebug capability of displaying DDR training data.</li> <li>Updated for Libero SoC v12.5.</li> </ul> | | Revision 6.0 | 05/20 | <ul> <li>The following is a summary of the changes made in this revision.</li> <li>Updated the description of the QDR output port DOFF_N.</li> <li>Updated the QDR Training information.</li> <li>Updated QDR related information for Burst of 2.</li> </ul> | | conti | | | |--------------|-------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Revision | Date | Description | | Revision 5.0 | 04/19 | The following is a summary of the changes made in this revision. | | | | Updated the document for Libero SoC v12.0. | | | | <ul> <li>Added simulated timing waveforms for write and read sequence with and<br/>without lookahead precharge.</li> </ul> | | | | Added simulated timing waveforms for sample multi-burst write and read sequence. | | | | <ul> <li>Added simulated timing waveforms for sample SDRAM write and read<br/>sequence.</li> </ul> | | | | Updated the information about the DQ bit swapping supported. | | | | • Updated the table that lists the Write Data to Write Strobe Mapping for QDR x36 mode. | | | | <ul> <li>Added tables that list the Write Data to Write Strobe Mapping for QDR x18 ar<br/>x9 modes.</li> </ul> | | | | <ul> <li>Updated "Burst of 2 local interface timing" and "Burst of 4 local interface<br/>timing" diagrams in QDR Memory Controller.</li> </ul> | | | | Updated the QDR simulation options. | | | | Added information about BFM simulation. | | | | <ul> <li>Added L_D_REQ_LAST_P0 and L_R_VALID_LAST_P0 signals in "Native Interface<br/>Signals" table.</li> </ul> | | Revision 4.0 | 09/18 | The following is a summary of the changes made in this revision. | | | | • The document was updated for Libero SoC PolarFire v2.3. | | | | Added information about the DDR PHY-only solution. | | | | Removed the VREFDQ leveling step from the initialization sequence of DDR3 and DDR4. | | Revision 3.0 | 04/18 | The following is a summary of the changes made in this revision. | | Nevision 5.0 | | <ul> <li>Updated the title of the document and added the QDR memory controller<br/>information. Now, both DDR and QDR information is available in this<br/>document.</li> </ul> | | | | • Updated the Libero SoC PolarFire version to v2.1 throughout the document. | | | | Updated the DDR3 and DDR4 resource utilization data. | | | | <ul> <li>Updated the information about connectivity between DDR3 and PolarFire devices.</li> </ul> | | | | <ul> <li>Updated the information about the native interface signals L_DATAIN[N:0],<br/>L_DATAOUT[N:0], and L_DM_IN[N:0].</li> </ul> | | | | <ul> <li>Updated the data widths and I/O standard information for various types of<br/>memories.</li> </ul> | | | | <ul> <li>Added the information about the simulation options available in the DDR3 configurator.</li> </ul> | | | | Updated the DDR3 and DDR4 parameters for PolarFire devices. | | | | <ul> <li>Updated the information about the standard LPDDR3 and DDR3 device<br/>configurations.</li> </ul> | | | | Added the DDR3, LPDDR3, and DDR4 layout guidelines. | | | | Added the list of DDR placement location for each device package. | | | | Removed the instances that mention the LPDDR2 support throughout the | | | | document. | | conti | .continued | | |--------------|------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Revision | Date | Description | | Revision 2.0 | 06/17 | <ul> <li>The following is a summary of the changes made in this revision.</li> <li>Information about DDR memory interface data lanes was updated.</li> <li>Information about training logic was updated.</li> <li>Information about diagnostic options was updated.</li> <li>Information about physical constraints of the DDR subsystem was updated.</li> <li>Information about accessing DDR Subsystem using the native interface was updated.</li> </ul> | | Revision 1.0 | 02/17 | The first publication of UG0676: PolarFire FPGA Memory Controller User Guide. | The following revision history table describes the changes that were implemented in the *UG0906: PolarFire SoC FPGA Memory Controller User Guide* document. The changes are listed by revision. **Note:** *UG0906: PolarFire SoC FPGA Memory Controller User Guide* document is now obsolete and the information in the document has been migrated to *PolarFire® FPGA and PolarFire SoC FPGA Memory Controller User Guide*. Table 13-2. Revision History of UG0906: PolarFire SoC FPGA Memory Controller User Guide | Revision | Date | Description | |--------------|-------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Revision 3.0 | 04/21 | <ul> <li>The following is a summary of the changes made in this revision.</li> <li>Updated that DDR Controller complies with the AXI3/4 protocol.</li> <li>Added DDR4 Additional Controller Options.</li> <li>Added DDR4 Optimization Guidelines.</li> <li>Added the prerequisite of setting VDDI and VREF before placing the DDR subsystem.</li> <li>Removed the DQ bit swapping information because it is not supported.</li> </ul> | | | | Updated for Libero SoC v2021.1. | | Revision 2.0 | 09/20 | Updated for Libero SoC v12.5. | | Revision 1.0 | 04/20 | The first publication of the UG0906: PolarFire SoC FPGA Memory Controller User Guide. | ## Microchip FPGA Support (Ask a Question) Microchip FPGA products group backs its products with various support services, including Customer Service, Customer Technical Support Center, a website, and worldwide sales offices. Customers are suggested to visit Microchip online resources prior to contacting support as it is very likely that their queries have been already answered. Contact Technical Support Center through the website at <a href="https://www.microchip.com/support">www.microchip.com/support</a>. Mention the FPGA Device Part number, select appropriate case category, and upload design files while creating a technical support case. Contact Customer Service for non-technical product support, such as product pricing, product upgrades, update information, order status, and authorization. - From North America, call 800.262.1060 - From the rest of the world, call 650.318.4460 - Fax, from anywhere in the world, 650.318.8044 ## Microchip Information (Ask a Question) ### The Microchip Website (Ask a Question) Microchip provides online support via our website at <a href="www.microchip.com/">www.microchip.com/</a>. This website is used to make files and information easily available to customers. Some of the content available includes: - **Product Support** Data sheets and errata, application notes and sample programs, design resources, user's guides and hardware support documents, latest software releases and archived software - **General Technical Support** Frequently Asked Questions (FAQs), technical support requests, online discussion groups, Microchip design partner program member listing - Business of Microchip Product selector and ordering guides, latest Microchip press releases, listing of seminars and events, listings of Microchip sales offices, distributors and factory representatives #### Product Change Notification Service (Ask a Question) Microchip's product change notification service helps keep customers current on Microchip products. Subscribers will receive email notification whenever there are changes, updates, revisions or errata related to a specified product family or development tool of interest. To register, go to www.microchip.com/pcn and follow the registration instructions. #### Customer Support (Ask a Question) Users of Microchip products can receive assistance through several channels: - Distributor or Representative - Local Sales Office - Embedded Solutions Engineer (ESE) - Technical Support Customers should contact their distributor, representative or ESE for support. Local sales offices are also available to help customers. A listing of sales offices and locations is included in this document. Technical support is available through the website at: www.microchip.com/support ## Microchip Devices Code Protection Feature (Ask a Question) Note the following details of the code protection feature on Microchip products: - Microchip products meet the specifications contained in their particular Microchip Data Sheet. - Microchip believes that its family of products is secure when used in the intended manner, within operating specifications, and under normal conditions. - Microchip values and aggressively protects its intellectual property rights. Attempts to breach the code protection features of Microchip product is strictly prohibited and may violate the Digital Millennium Copyright Act. - Neither Microchip nor any other semiconductor manufacturer can guarantee the security of its code. Code protection does not mean that we are guaranteeing the product is "unbreakable". Code protection is constantly evolving. Microchip is committed to continuously improving the code protection features of our products. #### Legal Notice (Ask a Question) This publication and the information herein may be used only with Microchip products, including to design, test, and integrate Microchip products with your application. Use of this information in any other manner violates these terms. Information regarding device applications is provided only for your convenience and may be superseded by updates. It is your responsibility to ensure that your application meets with your specifications. Contact your local Microchip sales office for additional support or, obtain additional support at <a href="https://www.microchip.com/en-us/support/design-help/client-support-services">www.microchip.com/en-us/support/design-help/client-support-services</a>. THIS INFORMATION IS PROVIDED BY MICROCHIP "AS IS". MICROCHIP MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WHETHER EXPRESS OR IMPLIED, WRITTEN OR ORAL, STATUTORY OR OTHERWISE, RELATED TO THE INFORMATION INCLUDING BUT NOT LIMITED TO ANY IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY, AND FITNESS FOR A PARTICULAR PURPOSE, OR WARRANTIES RELATED TO ITS CONDITION, QUALITY, OR PERFORMANCE. IN NO EVENT WILL MICROCHIP BE LIABLE FOR ANY INDIRECT, SPECIAL, PUNITIVE, INCIDENTAL, OR CONSEQUENTIAL LOSS, DAMAGE, COST, OR EXPENSE OF ANY KIND WHATSOEVER RELATED TO THE INFORMATION OR ITS USE, HOWEVER CAUSED, EVEN IF MICROCHIP HAS BEEN ADVISED OF THE POSSIBILITY OR THE DAMAGES ARE FORESEEABLE. TO THE FULLEST EXTENT ALLOWED BY LAW, MICROCHIP'S TOTAL LIABILITY ON ALL CLAIMS IN ANY WAY RELATED TO THE INFORMATION OR ITS USE WILL NOT EXCEED THE AMOUNT OF FEES, IF ANY, THAT YOU HAVE PAID DIRECTLY TO MICROCHIP FOR THE INFORMATION. Use of Microchip devices in life support and/or safety applications is entirely at the buyer's risk, and the buyer agrees to defend, indemnify and hold harmless Microchip from any and all damages, claims, suits, or expenses resulting from such use. No licenses are conveyed, implicitly or otherwise, under any Microchip intellectual property rights unless otherwise stated. #### Trademarks (Ask a Question) The Microchip name and logo, the Microchip logo, Adaptec, AVR, AVR logo, AVR Freaks, BesTime, BitCloud, CryptoMemory, CryptoRF, dsPIC, flexPWR, HELDO, IGLOO, JukeBlox, KeeLoq, Kleer, LANCheck, LinkMD, maXStylus, maXTouch, MediaLB, megaAVR, Microsemi, Microsemi logo, MOST, MOST logo, MPLAB, OptoLyzer, PIC, picoPower, PICSTART, PIC32 logo, PolarFire, Prochip Designer, QTouch, SAM-BA, SenGenuity, SpyNIC, SST, SST Logo, SuperFlash, Symmetricom, SyncServer, Tachyon, TimeSource, tinyAVR, UNI/O, Vectron, and XMEGA are registered trademarks of Microchip Technology Incorporated in the U.S.A. and other countries. AgileSwitch, APT, ClockWorks, The Embedded Control Solutions Company, EtherSynch, Flashtec, Hyper Speed Control, HyperLight Load, Libero, motorBench, mTouch, Powermite 3, Precision Edge, ProASIC, ProASIC Plus, ProASIC Plus logo, Quiet- Wire, SmartFusion, SyncWorld, Temux, TimeCesium, TimeHub, TimePictra, TimeProvider, TrueTime, and ZL are registered trademarks of Microchip Technology Incorporated in the U.S.A. Adjacent Key Suppression, AKS, Analog-for-the-Digital Age, Any Capacitor, AnyIn, AnyOut, Augmented Switching, BlueSky, BodyCom, Clockstudio, CodeGuard, CryptoAuthentication, CryptoAutomotive, CryptoCompanion, CryptoController, dsPICDEM, dsPICDEM.net, Dynamic Average Matching, DAM, ECAN, Espresso T1S, EtherGREEN, GridTime, IdealBridge, In-Circuit Serial Programming, ICSP, INICnet, Intelligent Paralleling, IntelliMOS, Inter-Chip Connectivity, JitterBlocker, Knob-on-Display, KoD, maxCrypto, maxView, memBrain, Mindi, MiWi, MPASM, MPF, MPLAB Certified logo, MPLIB, MPLINK, MultiTRAK, NetDetach, Omniscient Code Generation, PICDEM, PICDEM.net, PICkit, PICtail, PowerSmart, PureSilicon, QMatrix, REAL ICE, Ripple Blocker, RTAX, RTG4, SAM-ICE, Serial Quad I/O, simpleMAP, SimpliPHY, SmartBuffer, SmartHLS, SMART-I.S., storClad, SQI, SuperSwitcher, SuperSwitcher II, Switchtec, SynchroPHY, Total Endurance, Trusted Time, TSHARC, USBCheck, VariSense, VectorBlox, VeriPHY, ViewSpan, WiperLock, XpressConnect, and ZENA are trademarks of Microchip Technology Incorporated in the U.S.A. and other countries. SQTP is a service mark of Microchip Technology Incorporated in the U.S.A. The Adaptec logo, Frequency on Demand, Silicon Storage Technology, and Symmcom are registered trademarks of Microchip Technology Inc. in other countries. GestIC is a registered trademark of Microchip Technology Germany II GmbH & Co. KG, a subsidiary of Microchip Technology Inc., in other countries. All other trademarks mentioned herein are property of their respective companies. © 2023, Microchip Technology Incorporated and its subsidiaries. All Rights Reserved. ISBN: 978-1-6683-2640-4 #### Quality Management System (Ask a Question) For information regarding Microchip's Quality Management Systems, please visit www.microchip.com/quality. # **Worldwide Sales and Service** | AMERICAS | ASIA/PACIFIC | ASIA/PACIFIC | EUROPE | |---------------------------|-----------------------|-------------------------|-----------------------| | Corporate Office | Australia - Sydney | India - Bangalore | Austria - Wels | | 2355 West Chandler Blvd. | Tel: 61-2-9868-6733 | Tel: 91-80-3090-4444 | Tel: 43-7242-2244-39 | | Chandler, AZ 85224-6199 | China - Beijing | India - New Delhi | Fax: 43-7242-2244-393 | | el: 480-792-7200 | Tel: 86-10-8569-7000 | Tel: 91-11-4160-8631 | Denmark - Copenhagen | | ax: 480-792-7277 | China - Chengdu | India - Pune | Tel: 45-4485-5910 | | echnical Support: | Tel: 86-28-8665-5511 | Tel: 91-20-4121-0141 | Fax: 45-4485-2829 | | www.microchip.com/support | China - Chongqing | Japan - Osaka | Finland - Espoo | | Veb Address: | Tel: 86-23-8980-9588 | Tel: 81-6-6152-7160 | Tel: 358-9-4520-820 | | www.microchip.com | China - Dongguan | Japan - Tokyo | France - Paris | | Atlanta | Tel: 86-769-8702-9880 | Tel: 81-3-6880- 3770 | Tel: 33-1-69-53-63-20 | | ouluth, GA | China - Guangzhou | Korea - Daegu | Fax: 33-1-69-30-90-79 | | el: 678-957-9614 | Tel: 86-20-8755-8029 | Tel: 82-53-744-4301 | Germany - Garching | | ax: 678-957-1455 | China - Hangzhou | Korea - Seoul | Tel: 49-8931-9700 | | ustin, TX | Tel: 86-571-8792-8115 | Tel: 82-2-554-7200 | Germany - Haan | | el: 512-257-3370 | China - Hong Kong SAR | Malaysia - Kuala Lumpur | Tel: 49-2129-3766400 | | Soston | Tel: 852-2943-5100 | Tel: 60-3-7651-7906 | Germany - Heilbronn | | Vestborough, MA | China - Nanjing | Malaysia - Penang | Tel: 49-7131-72400 | | el: 774-760-0087 | Tel: 86-25-8473-2460 | Tel: 60-4-227-8870 | Germany - Karlsruhe | | ax: 774-760-0088 | China - Qingdao | Philippines - Manila | Tel: 49-721-625370 | | Chicago | Tel: 86-532-8502-7355 | Tel: 63-2-634-9065 | Germany - Munich | | tasca, IL | China - Shanghai | Singapore | Tel: 49-89-627-144-0 | | el: 630-285-0071 | Tel: 86-21-3326-8000 | Tel: 65-6334-8870 | Fax: 49-89-627-144-44 | | ax: 630-285-0075 | China - Shenyang | Taiwan - Hsin Chu | Germany - Rosenheim | | Pallas | Tel: 86-24-2334-2829 | Tel: 886-3-577-8366 | Tel: 49-8031-354-560 | | ddison, TX | China - Shenzhen | Taiwan - Kaohsiung | Israel - Ra'anana | | el: 972-818-7423 | Tel: 86-755-8864-2200 | Tel: 886-7-213-7830 | Tel: 972-9-744-7705 | | ax: 972-818-2924 | China - Suzhou | Taiwan - Taipei | Italy - Milan | | Petroit | Tel: 86-186-6233-1526 | Tel: 886-2-2508-8600 | Tel: 39-0331-742611 | | lovi, MI | China - Wuhan | Thailand - Bangkok | Fax: 39-0331-466781 | | el: 248-848-4000 | Tel: 86-27-5980-5300 | Tel: 66-2-694-1351 | Italy - Padova | | louston, TX | China - Xian | Vietnam - Ho Chi Minh | Tel: 39-049-7625286 | | el: 281-894-5983 | Tel: 86-29-8833-7252 | Tel: 84-28-5448-2100 | Netherlands - Drunen | | ndianapolis | China - Xiamen | | Tel: 31-416-690399 | | loblesville, IN | Tel: 86-592-2388138 | | Fax: 31-416-690340 | | el: 317-773-8323 | China - Zhuhai | | Norway - Trondheim | | ax: 317-773-5453 | Tel: 86-756-3210040 | | Tel: 47-72884388 | | el: 317-536-2380 | | | Poland - Warsaw | | os Angeles | | | Tel: 48-22-3325737 | | lission Viejo, CA | | | Romania - Bucharest | | el: 949-462-9523 | | | Tel: 40-21-407-87-50 | | ax: 949-462-9608 | | | Spain - Madrid | | el: 951-273-7800 | | | Tel: 34-91-708-08-90 | | aleigh, NC | | | Fax: 34-91-708-08-91 | | el: 919-844-7510 | | | Sweden - Gothenberg | | lew York, NY | | | Tel: 46-31-704-60-40 | | el: 631-435-6000 | | | Sweden - Stockholm | | an Jose, CA | | | Tel: 46-8-5090-4654 | | el: 408-735-9110 | | | UK - Wokingham | | el: 408-436-4270 | | | Tel: 44-118-921-5800 | | anada - Toronto | | | Fax: 44-118-921-5820 | | el: 905-695-1980 | | | | | ax: 905-695-2078 | | | |