

# Integrating IpGBT links into the Common Readout Units (CRU) of the ALICE Experiment

Erno DAVID on behalf of the ALICE collaboration Wigner Research Centre for Physics, HUN-REN

This work was supported by OTKA K-135515 and NKFI 2021-4.1.2-NEMZ\_KI-2022-00009 grants

> 3 Oct 2023 TWEPP 2023, Geremeas, Sardinia, Italy



## IpGBT Coming in ALICE...

- During the LS2 upgrade a brand new DAQ and trigger system had been developed for ALICE for Run 3 and Run 4
- The upgrading sub-detectors are now connected to the DAQ and Trigger systems with rad-hard GBT links through the CRU
- This enables the delivery of timing & control with deterministic latency and taking of data through a single fiber connection
- The GBTx ASIC is not available any more and the new lpGBT supersedes it for new developments or system additions



- During the coming LS3 upgrades, the new FE systems (e.g. ITS3, FoCal) will (have to) use lpGBT links to connect to the CRUs
- The lpGBT links have to be integrated into the existing CRU FW while keeping the compatibility with the existing O2, TRIGGER, and DCS systems

#### Main features of the present GBT links

- 4.8 Gb/s downlink
- 4.8 Gb/s uplink
- Front-end components:
  - GBTx ASIC
  - external slow-control (I2C, SPI, etc.) controller ASIC (SCA)
  - Versatile Link (VL) optical components

#### Main features of the new IpGBT with VTRX+

- 2.56 Gb/s downlink,
- 5.12/10.24 Gb/s uplink
- Front-end components:
  - IpGBT ASIC
  - internal slow-control controllers (I2C, SPI, GPIO, ADC, etc.) (and optional external SCA)
  - VL+ optical components



3 Oct 2023 TWEPP 2023

VS.



The goal of this *IpGBT in CRU* project is a *feasibility study* with the following consecutive steps:

- 1. Implementing the *IpGBT-FPGA IP* developed by the CERN EP-ESE team in the CRU *Arria10 GX FPGA* in a single-link **standalone design** and test the functionality at the lowest level (links only with pattern generators / checkers)
- **2. Integrating** the *lpGBT-FPGA IP* in the common CRU FW by removing the GBT interfaces and
  - adding 1 lpGBT interface with full clock & trigger integration
  - adding 12 IpGBT interface (2 TRX banks) with full clock & trigger integration
  - optional: adding 24 IpGBT interface with full clock, trigger, and data integration





 The aim is not a production firmware, but to study and demonstrate that the replacement of the GBT links with IpGBT links in the CRU is possible while keeping the compatibility with the connected systems (O2, TTS, DCS)



### Test Setup (simplified)

The setup is the same at Wigner RCP and in the CERN CRU-TRIG integration lab





### GBT / IpGBT Differences

| DOWNLINK                                                                               |                               | 4.8 Gb/s (120 bits @ 40 MHz) |    |    |                              |           |     |              |
|----------------------------------------------------------------------------------------|-------------------------------|------------------------------|----|----|------------------------------|-----------|-----|--------------|
| Header, internal and external control channels, data channel, forward error correction | GBT downlink                  | 4b                           | 2b | 2b | 80b                          |           | 32b | <b>ALTCE</b> |
|                                                                                        | (CRO FILL) GBT<br>frames:     | Header                       | IC | EC | User Data                    | User Data |     | HLICL        |
| differences:                                                                           |                               |                              |    |    |                              |           |     |              |
| <ul> <li>64-bit @ 40 MHz vs 120-bit @ 40 MHz</li> </ul>                                |                               |                              |    |    | 2.56 Gb/s (64 bits @ 40 MHz) |           |     |              |
| <ul> <li>32-bit payload vs 80-bit payload</li> </ul>                                   | IpGBT downlink                | 4b                           | 2b | 2b | 32b                          |           |     | _            |
| <ul> <li>2.56 Gb/s vs 4.8 Gb/s</li> </ul>                                              | (CRU -> FEE) IpGBT<br>frames: | Header                       | IC | EC | User Data                    | F         | EC  |              |

• TX parallel clock 320 MHz vs 240 MHz

| UPLINK<br>Header, internal and external control channels,<br>data channel, optional forward error correction |                                                  | GBT uplink   |                  | 4b     | 2b                             | 2b | 4.8 Gb/s (120 bits @ 40 MHz)<br>80b | 32b          |  |  |
|--------------------------------------------------------------------------------------------------------------|--------------------------------------------------|--------------|------------------|--------|--------------------------------|----|-------------------------------------|--------------|--|--|
|                                                                                                              |                                                  | (FEE -> CRU) | GBT<br>frames:   | Header | IC                             | EC | User Data                           | FEC<br>/DATA |  |  |
| diff                                                                                                         | erences:                                         |              |                  |        |                                |    |                                     |              |  |  |
| •                                                                                                            | 128/256-bit @ 40 MHz vs 120-bit @ 40 MHz         |              |                  |        |                                |    | 5.12 Gb/s (128 bits @ 40 MHz)       |              |  |  |
| •                                                                                                            | 96/112/192/224-bit payload vs 80/112-bit payload | IpGBT uplink |                  | 2b     | 2b                             | 2b | 96 / 112b                           | 5/12b        |  |  |
| •                                                                                                            | 5.12 Gb/s or 10.24 Gb/s vs 4.8 Gb/s              |              | IpGBT<br>frames: | Header | IC                             | EC | User Data                           | FEC          |  |  |
| •                                                                                                            | RX parallel clock 320 MHz vs 240 MHz             |              |                  |        |                                |    |                                     |              |  |  |
|                                                                                                              |                                                  |              |                  | 4      | 10.24 Gb/s (256 bits @ 40 MHz) |    |                                     |              |  |  |
|                                                                                                              |                                                  | IpGBT uplink |                  | 2b     | 2b                             | 2b | 192 / 224b                          | 5/12b        |  |  |
|                                                                                                              |                                                  | (FEE -> CRU) | lpGBT<br>frames: | Header | IC                             | EC | User Data                           | FEC          |  |  |



### ALICE Common Read-out Unit (CRU)



## The Common Read-out Units (CRU) are PCIe add-on cards installed in the First Level Processor (FLP) nodes of the ALICE DAQ system. Main tasks of the CRU:

- Deliver the trigger, timing and read-out control information to the Front-End Electronics
- Deliver detector data to the O2 (FLP Servers) with and/or without processing in the CRU FPGA
- Transport detector control information between the DCS and the FEE
- Take part of the Busy / Drop / Throttle mechanism of the detectors read-out







### CRU – Clock and Trigger (GBT version)



#### **GBT CRU**







#### **GBT CRU**



![](_page_9_Picture_3.jpeg)

#### **GBT CRU**

![](_page_10_Figure_2.jpeg)

#### solution (concept)

We have to go down to the common 40 MHz clock domain and align data at both sides to the common 40 MHz clock!

![](_page_10_Picture_5.jpeg)

#### **GBT CRU**

![](_page_11_Figure_2.jpeg)

#### solution (concept)

We have to go down to the common 40 MHz clock domain and align data at both sides to the common 40 MHz clock!

![](_page_11_Figure_5.jpeg)

- The PON module recovers the 240 MHz reference clock and a data valid bit with deterministic constant delay to the 40 MHz LHC clock rising edge.
- The IOPLL recovers the 40 MHz dividing the 240 MHz with six. At the output of the IOPLL the two clocks can randomly have six different phase relations.
- If the 40 MHz rising edge is not aligned with the data valid bit then the Control FSM resets the IOPLL.
- Within a reasonable time (practically within a few tries) the data valid bit and the 40 MHz clock will be aligned.

![](_page_12_Figure_5.jpeg)

![](_page_12_Figure_6.jpeg)

![](_page_12_Figure_7.jpeg)

![](_page_12_Picture_12.jpeg)

## Aligning IpGBT Data Write Enable to the 40 MHz Clock

ALICE

- The IpGBT data write enable bit is generated in the 320 MHz clock domain with a 40 MHz rate.
- We sample this bit with our 40 MHz clock which is in sync with the 320 MHz clock.
- If it is not aligned with the 40 MHz clock then the Control FSM shifts the write enable bit (bit slip).
- In a maximum of eight steps the write enable bit will be aligned with the 40 MHz clock.

![](_page_13_Figure_6.jpeg)

![](_page_13_Figure_7.jpeg)

![](_page_13_Picture_8.jpeg)

### Clock and Trigger Integration / Jitter Cleaning

![](_page_14_Figure_1.jpeg)

### First Test Results (Downlink Clock and Trigger Delay Stability I.)

#### LHC clock & trigger delivery measurements: Delay and stability from LTU to FEE (VLDB+)

- IN1 (yellow): DATA VALID (1:6 tick): PON link output (CRU CLK-OUT SMA conn.)
- IN2 (green): VLDB+ 40 MHz: lpGBT link output (VLDB+ E0CLK\_P SMA conn.)
- TRIGGER: external, LTU 40 MHz output (SCOPEA conn.)

![](_page_15_Figure_5.jpeg)

![](_page_15_Picture_6.jpeg)

Visualizing the 40 MHz LHC clock, DATA VALID, and IpGBT TX WRITE\_ENABLE flags via the clock chain (1 or 12 links being implemented) with infinite persistence for a few hours while challenging the stability with: **optical link disconnections**, **power cycling**, and **PLL forced initializations** 

- DATA\_VALID (1 link, 12 links) → stable with the same delay (no glitches, phase jumps observed, continuous 10<sup>12</sup> clock cycle)
- TX Write\_Enable (1 link, 12 links) → stable with the same delay (no glitches, phase jumps observed, continuous 10<sup>12</sup> clock cycle)
- VLDB+ ECLK0 (1 link, 12 link, internal feeding) → stable with the same delay (no glitches, phase jumps observed, continuous 10<sup>12</sup> clock cycle)

![](_page_15_Picture_11.jpeg)

### First Test Results (Downlink Clock and Trigger Delay Stability II.)

Two options for the clock integration (external/internal) has been tested for delay stability and with simple jitter measurements. (1 link only.)

![](_page_16_Figure_2.jpeg)

IpGBT TX\_REF: externally distributed to the IpGBT transceiver banks through one of the CRU's two on-board jitter cleaner PLL. (The other filters the incoming 240 MHz PON RX\_CLK.)

![](_page_16_Figure_4.jpeg)

IpGBT TX\_REF: internally distributed to the IpGBT transceiver banks ("Only" the incoming 240 MHz PON RX\_CLK is cleaned with one of the CRU's two on-board jitter cleaner PLLs.)

No instabilities or significant difference in jitter has been observed. Further jitter analysis may follow.

![](_page_16_Picture_7.jpeg)

3 Oct 2023 TWEPP 2023

![](_page_17_Picture_1.jpeg)

| Link<br>ID                      | GBT Mode<br>Tx/Rx                                              | Loopback                                             | GBT MUX                                                                              | Datapath<br>mode                                                                        | Datapath<br>status                                                        | RX freq<br>(MHz)                                                   | TX freq<br>(MHz)                                                   | Status                                       | Optical<br>power (uW)                                                | System<br>ID                                                | FEE<br>ID                                            |
|---------------------------------|----------------------------------------------------------------|------------------------------------------------------|--------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------|---------------------------------------------------------------------------|--------------------------------------------------------------------|--------------------------------------------------------------------|----------------------------------------------|----------------------------------------------------------------------|-------------------------------------------------------------|------------------------------------------------------|
| 0<br>1<br>2<br>3<br>4<br>5<br>6 | GBT/GBT<br>GBT/GBT<br>GBT/GBT<br>GBT/GBT<br>GBT/GBT<br>GBT/GBT | None<br>None<br>None<br>None<br>None<br>None<br>None | TTC:CTP<br>TTC:CTP<br>TTC:CTP<br>TTC:CTP<br>TTC:CTP<br>TTC:CTP<br>TTC:CTP<br>TTC:CTP | Streaming<br>Streaming<br>Streaming<br>Streaming<br>Streaming<br>Streaming<br>Streaming | Enabled<br>Enabled<br>Enabled<br>Enabled<br>Enabled<br>Enabled<br>Enabled | 259.55<br>320.44<br>321.17<br>320.02<br>317.92<br>313.63<br>320.63 | 320.63<br>320.63<br>320.63<br>320.63<br>320.63<br>320.63<br>320.63 | DOWN<br>DOWN<br>DOWN<br>DOWN<br>DOWN<br>DOWN | 0.0<br>0.0<br>0.0<br>0.0<br>0.0<br>0.0<br>0.0<br>0.0<br>0.0<br>900.0 | 0x3<br>0x3<br>0x3<br>0x3<br>0x3<br>0x3<br>0x3<br>0x3<br>0x0 | 0x0<br>0x0<br>0x0<br>0x0<br>0x0<br>0x0<br>0x0<br>0x0 |
| 7<br>8<br>9<br>10<br>11         | GBT/GBT<br>GBT/GBT<br>GBT/GBT<br>GBT/GBT<br>GBT/GBT            | None<br>None<br>None<br>None<br>None                 | TTC:CTP<br>TTC:CTP<br>TTC:CTP<br>TTC:CTP<br>TTC:CTP                                  | Streaming<br>Streaming<br>Streaming<br>Streaming<br>Streaming                           | Enabled<br>Enabled<br>Enabled<br>Enabled<br>Enabled                       | 319.88<br>298.17<br>254.86<br>257.52<br>249.85                     | 320.63<br>320.63<br>320.63<br>320.63<br>320.63<br>320.63           | DOWN<br>DOWN<br>DOWN<br>DOWN<br>DOWN         | 0.0<br>0.0<br>0.0<br>0.0<br>0.0                                      | 0x0<br>0x0<br>0x0<br>0x0<br>0x0<br>0x0                      | 0x0<br>0x0<br>0x0<br>0x0<br>0x0<br>0x0               |

#### Test results:

- The x12 link lpGBT module is recognized by the O2 software (link status, TX/RX frequency, etc.)
- The IpGBT ASIC "Loopback Downlink Group Data Source" mode was used with built in checkers in the IpGBT module
- The links were tested one-by-one with a single VLDB+ for 1 hour / link (10<sup>14</sup> bits)  $\rightarrow$  no errors
- Integration with the Datapath wrapper is ongoing

#### Summary:

- x12 link lpGBT module (with 10.24 Gbps / FEC12 uplink mode) implemented inside the CRU-FW
- No visible phase jumps in the VLDB+ 40 MHz clock (10<sup>12</sup> clock cycle)
- Stable data loopback through the IpGBT ASIC ("Loopback Downlink Group Data Source" mode) (10<sup>14</sup> bits per link)

#### Further work:

- Connect the IpGBT module with the DMA and Slow Control modules
- Improve stability between rebuilt: floor planning and design lock
- Characterize skew between links
- Clock and trigger analysis
- Test with detector specific user logic at full 10.24 Gbps speed

![](_page_18_Picture_11.jpeg)

![](_page_19_Picture_0.jpeg)

# **Thank You for Your Attention!**

![](_page_19_Picture_2.jpeg)

![](_page_20_Picture_0.jpeg)

### BACK-UP

![](_page_20_Picture_2.jpeg)

![](_page_21_Figure_1.jpeg)

![](_page_21_Picture_2.jpeg)

![](_page_21_Picture_7.jpeg)

![](_page_22_Figure_1.jpeg)

![](_page_22_Picture_2.jpeg)

![](_page_22_Picture_7.jpeg)

### Clock and Trigger Integration / Jitter Cleaning (Option B, alternative)

![](_page_23_Figure_1.jpeg)

### **Uplink Integration**

![](_page_24_Figure_1.jpeg)

ALICE

![](_page_25_Picture_1.jpeg)

#### For the uplinks testing we have the following options:

- Use the built-in pattern generators in the IpGBT ASIC and check the received payload in the IpGBT module.
- Use the CRU firmware's Datapath Wrapper which is able to record the incoming raw data. Currently it supports normal GBT (80-bit) and wide bus mode (112-bit) mode, so it can check only part of the IpGBT data
- Later: write a dedicated user logic to process the full IpGBT payload and creates the standard ALICE data packets (SAP) and sends them to the PCIe DMA (through the Datapath Wrapper).

## For our uplinks testing we have used the built in pattern generators and checkers with x12 lane 10.24 Gbps (FEC12) links. Test results so far:

- We have tested 1 standalone link for a few hours → no errors
- We have tested a 12-link implementation, the links one-by-one with a single VLDB+ for 1 hour /link → no errors
- Integration with the Datapath wrapper is ongoing (may evolve till the conference)

### Testing the IpGBT Control Channels

#### • The IpGBT ASIC supports both EC and IC 2-bit control channels:

- EC channel is backward compatible. It can connected to an GBT SCA on the FEE cards
- IC channel requires upgrade to a newer GBT-SC firmware module provided by CERN Electronics group

![](_page_26_Figure_4.jpeg)

- For control channel testing we can use the low level IC and EC field tester Python scripts used during the original CRU FW development. We can read and write the ASICs registers with them in the same way.
- To do this we have started to update the CRU FW with the new slow control module (GBT-SC IP) of the EP\_ESE team and modify our software scripts to support the new IpGBT ASIC control registers (IC).
- The aim is to test reading out a few registers only as a demonstration.

![](_page_26_Picture_8.jpeg)

![](_page_26_Picture_12.jpeg)

![](_page_26_Picture_13.jpeg)