Zynq UltraScale+ MPSoC VCU TRD 2020.1 - PL DDR HDMI Video Capture and Display

Zynq UltraScale+ MPSoC VCU TRD 2020.1 - PL DDR HDMI Video Capture and Display

This page provides all the information related to Design Module 10 - VCU TRD PL DDR HDMI design.

Table of Contents

1 Overview

This module enables the capture of video from an HDMI-Rx Subsystem implemented in the PL. The video can be displayed through HDMI-Tx through the PL and recorded in SD cards or USB/SATA drives. The module can Stream-in or Stream-out encoded data through an Ethernet interface. This module supports single-stream and multi-stream for XV20 format. It also supports DCI 4k (4096 x 2160) resolution at 60 FPS.

This is the new design approach proposed to use PL_DDR for decoding and PS_DDR for encoding so that DDR bandwidth would be enough to support high bandwidth VCU applications requiring simultaneous encoder and decoder operations and transcoding at 4k@60 FPS. This approach makes the most effective use of limited AXI4 read/write issuance capability in minimizing latency for the decoder. DMA buffer sharing requirements determine how capture, display, and intermediate processing stages should be mapped to the PS or PL DDR.

This design supports the following video interfaces:
Sources:

  • HDMI-Rx capture pipeline implemented in the PL.

  • File source (SD card, USB storage, SATA hard disk).

  • Stream-In from network or internet.

Sinks:

  • HDMI-Tx display pipeline implemented in the PL.

VCU Codec:

  • Video Encode/Decode capability using VCU hard block in PL 

    • AVC/HEVC encoding

    • Encoder/decoder parameter configuration.

Video format:

  • XV20

Supported Resolution:

The table below provides the supported resolution from the command line app only in this design.

Resolution

Command Line

Single Stream

Multi-stream

DCI-4kp60

NA

4kp60

NA

4kp30

√ (Max 2)

1080p60

√ (Max 4)

√ - Supported
NA – Not applicable
x – Not supported

The below table gives information about the features supported in this design. 

Pipeline

Input source

Format

Output Type

Resolution

VCU codec

Pipeline

Input source

Format

Output Type

Resolution

VCU codec

Serial pipeline

HDMI-Rx

XV20

HDMI-Tx

DCI-4kp60/4kp60/4kp30/1080p60

HEVC/AVC

Record/Stream-Out pipeline

HDMI-Rx

XV20

File Sink/ Stream-Out

DCI-4kp60/4kp60/4kp30/1080p60

HEVC/AVC

File/Streaming Playback pipeline

File Source/ Stream-In

XV20

HDMI-Tx

DCI-4kp60/4kp60/4kp30/1080p60

HEVC/AVC


The below figure shows the PL DDR HDMI design hardware block diagram.

 

The below figure shows the PL DDR HDMI design software block diagram.

1.1 Board Setup

Refer below link for Board Setup

1.2 Run Flow

The TRD package is released with the source code, Vivado project, Petalinux BSP, and SD card image that enables the user to run the demonstration. It also includes the binaries necessary to configure and boot the ZCU106 board. Prior to running the steps mentioned in this wiki page, download the TRD package and extract its contents to a directory referred to as TRD_HOME which is the home directory.

TRD package contents are placed in the following directory structure. The user needs to copy all the files from the $TRD_HOME/images/vcu_hdmi_multistream_xv20/ to FAT32 formatted SD card directory.

rdf0428-zcu106-vcu-trd-2020-1 ├── apu │   └── vcu_petalinux_bsp ├── images │   ├── vcu_10g │   ├── vcu_audio │   ├── vcu_hdmi_multistream_xv20 │   ├── vcu_hdmi_rx │   ├── vcu_hdmi_tx │   ├── vcu_llp2_hdmi_nv12 │   ├── vcu_llp2_hdmi_nv16 │   ├── vcu_llp2_hdmi_xv20 │   ├── vcu_llp2_sdi_xv20 │   ├── vcu_multistream_nv12 │   ├── vcu_pcie │   ├── vcu_sdirx │   ├── vcu_sditx │   └── vcu_sdi_xv20 ├── pcie_host_package │   ├── COPYING │   ├── include │   ├── libxdma │   ├── LICENSE │   ├── readme.txt │   ├── RELEASE │   ├── tests │   ├── tools │   └── xdma ├── pl │   ├── constrs │   ├── designs │   ├── prebuild │   ├── README.md │   └── srcs └── README.txt

TRD package contents specific to VCU PL DDR HDMI design are placed in the following directory structure.

rdf0428-zcu106-vcu-trd-2020-1 ├── apu │   └── vcu_petalinux_bsp │   └── xilinx-vcu-zcu106-v2020.1-final.bsp ├── images │   ├── vcu_hdmi_multistream_xv20 │   │   ├── autostart.sh │   │   ├── BOOT.BIN │   │   ├── boot.scr │   │   ├── config │   │   ├── image.ub │   │   ├── system.dtb │   │   └── vcu ├── pcie_host_package ├── pl │   ├── constrs │   ├── designs │   │   ├── zcu106_plddr_hdmi │   ├── prebuild │   │   ├── zcu106_plddr_hdmi │   ├── README.md │   └── srcs │   ├── hdl │   └── ip └── README.txt

Configuration files(input.cfg) for various resolutions are placed in the following directory structure in /media/card.

config ├── 2-4kp30 │   ├── Display │   ├── Record │   ├── Stream-in │   └── Stream-out ├── 4-1080p60 │   ├── Display │   ├── Record │   ├── Stream-in │   └── Stream-out ├── 4kp60 │   ├── Display │   ├── Record │   ├── Stream-in │   └── Stream-out ├── DCI-4kp60 │   ├── Display │   ├── Record │   ├── Stream-in │   └── Stream-out └── input.cfg

1.2.1 GStreamer Application (vcu_gst_app)

The vcu_gst_app is a command-line multi-threaded Linux application. The command-line application requires an input configuration file (input.cfg) to be provided in plain text.

Run below modetest command to set CRTC configurations for DCI-4kp60:

$ modetest -D a00c0000.v_mix -s 40:4096x2160-60@BG24

Run below modetest command to set CRTC configurations for 4kp60:

$ modetest -D a00c0000.v_mix -s 40:3840x2160-60@BG24

Run below modetest command to set CRTC configurations for 4kp30:

$ modetest -D a00c0000.v_mix -s 40:3840x2160-30@BG24

Execution of the application is shown below:

$ vcu_gst_app < path to *.cfg file>

Example:

4kp60 XV20 HEVC_HIGH Display Pipeline execution

$ vcu_gst_app /media/card/config/4kp60/Display/Single_4kp60_HEVC_HIGH.cfg

4kp60 XV20 HEVC_HIGH Record Pipeline execution

$ vcu_gst_app /media/card/config/4kp60/Record/Single_4kp60_HEVC_HIGH.cfg

4kp60 XV20 HEVC_HIGH Stream-out Pipeline execution

$ vcu_gst_app /media/card/config/4kp60/Stream-out/Single_4kp60_HEVC_HIGH.cfg

4kp60 XV20 HEVC_HIGH Stream-in Pipeline execution

$ vcu_gst_app /media/card/config/4kp60/Stream-in/input.cfg

Make sure HDMI-Rx should be configured to 4kp60 mode

To measure the latency of the pipeline, run the below command. The latency data is huge, so dump it to a file.

$ GST_DEBUG="GST_TRACER:7" GST_TRACERS="latency" GST_DEBUG_FILE=/run/latency.log vcu_gst_app /media/card/config/input.cfg

Refer below link for detailed run flow steps

1.3 Build Flow

Refer below link for detailed build flow steps


2 Other Information

2.1 Known Issues

2.2 Limitations

2.3 Optimum VCU Encoder parameters for use-cases.

Video streaming:

  • Video streaming use-case requires a very stable bitrate graph for all pictures.

  • It is good to avoid periodic large Intra pictures during the encoding session

  • Low-latency rate control (hardware RC) is the preferred control-rate for video streaming, it tries to maintain equal amount frame sizes for all pictures.

  • Good to avoid periodic Intra frames instead use low-delay-p (IPPPPP…)

  • VBR is not a preferred mode of streaming.

Performance: AVC Encoder settings:

  • It is preferred to use 8 or higher slices for better AVC encoder performance.

  • AVC standard does not support Tile mode processing which results in the processing of MB rows sequentially for entropy coding.

Quality: Low bitrate AVC encoding:

  • Enable profile=high and use qp-mode=auto for low-bitrate encoding use-cases.

  • The high profile enables 8x8 transform which results in better video quality at low bitrates.

2.4 Max Bit-rate Benchmarking

The following tables summarize the maximum bit rate achievable for 3840x2610p60 resolution, XV20 pixel format at GStreamer level. The maximum supported target bit rate values vary based on what elements and type of input used in the pipeline.

Maximum Bit Rate support for Record Use case with 4kp60 resolution

The table below provides Encoder Maximum Bit Rate Tests with XV20 format and Variable Rate Control Mode.

Video Recording ( Live video capture → VCU encoder → parser → muxer → filesink )

Format 

Codec

Entropy Mode

Rate Control Mode

B-Frames = 4

DDR Mode

Max Target Bitrate



4:2:2, 10 bit


H.264 (AVC)

CABAC

VBR

IBBBBP

PS-DDR

160 Mb/s

CAVLC

VBR

IBBBBP

PS-DDR

160 Mb/s

H.265 (HEVC)

-

VBR

IBBBBP

PS-DDR

267 Mb/s

Example pipeline used for measurement:

  • Run the following gst-launch-1.0 command to record the XV20 video using the GStreamer pipeline. where, video0 indicates a video node for the input source.

$ ENC_EXTRA_IP_BUFFERS=5 gst-launch-1.0 -e v4l2src device=/dev/video0 io-mode=4 num-buffers=7320 ! video/x-raw, width=3840, height=2160, format=NV16_10LE32, framerate=60/1 ! omxh265enc control-rate=variable target-bitrate=267000 max-bitrate=320400 gop-mode=basic gop-length=60 b-frames=4 num-slices=8 prefetch-buffer=TRUE cpb-size=1000 initial-delay=500 ! video/x-h265 , profile=main-422-10 ! queue max-size-bytes=0 ! h265parse ! mp4mux ! fpsdisplaysink name=fps1sink text-overlay=false fps-update-interval=1000 video-sink="filesink location=/run/test_4kp60_xv20_hevc_267000.mp4"
  • It is recommended to store output file in RAM or SATA

  • Achieved target bit rate depends on complexity of video content

  • Above data is captured by recording a file in mp4 container format

Maximum Bit Rate support for Playback Use case with 4kp60 resolution

The table below provides Decoder Maximum Bit Rate Tests with XV20 format and Variable Rate Control Mode.

Playback ( Filesrc → Decoder → Display )

Format 

Codec

Entropy Mode

Rate Control Mode

B-Frames = 4

DDR Mode

Max Target Bitrate



4:2:2, 10 bit


H.264 (AVC)

CABAC

VBR

IBBBBP

PL_DDR

120 Mb/s

CAVLC

VBR

IBBBBP

PL_DDR

160 Mb/s

H.265 (HEVC)

-

VBR

IBBBBP

PL_DDR

267 Mb/s

Example pipeline used for measurement:

  • Run the following gst-launch-1.0 command to play XV20 recorded file on HDMI-Tx using the GStreamer pipeline. where, a00c0000.v_mix indicates bus-id for DRM mixer.

$ gst-launch-1.0 filesrc location=/run/test_4kp60_xv20_hevc_267000.mp4 ! qtdemux ! h265parse ! omxh265dec internal-entropy-buffers=5 ! queue max-size-bytes=0 ! fpsdisplaysink name=fps1sink text-overlay=false video-sink="kmssink bus-id="a00c0000.v_mix" fullscreen-overlay=1" -v

It is recommended to keep input file in RAM or SATA

Maximum Bit Rate support for Streaming Use case with 4kp60 resolution.

The table below provides Encoder/Decoder Maximum Bit Rate Tests with XV20 format (For Streaming).

Video Streaming ( Live video capture → VCU encoder → Parser → rtppay → Stream-out
Stream-in → rtpdepay → Decoder → Display )

Format 

Codec

Rate Control Mode

Latency Mode

B-Frames = 0

DDR Mode

Max Target Bitrate







4:2:2, 10 bit




 


H.264 (AVC)


LOW_LATENCY

Normal







IPPP









Encoder (PS_DDR),
Decoder (PL_DDR)



90 Mb/s

Reduced

90 Mb/s


CBR  + max-picture-size

Normal

90 Mb/s

Reduced

90 Mb/s



H.265 (HEVC)




LOW_LATENCY

Normal

130 Mb/s

Reduced

130 Mb/s


CBR  + max-picture-size

Normal

110 Mb/s

Reduced

110 Mb/s

Example pipeline used for measurement:

H.265 (HEVC) with Low-latency rate control mode and Normal latency-mode

  • Run the following gst-launch-1.0 command to stream-out the XV20 video using the GStreamer pipeline. where, video0 indicates a video node for the input source.

$ gst-launch-1.0 -v v4l2src device=/dev/video0 io-mode=4 ! video/x-raw,format=NV16_10LE32, width=3840, height=2160, framerate=60/1 ! queue ! omxh265enc num-slices=8 gop-length=120 periodicity-idr=120 control-rate=low-latency prefetch-buffer=true target-bitrate=130000 gop-mode=low-delay-p ! video/x-h265, alignment=au ! rtph265pay ! udpsink buffer-size=60000000 host=192.168.25.89 port=5004 max-lateness=-1 qos-dscp=60 async=false max-bitrate=120000000 -v
  • Run the following gst-launch-1.0 command to display XV20 stream-in video on HDMI-Tx using the GStreamer pipeline where 5004 is port number.

$ gst-launch-1.0 udpsrc port=5004 buffer-size=60000000 caps="application/x-rtp, media=video, clock-rate=90000, payload=96, encoding-name=H265" ! rtpjitterbuffer latency=1000 ! rtph265depay ! h265parse ! video/x-h265, alignment=au ! omxh265dec low-latency=0 ! queue max-size-bytes=0 ! fpsdisplaysink name=fpssink text-overlay=false video-sink="kmssink bus-id=a00c0000.v_mix fullscreen-overlay=1" sync=true -v

H.265 (HEVC) with CBR rate control mode and Normal latency mode

  • Run the following gst-launch-1.0 command to stream-out the XV20 video using the GStreamer pipeline. Where video0 indicates a video node for the input source.

$ gst-launch-1.0 -v v4l2src device=/dev/video0 io-mode=4 ! video/x-raw,format=NV16_10LE32,width=3840,height=2160,framerate=60/1 ! queue ! omxh265enc num-slices=8 gop-length=120 periodicity-idr=120 control-rate=constant max-picture-size=2017 prefetch-buffer=true target-bitrate=110000 gop-mode=low-delay-p ! video/x-h265, alignment=au ! rtph265pay ! udpsink buffer-size=60000000 host=192.168.25.89 port=5004 max-lateness=-1 qos-dscp=60 async=false max-bitrate=120000000 -v
  • Run the following gst-launch-1.0 command to display XV20 stream-in video on HDMI-Tx using the GStreamer pipeline where 5004 is port number.

$ gst-launch-1.0 udpsrc port=5004 buffer-size=60000000 caps="application/x-rtp, media=video, clock-rate=90000, payload=96, encoding-name=H265" ! rtpjitterbuffer latency=1000 ! rtph265depay ! h265parse ! video/x-h265, alignment=au ! omxh265dec low-latency=0 ! queue max-size-bytes=0 ! fpsdisplaysink name=fpssink text-overlay=false video-sink="kmssink bus-id=a00c0000.v_mix fullscreen-overlay=1" sync=true -v
  • Above data is captured by streaming elementary stream over RTP.

  • 192.168.25.89 is host/client IP address and 5004 is port number.

Maximum Bit Rate support for Serial Use case with 4kp60 resolution.

The table below provides Encoder/Decoder Maximum Bit Rate Tests with XV20 format.

Serial ( Live video capture → VCU encoder → VCU decoder → Display )

Format 

Codec

Rate Control Mode

Latency Mode

B-Frames = 0 or 4

DDR Mode

Max Target Bitrate




4:2:2, 10 bit


H.264 (AVC)


CBR  + max-picture-size

Normal

IBBBBP



Encoder (PS_DDR),
Decoder (PL_DDR)

90 Mb/s

Reduced

IPPP

200 Mb/s


H.265 (HEVC)


CBR  + max-picture-size

Normal

IBBBBP

120 Mb/s

Reduced

IPPP

200 Mb/s

Example pipeline used for measurement:

H.265 (HEVC) with CBR rate control mode and Normal latency mode

  • Run the following gst-launch-1.0 command to display the XV20 video on HDMI-Tx using the GStreamer pipeline (capture → encode → decode → display). where, video0 indicates a video node for the input source.

$ gst-launch-1.0 -v v4l2src device=/dev/video0 io-mode=4 ! video/x-raw, format=NV16_10LE32, width=3840, height=2160, framerate=60/1 ! omxh265enc num-slices=8 gop-length=60 b-frames=4 control-rate=2 prefetch-buffer=true target-bitrate=120000 max-picture-size=2200 ! video/x-h265, alignment=au ! queue ! video/x-h265, profile=main-422-10, alignment=au ! omxh265dec low-latency=0 ! queue max-size-bytes=0 ! fpsdisplaysink name=fpssink text-overlay=false "video-sink=kmssink bus-id=a00c0000.v_mix fullscreen-overlay=1" sync=true -v

3 Appendix A - Input Configuration File (input.cfg)

The example configuration files are stored at /media/card/config/ folder.

Common Configuration:
It is the starting point of common configuration.
 
Num of Input:
1,2,3 or 4

Output:
Select the video interface.
Options: HDMI

Out Type:
Options: display, record, and stream

Display Rate:
Pipeline frame rate.
Options: 30 FPS or 60 FPS for each stream

Exit:
It indicates to the application that the configuration is over

Input Configuration:
It is the starting point of the input configuration.

Input Num:
Starting Nth input configuration.
Options: 1, 2, 3, or 4

Input Type:
Input source type.
Options: HDMI, File, Stream

Uri:
File path or Network URL. Applicable for file playback and stream-in pipeline only. Supported file formats for playback are ts, mp4, and mkv.
Options: file:///media/usb/abc.ts (for file path), udp://192.168.25.89:5004/ (for Network streaming, Here 192.168.25.89 is IP address and 5004 is port no)

Raw:
To tell the pipeline is processed or pass-through.
Options: False

The raw use-case is not supported with this design as mixer is not connected to PS DDR.


Width:
The width of the live source.
Options: 4096, 3840, 1920Height:The height of the live source.Options: 2160, 1080

© 2025 Advanced Micro Devices, Inc. Privacy Policy