Skip to main content

7.3.1 System Overview

Overview

The multimedia interface provided by D-Robotics includes system control, video input, video processing (ISP image processor, cropping, scaling, rotation, correction), H.264/H.265/JPEG/MJPEG encoding and decoding, video output display, etc. The interface is abstracted and encapsulated, which is a collection of low-level interfaces. These low-level interfaces support more flexible application development by better controlling the underlying hardware modules.

Multimedia System Architecture

The main internal processing flow of D-Robotics's multimedia processing is shown in the following figure, which is mainly divided into video input (VIN), video processing (VPS), video encoding (VENC), video decoding (VDEC), video output (VOT), region processing (REGION), AI algorithm inference (BPU), and other modules.

X3-ss_mm_system_topology

Terminology Conventions

AbbreviationFull NameExplanation
VINVideo INIncludes video processing access, image signal processor, distortion correction, and anti-shake processing. It receives and processes data from sensors, and can also directly receive image data from memory.
VPSVideo Process SystemIncludes image rotation, image cropping, and scaling functions, which can output images of different resolutions from the same input source. The input source can be the VIN module or image data in memory.
VENCVideo EncodeThe VENC encoding module supports H.264/H.265/JPEG/MJPEG encoding. The data processed by the VPS module can be encoded into a bitstream output according to different protocols by the encoding module.
VDECVideo DecodeThe VDEC decoding module supports H.264/H.265/JPEG/MJPEG decoding. It can decode the encoded bitstream and hand it over to the VPS module for further processing, and then output it to the VOT module for display.
VPUVideo Processing UnitVideo processing unit, responsible for video encoding and decoding functions.
JPUJPEG Processing UnitJPEG image processing unit, responsible for JPEG and MJPEG encoding and decoding functions.
VOTVideo OutputThe video output module receives image data from VPS and VDEC and can output to a display device.
VIOVideo IN/OUTVideo input and output, including VIN and VOT modules.
MIPIMobile Industry Processor InterfaceMobile industry processor interface.
CSICamera Serial InterfaceCamera serial interface. The CSI interface and the DSI interface belong to the same family and are interface specifications formulated by MIPI (Mobile Industry Processor Interface Alliance).
DVPDigital Video PortDigital video port.
SIFSensor InterfaceSensor interface used to receive MIPI, DVP, or image data from memory.
ISPImage Signal ProcessorImage signal processor, responsible for fine-tuning the image effect.
LDCLens Distortion CorrectionLens distortion correction.
DISDigital Image StabilizerDigital image stabilization.
DWEDewarp EngineDewarp engine, which integrates LDC and DIS, including LDC distortion correction and DIS statistical results.
IPUImage Process UnitImage signal processing unit, supports image rotation, image cropping, and scaling functions.
GDCGeometrical Distortion CorrectionGeometrical distortion correction.
PYMPyramidImage pyramid.
OSDOn Screen DisplayVideo image overlay display.
BPUBrain Process UnitProgrammable AI acceleration engine independently developed by D-Robotics.
HALHardware Abstraction LayerHardware abstraction layer.
FWFirmwareFirmware.
SensorSensorUnless otherwise specified, it refers to CMOS image sensors.