MIPI Camera Real-time Detection

Example Introduction

The MIPI camera real-time detection example is a Python interface development code example located in /app/pydev_demo/08_mipi_camera_sample. It demonstrates how to use the onboard MIPI camera for real-time object detection. This example uses the YOLOv5x object detection model to perform real-time inference on the video stream captured by the MIPI camera, and displays the detection results via HDMI, outputting bounding box information.

Included examples:

root@ubuntu:/app/pydev_demo/08_mipi_camera_sample$ tree
.
├── 01_mipi_camera_yolov5s.py
├── 02_mipi_camera_dump.py
├── 03_mipi_camera_scale.py
├── 04_mipi_camera_crop_scale.py
├── 05_mipi_camera_streamer.py
└── coco_classes.names

Effect Demonstration

01 Real-time Detection Effect

Visualization of Detection Results

To view the real-time camera feed and visualization of detection results on a display, you need to:

Connect an external display: Connect the development board to a monitor using an HDMI cable
Special handling for Desktop version: If using the Desktop version system, first execute the following command to stop the desktop service:
```
sudo systemctl stop lightdm
```
Remote connection: Connect to the board via SSH
Run the code: After executing the example program, you will see the real-time detection results on the connected display

output-img

02 Image Capture and Save Effect

After running, multiple YUV format image files will be saved in the same directory as the script, with a default resolution of 1920x1080.

output-img

03 Image Scaling Effect

After running, scaled YUV image files will be saved in the same directory as the script, with a default resolution of 640x360.

output-img

04 Image Cropping and Scaling Effect

After running, cropped and scaled YUV image files (NV12 format) will be saved in the same directory as the script. By default, the center of the image is cropped and scaled. Adjusting the cropping position yields the following YUV image.

output-img

05 Real-time Streaming Effect

After running, the camera feed is displayed in real-time on the HDMI screen (streaming test). Note that for the Desktop version, you need to first execute sudo systemctl stop lightdm to stop the desktop service.

output-img

Hardware Preparation

Hardware Connection

Prepare an RDK development board
Connect the officially adapted MIPI camera
Connect the monitor and development board via an HDMI cable
Connect the power cable and network cable

connect-img

Quick Start

Code and Board Location

The example files are located at /app/pydev_demo/08_mipi_camera_sample

Compilation and Execution

Python examples do not require compilation; they can be run directly:

Running 01_mipi_camera_yolov5s.py:

cd /app/pydev_demo/08_mipi_camera_sample
python 01_mipi_camera_yolov5s.py

Running 02_mipi_camera_dump.py:

cd /app/pydev_demo/08_mipi_camera_sample
python 02_mipi_camera_dump.py -f 30 -c 10 -w 1920 -h 1080

Running 03_mipi_camera_scale.py:

cd /app/pydev_demo/08_mipi_camera_sample

# This example requires input.yuv as input. Here we use output0.yuv from the previous example as input, execute the copy command
cp output0.yuv input.yuv

# Then run the example
python 03_mipi_camera_scale.py -i input.yuv -o output_640x360.yuv -w 640 -h 360 --iwidth 1920 --iheight 1080

Running 04_mipi_camera_crop_scale.py:

cd /app/pydev_demo/08_mipi_camera_sample
python 04_mipi_camera_crop_scale.py -i input.yuv -o output_640x480.yuv -w 640 -h 480 --iwidth 1920 --iheight 1080 -x 304 -y 304 --crop_w 896 --crop_h 592

Running 05_mipi_camera_streamer.py:

cd /app/pydev_demo/08_mipi_camera_sample
python 05_mipi_camera_streamer.py -w 1920 -h 1080

Detailed Introduction

Example Program Parameter Options

Parameter Description for 01_mipi_camera_yolov5s.py Example

The MIPI camera real-time detection example does not require command-line parameters; just run it directly. The program will automatically detect and use the onboard MIPI camera.

Parameter Description for 02_mipi_camera_dump.py Example

Parameter	Description	Type	Example
`-f`	Frame rate (FPS)	int	`30`
`-c`	Number of frames to capture (count)	int	`10`
`-w`	Image width	int	`1920`
`-h`	Image height	int	`1080`

Parameter Description for 03_mipi_camera_scale.py Example

Parameter	Description	Type	Example
`-i`	Input YUV file path	str	`input.yuv`
`-o`	Output file path	str	`output_scale.yuv`
`-w`	Output image width	int	`640`
`-h`	Output image height	int	`360`
`--iwidth`	Input image width	int	`1920`
`--iheight`	Input image height	int	`1080`

Parameter Description for 04_mipi_camera_crop_scale.py Example

Parameter	Description	Type	Example
`-i`	Input YUV file path	str	`input.yuv`
`-o`	Output file path	str	`output_crop_scale.yuv`
`-w`	Output image width	int	`640`
`-h`	Output image height	int	`480`
`--iwidth`	Original input image width	int	`1920`
`--iheight`	Original input image height	int	`1080`
`-x`	X coordinate of the top-left corner of the crop area	int	`304`
`-y`	Y coordinate of the top-left corner of the crop area	int	`304`
`--crop_w`	Width of the crop area	int	`896`
`--crop_h`	Height of the crop area	int	`592`

Parameter Description for 05_mipi_camera_streamer.py Example

Parameter	Description	Type	Example
`-w`	Output image width	int	`1920`
`-h`	Output image height	int	`1080`

Software Architecture Description

This section describes the software architecture and workflow of the MIPI camera real-time detection examples, explaining the complete execution process of each example program from initialization to completion, helping to understand the overall code structure and data flow.

Real-time Object Detection Example Software Architecture

software_arch

Model Loading - Load the model file using the hbm_runtime module
Camera Initialization - Initialize the MIPI camera
Display Initialization - Initialize the HDMI display
Device Binding - Bind the camera output to the display
Image Capture - Obtain video frames from the MIPI camera
Image Preprocessing - Scale the image to the model input size, format conversion
Model Inference - Perform YOLOv5x forward inference on the BPU
Result Post-processing - Decode output, filter low-confidence results, NMS deduplication, coordinate mapping
Result Visualization - Draw bounding boxes and labels on the original image
Display Output - Output results via HDMI, display FPS and detection information on the console

Image Capture and Save Example Software Architecture

software_arch

Camera Initialization - Initialize the MIPI camera
Parameter Configuration - Set capture frame rate, resolution, number of frames to capture
Image Capture - Continuously capture the specified number of image frames
File Saving - Save captured images as YUV format files

Image Scaling Example Software Architecture

software_arch

Parameter Retrieval - Parse command-line parameters to get input/output file paths and image dimensions
VPS Initialization - Create a VPS object, open a hardware scaling channel
File Reading - Read the input YUV file
Hardware Scaling - Complete image scaling via hardware VPS
Result Saving - Save the scaled image as a new YUV file
Resource Cleanup - Close VPS, release hardware resources

Image Cropping and Scaling Example Software Architecture

software_arch

Parameter Retrieval - Parse command-line parameters to get input/output file paths, image dimensions, and crop area coordinates
VPS Initialization - Create a VPS object, open a hardware cropping and scaling channel
File Reading - Read the input YUV file
Hardware Processing - Complete image cropping via hardware VPS
Result Saving - Save the processed image as a new YUV file
Resource Cleanup - Close VPS, release hardware resources

Real-time Streaming Display Example Software Architecture

software_arch

Parameter Retrieval - Parse command-line parameters to get display resolution
Display Initialization - Create a display object, initialize the HDMI display layer
Camera Initialization - Create a camera object, open the MIPI camera
Device Binding - Use hardware binding to directly connect the camera data stream to the display
Real-time Streaming - The camera feed is continuously output to the HDMI display via the hardware path
Device Unbinding - Unbind the camera from the display
Resource Cleanup - Close the display and camera, release hardware resources

API Flow Description

This section lists the main API interfaces used in the example programs, describing the functionality, input parameters, and return values of each interface, helping developers quickly understand code implementation details and interface calling methods.

Main Interfaces for Real-time Object Detection:

API_Flow

srcampy.Camera()

Create a MIPI camera object
get_display_res()

Get the HDMI display resolution, returns: width, height
cam.open_cam(pipe_id, video_index, fps, width, height, raw_height, raw_width)

Open the camera, inputs: pipeline channel number corresponding to the camera, host number corresponding to the camera, frame rate, output width, output height, raw width, raw height
srcampy.Display()

Create an HDMI display object
disp.display(layer, width, height)

Initialize the display layer, inputs: display layer number, width, height
srcampy.bind(camera, display)

Bind the camera and display, inputs: camera object, display object
hbm_runtime.HB_HBMRuntime(model_path)

Load the model, input: model file path
model.set_scheduling_params(priority, bpu_cores)

Set model scheduling parameters, inputs: priority (0-255), list of BPU cores

load_class_names(class_file)

Load class names, input: class file path, returns: list of class names
cam.get_img(chn, width, height)

Get a camera image frame, inputs: channel number (default is 2), width, height, returns: NV12 format image data
split_nv12_bytes(img, width, height)

Split the Y and UV components of an NV12 image, inputs: NV12 image data, width, height, returns: Y component, UV component
resize_nv12_yuv(y, uv, target_w, target_h)

Scale an NV12 image, inputs: Y component, UV component, target width, target height, returns: scaled Y and UV components
model.run(input_tensor)

Perform model inference, input: preprocessed input tensor dictionary, returns: model output dictionary
dequantize_outputs(outputs, output_quants)

Dequantize the results, inputs: model output dictionary, output quantization parameters, returns: float32 type data
decode_outputs(output_names, fp32_outputs, strides, anchors, classes_num)

Decode YOLO model outputs, inputs: list of output names, dequantized outputs, strides, anchors, number of classes, returns: predictions
filter_predictions(predictions, score_threshold)

Filter predictions, inputs: predictions, confidence threshold, returns: bounding boxes, confidences, classes
NMS(boxes, scores, classes, iou_threshold)

Perform non-maximum suppression, inputs: bounding boxes, confidences, classes, IoU threshold, returns: kept indices
scale_coords_back(boxes, orig_w, orig_h, model_w, model_h, resize_type)

Scale bounding boxes back to original image dimensions, inputs: bounding boxes, original width/height, model input width/height, resize type, returns: scaled bounding boxes
draw_detections_on_disp(display, boxes, cls_ids, scores, class_names, color_map, chn)

Draw detection results on the display layer, inputs: display object, bounding boxes, class IDs, confidences, class list, color map, channel number
srcampy.unbind(camera, display)

Unbind the camera and display
cam.close_cam()

Close the camera
disp.close()

Close the display

Main Interfaces for Image Capture and Save:

API_Flow

libsrcampy.Camera()

Create a MIPI camera object
cam.open_cam(pipe_id, video_index, fps, width, height,)

Open the camera, inputs: pipeline channel number corresponding to the camera, host number corresponding to the camera, frame rate, width, height
cam.get_img(chn)

Get an image, input: module for image acquisition, returns: YUV format image data
file.write(data)

Write file data, input: image data
cam.close_cam()

Close the camera

Main Interfaces for Image Scaling:

API_Flow

libsrcampy.Camera()

Create a VPS (Video Processing System) object
vps.open_vps(grp_id, chn_id, input_w, input_h, output_w, output_h)

Open a VPS channel, inputs: group ID, channel ID, input width/height, output width/height
file.read()

Read file data, returns: image data (byte stream)
vps.set_img(img_data)

Set input image data, input: YUV image data (byte stream)
vps.get_img(chn, width, height)

Get the processed image, inputs: channel number, width, height, returns: processed YUV image data (byte stream)
file.write(data)

Write the processed image data, input: image data (byte stream)
vps.close_cam()

Close VPS

Main Interfaces for Image Cropping and Scaling:

API_Flow

libsrcampy.Camera()

Create a VPS (Video Processing System) object
vps.open_vps(grp_id, chn_id, input_w, input_h, output_w, output_h, crop_rect)

Open a VPS channel and set the crop area, inputs: group ID, channel ID, input width/height, output width/height, crop area [x, y, w, h]
file.read()

Read file data, returns: image data (byte stream)
vps.set_img(img_data)

Set input image data, input: YUV image data (byte stream)
vps.get_img(chn, width, height)

Get the processed image, inputs: channel number, width, height, returns: processed YUV image data (NV12 format, byte stream)
file.write(data)

Write the processed image data, input: image data (byte stream)
vps.close_cam()

Close VPS

Main Interfaces for Real-time Streaming Display:

API_Flow

libsrcampy.Display()

Create an HDMI display object
disp.display(layer, width, height)

Initialize the display layer, inputs: display layer number, width, height
libsrcampy.Camera()

Create a MIPI camera object
cam.open_cam(pipe_id, video_index, fps, width, height,)

Open the camera, inputs: pipeline channel number corresponding to the camera, host number corresponding to the camera, frame rate, width, height
libsrcampy.bind(camera, display)

Bind the camera and display, inputs: camera object, display object, returns: binding result
libsrcampy.unbind(camera, display)

Unbind the camera and display
disp.close()

Close the display
cam.close_cam()

Close the camera

FAQ

Q: What should I do if the example prompts camera initialization failure?
A: Please check if the MIPI camera is properly connected and ensure the camera driver is loaded correctly. Try restarting the device.

Q: What should I do if the HDMI display is abnormal or has no output?
A: Please check the HDMI connection and ensure the display service has been stopped (e.g., using systemctl stop lightdm).

Q: How can I adjust the detection threshold?
A: Modify the value of --score-thres in the code; for example, changing it to 0.5 can increase detection sensitivity.

Q: How can I change the display resolution?
A: Modify the sensor_width and sensor_height variables in the code, but note whether the display device supports that resolution.

Q: What should I do if the frame rate is very low when running the example?
A: Try using a lighter model or adjust the camera's capture resolution.

Q: How can I save the detection result images?
A: You can add image saving logic to the code, such as using cv2.imwrite() to save the processed image.

Example Introduction​

Effect Demonstration​

01 Real-time Detection Effect​

02 Image Capture and Save Effect​

03 Image Scaling Effect​

04 Image Cropping and Scaling Effect​

05 Real-time Streaming Effect​

Hardware Preparation​

Hardware Connection​

Quick Start​

Code and Board Location​

Compilation and Execution​

Detailed Introduction​

Example Program Parameter Options​

Parameter Description for 01_mipi_camera_yolov5s.py Example​

Parameter Description for 02_mipi_camera_dump.py Example​

Parameter Description for 03_mipi_camera_scale.py Example​

Parameter Description for 04_mipi_camera_crop_scale.py Example​

Parameter Description for 05_mipi_camera_streamer.py Example​

Software Architecture Description​

Real-time Object Detection Example Software Architecture​

Image Capture and Save Example Software Architecture​

Image Scaling Example Software Architecture​

Image Cropping and Scaling Example Software Architecture​

Real-time Streaming Display Example Software Architecture​

API Flow Description​

Main Interfaces for Real-time Object Detection:​

Main Interfaces for Image Capture and Save:​

Main Interfaces for Image Scaling:​

Main Interfaces for Image Cropping and Scaling:​

Main Interfaces for Real-time Streaming Display:​

FAQ​

Example Introduction

Effect Demonstration

01 Real-time Detection Effect

02 Image Capture and Save Effect

03 Image Scaling Effect

04 Image Cropping and Scaling Effect

05 Real-time Streaming Effect

Hardware Preparation

Hardware Connection

Quick Start

Code and Board Location

Compilation and Execution

Detailed Introduction

Example Program Parameter Options

Parameter Description for 01_mipi_camera_yolov5s.py Example

Parameter Description for 02_mipi_camera_dump.py Example

Parameter Description for 03_mipi_camera_scale.py Example

Parameter Description for 04_mipi_camera_crop_scale.py Example

Parameter Description for 05_mipi_camera_streamer.py Example

Software Architecture Description

Real-time Object Detection Example Software Architecture

Image Capture and Save Example Software Architecture

Image Scaling Example Software Architecture

Image Cropping and Scaling Example Software Architecture

Real-time Streaming Display Example Software Architecture

API Flow Description

Main Interfaces for Real-time Object Detection:

Main Interfaces for Image Capture and Save:

Main Interfaces for Image Scaling:

Main Interfaces for Image Cropping and Scaling:

Main Interfaces for Real-time Streaming Display:

FAQ