5.1.6 Version Release Notes
tros-humble
Version: 2.4.6 (2026-02-02)
Feature Changes (RDK S100 platform):
- Adapted to system version
V4.0.5andOEversionV3.7.0.
Version: 2.4.5 (2025-10-28)
Bug Fixes (RDK X5 platform):
- Fixed a bug in the image processing acceleration module
hobot_cvwhere VSE-accelerated image resizing failed. - Fixed incorrect inference latency statistics in the on-device model inference framework
dnn_node. - Fixed runtime loading failures caused by incorrect installation paths of
ros component sofiles for MIPI image capture and algorithm modules. - Optimized image codec configurations: removed invalid parameters and added debug-specific configuration parameters.
Version: 2.4.4 (2025-10-24)
New Features (RDK S100 platform):
- The on-device algorithm inference framework
dnn_nodenow supportsROIinference mode, where the number of output tensors equalsoutput_size × roi_size. - Added a new algorithm example: Human Detection and Tracking (Ultralytics YOLO Pose).
- Added a new algorithm example: Human Instance Tracking, which extracts human features using a
reidmodel and stores, manages, and queries these features via anSQLitedatabase. The launch file disables theROI-based human tracking mode in themono2d_body_detectionnode and restricts inputROIdimensions to less than 3.5× the actual model input size. - Added a new algorithm example: Hand Keypoint and Gesture Recognition (MediaPipe). This implements basic palm detection, pre/post-processing, and
ai msgpublishing; supports MIPI cameras, USB cameras, and local image replay; and supports both zero-copy and non-zero-copy image acquisition methods. - Added a new algorithm example: DeepSeek Large Language Model, supporting human-robot dialogue. The
RDK S100andRDK S100Pplatforms now support theDeepSeek_R1_Distill_Qwen_1.5BandDeepSeek_R1_Distill_Qwen_7Bmodels.
Version: 2.4.3 (2025-09-15)
New Features (RDK X5 platform):
- MIPI Image Capture now supports multi-channel image acquisition.
- Stereo MIPI Image Capture now supports the
sc132gsstereo camera.
Version: 2.4.2 (2025-08-29)
New Features (RDK S100 platform):
- MIPI Image Capture now supports the
230aistereo module. - YOLO Object Detection now supports
yolo11andyolov12algorithms; added scripts to launch viacomponentmode, supporting 4K image inference frommipi cam. - The EdgeSAM Everything Segmentation algorithm now uses
edgesamfor model inference, supportsnv12format input, and adds padding to segmentation results for joint visualization in stereo depth examples. - Added a new algorithm: Text-to-Image Feature Retrieval for extracting and retrieving text-image features.
- Added a new algorithm: DOSOD, including Diguas self-developed open-vocabulary detection DOSOD edge deployment package and a reparameterized quantization method for customizing detection categories in DOSOD models.
- Added a new algorithm: Stereo OCC, integrating Diguas stereo
OCCnetwork. - The Vision-Language Model algorithm now supports the
smolvlm2model, with image replay and subscription modes; added capability to publish complete topics afterllmmodel inference completes. - Added a new algorithm: Stereo Depth Estimation.
Version: 2.4.1 (2025-07-30)
New Features (RDK X5 platform):
- MIPI Image Capture now supports the
imx415module. - The EdgeSAM Everything Segmentation algorithm now uses
edgesamfor model inference, supportsnv12format input, and adds padding to segmentation results for joint visualization in stereo depth examples. - Added a new algorithm: Human Instance Tracking, which extracts human features using a
reidmodel and stores, manages, and queries these features via anSQLitedatabase. - Added a new algorithm: Stereo OCC, integrating Diguas stereo
OCCnetwork. - The Vision-Language Model now supports the
smolvlm2model, with image replay and subscription modes; added capability to publish complete topics afterllmmodel inference completes.
Version: 2.4.0 (2025-05-12)
New Features:
- Added support for the
RDK S100platform.
Version: 2.3.3 (2025-04-30)
New Features:
- Added support for the
RDK X5 Moduleplatform. - Added an open-source ASR solution based on
sensevoice_cpp, supporting keyword and ASR data streaming. - Optimized post-processing time for the Stereo Depth Estimation algorithm and added the V2.3 model version.
- Added a new edge-side Vision-Language Model algorithm example based on
llama.cpp.
Version: 2.3.2 (2025-01-15)
Feature Changes:
- Updated the stereo model in Stereo Depth Estimation to improve depth estimation accuracy.
- Optimized the workflow and web-based visualization effects in the Multi-Stream Video Analysis algorithm example.
- Removed some non-functional launch files from the Stereo Utility Package.
New Features:
- Added ZED Camera Image Capture to acquire stereo images as input for stereo depth estimation algorithms.
- Added the DOSOD algorithm, including Diguas self-developed open-vocabulary detection DOSOD edge deployment package.
Bug Fixes:
- Fixed a crash in the post-processing of the yolov8-seg Image Segmentation algorithm caused by bounding boxes exceeding image boundaries.
- Fixed incorrect frame rate statistics in Image Codec.
- Fixed I2C detection issues in Stereo MIPI Image Capture and added LPWM switch configuration.
Version: 2.3.1 (2024-11-20)
Feature Changes:
- Upgraded the dependent
opencvversion from 3.4.5 to 4.5.4 (the latest release version used in Ubuntu 22.04).
New Features:
- The Image Publisher Tool now supports publishing
bgr/rgbformat message data and configuring theframe_idof published messages. - The Human Detection and Tracking Algorithm now supports configurable subscription topics,
componentmode execution, input image scaling during preprocessing, and launch scripts that support compressed image replay with configurable image paths. - The On-Device Algorithm Model Inference and Deployment Framework fixed incorrect inference time calculation in multi-threaded inference and added support for configuring task counts in config files.
- The Image Codec Node now uses the
frame_idfrom subscribed image messages as the output messageframe_idand supports frame-dropping control. - The Gesture Recognition Algorithm now supports configurable post-processing thresholds at startup and dynamic gesture recognition.
- Added a new algorithm: Face Age Detection for estimating human age.
- Added a new algorithm: 106-Face Landmark Detection for detecting 106 facial landmarks.
- Added a new Perception Message Fusion Node to subscribe to multiple topics of type PerceptionTargets, perform time alignment and deduplication, and publish a fused topic. See Multi-Algorithm Inference for usage reference.
- Added a new Perception Message Filtering Node using the OneEuroFilter strategy to smooth points and bounding boxes, correcting jitter in detection results for humans, faces, hands, etc. See Multi-Algorithm Inference for usage reference.
- Added a new Stereo Utility Package for capturing stereo and depth images.
- Added a new Multi-Stream Video Analysis algorithm example that pulls multiple H.264/H.265 streams via RTSP, performs inference, and visualizes perception results on the web.
Bug Fixes:
- Fixed startup failure of the
imx219module in MIPI Image Capture. - Added hand bounding box expansion in the preprocessing of the Hand Keypoint Detection Algorithm to resolve incorrect keypoint outputs.
Version: 2.3.0 (2024-09-19)
New Features:
- Added support for the
RDK X5platform. - Added Stereo MIPI Image Capture to data acquisition.
- Added reference algorithms to the algorithm repository:
yolov8andyolov10for Object Detection, andyolov8-segfor Image Segmentation. - Added the YOLO-World Algorithm for open-vocabulary object detection.
- Added the Optical Flow Estimation Algorithm for optical flow detection.
- Added the Everything Segmentation Algorithm for universal segmentation.
- Added the Text-to-Image Feature Retrieval Algorithm for text-image feature extraction and retrieval.
- Added the Stereo Depth Estimation Algorithm for vision-based depth estimation.
Version: 2.2.0 (2024-04-11)
Feature Changes:
- Based on TROS Foxy 2.1.3, adapted to Ubuntu 22.04 and ROS2 Humble.
- Changed the TROS installation path from
/opt/trosto/opt/tros/humbleto align with ROS2’s directory structure and naming convention. - Discontinued the
tros-ros-baseinstallation package (which included core ROS2 packages like rclcpp, rclpy, ros2cli); standard ROS2 distribution packages are now used, and ROS2 Humble dependencies are automatically installed when installing TROS Humble. - Enabled ROS2 FastDDS zero-copy communication for modules handling image data, including data acquisition, image codec, and algorithm examples.
- Changed the QoS Reliability policy for zero-copy communication from
RMW_QOS_POLICY_RELIABILITY_RELIABLE(rclcpp::QoS()) toRMW_QOS_POLICY_RELIABILITY_BEST_EFFORT(rclcpp::SensorDataQoS()) to mitigate potential stability risks with zero-copy. - Refactored
hobot_dnnto use the lower-level on-device inference frameworklibdnn, replacingeasydnn. - Upgraded the speech algorithm SDK in
hobot_audioto use the lower-level on-device inference frameworklibdnn, replacingeasydnn. - Adapted
hobot_triggerto ROS2 Humble’s rosbag2.
New Features:
- Added Bloom compilation and packaging scripts in
robot_dev_configfor compiling and packaging TROS on ARM platforms. - Added a
frame_ts_typeconfiguration option to thehobot_mipi_camnode, supportingrealtime(for communication latency measurement) andsensor(default, for sensor timestamp synchronization). - Added a new
hobot_shmnode for configuring the ROS2 zero-copy environment.
Bug Fixes:
- Fixed compatibility issues introduced by compiler upgrades.
- Fixed path dependency issues in on-device compilation of certain ROS2 packages.
tros-foxy
Version: 2.1.3 (2024-03-11)
Feature Changes:
- Changed the data type for JPEG-compressed images from
sensor_msgs::msg::Imageto the standardsensor_msgs::msg::CompressedImage, enabling tools like Foxglove and ROS2 rqt to view JPEG images published by TROS. Affects modules: hobot_websocket, hobot_codec, hobot_image_publisher, hobot_usb_cam. - Unified JPEG/MJPEG configuration options for publishing/subscribing JPEG-compressed images; removed
jpeg-compressed/mjpeg-compressedoptions. Affects modules: hobot_codec and hobot_usb_cam. - Introduced the environment variable
TROS_DISTROto indicate the TROS distribution. After runningsource /opt/tros/setup.bashorsource /opt/tros/local_setup.bash,TROS_DISTROis empty. Configuration file paths for modules like hobot_codec, hobot_audio, hobot_mipi_cam, and hobot_usb_cam changed from/opt/tros/libto/opt/tros/${TROS_DISTRO}/lib.
Version: 2.1.2 (2024-01-19)
New Features:
- Refactored
hobot_usb_camto support more format configurations and transcoding. - Updated the voice SDK in
hobot_audioto support both 2-mic and 4-mic microphone boards; addedmicphone_nameconfiguration for device ID.
Bug Fixes:
- Fixed incorrect
stepfield settings in data messages sent by thehobot_rgbd_camnode. - Updated audio playback function calls in
hobot_ttsto resolve playback failures on newer system versions. - Removed the config device tree file from
hobot_llmand updated the README; ION memory size can now be set via command-line tools on newer system versions.
Version: 2.1.1 (2023-11-03)
New Features:
- Added the
hobot_chatbotnode, integrating smart voice, large language models, and text-to-speech modules to enable on-device voice chat functionality.
Bug Fixes:
- Fixed an issue in the text-to-speech
hobot_ttsnode where certain characters caused the application to exit unexpectedly.
Version: 2.1.0 (2023-09-14)
Feature Changes:
- Updated
tros-ros-baseto the latest ROS2 Foxy source code, ensuring compatibility with the newest ROS2 Foxy packages. - When using ROS2 Foxy packages, simply run
source /opt/tros/setup.bash; creating symbolic links via scripts is no longer required.
New Features:
- The text-to-speech node
hobot_ttsnow supports a new parameter to specify the audio playback device. - Added a large language model (LLM) node
hobot_llm, enabling on-device LLM experiences. - The image codec node
hobot_codecnow includes a newin_formatconfiguration option:jpeg-compressed. The subscribed topic data type is automatically selected based on this configuration.
Bug Fixes:
- Fixed an incorrect
stepfield setting in RGB format data messages sent by the MIPI camera nodehobot_mipi_cam.
Version: 2.0.2 (2023-08-28)
Feature Changes:
- During tros.b installation, the configured ROS2 apt source (
/etc/apt/sources.list.d/ros2.list) has been switched to Tsinghua University’s mirror, resolving slow download speeds and installation failures for ROS2 packages.
New Features:
- Added permission checks when sourcing tros.b environment setup scripts (
source /opt/tros/setup.bashandsource /opt/tros/local_setup.bash). If the current user lacks root privileges, the system will automatically prompt and switch to a root account, preventing tros.b usage failures due to insufficient permissions. - The intelligent audio algorithm node
hobot_audionow supports configuring an audio device ID parameter, facilitating secondary development. - The event-trigger node
hobot_triggernow supports task assignment to the Trigger module via standardstd_msgstopics, standardizing Trigger configuration methods.
Bug Fixes:
- Fixed an issue where the accelerated image processing node
hobot_cvfailed during simultaneous crop & resize operations. - Resolved erroneous error logs output by the MIPI camera node
hobot_mipi_camupon startup. - Fixed invalid launch file configurations for the data visualization message conversion node
hobot_visualization.
Version: 2.0-Release (2.0.1) (2023-06-10)
Feature Changes:
- Upgraded voice algorithms to improve ASR (Automatic Speech Recognition) performance.
- Optimized the
model_nameconfiguration in algorithm examples:model_nameis now automatically parsed from the model file, preventing model loading failures caused by incorrect manual configuration and improving ease of secondary development. - Removed the Nav2 package from the tros.b installer. Users should now directly install the latest ROS2 Nav2 package on RDK using
apt, resolving stability issues present in older Nav2 versions.
New Features:
- Added support for the
RDK Ultraplatform. - Introduced new nodes including
hobot_triggerandhobot_visualizationfor triggering events, capturing, and visualizing rosbag data—helping users diagnose, reproduce, and visualize perception and planning/control issues in robotic scenarios. Users can also extend these nodes for custom data-triggering, recording, and real-time transmission features. - The USB camera capture node now auto-detects USB camera device IDs, lowering the barrier to using USB cameras.
- Added a Visual Inertial Odometry (VIO) algorithm node, enabling low-cost, robust, high-precision robot localization based on visual inputs.
- Added a text-to-speech node
hobot_ttsto convert text into spoken audio. - Added a LiDAR-based object detection algorithm node
hobot_centerpoint. - Added a Bird’s-Eye View (BEV) perception algorithm node
hobot_bev. - Added a stereo depth estimation algorithm node
hobot_stereonet.
Bug Fixes:
- Upgraded
easydnn(v1.6.1) anddnn(v1.18.4) onRDK X3, fixing operator crashes and adding support for more operators. - Fixed incorrect depth data published by the RGBD image capture node.
Other Updates:
- Optimized the human detection and tracking algorithm node to adaptively scale output coordinates based on input image resolution.
- Fixed a compilation failure in the ORB-SLAM3 algorithm caused by an incorrect script path.
Version: 2.0-Beta (2.0.0) (2023-05-29)
Version 2.0-Beta (2.0.0) is the first release in the 2.x series of tros.b. Users of tros.b 1.x are recommended to upgrade to the 2.x series.
Feature Changes:
- Migrated the code hosting platform from GitLab to GitHub to facilitate broader community participation in secondary development.
- Integrated a more efficient package management mechanism to accelerate version upgrades and simplify robot application deployment.
New Features:
- Added support for the new core board development kit: RDK X3 Module.
- Enhanced
hobot_audioto output ASR recognition results, simplifying voice application development.
Bug Fixes:
- Fixed a crash in the post-processing stage of the built-in MobileNet_SSD model within
dnn_nodeunder multi-threaded conditions. - Resolved model inference failures in
dnn_nodeon x86 platforms when using DDR as input. - Fixed compilation failures of
hobot_codecandhobot_image_publisheron x86 platforms.
Other Updates:
- Updated example launch scripts to reference dependent modules’ launch files and configure parameters accordingly.
- Updated the D-Robotics logo displayed on the WebSocket frontend.