
Hiwonder TonyPi Pro AI Humanoid Robot with Raspberry Pi 5 – Integrated Multimodal AI Model (ChatGPT), AI Vision Tracking, Voice Interaction, and Hand-Eye Coordination (Raspberry Pi 5 16GB Kit)
Description TonyPi Pro uses Raspberry Pi 5, OpenCV, and inverse kinematics for AI education, offering open-source support and a flexible development environment. With ChatGPT at its core and AI vision support, TonyPi Pro delivers intelligent perception, reasoning, and action for smooth human-machine interaction. Equipped with 18 high-voltage servos for precise motion, fast response, and stable humanoid movements, enabling accurate multi-joint coordination. Features upgraded robotic hands and AI vision for flexible object handling, plus dynamic gait for tasks like climbing stairs or crossing hurdles. Includes tutorials on motion control, OpenCV, AI models, voice interaction, and sensors to help users build and program their own AI robot. Product Description TonyPi Pro is an AI-powered humanoid robot built on the Raspberry Pi 5. It features high-voltage intelligent serial bus servos and an HD camera. With support for Python programming, TonyPi Pro can carry out tasks such as color recognition, target tracking, ball shooting, line following, somatosensory control, and a variety of creative AI-interactive games. TonyPi Pro also leverages a Multimodal AI Large Model to support more advanced embodied AI applications. To help you unlock its full potential, we offer comprehensive tutorials designed to inspire and support your AI-driven creative projects. 1) 20DOF AI Humanoid Robot TonyPi features 18 degrees of freedom (DOF), with an additional 2 DOF provided by open-close hands. Its 2-DOF head design allows for precise motion control and flexible visual exploration, supporting more expressive and responsive interactions. 2) Hand-Eye Coordination Smart Grasping and Sorting TonyPi Pro is powered by the Raspberry Pi 5 and equipped with high-performance hardware, supporting open-close robotic hand expansion. This enables intelligent grasping, sorting, and a wide range of interactive AI functions. 1. AI Vision, Unlimited Creativity TonyPi Pro is equipped with a HD wide-angle camera on its head, enabling real-time image acquisition and processing using the OpenCV vision library. It can detect and extract parameters such as the color and position of target objects within its field of view. The system supports a range of vision-based functions, including video streaming, color recognition, tag identification, and visual line following. By applying a PID control algorithm, TonyPi achieves real-time target locking, enabling advanced AI applications such as target tracking and autonomous ball kicking. 1) Object Tracking Powered by the OpenCV vision library, TonyPi Pro can detect and locate objects of a specific color in real-time. Using PID control, its head can actively track moving targets with precision. 2) Tag Recognition Using OpenCV algorithms, TonyPi Pro can recognize and interpret tag codes within its field of view. It can also calculate each tag’s position and orientation, allowing users to program customized interactive movements. 3) Face Detection TonyPi Pro features a built-in MediaPipe deep learning algorithm that works with a high-definition camera to accurately detect and lock onto human face. Users can program TonyPi to perform responsive actions based on facial detection. 4) Visual Line Following With AI vision and PID motion control, TonyPi Pro can identify colored lines in its view and autonomously adjust its gait to follow the path smoothly and stably. 2. Upgraded Hand-Eye Coordination & Dynamic Gait TonyPi Pro supports expandable open-close robotic hands, allowing it to flexibly grasp and transport small objects. Integrated with AI vision, it can autonomously assess the distance to target objects and intelligently adjust its speed. This results in smoother, more natural movement—enabling advanced AI functions such as autonomous hurdle-crossing and stair climbing for innovative applications. 1) Open-Close Robotic Hand Thanks to its innovative design, the robotic hand can open up to 66mm. When integrated with the vision system, it enables seamless hand-eye coordination, allowing the robot to accurately and stably grasp target objects based on various user commands. 2) Upgraded Gait Unlimited AI Creativity Using the OpenCV vision library, TonyPi Pro can seamlessly detect hurdles and stairs in real-time during its line-following process. It captures essential details like coordinates and geometric contours, enabling the robot to autonomously navigate over obstacles and climb stairs with ease. 3. Multimodal Models Deployment TonyPi Pro integrates a Multimodal Large AI Model and supports online deployment via OpenAI's API, enabling real-time access to advanced AI capabilities. It also allows seamless switching to alternative models, such as those available through OpenRouter, to support Vision Language Model applications. At its core, TonyPi Pro is designed as an all-in-one interaction hub built around ChatGPT, enabling sophisticated embodied AI use cases and creating a smooth, intuitive human-machine interaction experience! 1) Large Language Model With the integration of the ChatGPT Large Model, TonyPi Pro operates like a"super brain"-capable of comprehending diverse user commands and responding intelligently and contextually. 2) Large Speech Model With the integration of the Al voice interaction box, TonyPi Pro is equipped with speech input and output capabili- ties-functionally giving it'ears' and a'mouth.' Utilizing advanced end-to-end speech-language models and naturall anguage processing (NLP) technologies, TonyPi Pro can perform real-time speech recognition and generate natural, human-like responses, enabling seamless and intuitive voice-based human-machine interaction. 3) Vision Language Model TonyPi Pro integrates with OpenRouter's Vision Large Model, enabling advanced image understanding and analysis. It can accurately identify and locate objects within complex visual scenes, while also delivering detailed descriptions that cover object names, characteristics, and other relevant attributes. 4. Large Model Embodied AI Applications TonyPi Pro is equipped with a high-performance AI voice interaction module. Unlike conventional AI systems that operate on unidirectional command-response mechanisms, TonyPi Pro leverages ChatGPT to enable a cognitive transition from semantic understanding to physical execution, significantly enhancing the fluidity and naturalness of human-machine interaction. Combined with machine vision, TonyPi Pro exhibits advanced capabilities in perception, reasoning, and autonomous action—paving the way for more sophisticated embodied AI applications. 1) Voice Control Powered by ChatGPT, TonyPi Pro is capable of semantic understanding and executing corresponding actions, enabling smooth and natural voice control. 2) Scene Understanding Leveraging OpenAI's ChatGPT model, TonyPi Pro is capable of understanding user commands and performing semantic analysis of visual scenes within its field of view. It can interpret image content and features, delivering contextual feedback via both text and speech. 3) Ball Tracking and Shooting With semantic understanding powered by a large language model, TonyPi Pro can lock onto a target based on commands, adjust its posture in real time, and precisely execute ball tracking and kicking actions. 4) Autonomous Patrolling Utilizing semantic understanding from a large language model, TonyPi Pro can accurately detect and track lines of various colors in real time while autonomously navigating obstacles, ensuring smooth and efficient patrolling. 5) Object Transport Powered by the OpenRouter vision language model, TonyPi Pro can identify target objects within its view, assess their relative positions, and transport them to a designated location based on user commands. 6) Post Detection TonyPi Pro continuously reads IMU sensor data, which is analyzed by a large model to determine its current posture. Based on commands, it can adjust its stance, allowing the robot to stand up or lie down as needed. 7) Smart Home Assistant Leveraging the multimodal model deployed on its body, TonyPi Pro is capable of recognizing and analyzing objects within its field of view. Combined with ChatGPT, it can understand user commands and execute corresponding actions and responses. 8) Temperature Reporting Equipped with a temperature and humidity sensor, TonyPi Pro can continuously monitor environmental conditions, gather real-time data, and use semantic understanding powered by a large language model to report the current temperature and humidity. 9) Upgraded MediaPipe Human-Robot Interaction TonyPi Pro continuously captures human body features within its visual field and processes them using MediaPipe detection models.Based on real-time analysis, the robot executes corresponding actions, enabling advanced Al capabilities such as face recognition, gesture control, and somatosensory motion control. 10) Going Up and Down Stair TonyPi Pro leverages the OpenCV vision library to precisely detect the spatial position and geometric contours of stairs within its view. With autonomous decision-making, it can efficiently and steadily climb stairs on its own. 11) Autonomous Hurdling Using the OpenCV vision library, TonyPi Pro can detect obstacles in real time while following a path, capture their coordinates, and dynamically adjust its posture. It then autonomously makes decisions to smoothly navigate and overcome obstacles. 12) Auto Shooting Perform image processing through OpenCV to obtain the ball's position, and then use PID algorithm to track and kick it automatically. 13) Intelligent Transport TonyPi Pro can visually identify the distance of the item, and finally move the target item to the designated tag. 5. Sensor Expansion for Enhanced Functionality TonyPi Pro is equipped with a comprehensive sensor expansion pack, including a temperature and humidity sensor, ultrasonic sensor, touch sensor, and a range of additional electronic modules. This flexible architecture enables seamless integration into diverse AI applications, supports deep learning tasks, and provides a robust foundation for advanced development and creative exploration. 1) Intelligent Ranging With an ultrasonic sensor, TonyPi Pro can detect the distance to obstacles ahead with high accuracy. 2) Fan Tracking Integrated with the fan module, TonyPi Pro can execute a face-triggered smart fan application. 3) Touch Sensing By touching the metal plate on the touch sensor, TonyPi Pro can perform corresponding responsive actions. 4) Temperature and Humidity Display Using the temperature and humidity sensor, TonyPi Pro can acquire real-time environmental data and display it on the dot matrix display. 6. Support Diverse Control Methods ① APP Control WonderPi APP supports Android and iOS. Switch game modes easily and quickly to experience various AI games. ② PC Remote Control We can quickly connect the WonderPi configuration to the wireless LAN, which is more convenient for you to remotely connect and control TonyPi Pro. ③ PC Software Control With the graphical PC software, you can control the rotation of the robot servo by dragging the slider without code, and can edit the robot action group. View more What's Included 1* TonyPi 1* 12.6V 2A battery charger 3* Tag + EVA ball 1* Card Reader 1* Accessary bag 1* User Manual 1* WonderEcho Pro AI voice interaction box 1* Type*C cable 1* TonyPi hands 1* Wireless handle + handle receiver 1* Glowy ultrasonic sensor 1* Touch sensor 1* Light sensor 1* Dot matrix display 1* Temperature and humidity sensor 1* Fan module 1* Brackets 1* Map 3* Sponge cube (10*10cm) 1* Line map 1* Stair 1* Hurdle 3* EVA Block (3.5 * 3.5cm) 1* Accessories Dimensions 373*186*106mm (14.69x7.32x4.17inches) Multimedia docReady(function() {$('button[aria-controls=unique-tab-5]').one('click',function() {$("#iframe-video-1").html('');})}); Specifications Size: 373* 186* 106mm(14.69x7.32x4.17inches) Weight: About 1800g Camera pixel: 480P Pan-tilt DOF: 2DOF Battery: 11.1V 2000mAh 10C lithium battery Working hour: About 60mins Hardware: Raspberry Pi 5 and Raspberry Pi expansion board Software: APP + PC software Communication: WiFi, Ethernet Servo: LX-824HV bus serve/ LFD-01M anti-blocking servo Control method: App/ PC software/ wireless controller control Package size: 56*36*31cm Package weight: 4.5kg