ARTEMIS2 & ARTEMIS2-Mini: From Full-Scale to Distilled Real-Time Animal Behavior Recognition in a Robotic Dog
Fazzari, Edoardo ; Romano, Donato ; Falchi, Fabrizio ; Stefanini, Cesare
Fazzari, Edoardo
Romano, Donato
Falchi, Fabrizio
Stefanini, Cesare
Supervisor
Department
Robotics
Embargo End Date
Type
Conference proceeding
Date
License
Language
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Recent progress in pre-trained vision-language models has greatly advanced multimodal learning by enabling robust and generalizable feature extraction. However, animal action recognition remains a challenging task due to high intraspecies variability, subtle motion cues, and the need for fine-grained spatio-temporal reasoning. In this work, we present ARTEMIS2, an improved version of the ARTEMIS framework, featuring enhanced frame selection, more powerful visual and textual encoders, and a refined spatio-temporal captioning module for stronger alignment between visual content and textual descriptions. Evaluated on the Animal Kingdom benchmark, ARTEMIS2 achieves 82.4 mAP, setting a new state-of-the-art. To support real-time deployment on robotic platforms, we propose ARTEMIS2-Mini, a distilled unimodal variant based on the TimeSformer architecture. Despite relying solely on video input, it achieves 77.91 mAP and enables real-time inference onboard a Unitree Go2 quadruped robot. Field experiments demonstrate its effectiveness in recognizing feline behaviors across indoor and outdoor environments, showcasing its potential for embodied AI in animal monitoring and interaction tasks. The code of ARTEMIS2 is available at https://github.com/edofazza/artemis2.
Citation
E. Fazzari, D. Romano, F. Falchi, C. Stefanini, "ARTEMIS2 & ARTEMIS2-Mini: From Full-Scale to Distilled Real-Time Animal Behavior Recognition in a Robotic Dog," 2026, pp. 318-325.
Source
2025 5th International Conference on Robotics, Automation, and Artificial Intelligence (RAAI)
Conference
International Conference on Robotics, Automation, and Artificial Intelligence (RAAI)
Keywords
46 Information and Computing Sciences, 4602 Artificial Intelligence, 4608 Human-Centred Computing, 4611 Machine Learning
Subjects
Source
International Conference on Robotics, Automation, and Artificial Intelligence (RAAI)
Publisher
IEEE
