
AIC 2025 (ICCVW Workshop): Warehouse Spatial Intelligence
SmolRGPT achieved 3rd place in the AI City Challenge 2025 Track 3 with a final S1 score of 90.68, demonstrating that a 600M parameter model can compete with much larger architectures. It excelled in spatial relationship understanding, reaching 99.8% accuracy for left-right directional tasks, and achieved strong results in counting (92.76% accuracy, RMSE 0.0750) thanks to effective depth integration. For multiple-choice spatial reasoning, smolRGPT scored 88.02% accuracy, showing robust scene comprehension. The most challenging aspect was distance estimation (82.13% accuracy, RMSE 0.4740), but this still surpassed expectations for a model of this size, highlighting the impact of using both RGB and depth refiners.
| Rank | Team Name | Score |
| 1 | UWIPL_ETRI | 96.0789 |
| 2 | HCMUT.VNU | 91.9735 |
| 3 | Embia (smolRGPT) | 90.6772 |
| 4 | MIZSU | 73.0606 |
| 5 | HCMUS_HTH | 66.8861 |
| 6 | MealsRetrieval | 56.6352 |
| 7 | BKU22 | 50.3662 |
| 8 | Smart Lab | 31.9245 |
| 9 | AICV | 28.2993 |