xlang-ai/FineVLA
Scalable annotation pipeline for action-aglined fine-grained instruciton for Visual-language-Action model
GitHub repository with 19 stars and 0 forks.
Language: Python
Topics: benchmark, caption, caption-generation, fine-grained, roboitcs, vision-language-action-model, vla, vlm, steerable