facebookresearch/VLM3
Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".
GitHub repository with 207 stars and 8 forks.
Language: Jupyter Notebook
Topics: 3d-foundation-model, camera-pose-estimation, depth-estimation, image-matching, large-language-models, object-level-3d, vlms